Open to Work

13 8 1

Quantong Qiu

QQTang1223

https://qqtang-code.github.io/

qqtang-code

AI & ML interests

None yet

Recent Activity

new activity about 10 hours ago

QQTang1223/qwen_mix_sft_64K6:[bot] Conversion to Parquet

new activity 1 day ago

QQTang1223/full_streaming_Qwen3-4B:Improve model card and add metadata

new activity 1 day ago

QQTang1223/full_triangle_Llama-3.1-8B-Instruct:Improve model card: add pipeline tag, library name, links and sample usage

View all activity

Organizations

Collections 1

Papers 4

models 11

datasets 2

QQTang1223/llama_mix_sft_64K6

Viewer • Updated 1 day ago • 49.3k • 13

QQTang1223/qwen_mix_sft_64K6

Viewer • Updated 1 day ago • 49.3k • 12

Quantong Qiu

AI & ML interests

Recent Activity

Organizations

Collections 1

Flux Attention: Context-Aware Hybrid Attention for Efficient LLMs Inference

QQTang1223/full_streaming_Llama-3.1-8B-Instruct

QQTang1223/full_xattn_Qwen3-8B

QQTang1223/full_xattn_Qwen3-4B

Flux Attention: Context-Aware Hybrid Attention for Efficient LLMs Inference

QQTang1223/full_streaming_Llama-3.1-8B-Instruct

QQTang1223/full_xattn_Qwen3-8B

QQTang1223/full_xattn_Qwen3-4B

Papers 4

models 11

QQTang1223/full_streaming_Qwen3-4B

QQTang1223/full_triangle_Llama-3.1-8B-Instruct

QQTang1223/full_triangle_Qwen3-4B

QQTang1223/full_triangle_Qwen3-8B

QQTang1223/full_xattn_Llama-3.1-8B-Instruct

QQTang1223/full_xattn_Qwen3-4B

QQTang1223/full_xattn_Qwen3-8B

QQTang1223/full_streaming_Llama-3.1-8B-Instruct

QQTang1223/full_streaming_Qwen3-8B

QQTang1223/qwen_mix_sft_64K6

datasets 2

QQTang1223/llama_mix_sft_64K6

QQTang1223/qwen_mix_sft_64K6

Quantong Qiu

AI & ML interests

Recent Activity

Organizations

Collections 1

Papers 4

models 11 Sort: Recently updated

datasets 2 Sort: Recently updated

models 11

datasets 2