Knowledge Engineer Group @ Tsinghua University

university

https://keg.cs.tsinghua.edu.cn/

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

NeoZ123 updated a collection 24 days ago

NeoZ123 updated a model 24 days ago

THU-KEG/DeepDive-30B-A3B-C-GRPO

NeoZ123 updated a model 24 days ago

THU-KEG/DeepDive-4B-C-GRPO

View all activity

Papers

WildReward: Learning Reward Models from In-the-Wild Human Interactions

DeepPrune: Parallel Scaling without Inter-trace Redundancy

View all Papers

updated a collection 24 days ago

CaRR & C-GRPO

Data and models for the paper "Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards". • 6 items • Updated 24 days ago • 1

updated 4 models 24 days ago

THU-KEG/DeepDive-30B-A3B-C-GRPO

31B • Updated 24 days ago • 15

THU-KEG/DeepDive-4B-C-GRPO

4B • Updated 24 days ago • 12

THU-KEG/DeepDive-30B-A3B-SFT

31B • Updated 24 days ago • 13

THU-KEG/DeepDive-4B-SFT

4B • Updated 24 days ago • 40

updated a dataset 24 days ago

THU-KEG/CaRR-DeepDive

Preview • Updated 24 days ago • 341 • 1

authored a paper about 1 month ago

IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse

Paper • 2603.12201 • Published Mar 12 • 53

submitted a paper to Daily Papers about 1 month ago

IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse

Paper • 2603.12201 • Published Mar 12 • 53

updated a collection about 1 month ago

CaRR & C-GRPO

Data and models for the paper "Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards". • 6 items • Updated 24 days ago • 1

published 4 models about 1 month ago

THU-KEG/DeepDive-30B-A3B-C-GRPO

31B • Updated 24 days ago • 15

THU-KEG/DeepDive-30B-A3B-SFT

31B • Updated 24 days ago • 13

THU-KEG/DeepDive-4B-C-GRPO

4B • Updated 24 days ago • 12

THU-KEG/DeepDive-4B-SFT

4B • Updated 24 days ago • 40

submitted a paper to Daily Papers about 1 month ago

Lost in Stories: Consistency Bugs in Long Story Generation by LLMs

Paper • 2603.05890 • Published Mar 6 • 93

updated a collection about 2 months ago

WildReward

Learning Reward Models from In-the-Wild Interactions • 4 items • Updated Mar 2 • 2

updated a model about 2 months ago

THU-KEG/WildReward-8B

Text Classification • 8B • Updated Feb 26 • 18 • 3