LoopRPT: Reinforcement Pre-Training for Looped Language Models Paper • 2603.19714 • Published 27 days ago • 15
LoopRPT: Reinforcement Pre-Training for Looped Language Models Paper • 2603.19714 • Published 27 days ago • 15
The End of Manual Decoding: Towards Truly End-to-End Language Models Paper • 2510.26697 • Published Oct 30, 2025 • 119
allenai/mid-training-OpenMathReasoning-rewrite-teacher-student-lecture-filtered Viewer • Updated Jul 6, 2025 • 291k • 22 • 3