view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 12 days ago • 839
DFlash Collection Block Diffusion for Flash Speculative Decoding • 13 items • Updated 9 days ago • 55
Can LLMs Learn to Reason Robustly under Noisy Supervision? Paper • 2604.03993 • Published 9 days ago • 42
The Model Says Walk: How Surface Heuristics Override Implicit Constraints in LLM Reasoning Paper • 2603.29025 • Published 14 days ago • 12
Meta-Harness: End-to-End Optimization of Model Harnesses Paper • 2603.28052 • Published 15 days ago • 17
AgentHazard: A Benchmark for Evaluating Harmful Behavior in Computer-Use Agents Paper • 2604.02947 • Published 11 days ago • 19
Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory Paper • 2604.01007 • Published 12 days ago • 31
LightThinker++: From Reasoning Compression to Memory Management Paper • 2604.03679 • Published 10 days ago • 33
HippoCamp: Benchmarking Contextual Agents on Personal Computers Paper • 2604.01221 • Published 13 days ago • 28
Reasoning Shift: How Context Silently Shortens LLM Reasoning Paper • 2604.01161 • Published 13 days ago • 31
Embarrassingly Simple Self-Distillation Improves Code Generation Paper • 2604.01193 • Published 13 days ago • 36
CORAL: Towards Autonomous Multi-Agent Evolution for Open-Ended Discovery Paper • 2604.01658 • Published 12 days ago • 54
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published 12 days ago • 465