-
SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models
Paper • 2506.04180 • Published • 34 -
AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation
Paper • 2506.10540 • Published • 37 -
AutoMind: Adaptive Knowledgeable Agent for Automated Data Science
Paper • 2506.10974 • Published • 19 -
SPAR: Scholar Paper Retrieval with LLM-based Agents for Enhanced Academic Search
Paper • 2507.15245 • Published • 11
Collections
Discover the best community collections!
Collections including paper arxiv:2601.16206
-
SWE-Universe: Scale Real-World Verifiable Environments to Millions
Paper • 2602.02361 • Published • 60 -
LongCodeZip: Compress Long Context for Code Language Models
Paper • 2510.00446 • Published • 108 -
Code2World: A GUI World Model via Renderable Code Generation
Paper • 2602.09856 • Published • 202 -
Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces
Paper • 2601.11868 • Published • 35
-
EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience
Paper • 2601.15876 • Published • 92 -
LLM-in-Sandbox Elicits General Agentic Intelligence
Paper • 2601.16206 • Published • 86 -
Qwen3-TTS Technical Report
Paper • 2601.15621 • Published • 74 -
Learning to Discover at Test Time
Paper • 2601.16175 • Published • 44
-
HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding
Paper • 2601.14724 • Published • 75 -
LLM-in-Sandbox Elicits General Agentic Intelligence
Paper • 2601.16206 • Published • 86 -
Scalable Power Sampling: Unlocking Efficient, Training-Free Reasoning for LLMs via Distribution Sharpening
Paper • 2601.21590 • Published • 14
-
Imagine-then-Plan: Agent Learning from Adaptive Lookahead with World Models
Paper • 2601.08955 • Published • 13 -
EvoFSM: Controllable Self-Evolution for Deep Research with Finite State Machines
Paper • 2601.09465 • Published • 42 -
MAXS: Meta-Adaptive Exploration with LLM Agents
Paper • 2601.09259 • Published • 96 -
Toward Efficient Agents: Memory, Tool learning, and Planning
Paper • 2601.14192 • Published • 57
-
LLM-in-Sandbox Elicits General Agentic Intelligence
Paper • 2601.16206 • Published • 86 -
Guidelines to Prompt Large Language Models for Code Generation: An Empirical Characterization
Paper • 2601.13118 • Published • 1 -
SWE-Universe: Scale Real-World Verifiable Environments to Millions
Paper • 2602.02361 • Published • 60 -
Think Anywhere in Code Generation
Paper • 2603.29957 • Published • 25
-
The Smol Training Playbook
📚3.11kThe secrets to building world-class LLMs
-
LLM-in-Sandbox Elicits General Agentic Intelligence
Paper • 2601.16206 • Published • 86 -
EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience
Paper • 2601.15876 • Published • 92 -
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution
Paper • 2510.08697 • Published • 39
-
A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Doubao 1.8, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5
Paper • 2601.10527 • Published • 26 -
PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution
Paper • 2601.10657 • Published • 20 -
TranslateGemma Technical Report
Paper • 2601.09012 • Published • 20 -
Recursive Language Models
Paper • 2512.24601 • Published • 94
-
SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models
Paper • 2506.04180 • Published • 34 -
AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation
Paper • 2506.10540 • Published • 37 -
AutoMind: Adaptive Knowledgeable Agent for Automated Data Science
Paper • 2506.10974 • Published • 19 -
SPAR: Scholar Paper Retrieval with LLM-based Agents for Enhanced Academic Search
Paper • 2507.15245 • Published • 11
-
SWE-Universe: Scale Real-World Verifiable Environments to Millions
Paper • 2602.02361 • Published • 60 -
LongCodeZip: Compress Long Context for Code Language Models
Paper • 2510.00446 • Published • 108 -
Code2World: A GUI World Model via Renderable Code Generation
Paper • 2602.09856 • Published • 202 -
Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces
Paper • 2601.11868 • Published • 35
-
EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience
Paper • 2601.15876 • Published • 92 -
LLM-in-Sandbox Elicits General Agentic Intelligence
Paper • 2601.16206 • Published • 86 -
Qwen3-TTS Technical Report
Paper • 2601.15621 • Published • 74 -
Learning to Discover at Test Time
Paper • 2601.16175 • Published • 44
-
LLM-in-Sandbox Elicits General Agentic Intelligence
Paper • 2601.16206 • Published • 86 -
Guidelines to Prompt Large Language Models for Code Generation: An Empirical Characterization
Paper • 2601.13118 • Published • 1 -
SWE-Universe: Scale Real-World Verifiable Environments to Millions
Paper • 2602.02361 • Published • 60 -
Think Anywhere in Code Generation
Paper • 2603.29957 • Published • 25
-
HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding
Paper • 2601.14724 • Published • 75 -
LLM-in-Sandbox Elicits General Agentic Intelligence
Paper • 2601.16206 • Published • 86 -
Scalable Power Sampling: Unlocking Efficient, Training-Free Reasoning for LLMs via Distribution Sharpening
Paper • 2601.21590 • Published • 14
-
The Smol Training Playbook
📚3.11kThe secrets to building world-class LLMs
-
LLM-in-Sandbox Elicits General Agentic Intelligence
Paper • 2601.16206 • Published • 86 -
EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience
Paper • 2601.15876 • Published • 92 -
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution
Paper • 2510.08697 • Published • 39
-
Imagine-then-Plan: Agent Learning from Adaptive Lookahead with World Models
Paper • 2601.08955 • Published • 13 -
EvoFSM: Controllable Self-Evolution for Deep Research with Finite State Machines
Paper • 2601.09465 • Published • 42 -
MAXS: Meta-Adaptive Exploration with LLM Agents
Paper • 2601.09259 • Published • 96 -
Toward Efficient Agents: Memory, Tool learning, and Planning
Paper • 2601.14192 • Published • 57
-
A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Doubao 1.8, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5
Paper • 2601.10527 • Published • 26 -
PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution
Paper • 2601.10657 • Published • 20 -
TranslateGemma Technical Report
Paper • 2601.09012 • Published • 20 -
Recursive Language Models
Paper • 2512.24601 • Published • 94