ithinkimrishi 's Collections To read
updated
HoloScene: Simulation-Ready Interactive 3D Worlds from a Single Video
Paper
• 2510.05560
• Published • 8
TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular
Reasoning
Paper
• 2510.06217
• Published • 67
Less is More: Recursive Reasoning with Tiny Networks
Paper
• 2510.04871
• Published • 513
Fast-dLLM v2: Efficient Block-Diffusion LLM
Paper
• 2509.26328
• Published • 58
CoDA: Coding LM via Diffusion Adaptation
Paper
• 2510.03270
• Published • 43
MemMamba: Rethinking Memory Patterns in State Space Model
Paper
• 2510.03279
• Published • 74
33B • Updated • 5.74k
• 265
Thinking with Camera: A Unified Multimodal Model for Camera-Centric
Understanding and Generation
Paper
• 2510.08673
• Published • 127
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to
Embodied AI
Paper
• 2510.05684
• Published • 146
HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video
Narratives
Paper
• 2510.20822
• Published • 41
DeepAgent: A General Reasoning Agent with Scalable Toolsets
Paper
• 2510.21618
• Published • 103
Video-As-Prompt: Unified Semantic Control for Video Generation
Paper
• 2510.20888
• Published • 50
Sample By Step, Optimize By Chunk: Chunk-Level GRPO For Text-to-Image
Generation
Paper
• 2510.21583
• Published • 31
WorldGrow: Generating Infinite 3D World
Paper
• 2510.21682
• Published • 42
Paper
• 2510.18212
• Published • 36
Visual Diffusion Models are Geometric Solvers
Paper
• 2510.21697
• Published • 20
RECALL: REpresentation-aligned Catastrophic-forgetting ALLeviation via
Hierarchical Model Merging
Paper
• 2510.20479
• Published • 12