何子豪's picture

3 6

何子豪

jacobmitchell

AI & ML interests

Research on LLM agents and evaluation. Mostly focused on experiments.

Recent Activity

upvoted a paper about 11 hours ago

GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents

liked a model 3 days ago

tencent/HY-Embodied-0.5

liked a dataset 4 days ago

Evan7017/self-align-curated-data

View all activity

Organizations

None yet

upvoted a paper about 11 hours ago

GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents

Paper • 2604.07429 • Published 9 days ago • 105

upvoted a paper 6 days ago

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published 14 days ago • 357

upvoted a paper 15 days ago

CREval: An Automated Interpretable Evaluation for Creative Image Manipulation under Complex Instructions

Paper • 2603.26174 • Published 21 days ago • 5