Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

LMMs-Lab

community
https://www.lmms-lab.com/
lmmslab
EvolvingLMMs-Lab
Activity Feed

AI & ML interests

Feeling and building the multimodal intelligence.

Recent Activity

THUdyh  authored a paper 3 days ago
Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding
Jingkang  authored a paper 4 days ago
Sparse Mixture-of-Experts are Domain Generalizable Learners
Jingkang  authored a paper 4 days ago
Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models
View all activity

Papers

A Simple Baseline for Streaming Video Understanding

OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

View all Papers

Bo Li's profile picturePu Fanyi's profile pictureZhang Peiyuan's profile pictureZhang Yuanhan's profile pictureChunyuan Li's profile pictureHaotian Liu's profile picturekcz's profile pictureKairui's profile pictureNguyen Quang Trung's profile picturePham Ba Cong's profile pictureJinming Wu's profile pictureYingluo Li's profile pictureDevin Thang's profile pictureJingkang Yang's profile pictureZihao Deng's profile pictureYezhen Wang's profile pictureXinyu Huang's profile pictureXiyao Wang's profile pictureGao Yiming's profile pictureJinghao Guo's profile pictureDo Duc Anh's profile pictureyiyexy's profile picturewkzhang's profile picturexiangan's profile pictureHaiwen Diao's profile pictureJiankangDeng's profile pictureZhongang Cai's profile pictureyl-1993's profile picturewangyubo's profile pictureYANG Zhitao's profile pictureZuhao Yang's profile pictureYuwei Niu's profile pictureYuhao Dong's profile picture
lmms-lab 's Papers 4
Submitted by
Yujiao Shen
69

A Simple Baseline for Streaming Video Understanding

lmms-lab LMMs-Lab
83 6
Submitted by
yiyexy
52

OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

lmms-lab LMMs-Lab
323 4
Submitted by
Zuhao Yang
189

LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling

lmms-lab LMMs-Lab
217 7
Submitted by
kcz
96

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

lmms-lab LMMs-Lab
156 3
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs