A collection of models and demos linked to papers presented at CVPR 2025.
Piotr Skalski PRO
SkalskiP
AI & ML interests
Computer Vision | Multimodality
Recent Activity
liked a model 3 days ago
tiiuae/Falcon-Perception liked a model 23 days ago
PaddlePaddle/PaddleOCR-VL-1.5 liked a model 29 days ago
allenai/MolmoPoint-GUI-8BOrganizations
Zero-Shot Detection and Segmentation
Demos of projects focused on zero-shot detection and segmentation.
- Runtime errorAgentsFeatured121
SAM And MetaCLIP
π121 - Runtime errorAgentsFeatured396
Grounded Segment Anything
π396 - Runtime errorAgentsFeatured424
Kosmos 2
π»424Describe and highlight entities in images
- Runtime errorAgentsFeatured491
YOLO World
π₯491Detect objects in images or videos
LMMs - Large Multimodal Models
Demos of LMM projects.
- Runtime errorAgentsFeatured428
LLaVA
π₯428Chat with an AI assistant using text and images
- Running on CPU UpgradeAgents166
CogVLM
π166Answer questions about uploaded images using natural language
- Runtime errorAgentsFeatured886
MiniGPT-4
π886 - Runtime errorAgentsFeatured308
Fuyu Multimodal
π308
CVPR 2025
A collection of models and demos linked to papers presented at CVPR 2025.
Zero-Shot Detection and Segmentation
Demos of projects focused on zero-shot detection and segmentation.
- Runtime errorAgentsFeatured121
SAM And MetaCLIP
π121 - Runtime errorAgentsFeatured396
Grounded Segment Anything
π396 - Runtime errorAgentsFeatured424
Kosmos 2
π»424Describe and highlight entities in images
- Runtime errorAgentsFeatured491
YOLO World
π₯491Detect objects in images or videos
OpenAI Vision API
Demos of projects using the OpenAI Vision API.
LMMs - Large Multimodal Models
Demos of LMM projects.
- Runtime errorAgentsFeatured428
LLaVA
π₯428Chat with an AI assistant using text and images
- Running on CPU UpgradeAgents166
CogVLM
π166Answer questions about uploaded images using natural language
- Runtime errorAgentsFeatured886
MiniGPT-4
π886 - Runtime errorAgentsFeatured308
Fuyu Multimodal
π308