baidu/Qianfan-OCR
Image-Text-to-Text • 5B • Updated • 41.7k • 1.13k
Qianfan-vl model series. The models are mainly domain enhanced vision language model, targeting enterprise level multi modal understanding scenarios.
Domain-Enhanced Universal Vision-Language Models