SigLino: Vision Foundation Models (SigLIP2 + DINOv3)
Collection
Vision encoders distilled from DINOv3 and SigLIP2 (MoE & Dense). CVPR 2026. ⢠6 items ⢠Updated ⢠16
Fine Tuning MetaCLIP 2 for Image Classification on Downstream Tasks demonstrates the step by step finetuning using CIFAR10 and is also flexible for adapting to other datasets. For more details, check out the linked blog below. š¤āļø