Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
CodeGoat24 's Collections
UnifiedReward 2.0 Qwen3.5 Models
UnifiedReward Flex
Pref-GRPO & UniGenBench
UnifiedReward Edit Models
UnifiedReward 2.0 Qwen3VL Models
UnifiedReward 2.0 Qwen2.5VL Models
UnifiedReward 1.0 Qwen2.5VL Models
UnifiedReward 1.0 Qwen2.5 Models GGUF
UnifiedReward 1.0 LLaVA Model
UnifiedReward Training Data

UnifiedReward 1.0 Qwen2.5 Models GGUF

updated Feb 7
Upvote
2

  • Unified Reward Model for Multimodal Understanding and Generation

    Paper • 2503.05236 • Published Mar 7, 2025 • 124

  • Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

    Paper • 2505.03318 • Published May 6, 2025 • 94

  • mradermacher/UnifiedReward-qwen-32b-i1-GGUF

    33B • Updated Jul 10, 2025 • 195 • 1

  • mradermacher/UnifiedReward-Think-qwen-7b-i1-GGUF

    8B • Updated Jul 10, 2025 • 273

  • mradermacher/UnifiedReward-Think-qwen-7b-GGUF

    8B • Updated Jul 31, 2025 • 810

  • mradermacher/UnifiedReward-qwen-7b-i1-GGUF

    8B • Updated Jul 10, 2025 • 259 • 1

  • mradermacher/UnifiedReward-qwen-7b-GGUF

    8B • Updated Jul 31, 2025 • 17 • 1

  • mradermacher/UnifiedReward-qwen-3b-GGUF

    3B • Updated Jul 31, 2025 • 62

  • mradermacher/UnifiedReward-qwen-32b-GGUF

    33B • Updated Jul 31, 2025 • 287
Upvote
2
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs