Inference Providers
Active filters: redhat
BCCard/Qwen3-32B-FP8-Dynamic
Text Generation
• 33B • Updated • 4
• 1
BCCard/Qwen3-30B-A3B-FP8-Dynamic
Text Generation
• 31B • Updated • 31k
Text Generation
• 15B • Updated • 92
• 1
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-FP8
Image-Text-to-Text
• 402B • Updated • 181
• 2
RedHatTraining/AI296-m3diterraneo-hotels
8B • Updated • 60
• 1
RedHatAI/DeepSeek-R1-0528-quantized.w4a16
Text Generation
• 104B • Updated • 832
• 13
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-quantized.w4a16
Image-Text-to-Text
• 59B • Updated • 367
• 1
Image-Text-to-Text
• 109B • Updated • 3
RedHatAI/Kimi-K2-Instruct-quantized.w4a16
Text Generation
• 1T • Updated • 605
• 12
nm-testing/Llama-3.1-8B-Instruct-speculator.eagle3-converted
Text Generation
• 1.0B • Updated • 293
RedHatAI/SmolLM3-3B-quantized.w4a16
0.9B • Updated • 24
• 1
Text-to-Image
• Updated • 5
RedHatAI/Devstral-Small-2507-FP8-Dynamic
Text Generation
• 24B • Updated • 38
• 4
RedHatAI/Devstral-Small-2507-quantized.w8a8
Text Generation
• 24B • Updated • 67
• 1
RedHatAI/Devstral-Small-2507-quantized.w4a16
Text Generation
• 4B • Updated • 24
• 2
RedHatAI/Qwen3-14B-speculator.eagle3
Text Generation
• 1B • Updated • 5.76k
RedHatAI/Qwen3-32B-speculator.eagle3
Text Generation
• 2B • Updated • 920
• 8
RedHatAI/Llama-3.3-70B-Instruct-speculator.eagle3
Text Generation
• 2B • Updated • 3.32k
• 1
RedHatAI/Llama-3.1-8B-Instruct-speculator.eagle3
Text Generation
• 1.0B • Updated • 24.1k
• 2
RedHatAI/Qwen3-8B-speculator.eagle3
Text Generation
• 1B • Updated • 74.1k
• 28
RedHatAI/gpt-oss-20b-speculator.eagle3
Text Generation
• 0.9B • Updated • 20k
• 8
RedHatAI/Qwen3-235B-A22B-Instruct-2507-speculator.eagle3
Text Generation
• 1B • Updated • 824
ChibuUkachi/Qwen3-4B-Instruct-2507.w4a16
Text Generation
• 1B • Updated • 5
RedHatAI/Qwen3-4B-Thinking-2507-quantized.w4a16
Text Generation
• 4B • Updated • 254
RedHatAI/Qwen3-4B-Instruct-2507-quantized.w4a16
Text Generation
• 4B • Updated • 147
RedHatAI/Qwen3-30B-A3B-Thinking-2507-quantized.w4a16
Text Generation
• 5B • Updated • 106
RedHatAI/Qwen3-30B-A3B-Instruct-2507-quantized.w4a16
Text Generation
• 5B • Updated • 1.52k
• 1
RedHatAI/Qwen3-Next-80B-A3B-Instruct-quantized.w4a16
Text Generation
• 12B • Updated • 314
• 3
RedHatAI/Qwen3-30B-A3B-Instruct-2507-speculator.eagle3
Text Generation
• 0.5B • Updated • 829
• 2
RedHatAI/Qwen3-Next-80B-A3B-Thinking-quantized.w4a16
Text Generation
• Updated • 36