Inference Providers
Active filters: cuda
Text Generation
• 8B • Updated • 15.4k
• 732
ussoewwin/Flash-Attention-2_for_Windows
Updated • 116
Multilingual-Multimodal-NLP/IndustrialCoder
Text Generation
• 32B • Updated • 195
• 67
Text Generation
• 4B • Updated • 9.13k
• 51
prism-ml/bonsai-image-ternary-4B-gemlite-2bit
Text-to-Image
• Updated • 2.7k
• 122
thad0ctor/torch2.12-cu133-cp312-wheels
Text-to-Speech
• Updated • 279
• 6
Sumitc13/flash-attn-windows-wheels
prism-ml/bonsai-image-binary-4B-gemlite-1bit
Text-to-Image
• Updated • 221
• 42
koreallmdev/qwen2-5-14b-korean-coding-assistant-lora
Text Generation
• Updated • 1
• 1
koreallmdev/qwen2-5-14b-korean-coding-assistant-gguf
15B • Updated • 1
Text Generation
• Updated • 30
• 23
CalderaAI/13B-Ouroboros-GPTQ4bit-128g-CUDA
Text Generation
• Updated • 15
marcorez8/llama-cpp-python-windows-blackwell-cuda
ValiantLabs/Qwen3-8B-ShiningValiant3
Text Generation
• 8B • Updated • 17
• 3
mradermacher/Qwen3-8B-ShiningValiant3-GGUF
8B • Updated • 682
• 2
mradermacher/Qwen3-8B-ShiningValiant3-i1-GGUF
8B • Updated • 222
• 2
ValiantLabs/Qwen3-1.7B-ShiningValiant3
Text Generation
• 2B • Updated • 22
• • 5
mradermacher/Qwen3-1.7B-ShiningValiant3-GGUF
2B • Updated • 74
mradermacher/Qwen3-1.7B-ShiningValiant3-i1-GGUF
2B • Updated • 233
ValiantLabs/Qwen3-4B-ShiningValiant3
Text Generation
• 4B • Updated • 54
• • 7
sequelbox/Qwen3-8B-PlumEsper
Text Generation
• 8B • Updated • 2
sequelbox/Qwen3-4B-PlumEsper
Text Generation
• 4B • Updated • 6
mradermacher/Qwen3-Shining-Lucy-CODER-3.5B-Brainstorm20x-e32-GGUF
3B • Updated • 168
• 1
mradermacher/Qwen3-Shining-Lucy-CODER-2.4B-mix2-GGUF
2B • Updated • 107
mradermacher/Qwen3-Shining-Lucy-CODER-2.4B-GGUF
2B • Updated • 63
mradermacher/Qwen3-Shining-Lucy-CODER-2.4B-mix2-i1-GGUF
2B • Updated • 142
mradermacher/Qwen3-Shining-Lucy-CODER-2.4B-i1-GGUF
2B • Updated • 69
mradermacher/Qwen3-Shining-Lucy-CODER-3.5B-Brainstorm20x-e32-i1-GGUF
3B • Updated • 114
• 1