leon-se/gemma-3-27b-it-FP8-Dynamic
27B • Updated •
29.7k •
20
Models that run well on RTX 5090
27B • Updated •
29.7k •
20
Note Runs well on vLLM, will not fit standard sglang. Very good model for the size.
Text Generation • 15B • Updated •
3 •
1
Note Very fast with sglang, can improve even more with draft easiest-ai-shawn/Phi-4-EAGLE3-sharegpt-unfiltered
Text Generation • 5B • Updated •
96.2k •
35
Note Very good at tool call and instruction following, prone to unexpected hallucinations.
Image-Text-to-Text • 6B • Updated •
25.9k •
37
Note Good balance of size and ease of running
Image-Text-to-Text • 12B • Updated •
1.35M • •
643
Note On the smaller side, I recommend larger models