leon-se/gemma-3-27b-it-FP8-Dynamic
27B •
Updated
•
6.08k
•
20
Models that run well on RTX 5090
27B •
Updated
•
6.08k
•
20
Note Runs well on vLLM, will not fit standard sglang. Very good model for the size.
![]()
Text Generation •
15B •
Updated
•
1
•
1
Note Very fast with sglang, can improve even more with draft easiest-ai-shawn/Phi-4-EAGLE3-sharegpt-unfiltered
![]()
Text Generation •
5B •
Updated
•
137k
•
45
Note Very good at tool call and instruction following, prone to unexpected hallucinations.
![]()
Image-Text-to-Text •
6B •
Updated
•
23.9k
•
39
Note Good balance of size and ease of running
![]()
Image-Text-to-Text •
Updated
•
2.15M
•
•
687
Note On the smaller side, I recommend larger models