Settings

Theme

TinyChat: Large Language Model on the Edge

hanlab.mit.edu

2 points by enduku 2 years ago · 1 comment

Reader

endukuOP 2 years ago

TinyChat is an efficient, lightweight, Python-native serving framework for 4-bit LLMs by AWQ. It delivers 2.3x generation speed up on RTX4090.

Code: https://github.com/mit-han-lab/llm-awq/tree/main/tinychat

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection