Settings

Theme

PostgresML Adds GPTQ and GGML Quantized LLM Support for HuggingFace Transformers

postgresml.org

4 points by montanalow 3 years ago · 1 comment

Reader

montanalowOP 3 years ago

Quantization allows PostgresML to fit larger models in less RAM. These algorithms perform inference significantly faster on NVIDIA, Apple and Intel hardware. Half-precision floating point and quantized optimizations are now available for your favorite LLMs downloaded from Huggingface.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection