Settings

Theme

Practical Llama 3 inference in Java

github.com

4 points by mukel 2 years ago · 2 comments

Reader

mukelOP 2 years ago

Llama3.java: featuring .GGUF file format support, Q8_0 and Q4_0 quantizations, fast matrix/vector multiplication routines using Java's Vector API; served by a simple CLI with a --chat mode to interact with the Llama 3 models.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection