Settings

Theme

Deploying Llama3 70B on AWS – GPU Requirement, Cost and Step-by-Step Guide

slashml.com

3 points by JJneid 2 years ago · 6 comments

Reader

rini17 2 years ago

Note that quantized versions of llama3 70B can be ran on CPU on much cheaper server. I am personally using it via llama.cpp on bare metal 6-core Xeon CPU with 128G RAM for ~50 euro monthly.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection