Settings

Theme

Rail-Only: A Low-Cost High-Performance Network for Training LLMs with T Params

arxiv.org

2 points by edelsohn a year ago · 1 comment

Reader

teleforce a year ago

Please check this HN post on the similar subject by Meta [1].

Previous paper by the same team from Meta and MIT but with Billions instead of Trillions of parameters [2].

[1] A RoCE network for distributed AI training at scale:

https://news.ycombinator.com/item?id=41162664

[2] Optimized Network Architectures for Training Large Language Models With Billions of Parameters [PDF]:

https://people.csail.mit.edu/ghobadi/papers/rail_llm_hotnets...

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection