Sophia: A Scalable Second-Order Optimizer for Language Model Pre-Training arxiv.org 4 points by Anon84 2 days ago · 1 comment Reader PiP Save 66y66 a day ago ggtttttttttyy