Settings

Theme

Model training diary/journal for LLMs?

1 points by nalzok 2 years ago · 1 comment · 1 min read


About half a year ago, some big tech company released an open-source LLM. What makes that model special is that they made available a model training diary/journal recording everything their engineers did to babysit the training process, e.g. "on day 143, the training loss plateaued, so we decreased the learning rate further". I think it was in a shared Google Doc.

Can you remind me of the name of the company/model?

nalzokOP 2 years ago

Nevermind, I figured it out: https://github.com/facebookresearch/metaseq/blob/main/projec...

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection