Pretraining Under Infinite Compute
arxiv.org"Our results show that simple algorithmic improvements can enable significantly more data-efficient pre-training in a compute-rich future."
"Our results show that simple algorithmic improvements can enable significantly more data-efficient pre-training in a compute-rich future."