Settings

Theme

Apple open sources a framework for training text-to-image models

github.com

4 points by jlukecarlson a year ago · 1 comment

Reader

jlukecarlsonOP a year ago

I'm one of the contributors to this repo, so feel free to comment with any feedback!

The key research contribution from the related paper is that with a moderate amount of data (eg. 12M image pairs from CC12M) and a moderate amount of compute (single node of 8 A-100 GPUs for example) anyone can train a good text to image model using the unique multi scale nested u-net pipeline.

Hope this can help level the playing field for researchers everywhere.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection