DiffusionBlocks: Training Neural Networks One Block at a Time

4 points by sebg 12 hours ago · 1 comment

Reader

billconan 10 hours ago

I do not understand.

how is this different from building smaller transformer layers, and each layer just denoises less?

Settings