DiffusionBlocks: Block-Wise Neural Network Training
arxiv.orgThe same Sakana that couldn't validate experiments
https://techcrunch.com/2025/02/21/sakana-walks-back-claims-t...
Other teams did a better job and provided code
This is unrelated. They both use the word "block", but what they are referring to differs
"Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models"
Yes and? The paper I linked is about network weights, not the type of generative model