About time to get back to Dreamcast cluster project. When I left off, I had forked llama2.c to do inference in pipeline parallel w/accel matmul using SH4 vector ops. Here it is running on 2 DCs. With only 32MB RAM, very tiny model outputs gibberish. 8 DCs will do TinyStories. https://t.co/htic9xVDtZ

1 min read Original article ↗

Post

About time to get back to Dreamcast cluster project. When I left off, I had forked llama2.c to do inference in pipeline parallel w/accel matmul using SH4 vector ops. Here it is running on 2 DCs. With only 32MB RAM, very tiny model outputs gibberish. 8 DCs will do TinyStories.

Don't miss what's happening

People on X are the first to know.

Log inSign up