Settings

Theme

Tiny hackable CUDA language model implementation

github.com

39 points by markusheimerl 3 days ago · 3 comments

Reader

yobbo 7 hours ago

Looks very nice, but I can't find numerical gradient checks, which is helpful when verifying that backward pass is correct:

https://github.com/markusheimerl/gpt/blob/main/transformer/a...

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection