Settings

Theme

Show HN: gpt-tfjs – GPT model in TensorFlow.js

github.com

4 points by zemlyansky 2 years ago · 0 comments · 1 min read

Reader

The model architecture follows minGPT as much as possible. It actually uses minGPT for JS tests generation (gradients, predictions). The main advantage of having the model implemented in TensorFlow.js is the ability to perform training/fine-tuning, for example, in a browser using WebGPU or in Node.js.

Examples in the `projects` folder include: - sorting (basic example) - loading GPT-2 weights - training on large texts using streams

Feedback is really welcome! There also an open PR on porting the model to Typescript, which has some unresolved issues

No comments yet.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection