Settings

Theme

Show HN: Skeletoken, a Package for Editing Tokenizers

github.com

1 points by stephantul 3 months ago · 0 comments · 1 min read

Reader

Hello!

I work on Hugging Face tokenizers a lot in my day job. Editing tokenizers, e.g., adding or removing tokens is super painful. This is why I wrote a library for working with the format.

It contains many useful tools for working with tokenizers, checking them, making them lowercased, etc.

There’s still loads of features to add and probably bugs to iron out, but I’ve been using it and it seems to work well!

Please let me know what you think, Stéphan

No comments yet.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection