Settings

Theme

Show HN: FlashTokenizer – 10x faster C++ tokenizer for Python

github.com

5 points by springkim 9 months ago · 0 comments · 1 min read

Reader

I built a tokenizer in C++ with a Python binding that outperforms HuggingFace tokenizers by 10x on large inputs. It's optimized for minimal memory usage and latency.

Benchmarks and comparison included in README. Would love feedback or contributions!

No comments yet.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection