cubie
- Karma
- 151
- Created
- 2 years ago
Recent Submissions
- 1. ▲ A Replacement for BERT (huggingface.co)
- 2. ▲ Training and Finetuning Embedding Models with Sentence Transformers v3 (huggingface.co)
- 3. ▲ Embedding Quantization: 25-45x retrieval speedup, 32x or 4x less memory usage (huggingface.co)
- 4. ▲ Attention Sinks in LLMs for endless fluency (huggingface.co)