I made a convenient API service to chunk files for RAG
In my experience developing RAG-based applications, I was surprised to find that there aren't any simple, reliable ways to chunk files.
I ended up implementing my own chunking system that includes deep positioning data like page index and bounding box coordinates for every chunk.
You can try it out for free here (no account/api key required):
https://filechipper.com
Would any of you be interested in something like this? Let me know! Sorry, but it is really strange how haphazard it is. Am I doing something wrong? It's missing more or less all the titles
The order of the text is casual
The system chunks at random points in the text https://mlops.community/wp-content/uploads/2023/07/survey-re...