Show HN: Generate a 1M-document RAG eval dataset from a single prompt

alexjacobs08.github.io

2 points by tacoooooooo a month ago · 1 comment

Reader

Standard benchmarks (like BEIR/MS MARCO) are great, but they are likely already in distribution for foundation models training sets, and crucially, they lack the complex, structured metadata needed to test real-world filtering scenarios (e.g., "Find docs from region X, between dates Y and Z, with tag A").

datasetFactory is an orchestrated LLM pipeline that turns a single natural language prompt into a (potentially) massive, structured evaluation dataset.

Settings

Show HN: Generate a 1M-document RAG eval dataset from a single prompt

Keyboard Shortcuts