Show HN: An Embedding-Based Search Service over ShowHN, AskHN, GitHub, More
payperrun.comHey folks, Elias here. Excited to unveil my latest project.
Why I Built This: Traditional keyword search isn't cutting it. I've used LLM-embeddings to provide more nuanced, relevant results.
How It Works: LLM-embedding similarity on curated datasets for semantically similar results. No need to iterate over keywords any more.
Current Datasets:
- YC Companies - Show HN Posts, - Ask HN Posts - ProductHunt Startups - Github Top 200k Repos
Use Cases:
- Validate a product idea's existence - Check if someone already Asked HN something - Have fun - search random terms and see what pops up
Want to see other datasets? Got one in mind? What would you use it for? (I'm even thinking of adding all PG's tweets - so feel free to be creative.)
Eager for your feedback and ideas. Hope you find this useful and fun! I like the section on how it works: https://payperrun.com/%3E/search?display=How%20this%20servic... The vector search is using https://lancedb.com/ and OpenAI embeddings. This is will be really useful for research within the ingested domains! Awesome. Thank you for the feedback! If there are any other domains you'd like to see, let me know :) I'm thinking Crunchbase would give you a comprehensive view into Silicon Valley (and adjacent) company data, but the data is behind an API.