Ask HN: what type of sources should I use for my novel unsearch engine
Yes you read that right we are building an un-search engine. We are very different from existing search engines. However we are in a dilemma to choose data sources to build out our MVP. Our search is most effective and value clear when multiple heterogeneous data are indexed (obviously we have anything but a Google data farm indexing power).
We are trying to pick a domain or source that is manageable scale for a small startup, and yet shows the ability to "weave" interdomain knowledge. The obvious ones are wikipedia, wikibooks etc. but would the community have any suggestions on on other sources? news perhaps? Thanks in advance! Seems like you're building a semantic search engine. Have a look at some of these: https://en.wikipedia.org/wiki/List_of_online_databases Judicial opinions. I'm not really clear on what your product does from this description, though. Thanks, are these available freely online? and what might the user base be? Presumably legal firms etc. I am trying to make it more general public oriented.