Ask HN: How to use ML in information retrieval

2 points by kevinfat 2 years ago · 0 comments · 1 min read

In the information retrieval space I am aware of software such as the following: lucene, solr, elasticsearch, sphinx, manticore, etc. As far as I am aware these are based on inverted indices with a bunch of stuff like custom stemmers for the particular language being used. Now this is rather annoying because to make improvements requires handcrafting various customizations for each language.

Is there a way to apply ML to such a problem? I am envisioning the following

* the smallish dataset to be queried on

* a much bigger corpus for teaching the system about the language so we don't need to handcraft customizations for the language such as a stemmer

* queries and labeled best results from the dataset to learn a ranking from

Is any of that possible with neural networks and if so where would one start to learn about what works? Ideally we want something where we can train on ever bigger data without needing to think about stuff like handcrafting feature vectors.

No comments yet.

Settings

Ask HN: How to use ML in information retrieval

Keyboard Shortcuts