Settings

Theme

Ask HN: Stemmers in ruby, any good? or should I just go ahead and write my own?

2 points by arindam_ 12 years ago · 2 comments · 1 min read


The most popular option seems to be https://github.com/aurelian/ruby-stemmer. But its kinda outdated and gives poor-ish results. "why" becomes "whi", "people" becomes "peopl" and a lot lot many incorrect ones.

Something like Solr's reduction to stems is what I was hoping for to be able to use it in my project.

Thinking of going for a full port of Porter's stemming. Thoughts?

danso 12 years ago

Thanks for asking this, I'd also be interested in knowing...

Did you try the uea-stemmer? Also pretty old: https://github.com/ealdent/uea-stemmer

Also, there's the treat gem, which is an all-in-one package...it uses both the stemmer you mentioned and the uea one...so maybe that's it for Rubyists.

https://github.com/louismullie/treat

boyter 12 years ago

https://github.com/raypereda/stemmify

Not a Ruby guy so no idea if its any good, but it is the implementation linked from here http://tartarus.org/~martin/PorterStemmer/

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection