Settings

Theme

Ask HN: Stanford NLP vs OpenNLP vs NLTK?

2 points by dquail 12 years ago · 4 comments · 1 min read


Anyone suggest Stanford NLP / OpenNLP / NLTK ?

We have a bit of experience with Stanford NLP but are wondering if others have surpassed it.

We're looking to read email documents to, among other things, identify implicit tasks (and their attributes - owner, priority, date).

agibsonccc 12 years ago

Stanford NLP has most of what you'd need out of the box. NLTK is way more popular though.Keep in mind the licenses (GPL vs Apache for NLTK) unless you've already bought the commercial license for Stanford.

I've built my own full stack http://www.alchemyapi.com/ using UIMA[1] and cleartk as a base though.

The option of the different JVM scale out frameworks like akka,hadoop,storm etc have been amazing.

I would recommend NLTK for simpler tasks though.

We're working on an OSS Watson over at[3]:

that uses those components though. It's fairly easy to use different kinds of machine learning models for what you're looking for. Shoot me an email (in my profile) if you have any specific questions.

[1] http://uima.apache.org/ [2] http://cleartk.googlecode.com/ [3] https://github.com/SolrSherlock/

codyguy 12 years ago

Hi, I am working on a new NLP engine that might be of use to you. It's a work in progress but promising and is customizable. Are you open to using a paid component? Please shoot an email about your needs to mail [at] thetruebot [dot] com

denzil_correa 12 years ago

I would suggest NLTK. Python has inherent advantages for text processing tasks.

  • dquailOP 12 years ago

    cool. The rest of the system is Django so that'd be ideal. I'm not against a polyglot ... but prefer not having that level of complexity.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection