Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Entity Extraction Library [closed]

I’m looking for a library that does text analysis and extract entities.

The type/classification of an entity is not critical, it’s the identification of something that’s worthwhile that is critical. The entities universe in this case is infinite, it’s not bounded by fixed dictionary.

It seems that there are a couple of web services that do that (NERD let you compare the results of these web services: http://nerd.eurecom.fr/documentation which is pretty useful), but I’m looking for a local library and not a remotely hosted service. I’d prefer Java or .NET but if it’s a good library I’ll learn whatever language that it’s written in.

There are few older threads on similar topic and I was hoping to find new development in this area, and/or libraries built on top of lower level NLP libraries:

  • Entity Extraction/Recognition with free tools while feeding Lucene Index
  • Lucene Entity Extraction
  • How do I do Entity Extraction in Lucene
  • https://stackoverflow.com/questions/tagged/named-entity-extraction
  • https://stackoverflow.com/questions/tagged/named-entity-recognition

Does anyone know about a good library that does a decent job?

like image 452
hi1869695 Avatar asked Oct 06 '22 19:10

hi1869695


2 Answers

I've researched, but never used, the following hosted entity identification services:

OpenCalais

  • Documentation: http://www.opencalais.com/documentation/calais-web-service-api
  • Java API: http://code.google.com/p/j-calais/

AlchemyAPI

  • Website: http://www.alchemyapi.com/
  • Java API http://www.alchemyapi.com/tools/
like image 127
cmbaron Avatar answered Oct 10 '22 03:10

cmbaron


If you are comfortable with Perl, there are several language taggers / parts-of-speech taggers available (Lingua::TreeTagger and Lingua::BrillTagger come to mind (via Google)).

like image 40
Mark Leighton Fisher Avatar answered Oct 10 '22 03:10

Mark Leighton Fisher