Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

NLP programming tools using PHP?

Tags:

php

lucene

nlp

Since big web applications came into existence, searching for data (and doing it lightning fast and accurate) has been one of the most important problems in web applications. For a while, I've worked using Lucene.NET, which is a C# port of the Lucene project.

I also work using PHP using Zend Framework's Lucene API, which brings me to my question. Most times for providing good indexing we need to perform some NLP tools like tokenizing, lemmatizing, and many more, the question is:

Do you know of any good NLP programming framework/toolset using PHP?

PS: I'm very aware of the Zend API for Lucene, but indexing data properly is not just storing and relying in Lucene, you need to perform some extra tasks, like those above.

like image 412
David Conde Avatar asked Dec 16 '10 05:12

David Conde


People also ask

Can I use Java for NLP?

Java can be applied to a wide range of processes in machine learning and data science, including data export and import, data cleaning, deep learning, statistical analysis, NLP, ML, and data visualization.


2 Answers

I would suggest that you look at Solr, which is a best practice implementation of Lucene. Solr uses a REST based API that also has a very good PHP client. This will allow you to leverage the power of Lucene without needing to perform any of the low level programming to get the NLP power that you want. Also, you would probably want to grab the trunk version of Solr as the NLP development is very active right now and new capabilities are being added every day.

like image 178
Paige Cook Avatar answered Oct 03 '22 09:10

Paige Cook


Zend has a full port of lucene to PHP. See docs here.

  • Lucene has tokenizers
  • Lucene has a porter stemmer
  • Lucene has snowball
  • Lucene can tie in with wordnet
like image 25
Xodarap Avatar answered Oct 03 '22 11:10

Xodarap