Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How to configure Apache Tika with apache Solr 1.4.1

I want to index a large number of pdf documents.

I have found a reference showing that it could be done using Apache Tika but unfortunately I cannot find any reference that describes I could configure Apache Tika in Solr 1.4.1.

Once configured I do have it configured, how can I send documents to Solr directly without using curl?

I am using solrnet for indexing.

like image 686
Ahsan Iqbal Avatar asked Oct 05 '10 13:10

Ahsan Iqbal


2 Answers

See ExtractingRequestHandler

like image 117
Pascal Dimassimo Avatar answered Nov 06 '22 19:11

Pascal Dimassimo


Support for ExtractingRequestHandler in SolrNet is not yet complete. You can either finish implementing it, or work around it and craft your own HttpWebRequests.

like image 44
Mauricio Scheffer Avatar answered Nov 06 '22 19:11

Mauricio Scheffer