I am using solrj 1.4. My solrj doesn't index properly the documents in utf-16 encoding. I guess when it tries to convert to unicode, it replaces the problematic utf-16 surrogate keys with unicode replaceable character U+FFFD. Can anyone guide me on how to configure solrj 1.4 to index/search for utf-16 documents as well as utf-8 ?
The Solr index is in utf-8 (Why don't International Characters Work). In order to be able to search using other encodings you can always perform the translation in your software interfacing Solr.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With