Does anyone here have experience with writing custom FTS3 (the full-text-search extension) tokenizers? I'm looking for a tokenizer that will ignore HTML tags.
Thanks.
I have no direct experience, but by doing a web search with "sqlite3 registerTokenizer" I found two tokenizers that can be used as a basis: a snowball tokenizer and a MeCab tokenizer.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With