Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Sample Database for Full Text Searching

I am looking to do some benchmarking on Full Text Search indexes in PostgreSQL, SQLServer and Lucene.

Any ideas on where to find a good big sample database to perform queries against?

Thanks a lot in advance.

like image 959
Pablo Santa Cruz Avatar asked Oct 24 '25 18:10

Pablo Santa Cruz


1 Answers

I think the great source would be wikipedia's database dump, since they contains really great amount of text. They are available here: http://dumps.wikimedia.org/

You could also try usenet archive, but there's harder to pick target language and the quality of language used is also lower.

like image 63
Danubian Sailor Avatar answered Oct 27 '25 01:10

Danubian Sailor



Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!