Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Where can I find a huge amount of text files? [duplicate]

Possible Duplicate:
Looking for dataset to test FULLTEXT style searches on

I am recently on to a project of Data Mining, for which I need 100 GB of plain text for testing. I am tired of searching the net the whole day. Someone please help me out by providing the links, where can I download such text files?

like image 309
Sri Avatar asked Feb 07 '12 07:02

Sri


People also ask

How do I find duplicates in a text file?

To start your duplicate search, go to File -> Find Duplicates or click the Find Duplicates button on the main toolbar. The Find Duplicates dialog will open, as shown below. The Find Duplicates dialog is intuitive and easy to use. The first step is to specify which folders should be searched for duplicates.


1 Answers

What type of text are you searching for? Conversational, articles, books - or a good spread of everything?

Project Gutenberg might be a good start: http://www.gutenberg.org/

Wikipedia also allows you to download an archive of articles: http://en.wikipedia.org/wiki/Wikipedia:Database_download

like image 163
Jordan Avatar answered Sep 24 '22 00:09

Jordan