Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Sample data for Hadoop [duplicate]

Tags:

hadoop

For education purpose I am looking for a large set of data. Data from social networks could be interesting but difficult to obtain. Data from scientific experiments could lead to write very difficult algorithm to have interesting results. Does any one have an idea how / where can I generate / find a large interesting data set ?

like image 721
Kartoch Avatar asked Mar 15 '13 15:03

Kartoch


Video Answer


2 Answers

Here are some public data sets I have gathered over time

http://wiki.gephi.org/index.php/Datasets
Download large data for Hadoop
http://datamob.org/datasets
http://konect.uni-koblenz.de/
http://snap.stanford.edu/data/
http://archive.ics.uci.edu/ml/
https://bitly.com/bundles/hmason/1
http://www.inside-r.org/howto/finding-data-internet
http://goo.gl/Jecp6
http://ftp3.ncdc.noaa.gov/pub/data/noaa/1990/
http://data.cityofsantacruz.com/
http://bitly.com/bundles/hmason/1

like image 151
Praveen Sripati Avatar answered Sep 19 '22 03:09

Praveen Sripati


Here Amazon has a list of some huge public datasets you may try out : http://aws.amazon.com/publicdatasets/

like image 44
Amar Avatar answered Sep 20 '22 03:09

Amar