Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Ask HN: big data sets to play around with?
3 points by zxcvvcxz on June 25, 2012 | hide | past | favorite | 5 comments
I want to gain some experience programming some machine learning techniques with large data sets. Something for fun like trying to predict the stock market, etc.

Does anyone know of some relatively accessible sets of large data that one could get a hold of for free? Anything like past financial history, to tweets or facebook posts, whatever.

Cheers



http://aws.amazon.com/publicdatasets/ is a good place to start


Just go to this Quora question, you'll find tons of answers to this question:

http://www.quora.com/Data/Where-can-I-get-large-datasets-ope...

And don't forget http://commoncrawl.org


Infochimps hosts lots of free data sets:

http://www.infochimps.com/search?view=list&price_categor...


http://buzzdata.com/content/

They have TONS of free open data sets that you can play aound with.


Why not try joining in a competition while you're learning?

http://www.kaggle.com/




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: