Skip to content

Latest commit

 

History

History
89 lines (46 loc) · 5.19 KB

dataset_collections.md

File metadata and controls

89 lines (46 loc) · 5.19 KB

Dataset Collections

  • Kaggle - Kaggle, the leading platform for predictive modeling competitions.

  • UCI MLR - UC Irvine Machine Learning Repository

  • google.com/publicdata - public data maintained by Google

  • Freebase - A community-curated database of well-known people, places, and things

  • mldata.org - machine learning data set repository for uploading and finding data sets

  • Infochimps - a huge collection of large-sized data sets

  • Amazon Web Services - Public Data Sets on AWS provides a centralized repository of public data sets that can be seamlessly integrated into AWS cloud-based applications.

  • Databib - a searchable catalog / registry / directory / bibliography of research data repositories.

  • figshare - an online digital repository where researchers can preserve and share their research outputs, including figures, datasets, images, and videos.

  • reddit r/datasets - datasets shared on reddit

  • datahub - the free, powerful data management platform from the Open Knowledge Foundation

  • Quandl - a search engine for numerical data

  • enigma - a search engine for public records published by governments, companies and organizations.



Specialized Datasets

[back to top]