Add Imagenet
Add Stanford cars
### datasets from Recommendations for Engineers - Pawel Cislo
- 50 Best Free Datasets for ML
- 100,000 Faces Generated by AI
- Academic Torrents
- Apollo Scape <— RGB videos with high resolution image sequences and per pixel annotation, survey-grade dense 3D points with semantic segmentation
- Awesome Public Datasets
- AWS Public Datasets
- Berkeley DeepDrive BDD100k <— 100,000 HD video sequences of over 1,100-hour driving experience across many different times in the day, weather conditions, and driving scenarios. Video sequences also include GPS locations, IMU data, and timestamps
- Caffe2
- Common Voice <— dataset of voices that everyone can use to train speech-enabled applications
- CORGIS <— datasets for beginners
- DataHub <— easiest way to find, share and publish datasets online
- Datasets for machine learning <— huge list (CV/NLP/Audio)
- Datasets for mind reading (dataset is still available here with a CRCNS account) <— fancy, huh?
- FiveThirtyEight <— economics, sports, politics
- Goodbooks-10k <— new dataset for book recommendations
- Google BigQuery <— public datasets from Google
- Google Dataset Search <— search engine of datasets from Google
- Grouplens datasets
- HealthData <— high value health data
- How readers browse Wikipedia
- Kaggle datasets
- List of lists with datasets
- LVIS <— dataset for large vocabulary instance segmentation
- Mapillary Vistas Dataset <— street-level imagery dataset with pixel‑accurate and instance‑specific human annotations for understanding street scenes around the world
- Mathematics Dataset <— generates mathematical question and answer pairs, from a range of question types at roughly school-level difficulty
- Million Song Dataset
- nuScenes <— large-scale autonomous driving dataset
- Papers With Code Datasets <— ML datasets with lots of filtering options
- Quandl <— financial data directly into Python
- Quantopian Datasets
- Tencent ML-Images <— largest multi-label image database; ResNet-101 model; 80.73% top-1 acc on ImageNet
-
VQA <— 200k+ images and over a million questions (with answers) about those images
- easy-VQA <— simpler version of the original VQA dataset
- World Bank Open Data <— economic data
- YouTube 8M <— dataset of YouTube videos