Data Scientist Dataset Finder Blog

Friday, 17 April 2020

Open Datasets

Recent Additions
  • The UZH-FPV Drone Racing Dataset: High-speed, Aggressive 6DoF Trajectories for State Estimation and Drone Racing
  • Hotels-50K: A Global Hotel Recognition Dataset Code
  • North Korean Missile Test Database (Yes, this is a thing…)
  • Flickr-Faces-HQ Dataset (FFHQ): A high-quality image dataset of human faces
  • Two New Evaluation Data-Sets for Low-Resource Machine Translation: Nepali–English and Sinhala–English
  • MIMIC-CXR: A large publicly available database of labeled chest radiographs
  • Open Source Biometric Recognition Data
  • Google Audioset: An expanding ontology of 632 audio event classes and a collection of 2,084,320 human-labeled 10-second sound clips drawn from YouTube videos.
  • Uber 2B trip data: Slow rollout of access to ride data for 2Bn trips.
  • Yelp Open Dataset: The Yelp dataset is a subset of Yelp businesses, reviews, and user data for use in NLP.
  • Core50: A new Dataset and Benchmark for Continuous Object Recognition
  • Data Portals
  • Open Data Monitor
  • Quandl Data Portal
  • Mut1ny Face/Head segmentation dataset
  • Awesome Public Datasets on Github
  • Head CT scan dataset: CQ500 dataset of 491 scans
  • Open Datasets at OpenML.org
  • WaPo: How to Download and Use the DEA’s Pain Pill Database
  • The Korean Question Answering Dataset
  • Chess Dataset
  • NLP datasets
at 03:14
Share

No comments:

Post a Comment

‹
›
Home
View web version
Powered by Blogger.