These Datasets Are Just Waiting for Your Next Creative Project.
Artificial Intelligence isn’t just for scary algorithms ready to take over our lives – it can also be fun to play with, as we learned when we taught the computer to generate Lifehacker titles . But you can’t play until you have good datasets to start with.
Luckily, Gengo AI has put together several datasets for your next project, whether it’s a silly Twitter bot or your next self-driving car. (Actually – check out this database of street sign video clips , for example.)
Could your next project use 20,000 dog images ? Airline tweets already classified based on whether the tweet is positive, negative, or neutral? (Surprise: They’re mostly negative.) You might be able to do something interesting with 5 million reviews on Yelp . Or perhaps you could feed these Jeopardy questions and answers to a neural network and ask it to build a brand new Jeopardy game for you.
The overview also includes several data repositories, each of which is a gold mine in its own right:
- Kaggle has a vast array of datasets including superheroes , cryptocurrency markets, and chest x-rays .
- Data.gov collects datasets from US government agencies, including food recalls , bank complaints, and how much hospitals charge for the most common procedures .
- The UK Data Center is the UK ‘s central repository of social, economic and demographic datasets.
These are all free datasets available for any project you can dream of. You might want to use some of this data to analyze social issues and make the world a better place. Or just messing around with bots, that’s okay too.
50 Best Free Machine Learning Datasets | Gengo AI