Watch the weird videos used to train AI what different actions look like

via GIPHY

Consider the verb “removing.” As a human, you understand the different ways that word can be used—and you know that visually, a scene is going to look different depending on what is being removed from what. Pulling a piece of honeycomb from a larger chunk looks different from a tarp being pulled away from a field, or a screen protector being separated from a smartphone. But you get it: in all those examples, something is being removed.

Computers and artificial intelligence systems, though, need to be taught what actions like these look like. In order to help accomplish that, IBM recently published a large new dataset of three-second video clips intended for researchers to use to help train their machine learning systems by giving them visual examples of action verbs like “aiming,” “diving,” and “weeding.” And exploring it (the car video above, and the bee video below, come from the dataset and illustrate “removing”) provides a strange tour of the sausage-making process that goes into machine learning. Under “winking,” viewers can see a clip of Jon Hamm as Don Draper giving a wink, as well as a moment from the Simpsons; there’s plenty more where that came from. Check out a portion of the dataset here—there are over 300 verbs and a million videos in total.

via GIPHY

Teaching computers how to understand actions in videos is tougher than getting them to understand images. “Videos are harder because the problem that we are dealing with is one step higher in terms of complexity if we compare it to object recognition,” says Dan Gutfreund, a researcher at a joint IBM-MIT laboratory. “Because objects are objects; a hot dog is a hot dog.” Meanwhile, understanding the verb “opening” is tricky, he says, because a dog opening its mouth, or a person opening a door, are going to look different.

The dataset is not the first one out there that researchers have created to help machines understand images or videos. One called ImageNet has been important in teaching computers to learn to identify pictures, and other video datasets are already out there, too: one is called Kinetics, another focuses on sports, and still another is from the University of Central Florida and contains actions like “basketball dunk.”

But Gutfreund says that one of the strengths of their new dataset is that it focuses on what he calls “atomic actions.” Those include basics, from “attacking” to “yawning.” And breaking things down into atomic actions is better for machine learning than focusing on more complex actions, Gutfreund says, like showing someone changing a tire or tying a necktie.

Ultimately, he says that he hopes this dataset will helps computer models be able to understand simple actions as easily as we humans can.

Watch the weird videos used to train AI what different actions look like

AI trained on AI churns out gibberish garbage AI trained on AI churns out gibberish garbage

Everyone is judging AI by these tests, but experts say they’re close to meaningless Everyone is judging AI by these tests, but experts say they’re close to meaningless

This bot-maker wants to make a thousand interconnected AIs out of your documents This bot-maker wants to make a thousand interconnected AIs out of your documents

Blizzard Opens Up Starcraft To Google’s DeepMind AI Blizzard Opens Up Starcraft To Google’s DeepMind AI

A Speech Recognition System Has Reached Human Parity A Speech Recognition System Has Reached Human Parity

S.A.R.A. Seeks To Give Artificial Intelligence People Skills S.A.R.A. Seeks To Give Artificial Intelligence People Skills

How Will Robots Learn? How Will Robots Learn?

The White House Sizes Up The Future Of Artificial Intelligence The White House Sizes Up The Future Of Artificial Intelligence

Software That Identifies Any Passing Face Is Ready For Market Software That Identifies Any Passing Face Is Ready For Market

BAE Systems Wants To Defeat Jammers With Thinking Machines BAE Systems Wants To Defeat Jammers With Thinking Machines

Watch This Robot Suck Up Fruit Flies Like A UFO [Video] Watch This Robot Suck Up Fruit Flies Like A UFO [Video]

Lie Like A Lady: The Profoundly Weird, Gender-Specific Roots Of The Turing Test Lie Like A Lady: The Profoundly Weird, Gender-Specific Roots Of The Turing Test

When politicians link immigrants to disease, the science just doesn’t add up When politicians link immigrants to disease, the science just doesn’t add up

Pushing the limits of assistive technology during the Boston Marathon Pushing the limits of assistive technology during the Boston Marathon

The official PopSci Review Rating Scale The official PopSci Review Rating Scale

Navigating with GPS is making our brains lazy Navigating with GPS is making our brains lazy

A new cloud species, a fluorescent tree frog, and other amazing images of the week A new cloud species, a fluorescent tree frog, and other amazing images of the week

3D printing is tackling what may be its biggest challenge yet: the humble book 3D printing is tackling what may be its biggest challenge yet: the humble book