Rethinking The Turing Test

At a competition in June, a chatbot named Eugene duped a group of human judges into believing it was a Ukrainian teenager. The judges hailed it as the first time a machine passed the Turing Test—that hallowed measure of artificial intelligence (AI) proposed by computer scientist Alan Turing in 1950.

Eugene’s victory was short-lived. Within days, AI researchers had dismissed the chatbot’s achievement as a collection of canned responses. Then they took the Turing Test itself to task. Conceived of as a kind of existential parlor game, the test asks a human and a machine to respond to questions from remote interrogators. A computer mistaken for a person would prove that it had developed the capacity to mimic our own thought processes.

That all sounds good enough, but “people are easy to deceive,” says Ernie Davis, a computer scientist at New York University. “We’re used to the safe assumption that whoever is talking to us is actually an intelligent person.” So human officiants will likely give the computer the benefit of the doubt. Additionally, chatbots often mask their lack of reasoning by coming across as merely scatterbrained. For example, futurist Ray Kurzweil once asked Eugene, “If I have two marbles in a bowl and I add two more, how many marbles are in the bowl now?” “Not too many,” wrote Eugene. “I can’t tell you the exact number; I forgot it. If I’m not mistaken, you still didn’t tell me where you live.”

“We’re used to the safe assumption that whoever is talking to us is actually an intelligent person.”

In that way, the Turing Test doesn’t foster the development of machines with adaptive, human-level smarts. Instead, it exposes our own gullibility, and spawns programs whose greatest innovation is the tactical use of snarky non-sequiturs and manipulative charm.

The harsh criticism of AI’s most famous benchmark comes at a moment when interest and investment in the field are spiking. Google recently acquired AI firm DeepMind for $400 million, and IBM is investing $1 billion in its Watson system, the former Jeopardy! winner that’s now unraveling the genetics of brain cancer. Even the late Alan Turing is getting the Hollywood treatment this fall, as the subject of the biopic The Imitation Game. Some might say the field of AI doesn’t need the Turing Test anymore. We should just let machines grow smarter on their own inhuman terms.

That would be a mistake. The genius of the Turing Test is that it captured the public imagination and drove innovation. So why not build a new one better suited to the task of proving true artificial intelligence. “Maybe rather than looking at one big hurdle, we should try to understand how to make a bunch of small steps that lead us along the path to something useful,” says Noah Goodman, a cognitive scientist at Stanford University. Machines should have to tackle a range of tasks that emphasize nimble, on-the-spot thinking. Can it describe a video after seeing it for the first time, respond to direct questions with direct answers, and recognize nuances in language? Far more than a gimmick, such a system would finally demonstrate, in Turing’s words, “a machine that thinks.” Eugene was nowhere close.

This article originally appeared in the October 2014 issue of Popular Science.

Rethinking The Turing Test

“We’re used to the safe assumption that whoever is talking to us is actually an intelligent person.”

AI trained on AI churns out gibberish garbage AI trained on AI churns out gibberish garbage

Everyone is judging AI by these tests, but experts say they’re close to meaningless Everyone is judging AI by these tests, but experts say they’re close to meaningless

How Smart Is Your Artificial Intelligence? How Smart Is Your Artificial Intelligence?

Is A Simulated Brain Conscious? Is A Simulated Brain Conscious?

Writer/Director Alex Garland Discusses His Latest AI-Inspired Film ‘Ex Machina’ Writer/Director Alex Garland Discusses His Latest AI-Inspired Film ‘Ex Machina’

How To Create Super-Intelligent Machines That Won’t Kill Us How To Create Super-Intelligent Machines That Won’t Kill Us

Google and NASA Have A New Quantum Computer Google and NASA Have A New Quantum Computer

Google’s New AI Plays Atari Games As Well As You Can, Or Better Google’s New AI Plays Atari Games As Well As You Can, Or Better

Can A Human Fall In Love With A Computer? Can A Human Fall In Love With A Computer?

How Facebook’s New Machine Brain Will Learn All About You From Your Photos How Facebook’s New Machine Brain Will Learn All About You From Your Photos

Lie Like A Lady: The Profoundly Weird, Gender-Specific Roots Of The Turing Test Lie Like A Lady: The Profoundly Weird, Gender-Specific Roots Of The Turing Test

Solve for Standing Ovation: Should AI Researchers Bother Building a TED-Bot? Solve for Standing Ovation: Should AI Researchers Bother Building a TED-Bot?

The Machines Vs. Mitt Romney: How Artificial Intelligence Is Parsing Political Rhetoric The Machines Vs. Mitt Romney: How Artificial Intelligence Is Parsing Political Rhetoric

The End Is A.I.: The Singularity Is Sci-Fi’s Faith-Based Initiative The End Is A.I.: The Singularity Is Sci-Fi’s Faith-Based Initiative

Could Desktop Computers Be The Best Home For Virtual Assistants? Could Desktop Computers Be The Best Home For Virtual Assistants?

An Open Letter To Everyone Tricked Into Fearing Artificial Intelligence An Open Letter To Everyone Tricked Into Fearing Artificial Intelligence

The Looming Threat of Artificial Unintelligence The Looming Threat of Artificial Unintelligence

CAPTCHA is Dead, But the AI Winter Lives On CAPTCHA is Dead, But the AI Winter Lives On