The logic behind AI chatbots like ChatGPT is surprisingly basic

CHATBOTS MIGHT APPEAR to be complex conversationalists that respond like real people. But if you take a closer look, they are essentially an advanced version of a program that finishes your sentences by predicting which words will come next. Bard, ChatGPT, and other AI technologies are large language models—a kind of algorithm trained on exercises similar to the Mad Libs-style questions found on elementary school quizzes. More simply put, they are human-written instructions that tell computers how to solve a problem or make a calculation. In this case, the algorithm uses your prompt and any sentences it comes across to auto-complete the answer.

Systems like ChatGPT can use only what they’ve gleaned from the web. “All it’s doing is taking the internet it has access to and then filling in what would come next,” says Rayid Ghani, a professor in the machine learning department at Carnegie Mellon University.

Let’s pretend you plugged this sentence into an AI chatbot: “The cat sat on the ___.” First, the language model would have to know that the missing word needs to be a noun to make grammatical sense. But it can’t be any noun—the cat can’t sit on the “democracy,” for one. So the algorithm scours texts written by humans to get a sense of what cats actually rest on and picks out the most probable answer. In this scenario, it might determine the cat sits on the “laptop” 10 percent of the time, on the “table” 20 percent of the time, and on the “chair” 70 percent of the time. The model would then go with the most likely answer: “chair.”

The system is able to use this prediction process to respond with a full sentence. If you ask a chatbot, “How are you?” it will generate “I’m” based on the “you” from the question and then “good” based on what most people on the web reply when asked how they are.

The way these programs process information and arrive at a decision sort of resembles how the human brain behaves. “As simple as this task [predicting the most likely response] is, it actually requires an incredibly sophisticated knowledge of both how language works and how the world works,” says Yoon Kim, a researcher at MIT’s Computer Science and Artificial Intelligence Laboratory. “You can think of [chatbots] as algorithms with little knobs on them. These knobs basically learn on data that you see out in the wild,” allowing the software to create “probabilities over the entire English vocab.”

The beauty of language models is that researchers don’t have to rigidly define any rules or grammar for them to follow. An AI chatbot implicitly learns how to form sentences that make sense by consuming tokens, which are common sequences of characters grouped together taken from the raw text of books, articles, and websites. All it needs are the patterns and associations it finds among certain words or phrases.

But these tools often spit out answers that are imprecise or incorrect—and that’s partly because of how they were schooled. “Language models are trained on both fiction and nonfiction. They’re trained on every text that’s out on the internet,” says Kim. If MoonPie tweets that its cookies really come from the moon, ChatGPT might incorporate that in a write-up on the product. And if Bard concludes that a cat sat on the democracy after scanning this article, well, you might have to get more used to the idea.

Read more about life in the age of AI:

Or check out all of our PopSci+ stories.

The logic behind AI chatbots like ChatGPT is surprisingly basic

AI trained on AI churns out gibberish garbage AI trained on AI churns out gibberish garbage

Everyone is judging AI by these tests, but experts say they’re close to meaningless Everyone is judging AI by these tests, but experts say they’re close to meaningless

ChatGPT is, scientifically speaking, not funny ChatGPT is, scientifically speaking, not funny

AI ‘pastor’ leaves churchgoers surprised but uninspired AI ‘pastor’ leaves churchgoers surprised but uninspired

Radio host sues ChatGPT developer over allegedly libelous claims Radio host sues ChatGPT developer over allegedly libelous claims

The next version of ChatGPT is live—here’s what’s new The next version of ChatGPT is live—here’s what’s new

No, the AI chatbots (still) aren’t sentient No, the AI chatbots (still) aren’t sentient

Microsoft is betting ChatGPT will make Bing useful Microsoft is betting ChatGPT will make Bing useful

Sounding like an AI chatbot may hurt your credibility Sounding like an AI chatbot may hurt your credibility

Is ChatGPT groundbreaking? These experts say no. Is ChatGPT groundbreaking? These experts say no.

Google’s own upcoming AI chatbot draws from the power of its search engine Google’s own upcoming AI chatbot draws from the power of its search engine

6 ways ChatGPT is actually useful right now 6 ways ChatGPT is actually useful right now

ChatGPT is quietly co-authoring books on Amazon ChatGPT is quietly co-authoring books on Amazon

CEOs are already using ChatGPT to write their emails CEOs are already using ChatGPT to write their emails

Google is helping Wendy’s build an AI drive-thru Google is helping Wendy’s build an AI drive-thru

Microsoft’s take on AI-powered search struggles with accuracy Microsoft’s take on AI-powered search struggles with accuracy

The highlights and lowlights from the Google AI event The highlights and lowlights from the Google AI event

Microsoft changes Bing chatbot restrictions after much AI-generated weirdness Microsoft changes Bing chatbot restrictions after much AI-generated weirdness