Google's AI robot uses language models to write code

Writing working code can be a challenge. Even relatively easy languages like HTML require the coder to understand the specific syntax and available tools. Writing code to control robots is even more involved and often has multiple steps: There’s code to detect objects, code to trigger the actuators that move the robot’s limbs, code to specify when the task is complete, and so on. Something as simple as programming a robot to pick up a yellow block instead of a red one is impossible if you don’t know the coding language the robot runs on.

But Google’s robotics researchers are exploring a way to fix that. They’ve developed a robot that can write its own programming code based on natural language instructions. Instead of having to dive into a robot’s configuration files to change block_target_color from #FF0000 to #FFFF00, you could just type “pick up the yellow block” and the robot would do the rest.

Code as Policies (or CaP for short) is a coding-specific language model developed from Google’s Pathways Language Model (PaLM) to interpret the natural language instructions and turn them into code it can run. Google’s researchers trained the model by giving it examples of instructions (formatted as code comments written by the developers to explain what the code does for anyone reviewing it) and the corresponding code. From that, it was able to take new instructions and “autonomously generate new code that re-composes API calls, synthesizes new functions, and expresses feedback loops to assemble new behaviors at runtime,” Google engineers explained in a blog post published this week, In other words, given a comment-like prompt, it could come up with some probable robot code. Read the preprint of their work here.

To get CaP to write new code for specific tasks, the team provided it with “hints,” like what APIs or tools were available to it, and a few instructions-to-code paired examples. From that, it was able to write new code for new instructions. It does this using “hierarchical code generation” which prompts it to “recursively define new functions, accumulate their own libraries over time, and self-architect a dynamic codebase.” This means that given one set of instructions once, it can develop some code that it can then repurpose for similar instructions later on.

CaP can also use the arithmetic operations and logic of specific languages. For example, a model trained on Python can use the appropriate if/else and for/while loops when needed, and use third-party libraries for additional functionality. It can also turn ambiguous descriptions like “faster” and “to the left” into the precise numerical values necessary to perform the task. And because CaP is built on top of a regular language model, it has a few features unrelated to code—like understanding emojis and non-English languages.

For now, CaP is still very much limited in what it can do. It relies on the language model it is based on to provide context to its instructions. If they don’t make sense or use parameters it doesn’t support, it can’t write code. Similarly, it apparently can only manage a handful of parameters in a single prompt; more complex sequences of actions that require dozens of parameters just aren’t possible. There are also safety concerns: Programming a robot to write its own code is a bit like Skynet. If it thinks the best way to achieve a task is to spin around really fast with its arm extended and there is a human nearby, somebody could get hurt.

Still, it’s incredibly exciting research. With robots, one of the hardest tasks is generalizing their trained behaviors. Programming a robot to play ping-pong, doesn’t make it capable of playing other games like baseball or tennis. Although CaP is still miles away from such broad real world applications, it does allow a robot to perform a wide range of complex robot tasks without task-specific training. That’s a big step in the direction of one day being able to teach a robot that can play one game how to play another—without having to break everything down to new human-written code.

Win the Holidays with PopSci's Gift Guides

25 enchanting images from the Wildlife Photographer of the Year People’s Choice awards 25 enchanting images from the Wildlife Photographer of the Year People’s Choice awards

Are weight-loss drugs contributing to a fall in the obesity rate? Are weight-loss drugs contributing to a fall in the obesity rate?

MIT’s new robot takes orders from your muscles MIT’s new robot takes orders from your muscles

Google’s Robots Are Learning How To Pick Things Up Google’s Robots Are Learning How To Pick Things Up

How Google’s New A.I. Microchips Take A Page From Bitcoin Miners How Google’s New A.I. Microchips Take A Page From Bitcoin Miners

The Instagram For Google’s ‘DeepDream’ Is Finally Here The Instagram For Google’s ‘DeepDream’ Is Finally Here

Facebook Open-Sources The Computers Behind Its Artificial Intelligence Facebook Open-Sources The Computers Behind Its Artificial Intelligence

Google’s New AI Plays Atari Games As Well As You Can, Or Better Google’s New AI Plays Atari Games As Well As You Can, Or Better

The best ways to steal signs in baseball The best ways to steal signs in baseball

Artificial intelligence could improve psychiatric care Artificial intelligence could improve psychiatric care

The military wants their robots to be better listeners The military wants their robots to be better listeners

Work faster in Google Docs and Sheets by creating your own shortcuts Work faster in Google Docs and Sheets by creating your own shortcuts

Google’s new Nest Hub can track your sleep without a wearable or camera Google’s new Nest Hub can track your sleep without a wearable or camera

5 Google search tips for the most accurate results 5 Google search tips for the most accurate results

These three robots can teach kids how to code These three robots can teach kids how to code

Last week in tech: A gaggle of new Google gadgets, more info on the Facebook hack, and robot parkour Last week in tech: A gaggle of new Google gadgets, more info on the Facebook hack, and robot parkour

Roomba’s new robotic vacuum remembers your home’s layout for quicker cleaning Roomba’s new robotic vacuum remembers your home’s layout for quicker cleaning

All the cool new stuff from Google’s 2018 I/O developers conference All the cool new stuff from Google’s 2018 I/O developers conference

Share

Win the Holidays with PopSci's Gift Guides