A look at Adobe's new 2D to 3D image generator

Today at Adobe MAX, the company’s annual creativity conference, Adobe will preview a new technology called “Beyond the Seen” that uses artificial intelligence to extend the boundaries of two-dimensional images and even turn them into immersive three-dimensional scenes. While just a demonstration, it shows how AI image generators designed for specific purposes could have far reaching commercial and artistic applications.

The image generator works by taking a landscape or photograph from inside a building and expanding it into a full 360-degree spherical panorama around the camera. Of course, it can’t know what’s actually behind the camera, so it uses machine learning to create a plausible and seamless environment—whether the input image is of a mountain landscape or the interior of a concert hall. Adobe’s algorithms can also estimate the 3D geometry of the new environment, which enables the view point to be changed, and even for the camera to appear to move around the environment.

While image extension or out-painting isn’t new, Adobe’s AI generator is the first to be built exclusively around it. For example, DALL-E 2 allows users to extend their images in small blocks, while Stable Diffusion requires a work around.

Adobe’s AI image generator is a little different from more general image generators like DALL-E 2 and Stable Diffusion in a couple of key ways. First, it’s trained on a much more limited dataset with a specific purpose in mind. DALL-E 2 and Stable Diffusion were trained on billions of text-image pairs that cover every concept from avocados and Avril Lavigne, to zebras and Zendaya. Adobe’s generator was trained exclusively on a dataset of roughly 250,000 high-resolution 360-degree panoramas. This means it’s great at generating realistic environments from seed images, but it has no text-to-image features (in other words, you can’t enter a text prompt and get a weird result) or any other general generation features. It’s a tool with a specific job. However, the images it outputs are significantly larger.

Users can also turn images into panoramas with the AI tool. *Adobe*

Adobe’s generator currently uses an artificial intelligence technique called a General Adversarial Network, or GAN, and not a diffusion model. GANs work by using two neural networks against each other. The Generator is responsible for creating new outputs, and the Discriminator has to guess whether any image it is presented with is an output from the Generator or an actual image from the training set. As the Generator gets better at creating realistic images, it gets better at fooling the Discriminator, and thus a functioning image generation algorithm is created.

Meanwhile, diffusion models, which DALL-E 2 and Stable Diffusion use, start with random noise and edit it to create a plausible image. Recent research has shown that they can produce more realistic results than GANs. Given that, Gavin Miller, VP and Head of Adobe Research, tells PopSci the algorithm could be adapted to use a diffusion model before it was commercially released.

Although this is still in early development, Adobe has highlighted a couple of potential uses for the technology. While there are the claims about the Metaverse and generating 3D worlds from 2D snapshots, it’s the regular image extension features that are likely to prove valuable first. One example Adobe demonstrated in the demo video is how its algorithm allowed for “specular” (or shiny) rendered objects to be inserted into an image. The AI generator was used to extrapolate what could be behind the camera and above the object in order to create realistic reflections off of that shiny object. This is the kind of thing that would allow architects and interior designers to more easily create accurate-seeming renderings for their projects.

Similarly, it would allow photographers and videographers to expand the background of their images in a more natural way. Miller explained that the content aware tools, which have been in Adobe’s apps like Photoshop since 2010, are able to generate naturalistic texture, while the new generative models are capable of creating both texture and structure.

While there is no word yet on when this technology will be available to the public, revealing it today is all “part of a larger agenda towards more generative technologies,” that Adobe is pursuing, Miller says. It’s always been possible to create 360-degree panoramas with hardware, but soon it will be possible to create realistic seeming ones using just software. And that really could change things—and yes, maybe make it possible for small creators to make metaverse-adjacent experiences.

Win the Holidays with PopSci's Gift Guides

NASA is finishing its first off-world accident report NASA is finishing its first off-world accident report

Fire likely killed a group of Stone Age humans uncovered in Ukraine Fire likely killed a group of Stone Age humans uncovered in Ukraine

Facebook is working on AI tools to fix photos ruined by blinking Facebook is working on AI tools to fix photos ruined by blinking

Let this AI bot turn your words into vaguely-recognizable pictures Let this AI bot turn your words into vaguely-recognizable pictures

Adobe is training AI to be a better photo and video editor than you Adobe is training AI to be a better photo and video editor than you

This AI-powered hearing aid improves as you wear it This AI-powered hearing aid improves as you wear it

Photoshop’s Neural Filters can alter people’s expressions in convincing—and nightmarish—ways Photoshop’s Neural Filters can alter people’s expressions in convincing—and nightmarish—ways

Photoshop will soon use AI to add dramatic skies to your boring photos Photoshop will soon use AI to add dramatic skies to your boring photos

Artificial intelligence creates better, faster MRI scans Artificial intelligence creates better, faster MRI scans

Watch a computer clobber a human pilot in a simulated fighter jet duel Watch a computer clobber a human pilot in a simulated fighter jet duel

The Army’s new tool for analyzing bomb shrapnel could lead to better body armor The Army’s new tool for analyzing bomb shrapnel could lead to better body armor

Microsoft’s new Mesh platform turns your remote coworkers into holograms Microsoft’s new Mesh platform turns your remote coworkers into holograms

The top 10 cities for biking probably aren’t where you think The top 10 cities for biking probably aren’t where you think

Designing spaces with marginalized people in mind makes them better for everyone Designing spaces with marginalized people in mind makes them better for everyone

To move cargo with less mess, these ships unload themselves To move cargo with less mess, these ships unload themselves

4 ways to run Android apps and games on your computer 4 ways to run Android apps and games on your computer

How to shoot great Instagram photos How to shoot great Instagram photos

Spotify has a major audio-quality upgrade coming later this year Spotify has a major audio-quality upgrade coming later this year

Share

Win the Holidays with PopSci's Gift Guides