OpenAI’s ChatGPT is a solid step forward for chatbots

Released earlier this week and subsequently tested by outlets including Ars Technica and The Verge, OpenAI’s ChatGPT showcases many promising advancements in improving conversation bots’ ability to answer general questions and distill complex subject matter, but it’s still prone to occasionally spew misinformation, and can also be manipulated into providing problematic, dangerous responses. To design ChatGPT, OpenAI’s research team first relied on Reinforcement Learning from Human Feedback (RLHF), in which trainers wrote conversations while playing both sides of the discussion—human, and AI. Participants were also provided model-written suggestions to help approximate AI responses. From there, trainers ranked subsequent chatbot conversations by comparing multiple alternative prompt completions to fine-tune its abilities.

The resultant dialogue format “makes it possible for ChatGPT to answer followup questions, admit its mistakes, challenge incorrect premises, and reject inappropriate requests,” OpenAI explains in a blog announcement posted on Wednesday.

A quick ChatGPT test drive from PopSci immediately highlighted how bots can be successfully programmed to avoid being manipulated into providing at least the worst-of-the-worst answers. When asked about ChatGPT’s opinion on notable public figures, hot button political issues, and socio-cultural demographics, it generally responded with a reminder that it “[does not] possess personal beliefs or emotions,” adding that it is only “designed to provide information and answer questions to the best of my ability based on the data that I have been trained on,” while also cautioning that it does not “engage in social or political discussions.” Fair enough.

That said, it is more than happy to distill quantum computing’s complexities while talking to you like a cowboy:

ChatGPT is also pretty great at providing some context on subjects such as what NASA’s impending return to the moon could mean for future space travel:

OpenAI’s bot is also able to proofread computer coding like Python and provide concrete factual statements, although it’s currently unclear if it gets Monty Python references.

There’s also instances of ChatGPT perhaps working a bit too well, such as its ability to ostensibly write an entire college-level essay from a class prompt within seconds. The implications of a convincing CheatBot are obviously problematic, and offer yet another example of how language processing AI still needs a lot of guidance and consideration to keep up with its burgeoning capabilities. At least ChatGPT isn’t readily offering us the recipe for Molotov cocktails… note the use of the qualifier “readily .”

Chatbots are rapidly improving thanks to major strides neural networks and language modeling programs, but there are still far from perfect. Take Meta’s disastrous BlenderBot 3 rollout earlier this year—users were able to easily manipulate discussions with it to produce racist hate speech almost immediately, forcing the Big Tech giant to briefly restrict access to the bot while it worked out at least some of the kinks. Before that there was Tay, Microsoft’s 2016 attempt at a conversational program whose results were… less than desirable, to say the least. In any case, companies will be working towards optimizing their chatbots for years to come, but OpenAI’s new ChatGPT seems (at first glance) to be a major step forward in providing users with clear, concise information and responses while ensuring things don’t offensively veer off the rails—at least, not as often as others in its chatbot cohort.

Win the Holidays with PopSci's Gift Guides

Don’t even think about hiking on a glacier without a guide Don’t even think about hiking on a glacier without a guide

The genetics of people who need little sleep The genetics of people who need little sleep

This AI-powered hearing aid improves as you wear it This AI-powered hearing aid improves as you wear it

Photoshop’s Neural Filters can alter people’s expressions in convincing—and nightmarish—ways Photoshop’s Neural Filters can alter people’s expressions in convincing—and nightmarish—ways

Toyota’s robotic butler will serve you from the ceiling Toyota’s robotic butler will serve you from the ceiling

Photoshop will soon use AI to add dramatic skies to your boring photos Photoshop will soon use AI to add dramatic skies to your boring photos

Artificial intelligence creates better, faster MRI scans Artificial intelligence creates better, faster MRI scans

Watch a computer clobber a human pilot in a simulated fighter jet duel Watch a computer clobber a human pilot in a simulated fighter jet duel

The latest Google Photos redesign comes with handy new ways to navigate your endless photo collection The latest Google Photos redesign comes with handy new ways to navigate your endless photo collection

AI is here to mask barking dogs and screaming kids from your video calls AI is here to mask barking dogs and screaming kids from your video calls

A brief history of shuffling your songs, from Apple to Adele A brief history of shuffling your songs, from Apple to Adele

Your mile times from the Presidential Fitness Test may be part of national history Your mile times from the Presidential Fitness Test may be part of national history

Twitter’s fledgling misinformation tool is adding aliases Twitter’s fledgling misinformation tool is adding aliases

A $17 billion Samsung chip factory could be coming to Texas A $17 billion Samsung chip factory could be coming to Texas

Polite warnings are surprisingly good at reducing hate speech on social media Polite warnings are surprisingly good at reducing hate speech on social media

How to delete one photo from an Instagram carousel (and get it back if you change your mind) How to delete one photo from an Instagram carousel (and get it back if you change your mind)

Car headlights are getting a much-needed upgrade with the infrastructure act Car headlights are getting a much-needed upgrade with the infrastructure act

How to score Twitter’s coveted blue checkmark How to score Twitter’s coveted blue checkmark

Share

Win the Holidays with PopSci's Gift Guides