CDC's COVID death tracker got a major software update

In 2020 and 2021, COVID-19 became the third leading cause of death in the US. This May, the country passed the grim milestone of 1 million known COVID deaths. Although fewer people are dying from the virus now than during the height of the Omicron surge this winter or previous waves, new strains have continued to take lives.

As the pandemic drags on, understanding how many people are dying and who is most vulnerable remains crucial for efforts to avert further deaths. To that end, the Centers for Disease Control and Prevention (CDC) recently updated the software it uses to process all of the country’s mortality data. The change, powered by advanced computing techniques like machine learning, could supply health officials and the public with more up-to-date information about the disease.

“Civil registration of births and deaths and understanding causes of death are really key to a functioning health system,” says Emily Smith, an assistant professor of global health at George Washington University. “There are a lot of ways to use this information.”

Tracking the leading causes of death in a community and identifying where those deaths are concentrated helps public health officials direct resources, she adds. During a crisis like the COVID pandemic, having prompt information is particularly crucial. But the national statistics system has been slow to process and post mortality figures. When the US passed a million deaths from the virus earlier this year, the CDC’s tracker was still weeks behind.

“If the data aren’t as timely, then our situational awareness degrades by a week or two or maybe three.”
Robert Anderson, CDC’s National Center for Health Statistics

“Effective epidemic response is getting the right resources—whether that’s drugs or vaccines or prevention programs—to the right people at the right time,” Smith says. “Data helps us do that.”

The CDC upgrade represents an important step forward. “It’s great to see the US moving ahead with this,” Smith notes. “More transparent, faster data is a great advance.”

Coding COVID-19

For decades, the CDC has relied on computers to analyze death certificates and assign four-digit codes to each report based on the underlying cause so they can be tracked by the National Vital Statistics System.

However, only about 70 to 75 percent of the country’s death certificates could be coded automatically; the rest were flagged for review, which means a staff member would have to input the cause of death into the system by hand. “When you’re dealing with 2 to 3 million deaths [every year], 25 to 30 percent of records is quite a substantial number and requires quite a lot of resources,” says Robert Anderson, chief of the Mortality Statistics Branch at the National Center for Health Statistics.

The updated cause of death coding system, known as MedCoder, can handle a greater proportion of these records: It currently codes 85 percent of records automatically, and with continued improvements, “has the potential to code better than 90 percent of records,” Anderson says. “These records can be autocoded in minutes, whereas manual review might take a couple of weeks,” he adds. “It just means more information is available in a more timely fashion.”

MedCoder is more adept than past systems at dealing with variations in the terms that physicians, medical examiners, and coroners use to describe mortalities, Anderson explains. The computer assigns one of 10,000 possible codes for causes of death to a record. For example, when COVID is mentioned on a death certificate, it chooses U07.1. To improve the results, Anderson and his team used machine learning techniques that drew upon a decade’s worth of national death certificate data to train MedCoder to recognize mistakes and other aberrations. So, when a doctor fills out the death certificate with “Coronavirus 2019,” “SARS-CoV-2,” “Delta variant,” or another name for the disease, the computer still codes it as U07.1. “The old system would say, ‘I don’t find that term in the dictionary,’ and kick it out for somebody to look at,” Anderson explains. “[Now] the computer says, ‘Okay, I know what to do with this and what code to assign.”

While installing the upgrades between June 6 to 24, the National Center for Health Statistics paused its processing of death data reported by states and didn’t refresh the COVID surveillance datasets on the National Vital Statistics System’s public page. Counts from weeks earlier in 2022 may temporarily seem low while the system catches up and reprocesses these records, the agency’s website notes.

“Once we get over this backlog here the system is going to function pretty much the way the old system did,” Anderson says. “I don’t want people to worry that the data that we’re putting out now is not comparable to the data we were putting out before. It is comparable; it’s just going to be a little more timely.”

Mortality numbers matter

It’s unusual for death certificates to mention which variant of SARS-CoV-2 afflicted the deceased person. But looking for patterns in more precise mortality data can help health experts understand how dangerous a new strain might be—and whether extra precautions are needed.

“If deaths are rising it increases the urgency,” Anderson says. “If the data aren’t as timely, then our situational awareness degrades by a week or two or maybe three.”

It’s also possible that having speedier data would have allowed the US to recognize that it had reached 1 million COVID-19 deaths sooner. “Having better real-time data hypothetically should matter on a lot of different fronts,” Smith says. “It matters for public perception; it matters for political will.”

Reported deaths tend to lag behind other warning signs such as a rise in positive COVID tests or hospitalizations. However, these measures can be difficult to interpret. An uptick in hospitalizations can indicate that more people are becoming seriously ill, but might not capture the full scope of the problem because not everybody with severe disease has access to hospitals.

“Those are softer outcomes that incorporate both the severity of the disease and other social and economic factors, whereas death is a hard outcome.” Smith says. “Mortality is the ultimate indicator—it’s black and white.”

Coding COVID-19

Mortality numbers matter

Win the Holidays with PopSci's Gift Guides

25 enchanting images from the Wildlife Photographer of the Year People’s Choice awards 25 enchanting images from the Wildlife Photographer of the Year People’s Choice awards

Are weight-loss drugs contributing to a fall in the obesity rate? Are weight-loss drugs contributing to a fall in the obesity rate?

Where are we most likely to catch COVID-19? Where are we most likely to catch COVID-19?

COVID-19 is shortening US life expectancy—especially for people of color COVID-19 is shortening US life expectancy—especially for people of color

2020 is shaping up to be America’s deadliest year in recent memory 2020 is shaping up to be America’s deadliest year in recent memory

You can get COVID-19 and the flu at the same time You can get COVID-19 and the flu at the same time

Scientists may have confirmed you can catch COVID-19 twice Scientists may have confirmed you can catch COVID-19 twice

The current state of the pandemic in five graphs The current state of the pandemic in five graphs

The racial disparities in COVID cases are even more striking than we thought The racial disparities in COVID cases are even more striking than we thought

On surviving—and leaving—prison during a pandemic On surviving—and leaving—prison during a pandemic

Native American Nations are even more vulnerable to COVID-19 Native American Nations are even more vulnerable to COVID-19

For the second time ever, someone was spontaneously cured of HIV For the second time ever, someone was spontaneously cured of HIV

There’s a new Delta variant. Here’s why you shouldn’t panic. There’s a new Delta variant. Here’s why you shouldn’t panic.

Millions of people are vaccinated—less than 8,000 of them have died from the virus Millions of people are vaccinated—less than 8,000 of them have died from the virus

The FDA advisory panel recommends Moderna boosters for certain at-risk groups The FDA advisory panel recommends Moderna boosters for certain at-risk groups

Aspirin has long been prescribed to prevent heart attacks. Now experts say it shouldn’t. Aspirin has long been prescribed to prevent heart attacks. Now experts say it shouldn’t.

A comprehensive guide to coronavirus symptoms A comprehensive guide to coronavirus symptoms

E. coli isn’t always bad—it’s actually an unlikely research hero E. coli isn’t always bad—it’s actually an unlikely research hero

Infographics have helped keep us alive for centuries Infographics have helped keep us alive for centuries

Why a decline in US birth rates could actually help our economy Why a decline in US birth rates could actually help our economy

We’re creeping back up to mid ‘90s-level gun death rates We’re creeping back up to mid ‘90s-level gun death rates

New data shows that Fentanyl kills more people than heroin New data shows that Fentanyl kills more people than heroin

The benchmark for human diversity is based on one man’s genome. A new tool could change that. The benchmark for human diversity is based on one man’s genome. A new tool could change that.

For better sleep, borrow the bedtime routine of a toddler For better sleep, borrow the bedtime routine of a toddler

Eleven gifts for the hypochondriac in your life Eleven gifts for the hypochondriac in your life

These ‘experts’ once said women couldn’t play football. Boy were they wrong. These ‘experts’ once said women couldn’t play football. Boy were they wrong.

Share

Coding COVID-19

Mortality numbers matter

Win the Holidays with PopSci's Gift Guides