AI Scans Audio Recordings to Detect Voice Box Cancer

An acoustic feature of people’s voices, measured by AI from recordings, could indicate a form of throat cancer.

Written byAndrea Lius, PhD
| 2 min read
A girl holds her throat with her left hand. The region where the vocal cords would be is glowing red, indicating some strain, or even voice box cancer, which an AI-based tool may help detect.
Register for free to listen to this article
Listen with Speechify
0:00
2:00
Share

People often “lose their voice” after spending the night cheering for a local sports team or singing along to their favorite songs at a concert. Such overuse can temporarily injure the vocal cords, making people’s voices sound hoarse and strained. But there’s a much more alarming cause that can also alter a person’s voice: laryngeal cancer, which may be fatal if left untreated. Clinicians typically assess this condition using invasive—and at times, unavailable in underserved areas—methods such as endoscopy and biopsy.

In a recent study, researchers found that certain acoustic features could distinguish people with vocal cord lesions from those without based on their voice recordings.1 One of the characteristics that the researchers measured could even differentiate between benign and cancerous lesions. This work, led by Phillip Jenkins, a general surgery resident at Oregon Health and Science University, put forward a non-invasive and more accessible way to diagnose voice disorders. Their findings were published in Frontiers in Digital Health.

This study is part of the Bridge2AI program, a National Institutes of Health consortium, which aims to develop AI models to address key challenges in biomedical research. Jenkins’s team used an existing Bridge2AI-Voice dataset, which contains recordings of study participants reading the Rainbow Passage, a text that speech pathologists commonly use to assess American English speakers.2

Using AI, the researchers extracted acoustic features that have been previously associated with vocal cord pathologies: fundamental frequency, which indicates pitch and intonation; jitter, which measures fluctuations in fundamental frequency and signifies control of the vocal cord; shimmer, which quantifies fluctuations of sound wave amplitudes and may denote the presence of lesions that interfere with vocal cord movement; and harmonic-to-noise ratio (HNR), which can indicate improper closing of the vocal cords.3,4

Continue reading below...

Like this story? Sign up for FREE Cancer updates:

Latest science news storiesTopic-tailored resources and eventsCustomized newsletter content
Subscribe

When the researchers compared these features in 122 individuals with no voice disorders, 13 with benign vocal cord lesions, and 10 with laryngeal cancer, they found significant differences between the HNR standard deviation in people with benign vocal cord lesions and those without voice disorders as well as between those with benign vocal cord lesions and laryngeal cancer. This suggested that among the metrics that the researchers tested, HNR may be the most indicative.

Findings from this study demonstrate the possibility of using the voice to diagnose vocal cord lesions non-invasively. In the future, Jenkins and his colleagues hope to evaluate larger datasets and incorporate more variables, such as the size of vocal cord lesions.

Related Topics

Meet the Author

  • Image of Andrea Lius.

    Andrea Lius is an intern at The Scientist. She earned her PhD in pharmacology from the University of Washington. Besides science, she also enjoys writing short-form creative nonfiction.

    View Full Profile
Share
You might also be interested in...
Loading Next Article...
You might also be interested in...
Loading Next Article...
Illustration of a developing fetus surrounded by a clear fluid with a subtle yellow tinge, representing amniotic fluid.
January 2026

What Is the Amniotic Fluid Composed of?

The liquid world of fetal development provides a rich source of nutrition and protection tailored to meet the needs of the growing fetus.

View this Issue
Human-Relevant In Vitro Models Enable Predictive Drug Discovery

Advancing Drug Discovery with Complex Human In Vitro Models

Stemcell Technologies
Redefining Immunology Through Advanced Technologies

Redefining Immunology Through Advanced Technologies

Ensuring Regulatory Compliance in AAV Manufacturing with Analytical Ultracentrifugation

Ensuring Regulatory Compliance in AAV Manufacturing with Analytical Ultracentrifugation

Beckman Coulter Logo
Conceptual multicolored vector image of cancer research, depicting various biomedical approaches to cancer therapy

Maximizing Cancer Research Model Systems

bioxcell

Products

Refeyn logo

Refeyn named in the Sunday Times 100 Tech list of the UK’s fastest-growing technology companies

Parse Logo

Parse Biosciences and Graph Therapeutics Partner to Build Large Functional Immune Perturbation Atlas

Sino Biological Logo

Sino Biological's Launch of SwiftFluo® TR-FRET Kits Pioneers a New Era in High-Throughout Kinase Inhibitor Screening

SPT Labtech Logo

SPT Labtech enables automated Twist Bioscience NGS library preparation workflows on SPT's firefly platform