AI Scans Audio Recordings to Detect Voice Box Cancer

An acoustic feature of people’s voices, measured by AI from recordings, could indicate a form of throat cancer.

Written byAndrea Lius, PhD
| 2 min read
A girl holds her throat with her left hand. The region where the vocal cords would be is glowing red, indicating some strain, or even voice box cancer, which an AI-based tool may help detect.
Register for free to listen to this article
Listen with Speechify
0:00
2:00
Share

People often “lose their voice” after spending the night cheering for a local sports team or singing along to their favorite songs at a concert. Such overuse can temporarily injure the vocal cords, making people’s voices sound hoarse and strained. But there’s a much more alarming cause that can also alter a person’s voice: laryngeal cancer, which may be fatal if left untreated. Clinicians typically assess this condition using invasive—and at times, unavailable in underserved areas—methods such as endoscopy and biopsy.

In a recent study, researchers found that certain acoustic features could distinguish people with vocal cord lesions from those without based on their voice recordings.1 One of the characteristics that the researchers measured could even differentiate between benign and cancerous lesions. This work, led by Phillip Jenkins, a general surgery resident at Oregon Health and Science University, put forward a non-invasive and more accessible way to diagnose voice disorders. Their findings were published in Frontiers in Digital Health.

Continue reading below...

Like this story? Sign up for FREE Cancer updates:

Latest science news storiesTopic-tailored resources and eventsCustomized newsletter content
Subscribe

This study is part of the Bridge2AI program, a National Institutes of Health consortium, which aims to develop AI models to address key challenges in biomedical research. Jenkins’s team used an existing Bridge2AI-Voice dataset, which contains recordings of study participants reading the Rainbow Passage, a text that speech pathologists commonly use to assess American English speakers.2

Using AI, the researchers extracted acoustic features that have been previously associated with vocal cord pathologies: fundamental frequency, which indicates pitch and intonation; jitter, which measures fluctuations in fundamental frequency and signifies control of the vocal cord; shimmer, which quantifies fluctuations of sound wave amplitudes and may denote the presence of lesions that interfere with vocal cord movement; and harmonic-to-noise ratio (HNR), which can indicate improper closing of the vocal cords.3,4

When the researchers compared these features in 122 individuals with no voice disorders, 13 with benign vocal cord lesions, and 10 with laryngeal cancer, they found significant differences between the HNR standard deviation in people with benign vocal cord lesions and those without voice disorders as well as between those with benign vocal cord lesions and laryngeal cancer. This suggested that among the metrics that the researchers tested, HNR may be the most indicative.

Findings from this study demonstrate the possibility of using the voice to diagnose vocal cord lesions non-invasively. In the future, Jenkins and his colleagues hope to evaluate larger datasets and incorporate more variables, such as the size of vocal cord lesions.

Related Topics

Meet the Author

  • Image of Andrea Lius.

    Andrea Lius is an intern at The Scientist. She earned her PhD in pharmacology from the University of Washington. Besides science, she also enjoys writing short-form creative nonfiction.

    View Full Profile
Share
You might also be interested in...
Loading Next Article...
You might also be interested in...
Loading Next Article...
December digest cover image of a wooden sculpture comprised of multiple wooden neurons that form a seahorse.
December 2025, Issue 1

Wooden Neurons: An Artistic Vision of the Brain

A neurobiologist, who loves the morphology of cells, turns these shapes into works of art made from wood.

View this Issue
Stacks of cell culture dishes, plates, and flasks with pink cell culture medium on a white background.

Driving Innovation with Cell Culture Essentials

Merck
Stacks of cell culture dishes, plates, and flasks with pink cell culture medium on a white background.

Driving Innovation with Cell Culture Essentials

MilliporeSigma purple logo
Abstract wireframe sphere with colorful dots and connecting lines representing the complex cellular and molecular interactions within the tumor microenvironment.

Exploring the Inflammatory Tumor Microenvironment 

Cellecta logo
An image of a DNA sequencing spectrum with a radial blur filter applied.

A Comprehensive Guide to Next-Generation Sequencing

Integra Logo

Products

brandtech logo

BRANDTECH® Scientific Announces Strategic Partnership with Copia Scientific to Strengthen Sales and Service of the BRAND® Liquid Handling Station (LHS) 

Top Innovations 2026 Contest Image

Enter Our 2026 Top Innovations Contest

Biotium Logo

Biotium Expands Tyramide Signal Amplification Portfolio with Brighter and More Stable Dyes for Enhanced Spatial Imaging

Labvantage Logo

LabVantage Solutions Awarded $22.3 Million U.S Customs and Border Protection Contract to Deliver Next-Generation Forensic LIMS