New Database Expands Number of Estimated Human Protein-Coding Genes

Some scientists are not yet convinced that the list is accurate.

| 2 min read

Register for free to listen to this article
Listen with Speechify
0:00
2:00
Share

ISTOCK, BLACKJACK3D

The human genome may contain more protein-coding genes than prior analyses suggested. A study published last month (May 29) on BioRxiv provides an expanded database of approximately 5,000 novel genes—of those, around 1,000 code for proteins, expanding the estimated number of protein-coding genes from around 20,000 to 21,000.

“If people like our gene list, then maybe a couple years from now we’ll be the arbiter of human genes,” study coauthor Steven Salzberg, a computational biologist at Johns Hopkins University, tells Nature.

Salzberg and his colleagues compiled a catalog of human genes and transcripts using data from the Genotype-Tissue Expression (GTEx) project, in which scientists sequenced the RNA from various tissues in hundreds of human subjects. By comparing the sequenced RNA to the human genome, the ...

Interested in reading more?

Become a Member of

The Scientist Logo
Receive full access to more than 35 years of archives, as well as TS Digest, digital editions of The Scientist, feature stories, and much more!
Already a member? Login Here

Keywords

Meet the Author

  • Diana Kwon

    Diana is a freelance science journalist who covers the life sciences, health, and academic life.
Share
May digest 2025 cover
May 2025, Issue 1

Study Confirms Safety of Genetically Modified T Cells

A long-term study of nearly 800 patients demonstrated a strong safety profile for T cells engineered with viral vectors.

View this Issue
iStock

TaqMan Probe & Assays: Unveil What's Possible Together

Thermo Fisher Logo
Meet Aunty and Tackle Protein Stability Questions in Research and Development

Meet Aunty and Tackle Protein Stability Questions in Research and Development

Unchained Labs
Detecting Residual Cell Line-Derived DNA with Droplet Digital PCR

Detecting Residual Cell Line-Derived DNA with Droplet Digital PCR

Bio-Rad
How technology makes PCR instruments easier to use.

Making Real-Time PCR More Straightforward

Thermo Fisher Logo

Products

fujirebio-square-logo

Fujirebio Receives Marketing Clearance for Lumipulse® G pTau 217/ β-Amyloid 1-42 Plasma Ratio In-Vitro Diagnostic Test

The Scientist Placeholder Image

Biotium Launches New Phalloidin Conjugates with Extended F-actin Staining Stability for Greater Imaging Flexibility

Leica Microsystems Logo

Latest AI software simplifies image analysis and speeds up insights for scientists

BioSkryb Genomics Logo

BioSkryb Genomics and Tecan introduce a single-cell multiomics workflow for sequencing-ready libraries in under ten hours