Technical Bias Widespread in RNA-Seq Datasets

Genes that are exceptionally long or short are overrepresented in some published reports, which can lead to misinterpreted results.

Written byDiana Kwon
| 3 min read
rna-seq rna sequencing bias dataset transcript

Register for free to listen to this article
Listen with Speechify
0:00
3:00
Share

ABOVE: © ISTOCK.COM, SHUOSHU

RNA sequencing is a popular tool among molecular biologists, because it allows them to examine gene expression patterns in DNA. However, the technique is susceptible to experimental artifacts, which can lead to misinterpreted findings. According to a study published last week (November 12) in PLOS Biology, one such bias, which is associated with gene length, is widespread in many published datasets.

Rani Elkon, a bioinformatician at Tel Aviv University in Israel, says that his team was analyzing RNA sequencing (RNA-seq) datasets for a project aimed at infering the co-regulation of genes by examining their co-expression across many different biological conditions when they stumbled upon a puzzling finding: Genes coding for proteins in the ribosome or other translation-related machinery—which are exceptionally short—and genes coding for extracellular matrix proteins such as collagen—which are exceptionally long—kept popping up in their analyses. “In many different datasets, genes that were upregulated ...

Interested in reading more?

Become a Member of

The Scientist Logo
Receive full access to more than 35 years of archives, as well as TS Digest, digital editions of The Scientist, feature stories, and much more!
Already a member? Login Here

Related Topics

Meet the Author

  • Diana is a freelance science journalist who covers the life sciences, health, and academic life. She’s a regular contributor to The Scientist and her work has appeared in several other publications, including Scientific American, Knowable, and Quanta. Diana was a former intern at The Scientist and she holds a master’s degree in neuroscience from McGill University. She’s currently based in Berlin, Germany.

    View Full Profile
Share
December digest cover image of a wooden sculpture comprised of multiple wooden neurons that form a seahorse.
December 2025, Issue 1

Wooden Neurons: An Artistic Vision of the Brain

A neurobiologist, who loves the morphology of cells, turns these shapes into works of art made from wood.

View this Issue
Alzheimer: Phosphorylation of Tau proteins leads to disintegration of microtubuli in a neuron axon stock photo

Advancing Alzheimer’s Disease Detection with Brain-Derived pTau217 Assays

Alamar Biosciences logo
Stacks of cell culture dishes, plates, and flasks with pink cell culture medium on a white background.

Driving Innovation with Cell Culture Essentials

Merck
Stacks of cell culture dishes, plates, and flasks with pink cell culture medium on a white background.

Driving Innovation with Cell Culture Essentials

MilliporeSigma purple logo
Human iPSC-derived Models for Brain Disease Research

Human iPSC-derived Models for Neurodegenerative Disease Research

Fujifilm

Products

Beckman Logo

Beckman Coulter Life Sciences Introduces the Biomek i3 Benchtop Liquid Handler, a Small but Mighty Addition to its Portfolio of Automated Workstations

brandtech logo

BRANDTECH® Scientific Announces Strategic Partnership with Copia Scientific to Strengthen Sales and Service of the BRAND® Liquid Handling Station (LHS) 

Top Innovations 2026 Contest Image

Enter Our 2026 Top Innovations Contest

Biotium Logo

Biotium Expands Tyramide Signal Amplification Portfolio with Brighter and More Stable Dyes for Enhanced Spatial Imaging