One Million Genomic Datasets

Publicly accessible databases now store nearly 1 million gene-expression datasets, giving researchers a robust resource for discovery.

Written byBob Grant
| 1 min read

Register for free to listen to this article
Listen with Speechify
0:00
1:00
Share

With more and more researchers conducting experiments that tease apart the functions of the thousands of genes that make up the genomes of mice, rats, and humans, the number of gene-expression datasets deposited in publicly accessible databases will soon reach 1,000,000, according to an analysis done by Nature. Adding together the number of datasets in the two major public data repositories, the National Center for Biotechnology Information's Gene Expression Omnibus and the gene-expression database at the European Bioinformatics Institute, the milestone should be reached within the next month.

"Some time in the next few weeks, the number of deposited data sets will top one million," Monya Baker wrote in Nature last week.

Gene-expression data can help scientists test preliminary hypotheses about which genes may contribute to the development of certain diseases, leading to potential drug targets. For example, a researcher could comb public data from several studies of people or ...

Interested in reading more?

Become a Member of

The Scientist Logo
Receive full access to more than 35 years of archives, as well as TS Digest, digital editions of The Scientist, feature stories, and much more!
Already a member? Login Here

Related Topics

Meet the Author

  • From 2017 to 2022, Bob Grant was Editor in Chief of The Scientist, where he started in 2007 as a Staff Writer. Before joining the team, he worked as a reporter at Audubon and earned a master’s degree in science journalism from New York University. In his previous life, he pursued a career in science, getting a bachelor’s degree in wildlife biology from Montana State University and a master’s degree in marine biology from the College of Charleston in South Carolina. Bob edited Reading Frames and other sections of the magazine.

    View Full Profile
Share
Illustration of a developing fetus surrounded by a clear fluid with a subtle yellow tinge, representing amniotic fluid.
January 2026, Issue 1

What Is the Amniotic Fluid Composed of?

The liquid world of fetal development provides a rich source of nutrition and protection tailored to meet the needs of the growing fetus.

View this Issue
Skip the Wait for Protein Stability Data with Aunty

Skip the Wait for Protein Stability Data with Aunty

Unchained Labs
Graphic of three DNA helices in various colors

An Automated DNA-to-Data Framework for Production-Scale Sequencing

illumina
Exploring Cellular Organization with Spatial Proteomics

Exploring Cellular Organization with Spatial Proteomics

Abstract illustration of spheres with multiple layers, representing endoderm, ectoderm, and mesoderm derived organoids

Organoid Origins and How to Grow Them

Thermo Fisher Logo

Products

Brandtech Logo

BRANDTECH Scientific Introduces the Transferpette® pro Micropipette: A New Twist on Comfort and Control

Biotium Logo

Biotium Launches GlycoLiner™ Cell Surface Glycoprotein Labeling Kits for Rapid and Selective Cell Surface Imaging

Colorful abstract spiral dot pattern on a black background

Thermo Scientific X and S Series General Purpose Centrifuges

Thermo Fisher Logo
Abstract background with red and blue laser lights

VANTAstar Flexible microplate reader with simplified workflows

BMG LABTECH