Save our Data!

Here's how to prevent critical biological data repositories from disappearing into the ether

Written byThe Plant Genome Database Working Group
| 5 min read

Register for free to listen to this article
Listen with Speechify
0:00
5:00
Share

The public research sector has invested hundreds of millions of dollars in grants to generate large-scale biological data sets, most notably in the field of genomics. These large data sets include genome sequences, gene expression array results, extensive surveys of sequence variation within populations, and findings from protein-protein interaction studies. Such data sets are housed in many online repositories, ranging in size and scope from small single-organism databases, to large multiorganism databases such as the comprehensive GenBank sequence database.

The problem is that the majority of these databases were established by the initiative of individual researchers, and their longevity is constrained by the continuing enthusiasm of their founders and their prospects for long-term funding. In other cases, online databases were established by individual researchers, who, in the absence of an obvious repository for the data they were generating, released their data through their own Web or FTP sites. When the ...

Interested in reading more?

Become a Member of

The Scientist Logo
Receive full access to digital editions of The Scientist, as well as TS Digest, feature stories, more than 35 years of archives, and much more!
Already a member? Login Here
Image of a woman with her hands across her stomach. She has a look of discomfort on her face. There is a blown up image of her stomach next to her and it has colorful butterflies and gut bacteria all swarming within the gut.
November 2025, Issue 1

Why Do We Feel Butterflies in the Stomach?

These fluttering sensations are the brain’s reaction to certain emotions, which can be amplified or soothed by the gut’s own “bugs".

View this Issue
Olga Anczukow and Ryan Englander discuss how transcriptome splicing affects immune system function in lung cancer.

Long-Read RNA Sequencing Reveals a Regulatory Role for Splicing in Immunotherapy Responses

Pacific Biosciences logo
Research Roundtable: The Evolving World of Spatial Biology

Research Roundtable: The Evolving World of Spatial Biology

Conceptual cartoon image of gene editing technology

Exploring the State of the Art in Gene Editing Techniques

Bio-Rad
Conceptual image of a doctor holding a brain puzzle, representing Alzheimer's disease diagnosis.

Simplifying Early Alzheimer’s Disease Diagnosis with Blood Testing

fujirebio logo

Products

Eppendorf Logo

Research on rewiring neural circuit in fruit flies wins 2025 Eppendorf & Science Prize

Evident Logo

EVIDENT's New FLUOVIEW FV5000 Redefines the Boundaries of Confocal and Multiphoton Imaging

Evident Logo

EVIDENT Launches Sixth Annual Image of the Year Contest

10x Genomics Logo

10x Genomics Launches the Next Generation of Chromium Flex to Empower Scientists to Massively Scale Single Cell Research