Predicting promoters

Finding the beginning of genes within genomic sequence presents a formidable challenge to projects to annotate the human genome sequence. In the Advanced Online Publication of Nature Genetics Ramana Davuluri and colleagues at Cold Spring Harbor Laboratory, in New York describe a bioinformatic strategy to predict gene promoters and first exons (Nat Genet 2001, DOI: 10.1038/ng780).They developed a new program, called FirstEF, that attempts to predict the starts of genes. They collected over two th

| 1 min read

Register for free to listen to this article
Listen with Speechify
0:00
1:00
Share

Finding the beginning of genes within genomic sequence presents a formidable challenge to projects to annotate the human genome sequence. In the Advanced Online Publication of Nature Genetics Ramana Davuluri and colleagues at Cold Spring Harbor Laboratory, in New York describe a bioinformatic strategy to predict gene promoters and first exons (Nat Genet 2001, DOI: 10.1038/ng780).

They developed a new program, called FirstEF, that attempts to predict the starts of genes. They collected over two thousand first-exons to use as a training dataset, and characterized those that were associated with a CpG island. FirstEF is designed to recognize CpG islands, promoter regions and first splice-donor sites.

The program could predict 86% of all first exons with about 17% false positives (92% of CpG-related first-exons and 74% of non-CpG exons). FirstEF gave a similar performance when tested against the finished sequences for human chromosomes 21 and 22.

Interested in reading more?

Become a Member of

The Scientist Logo
Receive full access to more than 35 years of archives, as well as TS Digest, digital editions of The Scientist, feature stories, and much more!
Already a member? Login Here

Meet the Author

  • Jonathan Weitzman

    This person does not yet have a bio.
Share
May digest 2025 cover
May 2025, Issue 1

Study Confirms Safety of Genetically Modified T Cells

A long-term study of nearly 800 patients demonstrated a strong safety profile for T cells engineered with viral vectors.

View this Issue
iStock

TaqMan Probe & Assays: Unveil What's Possible Together

Thermo Fisher Logo
Meet Aunty and Tackle Protein Stability Questions in Research and Development

Meet Aunty and Tackle Protein Stability Questions in Research and Development

Unchained Labs
Detecting Residual Cell Line-Derived DNA with Droplet Digital PCR

Detecting Residual Cell Line-Derived DNA with Droplet Digital PCR

Bio-Rad
How technology makes PCR instruments easier to use.

Making Real-Time PCR More Straightforward

Thermo Fisher Logo

Products

The Scientist Placeholder Image

Biotium Launches New Phalloidin Conjugates with Extended F-actin Staining Stability for Greater Imaging Flexibility

Leica Microsystems Logo

Latest AI software simplifies image analysis and speeds up insights for scientists

BioSkryb Genomics Logo

BioSkryb Genomics and Tecan introduce a single-cell multiomics workflow for sequencing-ready libraries in under ten hours

iStock

Agilent BioTek Cytation C10 Confocal Imaging Reader

agilent technologies logo