Predicting promoters

Finding the beginning of genes within genomic sequence presents a formidable challenge to projects to annotate the human genome sequence. In the Advanced Online Publication of Nature Genetics Ramana Davuluri and colleagues at Cold Spring Harbor Laboratory, in New York describe a bioinformatic strategy to predict gene promoters and first exons (Nat Genet 2001, DOI: 10.1038/ng780).They developed a new program, called FirstEF, that attempts to predict the starts of genes. They collected over two th

Written byJonathan Weitzman
| 1 min read

Register for free to listen to this article
Listen with Speechify
0:00
1:00
Share

Finding the beginning of genes within genomic sequence presents a formidable challenge to projects to annotate the human genome sequence. In the Advanced Online Publication of Nature Genetics Ramana Davuluri and colleagues at Cold Spring Harbor Laboratory, in New York describe a bioinformatic strategy to predict gene promoters and first exons (Nat Genet 2001, DOI: 10.1038/ng780).

They developed a new program, called FirstEF, that attempts to predict the starts of genes. They collected over two thousand first-exons to use as a training dataset, and characterized those that were associated with a CpG island. FirstEF is designed to recognize CpG islands, promoter regions and first splice-donor sites.

The program could predict 86% of all first exons with about 17% false positives (92% of CpG-related first-exons and 74% of non-CpG exons). FirstEF gave a similar performance when tested against the finished sequences for human chromosomes 21 and 22.

Interested in reading more?

Become a Member of

The Scientist Logo
Receive full access to more than 35 years of archives, as well as TS Digest, digital editions of The Scientist, feature stories, and much more!
Already a member? Login Here

Meet the Author

Share
Illustration of a developing fetus surrounded by a clear fluid with a subtle yellow tinge, representing amniotic fluid.
January 2026, Issue 1

What Is the Amniotic Fluid Composed of?

The liquid world of fetal development provides a rich source of nutrition and protection tailored to meet the needs of the growing fetus.

View this Issue
Skip the Wait for Protein Stability Data with Aunty

Skip the Wait for Protein Stability Data with Aunty

Unchained Labs
Graphic of three DNA helices in various colors

An Automated DNA-to-Data Framework for Production-Scale Sequencing

illumina
Exploring Cellular Organization with Spatial Proteomics

Exploring Cellular Organization with Spatial Proteomics

Abstract illustration of spheres with multiple layers, representing endoderm, ectoderm, and mesoderm derived organoids

Organoid Origins and How to Grow Them

Thermo Fisher Logo

Products

nuclera logo

Nuclera eProtein Discovery System installed at leading Universities in Taiwan

Brandtech Logo

BRANDTECH Scientific Introduces the Transferpette® pro Micropipette: A New Twist on Comfort and Control

Biotium Logo

Biotium Launches GlycoLiner™ Cell Surface Glycoprotein Labeling Kits for Rapid and Selective Cell Surface Imaging

Colorful abstract spiral dot pattern on a black background

Thermo Scientific X and S Series General Purpose Centrifuges

Thermo Fisher Logo