Gene Finding with Hidden Markov Models

made history in 1995 when it became the first free-living organism to have its genome completely sequenced.

| 8 min read

Register for free to listen to this article
Listen with Speechify
0:00
8:00
Share

Haemophilus influenzae made history in 1995 when it became the first free-living organism to have its genome completely sequenced. In the decade since, some 180 or so organisms have followed suit.

For every one of these genomes, the sequence is only the beginning. The challenge for the computational biologists charged with making sense of the data: to find the gene sequences hidden within those strings, billions of bases long, of As, Cs, Gs, and Ts. The genome annotation strategies these computer scientists cum biologists have developed clearly have come a long way. The most recent iteration (version 4.0) of the Drosophila genome annotation, for instance, updated only 25 predictions out of 13,472 protein-coding genes.

But improvements can still be made. "If they were 100% reliable, then they would have been run on the April 2003 complete human sequence and that would've been it. Those would have been your genes," says ...

Interested in reading more?

Become a Member of

The Scientist Logo
Receive full access to digital editions of The Scientist, as well as TS Digest, feature stories, more than 35 years of archives, and much more!
Already a member? Login Here

Keywords

Meet the Author

  • Karen Heyman

    This person does not yet have a bio.

Published In

Share
A greyscale image of cells dividing.
March 2025, Issue 1

How Do Embryos Know How Fast to Develop

In mammals, intracellular clocks begin to tick within days of fertilization.

View this Issue
Discover the history, mechanics, and potential of PCR.

Become a PCR Pro

Integra Logo
3D rendered cross section of influenza viruses, showing surface proteins on the outside and single stranded RNA inside the virus

Genetic Insights Break Infectious Pathogen Barriers

Thermo Fisher Logo
A photo of sample storage boxes in an ultra-low temperature freezer.

Navigating Cold Storage Solutions

PHCbi logo 
The Immunology of the Brain

The Immunology of the Brain

Products

Sapio Sciences

Sapio Sciences Makes AI-Native Drug Discovery Seamless with NVIDIA BioNeMo

DeNovix Logo

New DeNovix Helium Nano Volume Spectrophotometer

Olink Logo

Olink® Reveal: Accessible NGS-based proteomics for every lab

Olink logo
Zymo Logo

Zymo Research Launches the Quick-16S™ Full-Length Library Prep Kit