An international consortium of scientists aiming to sequence every transcript encoded by the mouse genome has analysed 21,076 so far.

Estimates of the number of genes in the mammalian genome range from 30,000 to 200,000. The problem is one of identifying which of the sequences in the billions of base pairs that make up the genome actually code for protein.

Instead of sequencing all 109 bp in the mouse genome, an international consortium of scientists has been sequencing a large bank of cDNAs prepared from various mouse tissues and developmental stages. The scientists, co-ordinated by Yoshihide Hayashizaki of the RIKEN Genomic Sciences Centre in Japan, report the characterization of the first 21,076 of these cDNA clones in the 8 February Nature (Nature 2001, 409:685-690).

The consortium found, for example, more than 100 new genes that represent metabolic enzymes. Ten novel orthologues of genes implicated in human disease...

Interested in reading more?

Become a Member of

Receive full access to more than 35 years of archives, as well as TS Digest, digital editions of The Scientist, feature stories, and much more!
Already a member?