<figcaption> Credit: © DAVID SCHARF/SCIENCE PHOTO LIBRARY</figcaption>

Genomic quirks make Candida albicans a troublesome creature in the lab. Unlike Saccharomyces cerevisiae, which was sequenced in 1996, C. albicans has no known haploid or homozygous form. Using whole genome shotgun sequencing is tricky, because multiple alleles confound standard assembly software. Moreover, the C. albicans genome is full of repeated sequences and recently diverged gene families.

In a 2004 paper, a team of researchers led by Ron Davis of the Stanford Genome Technology Center presented the first complete sequence of the pathogen's diploid genome.1 "The analysis was challenging," says study coauthor Nancy Federspiel. "It was hard to tell apart the allelic differences from the sequencing errors." Because their sequence assembly software assumed single-copy sequence, the researchers were not surprised when the sum of the contigs exceeded the genome size by 20%. They had to perform a pairwise comparison of the contigs...

1. T. Jones et al., "The diploid genome sequence of Candida albicans," Proc Nat Acad Sci, 101:7329-34, 2004 (cited in 80 papers).

Interested in reading more?

Magaizne Cover

Become a Member of

Receive full access to digital editions of The Scientist, as well as TS Digest, feature stories, more than 35 years of archives, and much more!
Already a member?