For every one of these genomes, the sequence is only the beginning. The challenge for the computational biologists charged with making sense of the data: to find the gene sequences hidden within those strings, billions of bases long, of As, Cs, Gs, and Ts. The genome annotation strategies these computer scientists cum biologists have developed clearly have come a long way. The most recent iteration (version 4.0) of the
But improvements can still be made. "If they were 100% reliable, then they would have been run on the April 2003 complete human sequence and that would've been it. Those would have been your genes," says ...