ABOVE: © ISTOCK.COM,
NOBI_PRIZUE
The human reference genome is a DNA blueprint used as a standard for comparison in basic research and clinical settings. Despite improvements in accuracy and completeness that have been made over the years, it still harbors limitations that can result in erroneous findings.
In the current version of the reference, called GRCh38 or Build 38, 93 percent of the sequence comes from just 11 individuals and 70 percent from just one man, resulting in a lack of diversity and at least 300 million missing letters of DNA. In addition, a small percentage of the genes in the reference genome are represented by alleles that are not the most common forms of the genes.
To address these issues, some scientists are developing a new reference, called the pangenome or graph genome, that contains a vast collection of genomes representing all possible DNA sequences for any given locus. ...