Only a small fraction of the thousands of described genetically inherited diseases have been linked to a specific gene. In an Advanced Online Publication in
Their data-mining system is based on 'fuzzy set theory', which can make inferences from the complex scientific literature. They integrated information from multiple databases to establish relationships between Medical Subject Headings (MeSH terms) related to diseases, or drugs, and Gene Ontology terms. After a series of computational steps they defined a 'core' for known genes in the RefSeq database. They then used the score to rank candidate genes in a given disease-associated region. When this approach was tested against known disease-linked genes, the score could predict promising candidate genes.
This type of strategy may be useful for prioritizing...