Opinion: Text Mining in the Clinic

Despite increasing use of electronic medical records, much patient data remains in text form, requiring text-mining techniques to make full use of patient information.

Written byMin Song

| 3 min read

Save for Later

Patient records in the patient administration department at Naval Medical Center San DiegoWIKIMEDIA, AMANDA L. KILPATRICKThe increased use of electronic medical records (EMRs) is supporting widespread data-mining efforts to uncover trends in health, disease, and treatment response data. But a significant chunk of information in EMRs remains stored as text, unusable by conventional data-mining methods. These semi-structured or unstructured data include clinical notes, certain categories of test results such as echocardiograms and radiology reports, and other important documentation. To take full advantage of EMRs, we need to utilize both data- and text-mining techniques to explore patient outcomes.

Text and data mining have much in common; underlying each is the assumption that knowledge lies buried in a scattered mass of information. But whereas data mining predominately relies on statistical methods to uncover trends in structured data, text-mining techniques seek to make sense of information that is unstructured, such as a doctor’s scribbles on a patient’s chart. For example, much of the available clinical data are in narrative form as a result of transcription of dictations, direct entry by providers, or use of speech-recognition applications. This “free-text” form is convenient to express concepts and events, but is difficult to search, summarize, and analyze. Fortunately, text-mining techniques can help code these data for analysis.

Text mining normally requires a pre-processing phase such as spell checking, ...

Opinion: Text Mining in the Clinic

The Scientist ARCHIVES

Become a Member of

Related Topics

Meet the Author

Min Song

Related Research Resources

A Stubborn Gene, a Failed Experiment, and a New Path

Research Resources

Podcasts

Webinars

Videos

Infographics

eBooks

Advancing Drug Discovery with Complex Human In Vitro Models

Redefining Immunology Through Advanced Technologies

Ensuring Regulatory Compliance in AAV Manufacturing with Analytical Ultracentrifugation

Maximizing Cancer Research Model Systems

Products

Product News

Sino Biological Pioneers Life Sciences Innovation with High-Quality Bioreagents on Inside Business Today with Bill and Guiliana Rancic

Sino Biological Expands Research Reagent Portfolio to Support Global Nipah Virus Vaccine and Diagnostic Development

Beckman Coulter Life Sciences Partners with Automata to Accelerate AI-Ready Laboratory Automation

Refeyn named in the Sunday Times 100 Tech list of the UK’s fastest-growing technology companies