Disputes Over Text-Mining

Computer programs that trawl research papers can reveal important large-scale patterns and facilitate further research, but publishers are wary.

Written byDan Cossins
| 2 min read

Register for free to listen to this article
Listen with Speechify
0:00
2:00
Share

FLICKR, ROBERT CUDMOREResearchers are increasingly keen to use computer programs that scour the text of thousands of scientific papers, a method known as text-mining, but publishers tend to block such programs. The resulting disagreements are coming to a head, reported Nature, with the European Union set to rule on the legality of text-mining, and researchers and publishers discussing the terms by which the method can be used.

“Data- and text-mining techniques . . . could hold the key to the next medical breakthrough, if only we freed them from their current legal tangle,” Neelie Kroes, vice-president of the European Commission, told a Brussels intellectual-property summit last September, according to Nature.

Indeed, text-mining of the scientific literature has already proven useful. For example, Raul Rodriguez-Esteban, a computational biologist at drug company Boehringer Ingelheim in Connecticut, told Nature that he used the method to search roughly 23,000 articles to identify hundreds of proteins that ameliorate multiple sclerosis in a mouse model. He then identified other proteins that interacted with them to find potential drug targets.

But it can take years to negotiate agreements with publishers to trawl their content, if permission is granted at ...

Interested in reading more?

Become a Member of

The Scientist Logo
Receive full access to more than 35 years of archives, as well as TS Digest, digital editions of The Scientist, feature stories, and much more!
Already a member? Login Here

Related Topics

Meet the Author

Share
Illustration of a developing fetus surrounded by a clear fluid with a subtle yellow tinge, representing amniotic fluid.
January 2026, Issue 1

What Is the Amniotic Fluid Composed of?

The liquid world of fetal development provides a rich source of nutrition and protection tailored to meet the needs of the growing fetus.

View this Issue
Skip the Wait for Protein Stability Data with Aunty

Skip the Wait for Protein Stability Data with Aunty

Unchained Labs
Graphic of three DNA helices in various colors

An Automated DNA-to-Data Framework for Production-Scale Sequencing

illumina
Exploring Cellular Organization with Spatial Proteomics

Exploring Cellular Organization with Spatial Proteomics

Abstract illustration of spheres with multiple layers, representing endoderm, ectoderm, and mesoderm derived organoids

Organoid Origins and How to Grow Them

Thermo Fisher Logo

Products

Brandtech Logo

BRANDTECH Scientific Introduces the Transferpette® pro Micropipette: A New Twist on Comfort and Control

Biotium Logo

Biotium Launches GlycoLiner™ Cell Surface Glycoprotein Labeling Kits for Rapid and Selective Cell Surface Imaging

Colorful abstract spiral dot pattern on a black background

Thermo Scientific X and S Series General Purpose Centrifuges

Thermo Fisher Logo
Abstract background with red and blue laser lights

VANTAstar Flexible microplate reader with simplified workflows

BMG LABTECH