Can Publication Records Predict Future PIs?

Researchers present a tool that uses a scientist’s PubMed data to estimate the probability of becoming a principal investigator in academia.

Written byTracy Vence
| 3 min read

Register for free to listen to this article
Listen with Speechify
0:00
3:00
Share

FLICKR, MOONLIGHTBULBThe odds of a scientist becoming an academic principal investigator (PI) can be predicted with publication data, according to Lucas Carey from Spain’s Pompeu Fabra University and his colleagues. The team developed an online tool, dubbed PIPredictor, which uses a machine-learning approach to analyze a user’s PubMed data and that has already churned out more than 800 career-success estimates to date. Carey and his colleagues describe their tool in Current Biology today (June 2).

“We show that becoming a research professor is highly predictable, [and] we analyze the features that are predictive” of success, Carey told The Scientist in an e-mail. The algorithm can predict who might become a PI and how long it could take for them to do so with an area under the curve (AUC), a measurement of accuracy, of 0.83 and 0.38, respectively.

While he was a postdoc in Eran Segal’s lab at the Weizmann Institute of Science in Rehovot, Israel, Carey and two then-PhD students, David van Dijk and Ohad Manor, decided to apply the machine learning-based approaches they were using to predict molecular mechanisms from gene expression data to guess who among them might one day become a PI. “There was quite a ...

Interested in reading more?

Become a Member of

The Scientist Logo
Receive full access to more than 35 years of archives, as well as TS Digest, digital editions of The Scientist, feature stories, and much more!
Already a member? Login Here

Related Topics

Meet the Author

Share
Illustration of a developing fetus surrounded by a clear fluid with a subtle yellow tinge, representing amniotic fluid.
January 2026, Issue 1

What Is the Amniotic Fluid Composed of?

The liquid world of fetal development provides a rich source of nutrition and protection tailored to meet the needs of the growing fetus.

View this Issue
Skip the Wait for Protein Stability Data with Aunty

Skip the Wait for Protein Stability Data with Aunty

Unchained Labs
Graphic of three DNA helices in various colors

An Automated DNA-to-Data Framework for Production-Scale Sequencing

illumina
Exploring Cellular Organization with Spatial Proteomics

Exploring Cellular Organization with Spatial Proteomics

Abstract illustration of spheres with multiple layers, representing endoderm, ectoderm, and mesoderm derived organoids

Organoid Origins and How to Grow Them

Thermo Fisher Logo

Products

Brandtech Logo

BRANDTECH Scientific Introduces the Transferpette® pro Micropipette: A New Twist on Comfort and Control

Biotium Logo

Biotium Launches GlycoLiner™ Cell Surface Glycoprotein Labeling Kits for Rapid and Selective Cell Surface Imaging

Colorful abstract spiral dot pattern on a black background

Thermo Scientific X and S Series General Purpose Centrifuges

Thermo Fisher Logo
Abstract background with red and blue laser lights

VANTAstar Flexible microplate reader with simplified workflows

BMG LABTECH