Low-quality papers based on public health data are flooding the scientific literature

2 min read Original article ↗
  • NEWS

The appearance of thousands of formulaic biomedical studies has been linked to the rise of text-generating AI tools.

Data from five large open-access health databases are being used to generate thousands of poor-quality, formulaic papers, an analysis has found. Its authors say that the surge in publications could indicate the exploitation of these databases by people using large language models (LLMs) to mass-produce scholarly articles, or even by paper mills — companies that churn out papers to order.

Access options

Access Nature and 54 other Nature Portfolio journals

Get Nature+, our best-value online-access subscription

$32.99 / 30 days

cancel any time

Subscribe to this journal

Receive 51 print issues and online access

$199.00 per year

only $3.90 per issue

Rent or buy this article

Prices vary by article type

from$1.95

to$39.95

Prices may be subject to local taxes which are calculated during checkout

doi: https://doi.org/10.1038/d41586-025-02241-2

References

Subjects

Latest on: