Abstract:
Purposes Aiming at the problem of the spread of incorrect knowledge caused by the lag in retracting academic papers, this study aims to shorten the identification cycle of abnormal paper status and maintain the reliability and integrity of the academic communication system through AI technique. Methods Integrate the metadata and peer reviews contained in 12,098 papers from authoritative platforms such as PubPeer and PubMed as data support, develop a hybrid enhanced detection framework for paper retraction risk, and make decisions through retrieval-enhanced generation and expert-enhanced generation results. Findings This framework can accurately and effectively identify retracted papers. The verification accuracy rate of the retracted paper status reaches 91.91%, and the recall rate reaches 73.72%. Conclusions This study confirms the technical feasibility of large language models in academic early warning research and academic integrity maintenance. The application of the hybrid enhanced detection framework can provide a practical retraction early warning plan for publishing institutions. The open sharing of the hybrid enhanced detection framework will promote the standardization process of research on using large language models to handle retraction risk detection, help build an active monitoring scientific research integrity management system, improve supervision efficiency, and maintain the healthy development of the scientific research ecosystem.
Key words:
Retracted papers,
Retraction risk detection,
Large language models,
Research integrity
摘要:
目的 针对学术论文撤稿滞后现象导致的错误知识传播问题,运用人工智能技术缩短论文异常状态识别周期,维护学术交流系统的可靠性与完整性。 方法 整合PubPeer、 PubMed等权威平台12098篇论文所包含的元数据及其同行评议作为数据支撑,开发论文撤销风险混合增强检测框架,通过检索增强生成和专家增强生成结果进行决策。 结果 框架可准确、有效地识别撤销论文,撤销论文状态验证准确率达91.91%,召回率达73.72%。 结论 大语言模型在学术预警研究及学术诚信维护中具有可行性,应用混合增强检测框架能够为出版机构提供可操作的撤稿预警方案,助力构建主动监测的科研诚信管理体系,提高监管效率,维护科研生态健康发展。
关键词:
撤销论文,
撤稿风险检测,
大语言模型,
科研诚信
ZHAO Xiansong, CHEN Xiaohui, LU Ye, YANG Ming, LIN Yuan. Research on intelligent early warning of academic paper retraction risks: Hybrid enhanced detection framework based on large language models[J]. Chinese Journal of Scientific and Technical Periodicals, 2025, 36(11): 1478-1486.
赵显嵩, 陈晓晖, 陆晔, 杨茗, 林原. 学术撤稿风险智能预警研究:基于大语言模型的混合增强检测框架[J]. 中国科技期刊研究, 2025, 36(11): 1478-1486.