Chinese Journal of Scientific and Technical Periodicals ›› 2025, Vol. 36 ›› Issue (11): 1499-1507. doi: 10.11946/cjstp.202507230883

Previous Articles     Next Articles

Risk of copyright and compliance authorization in the reuse of full-text big data in scientific journals

ZOU Qiang1,8,9()(), JIANG Xia2,8, WANG Linhui3,9, ZHANG Hui4,8, LI Feng5,8, NI Ming3,9, WU Minshu6,8, ZHANG Xiufeng7,8,*()()   

  1. 1)Editorial Office of Journal of Clinical Pediatrics,Xinhua Hospital of Shanghai Jiao Tong University School of Medicine,1665 Kongjiang Road,Yangpu District,Shanghai 200092,China
    2)Editorial Office of Journal of Shanghai Jiao Tong University,1954 Huashan Road,Xuhui District,Shanghai 200030,China
    3)Department of Journal Management,Fudan University Shanghai Cancer Center,270 Dong’an Road,Xuhui District,Shanghai 200032,China
    4)Shanghai Key Laboratory of Forensic Medicine,Key Laboratory of Forensic Science,Ministry of Justice,Shanghai Forensic Service Platform,Academy of Forensic Science,Periodical Center,1347 West Guangfu Road,Putuo District,Shanghai 200063,China
    5)Editorial Office of Journal of Tongji University (Medical Sciences),1239 Siping Road,Yangpu District,Shanghai 200092,China
    6)Editorial Office of Acta Pharmacologica Sinica,294 Taiyuan Road,Xuhui District,Shanghai 200031,China
    7)Editorial Office of Fudan University Journal of Medical Sciences,138 Yixueyuan,Xuhui District,Shanghai 200032,China
    8)Committee on Publication Ethics of Shanghai Society for Scientific & Technical Periodicals,390 Qinghe Road,Jiading District,Shanghai 201800,China
    9)Committee on Biomedical Journals of Shanghai Society for Scientific & Technical Periodicals,390 Qinghe Road,Jiading District,Shanghai 201800,China
  • Received:2025-07-23 Online:2025-11-25 Published:2025-12-10

科技期刊全文大数据再利用著作权风险与合规授权

邹强1,8,9()(), 蒋霞2,8, 王琳辉3,9, 张慧4,8, 李锋5,8, 倪明3,9, 吴民淑6,8, 张秀峰7,8,*()()   

  1. 1)上海交通大学医学院附属新华医院,《临床儿科杂志》编辑部,上海市杨浦区控江路1665号 200092
    2)《上海交通大学学报》编辑部,上海市徐汇区华山路1954号 200030
    3)复旦大学附属肿瘤医院期刊管理办公室,上海市徐汇区东安路270号 200032
    4)司法鉴定科学研究院期刊中心,上海市法医学重点实验室,司法部司法鉴定重点实验室,上海市司法鉴定专业技术服务平台,上海市普陀区光复西路1347号 200063
    5)《同济大学学报(医学版)》编辑部,上海市杨浦区四平路1239号 200092
    6)《中国药理学报》编辑部,上海市徐汇区太原路294号 200031
    7)《复旦学报(医学版)》编辑部,上海市徐汇区医学院路138号 200032
    8)上海市科技期刊学会出版伦理工作委员会,上海市嘉定区清河路390号 201800
    9)上海市科技期刊学会生物医学期刊专业委员会,上海市嘉定区清河路390号 201800
  • 通讯作者: 张秀峰(ORCID:0000-0002-5290-367X),硕士,副编审,上海市科技期刊学会出版伦理工作委员会主任委员,E-mail:
  • 作者简介:
    邹强(ORCID:0000-0002-7668-5910),编辑,E-mail:
    蒋霞,博士,副编审,上海市科技期刊学会出版伦理工作委员会副主任委员;
    王琳辉,硕士,副编审,上海市科技期刊学会生物医学期刊专委会主任委员;
    张 慧,硕士,编审,司法鉴定科学研究院期刊中心副主任,《法庭科学研究(英文)》编辑部主任;
    李 锋,博士,副编审,上海市科技期刊学会出版伦理工作委员会副主任委员;
    倪 明,硕士,副编审,复旦大学附属肿瘤医院期刊管理办公室主任,上海市科技期刊学会副理事长;
    吴民淑,硕士,编审,上海市科技期刊学会副理事长。
    作者贡献声明: 邹 强:论文撰写、资料收集、焦点小组访谈、论文修订; 蒋 霞,王琳辉,张 慧,李 锋,倪 明,吴民淑:参与资料收集、焦点小组访谈、论文修订; 张秀峰:研究方案的策划与组织协调;资料收集、焦点小组访谈、论文修订。

Abstract:

Purposes To clarify the copyright risks triggered by text and data mining (TDM) and large language models (LLM) technologies in the commercial reuse of full-text big data in scientific journals, and to propose actionable compliance solutions based on this analysis. Methods A literature review and case analysis were conducted, issueed by focus group interviews, to extensively collect issues and cases concerning the reuse of scientific journal copyrights in the era of big data. Differences between China’s Copyright Law and the copyright legislation of major global economies (the United States, Japan, the European Union, and the United Kingdom) regarding TDM and LLM usage were examined. The study analyzes the copyright risks associated with the reuse of full-text big data in Chinese scientific journals and discuss and generate compliance authorization recommendations for journals to address potential risks. Findings In light of domestic scientific journal practices, five major copyright risks in the reuse of full-text big data are identified. Against the backdrop of open science, we recommend that journals and authors give priority to “copyright licensing (exclusive/non-exclusive)” over traditional “copyright assignment”clauses. When entering into licensing agreements with databases, journals must ensure they have legally obtained the necessary copyright authorization from authors and clearly define the scope of authorization in the agreement to prevent unauthorized sublicensing to databases beyond the granted rights. We recommend renegotiating copyright-licence agreements among authors, journals, and full-text databases, so that a clear and compliant authorization chain of “authors-journals-databases” should be established. At the same time, we recommend that the government take the lead in establishing a national-level green open storage platform similar to PubMed Central, to provide the necessary infrastructure for the compliant reuse of big data. Conclusions Journals should re-sign copyright agreements with authors and databases that comply with the reuse of full-text big data, improve the rules for journal data security and copyright usage, and encourage scholars to make their research findings more openly accessible. This study provides strategies for balancing copyright protection and data reuse in the era of big data for scientific journals, proposing a set of practical and feasible compliance measures. These measures ensure that journals fully utilize big data resources while respecting authors’ rights, promoting the open sharing and innovative application of academic information.

Key words: Journal copyright, Text and data mining, Large language model, AI-generated content, Big data reuse

摘要:

目的 厘清大数据时代文本与数据挖掘(TDM)与大语言模型(LLM)技术在科技期刊商业性全文大数据再利用场景中所引发著作权风险,并据此提出可操作的合规授权策略。 方法 采用文献复习法、案例分析法,并结合焦点小组访谈,广泛收集大数据时代科技期刊著作权再利用问题和案例,对比我国《中华人民共和国著作权法》与世界主要经济体(美国、日本、欧盟、英国)版权法在TDM/LLM著作权使用政策的差异,分析我国科技期刊全文大数据再利用存在的著作权风险,讨论并产生期刊如何应对潜在风险的合规授权建议。 结果 结合国内科技期刊的实践,指出当前全文大数据再利用中的5项著作权风险。在开放科学背景下,建议期刊与作者优先采用“著作权许可(专有/非专有)”以替代传统的“著作权转让”条款。建议期刊与数据库签订许可协议时,必须确保期刊已合法取得作者的著作权授权,并在协议中清晰界定授权范围,以避免超出授权权限向数据库进行非法转授权。通过期刊与作者及全文数据库重新签订著作权许可协议,明确构建“作者-期刊-数据库”的合规授权链条。同时,建议政府主导建立类似 PubMed Central 的国家级绿色开放仓储平台,为大数据的合规再利用提供必要的基础设施。 结论 期刊须与作者和数据库重新签署符合全文大数据再利用的著作权协议,完善期刊数据安全和著作权使用规则,鼓励学者将科研成果进一步开放获取。本研究为科技期刊在大数据时代的著作权保护与数据再利用之间提供了平衡的策略,提出了一套切实可行的合规授权方案。在确保期刊尊重作者权利的同时,能够充分利用大数据资源,推动学术信息的开放共享与创新应用。

关键词: 期刊著作权, 文本与数据挖掘, 大语言模型, 人工智能生成内容, 大数据再利用