20
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Fighting reviewer fatigue or amplifying bias? Considerations and recommendations for use of ChatGPT and other large language models in scholarly peer review

      research-article

      Read this article at

      ScienceOpenPublisherPMC
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          The emergence of systems based on large language models (LLMs) such as OpenAI’s ChatGPT has created a range of discussions in scholarly circles. Since LLMs generate grammatically correct and mostly relevant (yet sometimes outright wrong, irrelevant or biased) outputs in response to provided prompts, using them in various writing tasks including writing peer review reports could result in improved productivity. Given the significance of peer reviews in the existing scholarly publication landscape, exploring challenges and opportunities of using LLMs in peer review seems urgent. After the generation of the first scholarly outputs with LLMs, we anticipate that peer review reports too would be generated with the help of these systems. However, there are currently no guidelines on how these systems should be used in review tasks.

          Methods

          To investigate the potential impact of using LLMs on the peer review process, we used five core themes within discussions about peer review suggested by Tennant and Ross-Hellauer. These include 1) reviewers’ role, 2) editors’ role, 3) functions and quality of peer reviews, 4) reproducibility, and 5) the social and epistemic functions of peer reviews. We provide a small-scale exploration of ChatGPT’s performance regarding identified issues.

          Results

          LLMs have the potential to substantially alter the role of both peer reviewers and editors. Through supporting both actors in efficiently writing constructive reports or decision letters, LLMs can facilitate higher quality review and address issues of review shortage. However, the fundamental opacity of LLMs’ training data, inner workings, data handling, and development processes raise concerns about potential biases, confidentiality and the reproducibility of review reports. Additionally, as editorial work has a prominent function in defining and shaping epistemic communities, as well as negotiating normative frameworks within such communities, partly outsourcing this work to LLMs might have unforeseen consequences for social and epistemic relations within academia. Regarding performance, we identified major enhancements in a short period and expect LLMs to continue developing.

          Conclusions

          We believe that LLMs are likely to have a profound impact on academia and scholarly communication. While potentially beneficial to the scholarly communication system, many uncertainties remain and their use is not without risks. In particular, concerns about the amplification of existing biases and inequalities in access to appropriate infrastructure warrant further attention. For the moment, we recommend that if LLMs are used to write scholarly reviews and decision letters, reviewers and editors should disclose their use and accept full responsibility for data security and confidentiality, and their reports’ accuracy, tone, reasoning and originality.

          Supplementary Information

          The online version contains supplementary material available at 10.1186/s41073-023-00133-5.

          Related collections

          Most cited references19

          • Record: found
          • Abstract: found
          • Article: not found

          ChatGPT is fun, but not an author

          In less than 2 months, the artificial intelligence (AI) program ChatGPT has become a cultural sensation. It is freely accessible through a web portal created by the tool’s developer, OpenAI. The program—which automatically creates text based on written prompts—is so popular that it’s likely to be “at capacity right now” if you attempt to use it. When you do get through, ChatGPT provides endless entertainment. I asked it to rewrite the first scene of the classic American play Death of a Salesman , but to feature Princess Elsa from the animated movie Frozen as the main character instead of Willy Loman. The output was an amusing conversation in which Elsa—who has come home from a tough day of selling—is told by her son Happy, “Come on, Mom. You’re Elsa from Frozen . You have ice powers and you’re a queen. You’re unstoppable.” Mash-ups like this are certainly fun, but there are serious implications for generative AI programs like ChatGPT in science and academia.
            • Record: found
            • Abstract: not found
            • Article: not found

            What ChatGPT and generative AI mean for science

              • Record: found
              • Abstract: not found
              • Article: not found

              Peer Review: A Flawed Process at the Heart of Science and Journals

                Author and article information

                Contributors
                mohammad.hosseini@northwestern.edu
                Journal
                Res Integr Peer Rev
                Res Integr Peer Rev
                Research Integrity and Peer Review
                BioMed Central (London )
                2058-8615
                18 May 2023
                18 May 2023
                2023
                : 8
                : 4
                Affiliations
                [1 ]GRID grid.16753.36, ISNI 0000 0001 2299 3507, Feinberg School of Medicine, , Northwestern University, ; 420 E. Superior Street, Chicago, IL 60611 USA
                [2 ]GRID grid.7048.b, ISNI 0000 0001 1956 2722, Danish Centre for Studies in Research and Research Policy, , Aarhus University, ; Bartholins Alle 7, 8000 Aarhus C Aarhus, Denmark
                Author information
                http://orcid.org/0000-0002-2385-985X
                http://orcid.org/0000-0003-0406-6261
                Article
                133
                10.1186/s41073-023-00133-5
                10191680
                37198671
                c4fd1d37-79e6-44e4-a832-2be00b3bcb63
                © The Author(s) 2023

                Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

                History
                : 14 February 2023
                : 19 April 2023
                Funding
                Funded by: FundRef http://dx.doi.org/10.13039/100006108, National Center for Advancing Translational Sciences;
                Award ID: UL1TR001422
                Award Recipient :
                Categories
                Research
                Custom metadata
                © BioMed Central Ltd., part of Springer Nature 2023

                peer review,academic writing,large language models,chagpt,editorial practices,generative ai

                Comments

                Comment on this article

                Related Documents Log