2
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Capabilities of GPT-4o and Gemini 1.5 Pro in Gram stain and bacterial shape identification

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Aim: Assessing the visual accuracy of two large language models (LLMs) in microbial classification.

          Materials & methods: GPT-4o and Gemini 1.5 Pro were evaluated in distinguishing Gram-positive from Gram-negative bacteria and classifying them as cocci or bacilli using 80 Gram stain images from a labeled database.

          Results: GPT-4o achieved 100% accuracy in identifying simultaneously Gram stain and shape for Clostridium perfringens, Pseudomonas aeruginosa and Staphylococcus aureus. Gemini 1.5 Pro showed more variability for similar bacteria (45, 100 and 95%, respectively). Both LLMs failed to identify both Gram stain and bacterial shape for Neisseria gonorrhoeae. Cumulative accuracy plots indicated that GPT-4o consistently performed equally or better in every identification, except for Neisseria gonorrhoeae's shape.

          Conclusion: These results suggest that these LLMs in their unprimed state are not ready to be implemented in clinical practice and highlight the need for more research with larger datasets to improve LLMs' effectiveness in clinical microbiology.

          Plain Language Summary

          This study looked at how well large language models (LLMs) could identify different types of bacteria using images, without having any specific training in this area beforehand.

          We tested two LLMs with image analysis capabilities, GPT-4o and Gemini 1.5 Pro. These models were asked to determine whether bacteria were Gram-positive or Gram-negative and whether they were round (cocci) or rod-shaped (bacilli). We used 80 images of four stained bacteria from a labeled database as a reference for this test.

          GPT-4o was more accurate in identifying both the Gram stain and shape of the bacteria compared with Gemini 1.5 Pro. GPT-4o had excellent accuracy in correctly classifying the Gram stain and bacterial shape of Clostridium perfringens, Pseudomonas aeruginosa and Staphylococcus aureus. Gemini 1.5 Pro had mixed results for these bacteria. However, both models struggled with Neisseria gonorrhoeae, failing to correctly identify its Gram stain and shape.

          The study shows that while these LLMs have potential, they are not ready to be implemented in clinical practice. More research and larger datasets are needed to improve their accuracy in clinical microbiology.

          Abstract

          Article highlights
          • Large language models (LLMs) are advanced artificial intelligence models, able to generate human-like text, with sophisticated natural language processing capabilities. They are trained on vast amounts of data and use deep learning techniques to understand complex inputs and produce language.

          • Recent studies have shown the potential of LLMs in medical image analysis across various fields like pathology and ophthalmology, demonstrating their ability to interpret complex medical visual data.

          • Clinical decisions on infection management such as initial antibiotic choice often rely on Gram stain results. Thus, it is crucial to ensure these tests are conducted and interpreted accurately.

          Materials & methods
          • Two LLMs were used in this study: Open AI's generative pretrained transformer (GPT), version 4 Omni (GPT-4o) and Google's Gemini version 1.5 Pro.

          • To the best of our knowledge, this study represents the first known accuracy analysis of the latest and most advanced visual LLMs, GPT-4o and Gemini 1.5 Pro, in the domain of Gram stain and bacterial shape identification.

          • A publicly available database of bacterial Gram stains was used. 80 bacterial samples were divided evenly among four bacteria representing all possible combinations of Gram Stain and bacterial shape.

          Results
          • GPT-4o correctly identified both the Gram stain and bacterial shape simultaneously with higher accuracy then Gemini 1.5 Pro (75 vs. 60%, respectively).

          • When examining the performance by specific bacteria, GPT-4o achieved 100% accuracy in identifying both the Gram stain and shape correctly for Clostridium perfringens, Pseudomonas aeruginosa and Staphylococcus aureus. Gemini 1.5 Pro, on the other hand, showed more variability in its performance for the same bacteria (45, 100 and 95%, respectively). However, both LLMs failed to correctly identify the Gram stain and bacterial shape in all cases (0% accuracy) with Neisseria gonorrhoeae.

          • Cumulative accuracy plots indicated that GPT-4o consistently performed equally or better in every identification, except for Neisseria gonorrhoeae‘s shape.

          Discussion
          • The results from this study provide valuable insights into the potential and limitations of these LLMs in microbial classification tasks, demonstrating their potential for microbial classification tasks without prior domain-specific training.

          • The results suggest that these LLMs in their unprimed state are not ready to be implemented in clinical practice.

          • The results underscore the need for further research with larger and more diverse datasets, as well as offline clinical samples, to better understand and enhance the capabilities of these LLMs in clinical microbiology.

          Related collections

          Most cited references21

          • Record: found
          • Abstract: found
          • Article: not found

          Staphylococcus aureus infections: epidemiology, pathophysiology, clinical manifestations, and management.

          Staphylococcus aureus is a major human pathogen that causes a wide range of clinical infections. It is a leading cause of bacteremia and infective endocarditis as well as osteoarticular, skin and soft tissue, pleuropulmonary, and device-related infections. This review comprehensively covers the epidemiology, pathophysiology, clinical manifestations, and management of each of these clinical entities. The past 2 decades have witnessed two clear shifts in the epidemiology of S. aureus infections: first, a growing number of health care-associated infections, particularly seen in infective endocarditis and prosthetic device infections, and second, an epidemic of community-associated skin and soft tissue infections driven by strains with certain virulence factors and resistance to β-lactam antibiotics. In reviewing the literature to support management strategies for these clinical manifestations, we also highlight the paucity of high-quality evidence for many key clinical questions.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            The future landscape of large language models in medicine

            Large language models (LLMs) are artificial intelligence (AI) tools specifically trained to process and generate text. LLMs attracted substantial public attention after OpenAI’s ChatGPT was made publicly available in November 2022. LLMs can often answer questions, summarize, paraphrase and translate text on a level that is nearly indistinguishable from human capabilities. The possibility to actively interact with models like ChatGPT makes LLMs attractive tools in various fields, including medicine. While these models have the potential to democratize medical knowledge and facilitate access to healthcare, they could equally distribute misinformation and exacerbate scientific misconduct due to a lack of accountability and transparency. In this article, we provide a systematic and comprehensive overview of the potentials and limitations of LLMs in clinical practice, medical research and medical education.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              Deep learning approach to bacterial colony classification

              In microbiology it is diagnostically useful to recognize various genera and species of bacteria. It can be achieved using computer-aided methods, which make the recognition processes more automatic and thus significantly reduce the time necessary for the classification. Moreover, in case of diagnostic uncertainty (the misleading similarity in shape or structure of bacterial cells), such methods can minimize the risk of incorrect recognition. In this article, we apply the state of the art method for texture analysis to classify genera and species of bacteria. This method uses deep Convolutional Neural Networks to obtain image descriptors, which are then encoded and classified with Support Vector Machine or Random Forest. To evaluate this approach and to make it comparable with other approaches, we provide a new dataset of images. DIBaS dataset (Digital Image of Bacterial Species) contains 660 images with 33 different genera and species of bacteria.
                Bookmark

                Author and article information

                Journal
                Future Microbiol
                Future Microbiol
                Future Microbiology
                Taylor & Francis
                1746-0913
                1746-0921
                29 July 2024
                2024
                29 July 2024
                : 19
                : 15
                : 1283-1292
                Affiliations
                [a ]Department of Internal Medicine , Cleveland Clinic , Cleveland, OH 44195, USA
                [b ]Department of Infectious Diseases , Cleveland Clinic , Cleveland, OH 44195, USA
                Author notes
                [* ]CONTACT: Tel.: +1(216)-630-6274; joyahindy@ 123456gmail.com
                [‡]

                Authors contributed equally

                Author information
                https://orcid.org/0000-0002-3672-8488
                https://orcid.org/0000-0002-4125-8054
                Article
                2381967
                10.1080/17460913.2024.2381967
                11486216
                39069960
                878e5b5c-3380-4778-9ccf-8a90afb4088c
                © 2024 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group

                This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives License ( http://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited, and is not altered, transformed, or built upon in any way. The terms on which this article has been published allow the posting of the Accepted Manuscript in a repository by the author(s) or with their consent.

                History
                Page count
                Figures: 6, Tables: 1, Pages: 10
                Categories
                Rapid Communication
                Rapid Communication

                bacteriology,gemini pro 1.5,gpt-4o,gram-negative bacteria,gram-positive bacteria,gram stain,large language model,staphylococcus aureus

                Comments

                Comment on this article