10
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Bayesian reconstruction of transmission within outbreaks using genomic variants

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Pathogen genome sequencing can reveal details of transmission histories and is a powerful tool in the fight against infectious disease. In particular, within-host pathogen genomic variants identified through heterozygous nucleotide base calls are a potential source of information to identify linked cases and infer direction and time of transmission. However, using such data effectively to model disease transmission presents a number of challenges, including differentiating genuine variants from those observed due to sequencing error, as well as the specification of a realistic model for within-host pathogen population dynamics. Here we propose a new Bayesian approach to transmission inference, BadTrIP (BAyesian epiDemiological TRansmission Inference from Polymorphisms), that explicitly models evolution of pathogen populations in an outbreak, transmission (including transmission bottlenecks), and sequencing error. BadTrIP enables the inference of host-to-host transmission from pathogen sequencing data and epidemiological data. By assuming that genomic variants are unlinked, our method does not require the computationally intensive and unreliable reconstruction of individual haplotypes. Using simulations we show that BadTrIP is robust in most scenarios and can accurately infer transmission events by efficiently combining information from genetic and epidemiological sources; thanks to its realistic model of pathogen evolution and the inclusion of epidemiological data, BadTrIP is also more accurate than existing approaches. BadTrIP is distributed as an open source package ( https://bitbucket.org/nicofmay/badtrip) for the phylogenetic software BEAST2. We apply our method to reconstruct transmission history at the early stages of the 2014 Ebola outbreak, showcasing the power of within-host genomic variants to reconstruct transmission events.

          Author summary

          We present a new tool to reconstruct transmission events within outbreaks. Our approach makes use of pathogen genetic information, notably genetic variants at low frequency within host that are usually discarded, and combines it with epidemiological information of host exposure to infection. This leads to accurate reconstruction of transmission even in cases where abundant within-host pathogen genetic variation and weak transmission bottlenecks (multiple pathogen units colonising a new host at transmission) would otherwise make inference difficult due to the transmission history differing from the pathogen evolution history inferred from pathogen isolets. Also, the use of within-host pathogen genomic variants increases the resolution of the reconstruction of the transmission tree even in scenarios with limited within-outbreak pathogen genetic diversity: within-host pathogen populations that appear identical at the level of consensus sequences can be discriminated using within-host variants. Our Bayesian approach provides a measure of the confidence in different possible transmission histories, and is published as open source software. We show with simulations and with an analysis of the beginning of the 2014 Ebola outbreak that our approach is applicable in many scenarios, improves our understanding of transmission dynamics, and will contribute to finding and limiting sources and routes of transmission, and therefore preventing the spread of infectious disease.

          Related collections

          Most cited references32

          • Record: found
          • Abstract: found
          • Article: not found

          Performance comparison of benchtop high-throughput sequencing platforms.

          Three benchtop high-throughput sequencing instruments are now available. The 454 GS Junior (Roche), MiSeq (Illumina) and Ion Torrent PGM (Life Technologies) are laser-printer sized and offer modest set-up and running costs. Each instrument can generate data required for a draft bacterial genome sequence in days, making them attractive for identifying and characterizing pathogens in the clinical setting. We compared the performance of these instruments by sequencing an isolate of Escherichia coli O104:H4, which caused an outbreak of food poisoning in Germany in 2011. The MiSeq had the highest throughput per run (1.6 Gb/run, 60 Mb/h) and lowest error rates. The 454 GS Junior generated the longest reads (up to 600 bases) and most contiguous assemblies but had the lowest throughput (70 Mb/run, 9 Mb/h). Run in 100-bp mode, the Ion Torrent PGM had the highest throughput (80–100 Mb/h). Unlike the MiSeq, the Ion Torrent PGM and 454 GS Junior both produced homopolymer-associated indel errors (1.5 and 0.38 errors per 100 bases, respectively).
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Transforming clinical microbiology with bacterial genome sequencing.

            Whole-genome sequencing of bacteria has recently emerged as a cost-effective and convenient approach for addressing many microbiological questions. Here, we review the current status of clinical microbiology and how it has already begun to be transformed by using next-generation sequencing. We focus on three essential tasks: identifying the species of an isolate, testing its properties, such as resistance to antibiotics and virulence, and monitoring the emergence and spread of bacterial pathogens. We predict that the application of next-generation sequencing will soon be sufficiently fast, accurate and cheap to be used in routine clinical microbiology practice, where it could replace many complex current techniques with a single, more efficient workflow.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Diverse sources of C. difficile infection identified on whole-genome sequencing.

              It has been thought that Clostridium difficile infection is transmitted predominantly within health care settings. However, endemic spread has hampered identification of precise sources of infection and the assessment of the efficacy of interventions. From September 2007 through March 2011, we performed whole-genome sequencing on isolates obtained from all symptomatic patients with C. difficile infection identified in health care settings or in the community in Oxfordshire, United Kingdom. We compared single-nucleotide variants (SNVs) between the isolates, using C. difficile evolution rates estimated on the basis of the first and last samples obtained from each of 145 patients, with 0 to 2 SNVs expected between transmitted isolates obtained less than 124 days apart, on the basis of a 95% prediction interval. We then identified plausible epidemiologic links among genetically related cases from data on hospital admissions and community location. Of 1250 C. difficile cases that were evaluated, 1223 (98%) were successfully sequenced. In a comparison of 957 samples obtained from April 2008 through March 2011 with those obtained from September 2007 onward, a total of 333 isolates (35%) had no more than 2 SNVs from at least 1 earlier case, and 428 isolates (45%) had more than 10 SNVs from all previous cases. Reductions in incidence over time were similar in the two groups, a finding that suggests an effect of interventions targeting the transition from exposure to disease. Of the 333 patients with no more than 2 SNVs (consistent with transmission), 126 patients (38%) had close hospital contact with another patient, and 120 patients (36%) had no hospital or community contact with another patient. Distinct subtypes of infection continued to be identified throughout the study, which suggests a considerable reservoir of C. difficile. Over a 3-year period, 45% of C. difficile cases in Oxfordshire were genetically distinct from all previous cases. Genetically diverse sources, in addition to symptomatic patients, play a major part in C. difficile transmission. (Funded by the U.K. Clinical Research Collaboration Translational Infection Research Initiative and others.).
                Bookmark

                Author and article information

                Contributors
                Role: ConceptualizationRole: Data curationRole: Formal analysisRole: InvestigationRole: MethodologyRole: Project administrationRole: ResourcesRole: SoftwareRole: ValidationRole: VisualizationRole: Writing – original draftRole: Writing – review & editing
                Role: Data curationRole: ResourcesRole: Writing – review & editing
                Role: SupervisionRole: Writing – review & editing
                Role: SupervisionRole: Writing – review & editing
                Role: Editor
                Journal
                PLoS Comput Biol
                PLoS Comput. Biol
                plos
                ploscomp
                PLoS Computational Biology
                Public Library of Science (San Francisco, CA USA )
                1553-734X
                1553-7358
                April 2018
                18 April 2018
                : 14
                : 4
                : e1006117
                Affiliations
                [1 ] Nuffield Department of Medicine, University of Oxford, Oxford, United Kingdom
                [2 ] Department of Ecology and Evolutionary Biology, Princeton University, Princeton, New Jersey, United States of America
                [3 ] Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
                Duke University, UNITED STATES
                Author notes

                The authors have declared that no competing interests exist.

                Author information
                http://orcid.org/0000-0002-1776-8564
                http://orcid.org/0000-0002-4508-7969
                Article
                PCOMPBIOL-D-17-01856
                10.1371/journal.pcbi.1006117
                5927459
                29668677
                bdec05b3-60ff-475b-96dd-d5216bb56cef
                © 2018 De Maio et al

                This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

                History
                : 9 November 2017
                : 3 April 2018
                Page count
                Figures: 5, Tables: 0, Pages: 23
                Funding
                Funded by: funder-id http://dx.doi.org/10.13039/100004440, Wellcome Trust;
                Award ID: 101237/Z/13/Z
                Award Recipient :
                Funded by: funder-id http://dx.doi.org/10.13039/501100004211, Oxford Martin School, University of Oxford;
                Award Recipient :
                Funded by: funder-id http://dx.doi.org/10.13039/100000865, Bill and Melinda Gates Foundation;
                Award ID: OPP1091919
                Award Recipient :
                Funded by: funder-id http://dx.doi.org/10.13039/501100002141, Public Health England;
                Award Recipient :
                Daniel J. Wilson was supported by a Sir Henry Dale Fellowship, jointly funded by the Wellcome Trust ( www.wellcome.ac.uk/) and the Royal Society ( https://royalsociety.org/grant101237/Z/13/Z). CJW was supported by the Bill and Melinda Gates Foundation ( https://www.gatesfoundation.org/grantOPP1091919). NS is currently funded through a Public Health England ( https://www.gov.uk/government/organisations/public-health-england)/University of Oxford ( http://www.ox.ac.uk/) Clinical Lectureship. N.D.M. was supported by the Oxford Martin School ( http://www.oxfordmartin.ox.ac.uk/). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
                Categories
                Research Article
                Medicine and Health Sciences
                Epidemiology
                Genetic Epidemiology
                Medicine and Health Sciences
                Pathology and Laboratory Medicine
                Pathogens
                Biology and Life Sciences
                Molecular Biology
                Molecular Biology Techniques
                Sequencing Techniques
                Nucleotide Sequencing
                Research and Analysis Methods
                Molecular Biology Techniques
                Sequencing Techniques
                Nucleotide Sequencing
                Biology and Life Sciences
                Evolutionary Biology
                Evolutionary Systematics
                Phylogenetics
                Phylogenetic Analysis
                Biology and Life Sciences
                Taxonomy
                Evolutionary Systematics
                Phylogenetics
                Phylogenetic Analysis
                Computer and Information Sciences
                Data Management
                Taxonomy
                Evolutionary Systematics
                Phylogenetics
                Phylogenetic Analysis
                Biology and life sciences
                Genetics
                DNA
                DNA recombination
                Biology and life sciences
                Biochemistry
                Nucleic acids
                DNA
                DNA recombination
                Biology and Life Sciences
                Evolutionary Biology
                Evolutionary Genetics
                Medicine and Health Sciences
                Epidemiology
                Biology and Life Sciences
                Evolutionary Biology
                Population Genetics
                Biology and Life Sciences
                Genetics
                Population Genetics
                Biology and Life Sciences
                Population Biology
                Population Genetics
                Custom metadata
                vor-update-to-uncorrected-proof
                2018-04-30
                All relevant data are within the paper and its Supporting Information files.

                Quantitative & Systems biology
                Quantitative & Systems biology

                Comments

                Comment on this article