74
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      An Evolutionary Model-Based Algorithm for Accurate Phylogenetic Breakpoint Mapping and Subtype Prediction in HIV-1

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Genetically diverse pathogens (such as Human Immunodeficiency virus type 1, HIV-1) are frequently stratified into phylogenetically or immunologically defined subtypes for classification purposes. Computational identification of such subtypes is helpful in surveillance, epidemiological analysis and detection of novel variants, e.g., circulating recombinant forms in HIV-1. A number of conceptually and technically different techniques have been proposed for determining the subtype of a query sequence, but there is not a universally optimal approach. We present a model-based phylogenetic method for automatically subtyping an HIV-1 (or other viral or bacterial) sequence, mapping the location of breakpoints and assigning parental sequences in recombinant strains as well as computing confidence levels for the inferred quantities. Our Subtype Classification Using Evolutionary ALgorithms (SCUEAL) procedure is shown to perform very well in a variety of simulation scenarios, runs in parallel when multiple sequences are being screened, and matches or exceeds the performance of existing approaches on typical empirical cases. We applied SCUEAL to all available polymerase (pol) sequences from two large databases, the Stanford Drug Resistance database and the UK HIV Drug Resistance Database. Comparing with subtypes which had previously been assigned revealed that a minor but substantial (≈5%) fraction of pure subtype sequences may in fact be within- or inter-subtype recombinants. A free implementation of SCUEAL is provided as a module for the HyPhy package and the Datamonkey web server. Our method is especially useful when an accurate automatic classification of an unknown strain is desired, and is positioned to complement and extend faster but less accurate methods. Given the increasingly frequent use of HIV subtype information in studies focusing on the effect of subtype on treatment, clinical outcome, pathogenicity and vaccine design, the importance of accurate, robust and extensible subtyping procedures is clear.

          Author Summary

          There are nine different subtypes of the main group of HIV-1, each originating as a distinct subepidemic of HIV-1. The distribution of subtypes is often unique to a given geographic region of the world and constitutes a useful epidemiological and surveillance resource. The effects of viral subtype on disease progression, treatment outcome and vaccine design are being actively researched, and the importance of accurate subtyping procedures is clear. In HIV-1, subtype assignment is complicated by frequent recombination among co-circulating strains, creating new genetic mosaics or recombinant forms: 43 have been characterized to date, and many more likely exist. We present an automated phylogenetic method (SCUEAL) to accurately characterize both simple and complex HIV-1 mosaics. Using computer simulations and biological data we demonstrate that SCUEAL performs very well under various conditions, especially when some of the existing classification procedures fail. Furthermore, we show that a small, but noticeable proportion of subtype characterization stored in public databases may be incomplete or incorrect. The computational technique introduced here should provide a much more accurate characterization of HIV-1 strains, especially novel recombinants, and lead to new insights into molecular history, epidemiology and geographical distribution of the virus.

          Related collections

          Most cited references50

          • Record: found
          • Abstract: found
          • Article: not found

          A modified bootscan algorithm for automated identification of recombinant sequences and recombination breakpoints.

          We have developed a modified BOOTSCAN algorithm that may be used to screen nucleotide sequence alignments for evidence of recombination without prior identification of nonrecombinant reference sequences. The algorithm is fast and includes a Bonferroni corrected statistical test of recombination to circumvent the multiple testing problems encountered when using the BOOTSCAN method to explore alignments for evidence of recombination. Using both simulated and real datasets we demonstrate that the modified algorithm is more powerful than other phylogenetic recombination detection methods and performs almost as well as one of the best substitution distribution recombination detection methods.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Identification of breakpoints in intergenotypic recombinants of HIV type 1 by bootscanning.

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              HIV-1 nomenclature proposal.

                Bookmark

                Author and article information

                Contributors
                Role: Editor
                Journal
                PLoS Comput Biol
                plos
                ploscomp
                PLoS Computational Biology
                Public Library of Science (San Francisco, USA )
                1553-734X
                1553-7358
                November 2009
                November 2009
                26 November 2009
                : 5
                : 11
                : e1000581
                Affiliations
                [1 ]Department of Medicine, University of California San Diego, La Jolla, California, United States of America
                [2 ]Department of Biochemistry, Genetics and Immunology, University of Vigo, Vigo, Spain
                [3 ]Monogram Biosciences, South San Francisco, California, United States of America
                [4 ]Department of Pathology, University of California San Diego, La Jolla, California, United States of America
                [5 ]Health Protection Agency East of England Regional Epidemiology Unit, Cambridge, United Kingdom
                [6 ]Medical Research Council Clinical Trials Unit, London, United Kingdom
                [7 ]School of Medicine, University of Swansea, Swansea, United Kingdom
                [8 ]Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, United Kingdom
                [9 ]Department of Veterinary Medicine, University of Cambridge, Cambridge, United Kingdom
                Imperial College London, United Kingdom
                Author notes

                Conceived and designed the experiments: SLKP DP AJLB SDWF. Performed the experiments: SLKP. Analyzed the data: SLKP DP ES GH. Contributed reagents/materials/analysis tools: SLKP DP ES CC AFYP GH EF MBG SDWF. Wrote the paper: SLKP DP AFYP SDWF.

                Article
                09-PLCB-RA-0151R4
                10.1371/journal.pcbi.1000581
                2776870
                19956739
                216401e9-15ce-4ffa-9ac2-1785b78fc6c2
                Kosakovsky Pond et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
                History
                : 10 February 2009
                : 28 October 2009
                Page count
                Pages: 21
                Categories
                Research Article
                Computational Biology/Comparative Sequence Analysis
                Computational Biology/Evolutionary Modeling
                Computer Science/Applications
                Mathematics/Statistics
                Virology/Immunodeficiency Viruses

                Quantitative & Systems biology
                Quantitative & Systems biology

                Comments

                Comment on this article