135
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      Microbial community resemblance methods differ in their ability to detect biologically relevant patterns

      research-article

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The development of high-throughput sequencing methods allows for the characterization of microbial communities in a wide range of environments on an unprecedented scale. However, insight into microbial community composition is limited by our ability to detect patterns in this flood of sequences. Here we compare the performance of 51 analysis techniques using real and simulated bacterial 16S rRNA pyrosequencing datasets containing either clustered samples or samples arrayed across environmental gradients. We find that many diversity patterns are evident with severely undersampled communities, and that methods vary widely in their ability to detect gradients and clusters. Chi-squared distances and Pearson correlation distances perform especially well for detecting gradients, while Gower and Canberra distances perform especially well for detecting clusters. These results also provide a basis for understanding tradeoffs between number of samples and depth of coverage, tradeoffs which are important to consider when designing studies to characterize microbial communities.

          Related collections

          Most cited references20

          • Record: found
          • Abstract: found
          • Article: not found

          Error-correcting barcoded primers for pyrosequencing hundreds of samples in multiplex.

          We constructed error-correcting DNA barcodes that allow one run of a massively parallel pyrosequencer to process up to 1,544 samples simultaneously. Using these barcodes we processed bacterial 16S rRNA gene sequences representing microbial communities in 286 environmental samples, corrected 92% of sample assignment errors, and thus characterized nearly as many 16S rRNA genes as have been sequenced to date by Sanger sequencing.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Fast UniFrac: Facilitating high-throughput phylogenetic analyses of microbial communities including analysis of pyrosequencing and PhyloChip data

            Next-generation sequencing techniques, and PhyloChip, have made simultaneous phylogenetic analyses of hundreds of microbial communities possible. Insight into community structure has been limited by the inability to integrate and visualize such vast datasets. Fast UniFrac overcomes these issues, allowing integration of larger numbers of sequences and samples into a single analysis. Its new array-based implementation offers orders of magnitude improvements over the original version. New 3D visualization of principal coordinates analysis (PCoA) results, with the option to view multiple coordinate axes simultaneously, provides a powerful way to quickly identify patterns that relate vast numbers of microbial communities. We demonstrate the potential of Fast UniFrac using examples from three data types: Sanger-sequencing studies of diverse free-living and animal-associated bacterial assemblages and from the gut of obese humans as they diet, pyrosequencing data integrated from studies of the human hand and gut, and PhyloChip data from a study of citrus pathogens. We show that a Fast UniFrac analysis using a reference tree recaptures patterns that could not be detected without considering phylogenetic relationships and that Fast UniFrac, coupled with BLAST-based sequence assignment, can be used to quickly analyze pyrosequencing runs containing hundreds of thousands of sequences, revealing patterns relating human and gut samples. Finally, we show that the application of Fast UniFrac to PhyloChip data could identify well-defined subcategories associated with infection. Together, these case studies point the way towards a broad range of applications and demonstrate some of the new features of Fast UniFrac.
              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Measuring Biological Diversity

                Bookmark

                Author and article information

                Journal
                101215604
                32338
                Nat Methods
                Nature methods
                1548-7091
                1548-7105
                9 August 2010
                5 September 2010
                October 2010
                1 April 2011
                : 7
                : 10
                : 813-819
                Affiliations
                [1 ] Department of Molecular, Cellular and Developmental Biology, University of Colorado, Boulder, Colorado, USA
                [2 ] Department of Pathology, Yale University School of Medicine, New Haven, Connecticut, USA
                [3 ] Department of Chemistry and Biochemistry, University of Colorado, Boulder, Colorado, USA
                [4 ] Department of Ecology and Evolutionary Biology, University of Colorado, Boulder, Colorado, USA
                [5 ] Cooperative Institute for Research in Environmental Sciences, University of Colorado, Boulder, Colorado, USA
                [6 ] Howard Hughes Medical Institute, University of Colorado, Boulder, Colorado, USA
                Author notes
                [* ]correspondence to rob.knight@ 123456colorado.edu
                Article
                nihpa226887
                10.1038/nmeth.1499
                2948603
                20818378
                45b62f58-e65d-46e4-87eb-336ea052168c

                Users may view, print, copy, download and text and data- mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use: http://www.nature.com/authors/editorial_policies/license.html#terms

                History
                Funding
                Funded by: National Human Genome Research Institute : NHGRI
                Funded by: National Institute of Diabetes and Digestive and Kidney Diseases : NIDDK
                Funded by: Howard Hughes Medical Institute
                Award ID: R01 HG004872-03 ||HG
                Funded by: National Human Genome Research Institute : NHGRI
                Funded by: National Institute of Diabetes and Digestive and Kidney Diseases : NIDDK
                Funded by: Howard Hughes Medical Institute
                Award ID: P01 DK078669-030003 ||DK
                Funded by: National Human Genome Research Institute : NHGRI
                Funded by: National Institute of Diabetes and Digestive and Kidney Diseases : NIDDK
                Funded by: Howard Hughes Medical Institute
                Award ID: ||HHMI_
                Categories
                Article

                Life sciences
                beta-diversity,community analysis,ordination,clustering,pyrosequencing,high-throughput
                Life sciences
                beta-diversity, community analysis, ordination, clustering, pyrosequencing, high-throughput

                Comments

                Comment on this article