20
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Localization of adaptive variants in human genomes using averaged one-dependence estimation

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Statistical methods for identifying adaptive mutations from population genetic data face several obstacles: assessing the significance of genomic outliers, integrating correlated measures of selection into one analytic framework, and distinguishing adaptive variants from hitchhiking neutral variants. Here, we introduce SWIF(r), a probabilistic method that detects selective sweeps by learning the distributions of multiple selection statistics under different evolutionary scenarios and calculating the posterior probability of a sweep at each genomic site. SWIF(r) is trained using simulations from a user-specified demographic model and explicitly models the joint distributions of selection statistics, thereby increasing its power to both identify regions undergoing sweeps and localize adaptive mutations. Using array and exome data from 45 ‡Khomani San hunter-gatherers of southern Africa, we identify an enrichment of adaptive signals in genes associated with metabolism and obesity. SWIF(r) provides a transparent probabilistic framework for localizing beneficial mutations that is extensible to a variety of evolutionary scenarios.

          Abstract

          Selective sweeps are events in which beneficial mutations spread rapidly through a population. Here, Sugden et al. develop SWIF(r), a probabilistic classification framework for detecting and localizing selective sweeps, and apply it to genomic data from the ‡Khomani San.

          Related collections

          Most cited references75

          • Record: found
          • Abstract: not found
          • Article: not found

          Scikit-learn : machine learning in Python

            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            A fast and flexible statistical model for large-scale population genotype data: applications to inferring missing genotypes and haplotypic phase.

            We present a statistical model for patterns of genetic variation in samples of unrelated individuals from natural populations. This model is based on the idea that, over short regions, haplotypes in a population tend to cluster into groups of similar haplotypes. To capture the fact that, because of recombination, this clustering tends to be local in nature, our model allows cluster memberships to change continuously along the chromosome according to a hidden Markov model. This approach is flexible, allowing for both "block-like" patterns of linkage disequilibrium (LD) and gradual decline in LD with distance. The resulting model is also fast and, as a result, is practicable for large data sets (e.g., thousands of individuals typed at hundreds of thousands of markers). We illustrate the utility of the model by applying it to dense single-nucleotide-polymorphism genotype data for the tasks of imputing missing genotypes and estimating haplotypic phase. For imputing missing genotypes, methods based on this model are as accurate or more accurate than existing methods. For haplotype estimation, the point estimates are slightly less accurate than those from the best existing methods (e.g., for unrelated Centre d'Etude du Polymorphisme Humain individuals from the HapMap project, switch error was 0.055 for our method vs. 0.051 for PHASE) but require a small fraction of the computational cost. In addition, we demonstrate that the model accurately reflects uncertainty in its estimates, in that probabilities computed using the model are approximately well calibrated. The methods described in this article are implemented in a software package, fastPHASE, which is available from the Stephens Lab Web site.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Genomic scans for selective sweeps using SNP data.

              Detecting selective sweeps from genomic SNP data is complicated by the intricate ascertainment schemes used to discover SNPs, and by the confounding influence of the underlying complex demographics and varying mutation and recombination rates. Current methods for detecting selective sweeps have little or no robustness to the demographic assumptions and varying recombination rates, and provide no method for correcting for ascertainment biases. Here, we present several new tests aimed at detecting selective sweeps from genomic SNP data. Using extensive simulations, we show that a new parametric test, based on composite likelihood, has a high power to detect selective sweeps and is surprisingly robust to assumptions regarding recombination rates and demography (i.e., has low Type I error). Our new test also provides estimates of the location of the selective sweep(s) and the magnitude of the selection coefficient. To illustrate the method, we apply our approach to data from the Seattle SNP project and to Chromosome 2 data from the HapMap project. In Chromosome 2, the most extreme signal is found in the lactase gene, which previously has been shown to be undergoing positive selection. Evidence for selective sweeps is also found in many other regions, including genes known to be associated with disease risk such as DPP10 and COL4A3.
                Bookmark

                Author and article information

                Contributors
                lauren_alpert@brown.edu
                sramachandran@brown.edu
                Journal
                Nat Commun
                Nat Commun
                Nature Communications
                Nature Publishing Group UK (London )
                2041-1723
                19 February 2018
                19 February 2018
                2018
                : 9
                : 703
                Affiliations
                [1 ]ISNI 0000 0004 1936 9094, GRID grid.40263.33, Center for Computational Molecular Biology, , Brown University, ; Providence, RI 02912 USA
                [2 ]ISNI 0000 0004 1936 9094, GRID grid.40263.33, Department of Ecology and Evolutionary Biology, , Brown University, ; Providence, RI 02912 USA
                [3 ]ISNI 0000 0001 2216 9681, GRID grid.36425.36, Department of Ecology and Evolution, , Stony Brook University, ; Stony Brook, NY 11794 USA
                [4 ]ISNI 0000 0004 1936 9094, GRID grid.40263.33, Division of Applied Mathematics, , Brown University, ; Providence, RI 02912 USA
                [5 ]ISNI 0000 0004 1936 9094, GRID grid.40263.33, Department of Molecular Biology, Cell Biology, and Biochemistry, , Brown University, ; Providence, RI 02912 USA
                Author information
                http://orcid.org/0000-0003-3495-8359
                Article
                3100
                10.1038/s41467-018-03100-7
                5818606
                29459739
                969e54ba-4fc8-4d9d-b525-9b8a1efe93da
                © The Author(s) 2018

                Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

                History
                : 19 June 2017
                : 19 January 2018
                Categories
                Article
                Custom metadata
                © The Author(s) 2018

                Uncategorized
                Uncategorized

                Comments

                Comment on this article