22
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      HiSSI: high-order SNP-SNP interactions detection based on efficient significant pattern and differential evolution

      research-article
      1 , 1 , 2 , 3 , 1 ,
      BMC Medical Genomics
      BioMed Central
      14th International Symposium on Bioinformatics Research and Applications (ISBRA'18) (ISBRA 2018)
      8-11 June 2018
      Genome-wide association studies, High-order SNP interactions, Statistically significant pattern, Family wise error rate, Differential evolution

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Detecting single nucleotide polymorphism (SNP) interactions is an important and challenging task in genome-wide association studies (GWAS). Various efforts have been devoted to detect SNP interactions. However, the large volume of SNP datasets results in such a big number of high-order SNP combinations that restrict the power of detecting interactions.

          Methods

          In this paper, to combat with this challenge, we propose a two-stage approach (called HiSSI) to detect high-order SNP-SNP interactions. In the screening stage, HiSSI employs a statistically significant pattern that takes into account family wise error rate, to control false positives and to effectively screen two-locus combinations candidate set. In the searching stage, HiSSI applies two different search strategies (exhaustive search and heuristic search based on differential evolution along with χ 2-test) on candidate pairwise SNP combinations to detect high-order SNP interactions.

          Results

          Extensive experiments on simulated datasets are conducted to evaluate HiSSI and recently proposed and related approaches on both two-locus and three-locus disease models. A real genome-wide dataset: breast cancer dataset collected from the Wellcome Trust Case Control Consortium (WTCCC) is also used to test HiSSI.

          Conclusions

          Simulated experiments on both two-locus and three-locus disease models show that HiSSI is more powerful than other related approaches. Real experiment on breast cancer dataset, in which HiSSI detects some significantly two-locus and three-locus interactions associated with breast cancer, again corroborate the effectiveness of HiSSI in high-order SNP-SNP interaction identification.

          Related collections

          Most cited references19

          • Record: found
          • Abstract: found
          • Article: not found

          Genome-wide strategies for detecting multiple loci that influence complex diseases.

          After nearly 10 years of intense academic and commercial research effort, large genome-wide association studies for common complex diseases are now imminent. Although these conditions involve a complex relationship between genotype and phenotype, including interactions between unlinked loci, the prevailing strategies for analysis of such studies focus on the locus-by-locus paradigm. Here we consider analytical methods that explicitly look for statistical interactions between loci. We show first that they are computationally feasible, even for studies of hundreds of thousands of loci, and second that even with a conservative correction for multiple testing, they can be more powerful than traditional analyses under a range of models for interlocus interactions. We also show that plausible variations across populations in allele frequencies among interacting loci can markedly affect the power to detect their marginal effects, which may account in part for the well-known difficulties in replicating association results. These results suggest that searching for interactions among genetic loci can be fruitfully incorporated into analysis strategies for genome-wide association studies.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Bayesian inference of epistatic interactions in case-control studies.

            Epistatic interactions among multiple genetic variants in the human genome may be important in determining individual susceptibility to common diseases. Although some existing computational methods for identifying genetic interactions have been effective for small-scale studies, we here propose a method, denoted 'bayesian epistasis association mapping' (BEAM), for genome-wide case-control studies. BEAM treats the disease-associated markers and their interactions via a bayesian partitioning model and computes, via Markov chain Monte Carlo, the posterior probability that each marker set is associated with the disease. Testing this on an age-related macular degeneration genome-wide association data set, we demonstrate that the method is significantly more powerful than existing approaches and that genome-wide case-control epistasis mapping with many thousands of markers is both computationally and statistically feasible.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Large-scale genotyping identifies 41 new loci associated with breast cancer risk.

              Breast cancer is the most common cancer among women. Common variants at 27 loci have been identified as associated with susceptibility to breast cancer, and these account for ∼9% of the familial risk of the disease. We report here a meta-analysis of 9 genome-wide association studies, including 10,052 breast cancer cases and 12,575 controls of European ancestry, from which we selected 29,807 SNPs for further genotyping. These SNPs were genotyped in 45,290 cases and 41,880 controls of European ancestry from 41 studies in the Breast Cancer Association Consortium (BCAC). The SNPs were genotyped as part of a collaborative genotyping experiment involving four consortia (Collaborative Oncological Gene-environment Study, COGS) and used a custom Illumina iSelect genotyping array, iCOGS, comprising more than 200,000 SNPs. We identified SNPs at 41 new breast cancer susceptibility loci at genome-wide significance (P < 5 × 10(-8)). Further analyses suggest that more than 1,000 additional loci are involved in breast cancer susceptibility.
                Bookmark

                Author and article information

                Contributors
                xiacao@email.swu.edu.cn
                jiel@email.swu.edu.cn
                guomaozu@bucea.edu.cn
                kingjun@swu.edu.cn
                Conference
                BMC Med Genomics
                BMC Med Genomics
                BMC Medical Genomics
                BioMed Central (London )
                1755-8794
                30 December 2019
                30 December 2019
                2019
                : 12
                Issue : Suppl 7 Issue sponsor : Publication of this supplement has not been supported by sponsorship. Information about the source of funding for publication charges can be found in the individual articles. The articles have undergone the journal's standard peer review process for supplements. The Supplement Editors declare that they have no competing interests.
                : 139
                Affiliations
                [1 ]GRID grid.263906.8, College of Computer and Information Science, Southwest University, ; Beibei, Chongqing, 400715 China
                [2 ]ISNI 0000 0000 8646 3057, GRID grid.411629.9, School of Electrical and Information Engineering, Beijing University of Civil Engineering and Architecture, ; Beijing, 100044 China
                [3 ]Beijing Key Laboratory of Intelligent Processing for Building Big Data, Beijing, 100044 China
                Author information
                http://orcid.org/0000-0002-5890-0365
                Article
                584
                10.1186/s12920-019-0584-6
                6936079
                31888641
                fe7e7d68-04c5-4e9f-ac13-dea3548786ea
                © The Author(s) 2019

                Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

                14th International Symposium on Bioinformatics Research and Applications (ISBRA'18)
                ISBRA 2018
                Beijing, China
                8-11 June 2018
                History
                : 28 August 2019
                : 10 September 2019
                Categories
                Research
                Custom metadata
                © The Author(s) 2019

                Genetics
                genome-wide association studies,high-order snp interactions,statistically significant pattern,family wise error rate,differential evolution

                Comments

                Comment on this article