3
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Genome-wide risk prediction of common diseases across ancestries in one million people

      brief-report
      1 , 1 , 2 , 3 , 4 , 20 , 3 , 4 , 5 , 6 , 7 , 8 , 9 , 8 , 1 , The Biobank Japan Project 10 , FinnGen 22 , 3 , 4 , 11 , 2 , 4 , 11 , 8 , 12 , 8 , 6 , 13 , 14 , 15 , 16 , 21 , 1 , 17 , 18 , 1 , 3 , 4 , 1 , 3 , 19 , 3 , 4 , 5 , 1 , 17 , 19 , 23 ,
      Cell Genomics
      Elsevier, Inc
      polygenic risk score, health disparities, global health, ancestry, precision medicine, risk prediction, coronary artery disease, diabetes, breast cancer, prostate cancer

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Summary

          Polygenic risk scores (PRS) measure genetic disease susceptibility by combining risk effects across the genome. For coronary artery disease (CAD), type 2 diabetes (T2D), and breast and prostate cancer, we performed cross-ancestry evaluation of genome-wide PRSs in six biobanks in Europe, the United States, and Asia. We studied transferability of these highly polygenic, genome-wide PRSs across global ancestries, within European populations with different health-care systems, and local population substructures in a population isolate. All four PRSs had similar accuracy across European and Asian populations, with poorer transferability in the smaller group of individuals of African ancestry. The PRSs had highly similar effect sizes in different populations of European ancestry, and in early- and late-settlement regions with different recent population bottlenecks in Finland. Comparing genome-wide PRSs to PRSs containing a smaller number of variants, the highly polygenic, genome-wide PRSs generally displayed higher effect sizes and better transferability across global ancestries. Our findings indicate that in the populations investigated, the current genome-wide polygenic scores for common diseases have potential for clinical utility within different health-care settings for individuals of European ancestry, but that the utility in individuals of African ancestry is currently much lower.

          Highlights

          • An evaluation of cross-ancestry transferability of polygenic risk scores

          • Four common diseases in four global ancestry groups and across Europe were studied

          • PRS transferability was high across European ancestry and lowest for African ancestry

          • PRS transferability was good across population substructures in Finland

          Abstract

          Combining six biobanks in Europe, the United States, and Asia, Mars et al. evaluated cross-ancestry transferability of polygenic risk scores for four common diseases: coronary artery disease, type 2 diabetes, and breast and prostate cancer. They observed good cross-ancestry transferability between individuals with different European ancestry, but poorer transferability in individuals of African, South Asian, and East Asian ancestry, which highlights the need for diversity in polygenic risk score development for clinical translation.

          Related collections

          Most cited references77

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          A global reference for human genetic variation

          The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            Second-generation PLINK: rising to the challenge of larger and richer datasets

            PLINK 1 is a widely used open-source C/C++ toolset for genome-wide association studies (GWAS) and research in population genetics. However, the steady accumulation of data from imputation and whole-genome sequencing studies has exposed a strong need for even faster and more scalable implementations of key functions. In addition, GWAS and population-genetic data now frequently contain probabilistic calls, phase information, and/or multiallelic variants, none of which can be represented by PLINK 1's primary data format. To address these issues, we are developing a second-generation codebase for PLINK. The first major release from this codebase, PLINK 1.9, introduces extensive use of bit-level parallelism, O(sqrt(n))-time/constant-space Hardy-Weinberg equilibrium and Fisher's exact tests, and many other algorithmic improvements. In combination, these changes accelerate most operations by 1-4 orders of magnitude, and allow the program to handle datasets too large to fit in RAM. This will be followed by PLINK 2.0, which will introduce (a) a new data format capable of efficiently representing probabilities, phase, and multiallelic variants, and (b) extensions of many functions to account for the new types of information. The second-generation versions of PLINK will offer dramatic improvements in performance and compatibility. For the first time, users without access to high-end computing resources can perform several essential analyses of the feature-rich and very large genetic datasets coming into use.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              The UK Biobank resource with deep phenotyping and genomic data

              The UK Biobank project is a prospective cohort study with deep genetic and phenotypic data collected on approximately 500,000 individuals from across the United Kingdom, aged between 40 and 69 at recruitment. The open resource is unique in its size and scope. A rich variety of phenotypic and health-related information is available on each participant, including biological measurements, lifestyle indicators, biomarkers in blood and urine, and imaging of the body and brain. Follow-up information is provided by linking health and medical records. Genome-wide genotype data have been collected on all participants, providing many opportunities for the discovery of new genetic associations and the genetic bases of complex traits. Here we describe the centralized analysis of the genetic data, including genotype quality, properties of population structure and relatedness of the genetic data, and efficient phasing and genotype imputation that increases the number of testable variants to around 96 million. Classical allelic variation at 11 human leukocyte antigen genes was imputed, resulting in the recovery of signals with known associations between human leukocyte antigen alleles and many diseases.
                Bookmark

                Author and article information

                Contributors
                Journal
                Cell Genom
                Cell Genom
                Cell Genomics
                Elsevier, Inc
                2666-979X
                13 April 2022
                13 April 2022
                : 2
                : 4
                : None
                Affiliations
                [1 ]Institute for Molecular Medicine Finland, FIMM, HiLIFE, University of Helsinki, Biomedicum 2U, Tukholmankatu 8, 00290 Helsinki, Finland
                [2 ]Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
                [3 ]Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
                [4 ]Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
                [5 ]Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
                [6 ]Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia
                [7 ]Department of Clinical and Molecular Medicine, Norwegian University of Science and Technology, Trondheim, Norway
                [8 ]K. G. Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, Faculty of Medicine and Health, Norwegian University of Science and Technology, Trondheim, Norway
                [9 ]BioCore - Bioinformatics Core Facility, Norwegian University of Science and Technology, Trondheim, Norway
                [10 ]Institute of Medical Science, The University of Tokyo, Tokyo, Japan
                [11 ]Harvard Medical School, Boston, MA, USA
                [12 ]HUNT Research Center, Department of Public Health and Nursing, Faculty of Medicine and Health Sciences, Norwegian University of Science and Technology, Trondheim, Norway
                [13 ]Department of Computational Biology and Medical Sciences, Graduate school of Frontier Sciences, the University of Tokyo, Tokyo, Japan
                [14 ]Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita, Japan
                [15 ]Laboratory of Statistical Immunology, Immunology Frontier Research Center (WPI-IFReC), Osaka University, Suita, Japan
                [16 ]Integrated Frontier Research for Medical Science Division, Institute for Open and Transdisciplinary Research Initiatives, Osaka University, Suita, Japan
                [17 ]Department of Public Health, University of Helsinki, Helsinki, Finland
                [18 ]Department of Mathematics and Statistics, University of Helsinki, Helsinki, Finland
                [19 ]Broad Institute of MIT and Harvard, Cambridge, MA, USA
                [20 ]Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan
                [21 ]Laboratory for Systems Genetics, RIKEN Center for Integrative Medical Sciences, Kanagawa, Japan
                Author notes
                []Corresponding author samuli.ripatti@ 123456helsinki.fi
                [22]

                Further details can be found in the supplemental information

                [23]

                Lead contact

                Article
                S2666-979X(22)00042-8 100118
                10.1016/j.xgen.2022.100118
                9010308
                35591975
                4a91b39e-d30b-4357-b383-1ba3ba266b93
                © 2022 The Author(s)

                This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

                History
                : 21 January 2021
                : 24 August 2021
                : 18 March 2022
                Categories
                Short Article

                polygenic risk score,health disparities,global health,ancestry,precision medicine,risk prediction,coronary artery disease,diabetes,breast cancer,prostate cancer

                Comments

                Comment on this article