10
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Generalizability of Polygenic Risk Scores for Breast Cancer Among Women With European, African, and Latinx Ancestry

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Key Points

          Question

          How do previously developed breast cancer polygenic risk scores (PRSs) perform in a clinical setting for women of different ancestries?

          Findings

          In this multicenter cohort study linking electronic medical records to genotyping data that including 39 591 women, PRSs were significantly associated with breast cancer risk in women of all ancestries, although the effect sizes were smaller in women with African ancestry.

          Meaning

          Previously developed PRS models for breast cancer risk performed well for women with European and Latinx ancestries in different clinical settings; these results suggest that larger studies are needed to develop and validate PRSs for women with African ancestry.

          Abstract

          This cohort study of multiple US medical centers with electronic medical records linked to genotype data examines the association of the ancestry of populations used to model polygenic risk scores for breast cancer with their ability to estimate cancer risk for women with different racial/ethnic ancestries.

          Abstract

          Importance

          Multiple polygenic risk scores (PRSs) for breast cancer have been developed from large research consortia; however, their generalizability to diverse clinical settings is unknown.

          Objective

          To examine the performance of previously developed breast cancer PRSs in a clinical setting for women of European, African, and Latinx ancestry.

          Design, Setting, and Participants

          This cohort study using the Electronic Medical Records and Genomics (eMERGE) network data set included 39 591 women from 9 contributing medical centers in the US that had electronic medical records (EMR) linked to genotype data. Breast cancer cases and controls were identified through a validated EMR phenotyping algorithm.

          Main Outcomes and Measures

          Multivariable logistic regression was used to assess the association between breast cancer risk and 7 previously developed PRSs, adjusting for age, study site, breast cancer family history, and first 3 ancestry informative principal components.

          Results

          This study included 39 591 women: 33 594 with European, 3801 with African, and 2196 with Latinx ancestry. The mean (SD) age at breast cancer diagnosis was 60.7 (13.0), 58.8 (12.5), and 60.1 (13.0) years for women with European, African, and Latinx ancestry, respectively. PRSs derived from women with European ancestry were associated with breast cancer risk in women with European ancestry (highest odds ratio [OR] per 1-SD increase, 1.46; 95% CI, 1.41-1.51), women with Latinx ancestry (highest OR, 1.31; 95% CI, 1.09-1.58), and women with African ancestry (OR, 1.19; 95% CI, 1.05-1.35). For women with European ancestry, this association with breast cancer risk was largest in the extremes of the PRS distribution, with ORs ranging from 2.19 (95% CI, 1.84-2.53) to 2.48 (95% CI, 1.89-3.25) for the 3 different PRSs examined for those in the highest 1% of the PRS compared with those in the middle quantile. Among women with Latinx and African ancestries at the extremes of the PRS distribution, there were no statistically significant associations.

          Conclusions and Relevance

          This cohort study found that PRS models derived from women with European ancestry for breast cancer risk generalized well for women with European, Latinx, and African ancestries across different clinical settings, although the effect sizes for women with African ancestry were smaller, likely because of differences in risk allele frequencies and linkage disequilibrium patterns. These results highlight the need to improve representation of diverse population groups, particularly women with African ancestry, in genomic research cohorts.

          Related collections

          Most cited references35

          • Record: found
          • Abstract: not found
          • Article: not found

          World Medical Association Declaration of Helsinki: ethical principles for medical research involving human subjects.

          (2013)
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            PLINK: a tool set for whole-genome association and population-based linkage analyses.

            Whole-genome association studies (WGAS) bring new computational, as well as analytic, challenges to researchers. Many existing genetic-analysis tools are not designed to handle such large data sets in a convenient manner and do not necessarily exploit the new opportunities that whole-genome data bring. To address these issues, we developed PLINK, an open-source C/C++ WGAS tool set. With PLINK, large data sets comprising hundreds of thousands of markers genotyped for thousands of individuals can be rapidly manipulated and analyzed in their entirety. As well as providing tools to make the basic analytic steps computationally efficient, PLINK also supports some novel approaches to whole-genome data that take advantage of whole-genome coverage. We introduce PLINK and describe the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation. In particular, we focus on the estimation and use of identity-by-state and identity-by-descent information in the context of population-based whole-genome studies. This information can be used to detect and correct for population stratification and to identify extended chromosomal segments that are shared identical by descent between very distantly related individuals. Analysis of the patterns of segmental sharing has the potential to map disease loci that contain multiple rare variants in a population-based linkage analysis.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              Second-generation PLINK: rising to the challenge of larger and richer datasets

              PLINK 1 is a widely used open-source C/C++ toolset for genome-wide association studies (GWAS) and research in population genetics. However, the steady accumulation of data from imputation and whole-genome sequencing studies has exposed a strong need for even faster and more scalable implementations of key functions. In addition, GWAS and population-genetic data now frequently contain probabilistic calls, phase information, and/or multiallelic variants, none of which can be represented by PLINK 1's primary data format. To address these issues, we are developing a second-generation codebase for PLINK. The first major release from this codebase, PLINK 1.9, introduces extensive use of bit-level parallelism, O(sqrt(n))-time/constant-space Hardy-Weinberg equilibrium and Fisher's exact tests, and many other algorithmic improvements. In combination, these changes accelerate most operations by 1-4 orders of magnitude, and allow the program to handle datasets too large to fit in RAM. This will be followed by PLINK 2.0, which will introduce (a) a new data format capable of efficiently representing probabilities, phase, and multiallelic variants, and (b) extensions of many functions to account for the new types of information. The second-generation versions of PLINK will offer dramatic improvements in performance and compatibility. For the first time, users without access to high-end computing resources can perform several essential analyses of the feature-rich and very large genetic datasets coming into use.
                Bookmark

                Author and article information

                Journal
                JAMA Netw Open
                JAMA Netw Open
                JAMA Netw Open
                JAMA Network Open
                American Medical Association
                2574-3805
                4 August 2021
                August 2021
                4 August 2021
                : 4
                : 8
                : e2119084
                Affiliations
                [1 ]Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, New York
                [2 ]Department of Epidemiology, Columbia University Irving Medical Center, New York, New York
                [3 ]Division of Medical Oncology, Rutgers Cancer Institute of New Jersey, Robert Wood Johnson Medical School, New Brunswick, New Jersey
                [4 ]Department of Pediatrics, Columbia University Irving Medical Center, New York, New York
                [5 ]Department of Medicine, Columbia University Irving Medical Center, New York, New York
                [6 ]National Human Genome Research Institute, Bethesda, Maryland
                [7 ]Department of Medicine, University of Washington, Seattle
                [8 ]Department of Population Health Sciences, Geisinger, Danville, Pennsylvania
                [9 ]Genomic Medicine Institute, Geisinger, Danville, Pennsylvania
                [10 ]Department of Bioethics and Humanities, University of Washington, Seattle
                [11 ]Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, Massachusetts
                [12 ]Kaiser Permanente Washington Health Research Institute, Seattle, Washington
                [13 ]Department of Cardiovascular Medicine, Mayo Clinic, Rochester, Minnesota
                [14 ]Department of Medicine, Division of Genetic Medicine, Vanderbilt University Medical Center, Nashville, Tennessee
                Author notes
                Article Information
                Accepted for Publication: May 23, 2021.
                Published: August 4, 2021. doi:10.1001/jamanetworkopen.2021.19084
                Open Access: This is an open access article distributed under the terms of the CC-BY License. © 2021 Liu C et al. JAMA Network Open.
                Corresponding Author: Cong Liu, PhD, Department of Biomedical Informatics, Columbia University, 622 W 168 St, PH-20, Rm 407, New York, NY 10032 ( cl3720@ 123456cumc.columbia.edu ).
                Author Contributions: Dr Weng had full access to all of the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis. Drs Liu and Zeinomar contributed equally to this work as first authors. Drs Terry and Weng contributed equally as senior authors.
                Concept and design: Liu, Zeinomar, Kiryluk, Manolio, Weng.
                Acquisition, analysis, or interpretation of data: Liu, Zeinomar, Chung, Kiryluk, Gharavi, Hripcsak, Crew, Shang, Khan, Fasel, Jarvik, Rowley, Justice, Rahm, Fullerton, Smoller, Larson, Crane, Dikilitas, Wiesner, Bick, Terry, Weng.
                Drafting of the manuscript: Liu, Zeinomar, Gharavi, Manolio, Dikilitas.
                Critical revision of the manuscript for important intellectual content: Liu, Zeinomar, Chung, Kiryluk, Hripcsak, Crew, Shang, Khan, Fasel, Manolio, Jarvik, Rowley, Justice, Rahm, Fullerton, Smoller, Larson, Crane, Dikilitas, Wiesner, Bick, Terry, Weng.
                Statistical analysis: Liu, Zeinomar, Kiryluk, Khan, Bick, Terry.
                Obtained funding: Kiryluk, Gharavi, Hripcsak, Jarvik, Rahm, Larson, Weng.
                Administrative, technical, or material support: Liu, Hripcsak, Shang, Jarvik, Rowley, Justice, Rahm, Larson.
                Supervision: Kiryluk, Crew, Jarvik, Bick, Terry, Weng.
                Conflict of Interest Disclosures: Dr Gharavi reported receiving grants from Renal Research Institute and Natera, and reported service on the advisory board for Goldfinch Bio outside the submitted work. Dr Smoller reported receiving an honorarium for an internal seminar from Biogen, Inc outside the submitted work. No other disclosures were reported.
                Funding/Support: The eMERGE Network was initiated and funded by National Human Genome Research Institute (NHGRI) through the following grants: U01HG006828 (Cincinnati Children's Hospital Medical Center and Boston Children's Hospital); U01HG006830 (Children's Hospital of Philadelphia); U01HG006389 (Essentia Institute of Rural Health, Marshfield Clinic Research Foundation, and Pennsylvania State University); U01HG006382 (Geisinger Clinic); U01HG006375 (Group Health Cooperative and the University of Washington); U01HG006379 (Mayo Clinic); U01HG006380 (Icahn School of Medicine at Mount Sinai); U01HG006388 (Northwestern University); U01HG006378 (Vanderbilt University Medical Center); and U01HG006385 (Vanderbilt University Medical Center serving as the coordinating center). This phase of the eMERGE network was initiated and funded by the NHGRI through the following grants: U01HG8657 (Group Health Cooperative/University of Washington); U01HG8685 (Brigham and Women's Hospital); U01HG8672 (Vanderbilt University Medical Center); U01HG6379 (Mayo Clinic); U01HG8679 (Geisinger Clinic); U01HG8680 (Columbia University Health Sciences); U01HG8684 (Children's Hospital of Philadelphia); U01HG8673 (Northwestern University); U01HG8701 (Vanderbilt University Medical Center serving as the Coordinating Center); U01HG8676 (Partners Healthcare and the Broad Institute); U54MD007593 (Meharry Translational Research Center); and U01HG8664 (Baylor College of Medicine). Drs Weng and Liu received additional support from National Library of Medicine/National Human Genomic Research Institute Grant R01LM012895. Dr Zeinomar was supported by the National Institutes of Health (NIH) National Center for Advancing Translational Sciences (NCATS) TL1 Training Program, grant No. TL1TR001875.
                Role of the Funder/Sponsor: The funders had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; and decision to submit the manuscript for publication.
                Disclaimer: The contents of this article are solely the responsibility of the authors and do not necessarily represent the official views of the National Institutes of Health.
                Additional Contributions: We would like to thank all the investigators and participants of the electronic Medical Records and Genomics (eMERGE) Network.
                Additional Information: The database of Genotypes and Phenotypes accession number for the imputed genotype data reported in this paper is phs001584 .v1.p1. The clinical data sets generated and analyzed during the current study contained personal health information and hence are not publicly available, but are available from the corresponding authors and the eMERGE consortium through authorized collaborations.
                Article
                zoi210566
                10.1001/jamanetworkopen.2021.19084
                8339934
                34347061
                ddac84a2-783e-4361-93ed-4456b648ed12
                Copyright 2021 Liu C et al. JAMA Network Open.

                This is an open access article distributed under the terms of the CC-BY License.

                History
                : 22 January 2021
                : 23 May 2021
                Categories
                Research
                Original Investigation
                Online Only
                Genetics and Genomics

                Comments

                Comment on this article