Mapping the Genetic Architecture of Gene Expression in Human Liver

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Genetic variants that are associated with common human diseases do not lead directly to disease, but instead act on intermediate, molecular phenotypes that in turn induce changes in higher-order disease traits. Therefore, identifying the molecular phenotypes that vary in response to changes in DNA and that also associate with changes in disease traits has the potential to provide the functional information required to not only identify and validate the susceptibility genes that are directly affected by changes in DNA, but also to understand the molecular networks in which such genes operate and how changes in these networks lead to changes in disease traits. Toward that end, we profiled more than 39,000 transcripts and we genotyped 782,476 unique single nucleotide polymorphisms (SNPs) in more than 400 human liver samples to characterize the genetic architecture of gene expression in the human liver, a metabolically active tissue that is important in a number of common human diseases, including obesity, diabetes, and atherosclerosis. This genome-wide association study of gene expression resulted in the detection of more than 6,000 associations between SNP genotypes and liver gene expression traits, where many of the corresponding genes identified have already been implicated in a number of human diseases. The utility of these data for elucidating the causes of common human diseases is demonstrated by integrating them with genotypic and expression data from other human and mouse populations. This provides much-needed functional support for the candidate susceptibility genes being identified at a growing number of genetic loci that have been identified as key drivers of disease from genome-wide association studies of disease. By using an integrative genomics approach, we highlight how the gene RPS26 and not ERBB3 is supported by our data as the most likely susceptibility gene for a novel type 1 diabetes locus recently identified in a large-scale, genome-wide association study. We also identify SORT1 and CELSR2 as candidate susceptibility genes for a locus recently associated with coronary artery disease and plasma low-density lipoprotein cholesterol levels in the process.

Author Summary

Genome-wide association studies seek to identify regions of the genome in which changes in DNA in a given population are correlated with disease, drug response, or other phenotypes of interest. However, changes in DNA that associate with traits like common human diseases do not lead directly to disease, but instead act on intermediate, molecular phenotypes that in turn induce changes in the higher-order disease traits. Therefore, identifying molecular phenotypes that vary in response to changes in DNA that also associate with changes in disease traits can provide the functional information necessary to not only identify and validate the susceptibility genes directly affected by changes in DNA, but to understand as well the molecular networks in which such genes operate and how changes in these networks lead to changes in disease traits. To enable this type of approach we profiled the expression levels of 39,280 transcripts and genotyped 782,476 SNPs in 427 human liver samples, identifying thousands of DNA variants that strongly associated with liver gene expression. These relationships were then leveraged by integrating them with genotypic and expression data from other human and mouse populations, leading to the direct identification of candidate susceptibility genes corresponding to genetic loci identified as key drivers of disease. Our analysis is able to provide much needed functional support for these candidate susceptibility genes.

Abstract

Identifying changes in DNA that associate with changes in gene expression in human tissues elucidates the genetic architecture of gene expression in human populations and enables the direct identification of functionally supported candidate susceptibility genes in genomic regions associated with disease.

Related collections

Most cited references 35

Record: found
Abstract: found
Article: not found

A genome-wide association study identifies novel risk loci for type 2 diabetes.

Robert Sladek, Ghislain Rocheleau, Johan Rung … (2007)

Type 2 diabetes mellitus results from the interaction of environmental factors with a combination of genetic variants, most of which were hitherto unknown. A systematic search for these variants was recently made possible by the development of high-density arrays that permit the genotyping of hundreds of thousands of polymorphisms. We tested 392,935 single-nucleotide polymorphisms in a French case-control cohort. Markers with the most significant difference in genotype frequencies between cases of type 2 diabetes and controls were fast-tracked for testing in a second cohort. This identified four loci containing variants that confer type 2 diabetes risk, in addition to confirming the known association with the TCF7L2 gene. These loci include a non-synonymous polymorphism in the zinc transporter SLC30A8, which is expressed exclusively in insulin-producing beta-cells, and two linkage disequilibrium blocks that contain genes potentially involved in beta-cell development or function (IDE-KIF11-HHEX and EXT2-ALX4). These associations explain a substantial portion of disease risk and constitute proof of principle for the genome-wide approach to the elucidation of complex genetic traits.

0 comments Cited 759 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Genomewide association analysis of coronary artery disease.

Nilesh J. Samani, Jeanette Erdmann, Alistair S. Hall … (2007)

Modern genotyping platforms permit a systematic search for inherited components of complex diseases. We performed a joint analysis of two genomewide association studies of coronary artery disease. We first identified chromosomal loci that were strongly associated with coronary artery disease in the Wellcome Trust Case Control Consortium (WTCCC) study (which involved 1926 case subjects with coronary artery disease and 2938 controls) and looked for replication in the German MI [Myocardial Infarction] Family Study (which involved 875 case subjects with myocardial infarction and 1644 controls). Data on other single-nucleotide polymorphisms (SNPs) that were significantly associated with coronary artery disease in either study (P 80%) of a true association: chromosomes 1p13.3 (rs599839), 1q41 (rs17465637), 10q11.21 (rs501120), and 15q22.33 (rs17228212). We identified several genetic loci that, individually and in aggregate, substantially affect the risk of development of coronary artery disease. Copyright 2007 Massachusetts Medical Society.

0 comments Cited 527 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Genetics of gene expression and its effect on disease.

Valur Emilsson, Gudmar Thorleifsson, Bin Zhang … (2008)

Common human diseases result from the interplay of many genes and environmental factors. Therefore, a more integrative biology approach is needed to unravel the complexity and causes of such diseases. To elucidate the complexity of common human diseases such as obesity, we have analysed the expression of 23,720 transcripts in large population-based blood and adipose tissue cohorts comprehensively assessed for various phenotypes, including traits related to clinical obesity. In contrast to the blood expression profiles, we observed a marked correlation between gene expression in adipose tissue and obesity-related traits. Genome-wide linkage and association mapping revealed a highly significant genetic component to gene expression traits, including a strong genetic effect of proximal (cis) signals, with 50% of the cis signals overlapping between the two tissues profiled. Here we demonstrate an extensive transcriptional network constructed from the human adipose data that exhibits significant overlap with similar network modules constructed from mouse adipose data. A core network module in humans and mice was identified that is enriched for genes involved in the inflammatory and immune response and has been found to be causally associated to obesity-related traits.

0 comments Cited 512 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

: Role: Academic Editor

Journal

Journal ID (nlm-ta): PLoS Biol

Journal ID (publisher-id): pbio

Journal ID (publisher-id): plbi

Journal ID (pmc): plosbiol

Title: PLoS Biology

Publisher: Public Library of Science (San Francisco, USA )

ISSN (Print): 1544-9173

ISSN (Electronic): 1545-7885

Publication date (Print): May 2008

Publication date (Electronic): 6 May 2008

Volume: 6

Issue: 5

Electronic Location Identifier: e107

Affiliations

[1 ] Rosetta Inpharmatics, Seattle, Washington, United States of America

[2 ] Department of Biostatistics, University of Washington, Seattle, Washington, United States of America

[3 ] Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America

[4 ] Department of Microbiology, Molecular Genetics, and Immunology, University of California Los Angeles, Los Angeles, California, United States of America

[5 ] Department of Medicine, University of California Los Angeles, Los Angeles, California, United States of America

[6 ] Department of Human Genetics, University of California Los Angeles, Los Angeles, California, United States of America

[7 ] Department of Pathology and Laboratory Medicine, University of California Los Angeles, Los Angeles, California, United States of America

[8 ] Department of Biochemistry, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America

[9 ] Center of Molecular Toxicology, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America

[10 ] Department of Pathology, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America

[11 ] Department of Pharmaceutical Sciences, Saint Jude Children's Research Hospital, Memphis, Tennessee, United States of America

[12 ] Drug Metabolism, Merck and Company, West Point, Pennsylvania, United States of America

University of Michigan, United States of America

Author notes

* To whom correspondence should be addressed. E-mail: eric_schadt@ 123456merck.com

Article

Publisher ID: 07-PLBI-RA-4030R3 Serial Item and Contribution ID: plbi-06-05-03

DOI: 10.1371/journal.pbio.0060107

PMC ID: 2365981

PubMed ID: 18462017

SO-VID: c3a908a6-daef-4a53-8146-5daa49948b1e

Copyright © Copyright: © 2008 Schadt et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

History

Date received : 3 December 2007

Date accepted : 18 March 2008

Page count

Pages: 13

Custom metadata

citation Schadt EE, Molony C, Chudin E, Hao K, Yang X, et al. (2008) Mapping the genetic architecture of gene expression in human liver. PLoS Biol 6(5): e107. doi: 10.1371/journal.pbio.0060107

Mapping the Genetic Architecture of Gene Expression in Human Liver

Read this article at

Abstract

Author Summary

Abstract

Related collections

Higher order chromatin architecture

Most cited references 35

A genome-wide association study identifies novel risk loci for type 2 diabetes.

Genomewide association analysis of coronary artery disease.

Genetics of gene expression and its effect on disease.

Author and article information

Contributors

Journal

Affiliations

Author notes

Article

History

Page count

Categories

Custom metadata

Comments

Comment on this article

Similar content 41

Cited by 319

Most referenced authors 2,467