58
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Gene Expression Variability within and between Human Populations and Implications toward Disease Susceptibility

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Variations in gene expression level might lead to phenotypic diversity across individuals or populations. Although many human genes are found to have differential mRNA levels between populations, the extent of gene expression that could vary within and between populations largely remains elusive. To investigate the dynamic range of gene expression, we analyzed the expression variability of ∼18, 000 human genes across individuals within HapMap populations. Although ∼20% of human genes show differentiated mRNA levels between populations, our results show that expression variability of most human genes in one population is not significantly deviant from another population, except for a small fraction that do show substantially higher expression variability in a particular population. By associating expression variability with sequence polymorphism, intriguingly, we found SNPs in the untranslated regions (5′ and 3′UTRs) of these variable genes show consistently elevated population heterozygosity. We performed differential expression analysis on a genome-wide scale, and found substantially reduced expression variability for a large number of genes, prohibiting them from being differentially expressed between populations. Functional analysis revealed that genes with the greatest within-population expression variability are significantly enriched for chemokine signaling in HIV-1 infection, and for HIV-interacting proteins that control viral entry, replication, and propagation. This observation combined with the finding that known human HIV host factors show substantially elevated expression variability, collectively suggest that gene expression variability might explain differential HIV susceptibility across individuals.

          Author Summary

          Many human genes have population-specific expression levels, which are linked to population-specific polymorphisms and copy-number variations. However, it is unclear whether human genes show similar dynamic range of expression between populations. In this work we analyzed HapMap gene expression compendium, and quantified the between-population and within-population expression variability for ∼18,000 human transcripts. We first concluded that the majority of the human genes have similar levels of within-population variability. However, a small fraction (∼4%) does show much higher expression variability in one population, and the deviation is consistently associated with increased SNP heterozygosity in their UTR regulatory regions. We further showed that genes with the greatest within-population expression variability are significantly enriched for chemokine signaling associated with HIV-1 infection. Combined with the finding that human HIV-1 host factors tend to have increased expression variability within populations, our analysis may explain, at least in part, different susceptibility to HIV infection within the human population. This work provides a fresh angle for analyzing gene expression variations in populations.

          Related collections

          Most cited references38

          • Record: found
          • Abstract: found
          • Article: not found

          DAVID: Database for Annotation, Visualization, and Integrated Discovery.

          Functional annotation of differentially expressed genes is a necessary and critical step in the analysis of microarray data. The distributed nature of biological knowledge frequently requires researchers to navigate through numerous web-accessible databases gathering information one gene at a time. A more judicious approach is to provide query-based access to an integrated database that disseminates biologically rich information across large datasets and displays graphic summaries of functional information. Database for Annotation, Visualization, and Integrated Discovery (DAVID; http://www.david.niaid.nih.gov) addresses this need via four web-based analysis modules: 1) Annotation Tool - rapidly appends descriptive data from several public databases to lists of genes; 2) GoCharts - assigns genes to Gene Ontology functional categories based on user selected classifications and term specificity level; 3) KeggCharts - assigns genes to KEGG metabolic processes and enables users to view genes in the context of biochemical pathway maps; and 4) DomainCharts - groups genes according to PFAM conserved protein domains. Analysis results and graphical displays remain dynamically linked to primary data and external data repositories, thereby furnishing in-depth as well as broad-based data coverage. The functionality provided by DAVID accelerates the analysis of genome-scale datasets by facilitating the transition from data collection to biological meaning.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Global variation in copy number in the human genome.

            Copy number variation (CNV) of DNA sequences is functionally significant but has yet to be fully ascertained. We have constructed a first-generation CNV map of the human genome through the study of 270 individuals from four populations with ancestry in Europe, Africa or Asia (the HapMap collection). DNA from these individuals was screened for CNV using two complementary technologies: single-nucleotide polymorphism (SNP) genotyping arrays, and clone-based comparative genomic hybridization. A total of 1,447 copy number variable regions (CNVRs), which can encompass overlapping or adjacent gains or losses, covering 360 megabases (12% of the genome) were identified in these populations. These CNVRs contained hundreds of genes, disease loci, functional elements and segmental duplications. Notably, the CNVRs encompassed more nucleotide content per genome than SNPs, underscoring the importance of CNV in genetic diversity and evolution. The data obtained delineate linkage disequilibrium patterns for many CNVs, and reveal marked variation in copy number among populations. We also demonstrate the utility of this resource for genetic disease studies.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Stochasticity in gene expression: from theories to phenotypes.

              Genetically identical cells exposed to the same environmental conditions can show significant variation in molecular content and marked differences in phenotypic characteristics. This variability is linked to stochasticity in gene expression, which is generally viewed as having detrimental effects on cellular function with potential implications for disease. However, stochasticity in gene expression can also be advantageous. It can provide the flexibility needed by cells to adapt to fluctuating environments or respond to sudden stresses, and a mechanism by which population heterogeneity can be established during cellular differentiation and development.
                Bookmark

                Author and article information

                Contributors
                Role: Editor
                Journal
                PLoS Comput Biol
                plos
                ploscomp
                PLoS Computational Biology
                Public Library of Science (San Francisco, USA )
                1553-734X
                1553-7358
                August 2010
                August 2010
                26 August 2010
                : 6
                : 8
                : e1000910
                Affiliations
                [1 ]Department of Molecular Genetics, University of Toronto, Toronto, Canada
                [2 ]Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Canada
                [3 ]Banting and Best Department of Medical Research, University of Toronto, Toronto, Canada
                [4 ]Department of Computer Science, University of Toronto, Toronto, Canada
                Tufts University, United States of America
                Author notes

                Conceived and designed the experiments: JL ZZ. Analyzed the data: JL YL TK RM. Wrote the paper: JL ZZ.

                Article
                10-PLCB-RA-1646R3
                10.1371/journal.pcbi.1000910
                2928754
                20865155
                f4d6f570-32c1-4aae-a257-de18d07c9a47
                Li et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
                History
                : 10 January 2010
                : 28 July 2010
                Page count
                Pages: 10
                Categories
                Research Article
                Computational Biology/Genomics
                Computational Biology/Molecular Genetics
                Computational Biology/Population Genetics

                Quantitative & Systems biology
                Quantitative & Systems biology

                Comments

                Comment on this article