Variations in gene expression level might lead to phenotypic diversity across individuals or populations. Although many human genes are found to have differential mRNA levels between populations, the extent of gene expression that could vary within and between populations largely remains elusive. To investigate the dynamic range of gene expression, we analyzed the expression variability of ∼18, 000 human genes across individuals within HapMap populations. Although ∼20% of human genes show differentiated mRNA levels between populations, our results show that expression variability of most human genes in one population is not significantly deviant from another population, except for a small fraction that do show substantially higher expression variability in a particular population. By associating expression variability with sequence polymorphism, intriguingly, we found SNPs in the untranslated regions (5′ and 3′UTRs) of these variable genes show consistently elevated population heterozygosity. We performed differential expression analysis on a genome-wide scale, and found substantially reduced expression variability for a large number of genes, prohibiting them from being differentially expressed between populations. Functional analysis revealed that genes with the greatest within-population expression variability are significantly enriched for chemokine signaling in HIV-1 infection, and for HIV-interacting proteins that control viral entry, replication, and propagation. This observation combined with the finding that known human HIV host factors show substantially elevated expression variability, collectively suggest that gene expression variability might explain differential HIV susceptibility across individuals.
Many human genes have population-specific expression levels, which are linked to population-specific polymorphisms and copy-number variations. However, it is unclear whether human genes show similar dynamic range of expression between populations. In this work we analyzed HapMap gene expression compendium, and quantified the between-population and within-population expression variability for ∼18,000 human transcripts. We first concluded that the majority of the human genes have similar levels of within-population variability. However, a small fraction (∼4%) does show much higher expression variability in one population, and the deviation is consistently associated with increased SNP heterozygosity in their UTR regulatory regions. We further showed that genes with the greatest within-population expression variability are significantly enriched for chemokine signaling associated with HIV-1 infection. Combined with the finding that human HIV-1 host factors tend to have increased expression variability within populations, our analysis may explain, at least in part, different susceptibility to HIV infection within the human population. This work provides a fresh angle for analyzing gene expression variations in populations.