32
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      MCA: Multiresolution Correlation Analysis, a graphical tool for subpopulation identification in single-cell gene expression data

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Biological data often originate from samples containing mixtures of subpopulations, corresponding e.g. to distinct cellular phenotypes. However, identification of distinct subpopulations may be difficult if biological measurements yield distributions that are not easily separable.

          Results

          We present Multiresolution Correlation Analysis (MCA), a method for visually identifying subpopulations based on the local pairwise correlation between covariates, without needing to define an a priori interaction scale. We demonstrate that MCA facilitates the identification of differentially regulated subpopulations in simulated data from a small gene regulatory network, followed by application to previously published single-cell qPCR data from mouse embryonic stem cells. We show that MCA recovers previously identified subpopulations, provides additional insight into the underlying correlation structure, reveals potentially spurious compartmentalizations, and provides insight into novel subpopulations.

          Conclusions

          MCA is a useful method for the identification of subpopulations in low-dimensional expression data, as emerging from qPCR or FACS measurements. With MCA it is possible to investigate the robustness of covariate correlations with respect subpopulations, graphically identify outliers, and identify factors contributing to differential regulation between pairs of covariates. MCA thus provides a framework for investigation of expression correlations for genes of interests and biological hypothesis generation.

          Related collections

          Most cited references26

          • Record: found
          • Abstract: found
          • Article: not found

          Nature, nurture, or chance: stochastic gene expression and its consequences.

          Gene expression is a fundamentally stochastic process, with randomness in transcription and translation leading to cell-to-cell variations in mRNA and protein levels. This variation appears in organisms ranging from microbes to metazoans, and its characteristics depend both on the biophysical parameters governing gene expression and on gene network structure. Stochastic gene expression has important consequences for cellular function, being beneficial in some contexts and harmful in others. These situations include the stress response, metabolism, development, the cell cycle, circadian rhythms, and aging.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Transcriptome-wide noise controls lineage choice in mammalian progenitor cells.

            Phenotypic cell-to-cell variability within clonal populations may be a manifestation of 'gene expression noise', or it may reflect stable phenotypic variants. Such 'non-genetic cell individuality' can arise from the slow fluctuations of protein levels in mammalian cells. These fluctuations produce persistent cell individuality, thereby rendering a clonal population heterogeneous. However, it remains unknown whether this heterogeneity may account for the stochasticity of cell fate decisions in stem cells. Here we show that in clonal populations of mouse haematopoietic progenitor cells, spontaneous 'outlier' cells with either extremely high or low expression levels of the stem cell marker Sca-1 (also known as Ly6a; ref. 9) reconstitute the parental distribution of Sca-1 but do so only after more than one week. This slow relaxation is described by a gaussian mixture model that incorporates noise-driven transitions between discrete subpopulations, suggesting hidden multi-stability within one cell type. Despite clonality, the Sca-1 outliers had distinct transcriptomes. Although their unique gene expression profiles eventually reverted to that of the median cells, revealing an attractor state, they lasted long enough to confer a greatly different proclivity for choosing either the erythroid or the myeloid lineage. Preference in lineage choice was associated with increased expression of lineage-specific transcription factors, such as a >200-fold increase in Gata1 (ref. 10) among the erythroid-prone cells, or a >15-fold increased PU.1 (Sfpi1) (ref. 11) expression among myeloid-prone cells. Thus, clonal heterogeneity of gene expression level is not due to independent noise in the expression of individual genes, but reflects metastable states of a slowly fluctuating transcriptome that is distinct in individual cells and may govern the reversible, stochastic priming of multipotent progenitor cells in cell fate decision.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Mass cytometry: technique for real time single cell multitarget immunoassay based on inductively coupled plasma time-of-flight mass spectrometry.

              A novel instrument for real time analysis of individual biological cells or other microparticles is described. The instrument is based on inductively coupled plasma time-of-flight mass spectrometry and comprises a three-aperture plasma-vacuum interface, a dc quadrupole turning optics for decoupling ions from neutral components, an rf quadrupole ion guide discriminating against low-mass dominant plasma ions, a point-to-parallel focusing dc quadrupole doublet, an orthogonal acceleration reflectron analyzer, a discrete dynode fast ion detector, and an 8-bit 1 GHz digitizer. A high spectrum generation frequency of 76.8 kHz provides capability for collecting multiple spectra from each particle-induced transient ion cloud, typically of 200-300 micros duration. It is shown that the transients can be resolved and characterized individually at a peak frequency of 1100 particles per second. Design considerations and optimization data are presented. The figures of merit of the instrument are measured under standard inductively coupled plasma (ICP) operating conditions ( 900 for m/z = 159, the sensitivity with a standard sample introduction system of >1.4 x 10(8) ion counts per second per mg L(-1) of Tb and an abundance sensitivity of (6 x 10(-4))-(1.4 x 10(-3)) (trailing and leading masses, respectively) are shown. The mass range (m/z = 125-215) and abundance sensitivity are sufficient for elemental immunoassay with up to 60 distinct available elemental tags. When 500) can be used, which provides >2.4 x 10(8) cps per mg L(-1) of Tb, at (1.5 x 10(-3))-(5.0 x 10(-3)) abundance sensitivity. The real-time simultaneous detection of multiple isotopes from individual 1.8 microm polystyrene beads labeled with lanthanides is shown. A real time single cell 20 antigen expression assay of model cell lines and leukemia patient samples immuno-labeled with lanthanide-tagged antibodies is presented.
                Bookmark

                Author and article information

                Contributors
                Journal
                BMC Bioinformatics
                BMC Bioinformatics
                BMC Bioinformatics
                BioMed Central
                1471-2105
                2014
                11 July 2014
                : 15
                : 240
                Affiliations
                [1 ]Institute of Computational Biology, Helmholtz Zentrum München, Ingolstädter Landstrasse 1, 85764 Neuherberg, Germany
                [2 ]Department of Mathematics, Technische Universität München, Boltzmannstrasse, 3, 85747 Garching, Germany
                Article
                1471-2105-15-240
                10.1186/1471-2105-15-240
                4227291
                25015590
                102b7504-fa66-4f1e-8607-cca1c8e34ec2
                Copyright © 2014 Feigelman et al.; licensee BioMed Central Ltd.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

                History
                : 23 April 2014
                : 4 July 2014
                Categories
                Research Article

                Bioinformatics & Computational biology
                multiresolution,correlation,subpopulation identification,qpcr analysis

                Comments

                Comment on this article