8
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Computing the Riemannian curvature of image patch and single-cell RNA sequencing data manifolds using extrinsic differential geometry

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Significance

          High-dimensional datasets are becoming increasingly prevalent in many scientific fields. A universal theme connecting these high-dimensional datasets is the ansatz that data points are constrained to lie on nonlinear low-dimensional manifolds, whose structure is dictated by the natural laws governing the data. While tools have been developed for estimating global properties of these data manifolds, estimating the Riemannian curvature, a local property, has not been considered. Computing curvature of data manifolds offers both detailed criteria with which to evaluate models of these complex data (e.g., a Klein bottle model of image patches) and a way to explore detailed geometric features that cannot simply be visualized by the naked eye (e.g., in single-cell RNA-sequencing data).

          Abstract

          Most high-dimensional datasets are thought to be inherently low-dimensional—that is, data points are constrained to lie on a low-dimensional manifold embedded in a high-dimensional ambient space. Here, we study the viability of two approaches from differential geometry to estimate the Riemannian curvature of these low-dimensional manifolds. The intrinsic approach relates curvature to the Laplace–Beltrami operator using the heat-trace expansion and is agnostic to how a manifold is embedded in a high-dimensional space. The extrinsic approach relates the ambient coordinates of a manifold’s embedding to its curvature using the Second Fundamental Form and the Gauss–Codazzi equation. We found that the intrinsic approach fails to accurately estimate the curvature of even a two-dimensional constant-curvature manifold, whereas the extrinsic approach was able to handle more complex toy models, even when confounded by practical constraints like small sample sizes and measurement noise. To test the applicability of the extrinsic approach to real-world data, we computed the curvature of a well-studied manifold of image patches and recapitulated its topological classification as a Klein bottle. Lastly, we applied the extrinsic approach to study single-cell transcriptomic sequencing (scRNAseq) datasets of blood, gastrulation, and brain cells to quantify the Riemannian curvature of scRNAseq manifolds.

          Related collections

          Most cited references61

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          Massively parallel digital transcriptional profiling of single cells

          Characterizing the transcriptome of individual cells is fundamental to understanding complex biological systems. We describe a droplet-based system that enables 3′ mRNA counting of tens of thousands of single cells per sample. Cell encapsulation, of up to 8 samples at a time, takes place in ∼6 min, with ∼50% cell capture efficiency. To demonstrate the system's technical performance, we collected transcriptome data from ∼250k single cells across 29 samples. We validated the sensitivity of the system and its ability to detect rare populations using cell lines and synthetic RNAs. We profiled 68k peripheral blood mononuclear cells to demonstrate the system's ability to characterize large immune populations. Finally, we used sequence variation in the transcriptome data to determine host and donor chimerism at single-cell resolution from bone marrow mononuclear cells isolated from transplant patients.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets.

            Cells, the basic units of biological structure and function, vary broadly in type and state. Single-cell genomics can characterize cell identity and function, but limitations of ease and scale have prevented its broad application. Here we describe Drop-seq, a strategy for quickly profiling thousands of individual cells by separating them into nanoliter-sized aqueous droplets, associating a different barcode with each cell's RNAs, and sequencing them all together. Drop-seq analyzes mRNA transcripts from thousands of individual cells simultaneously while remembering transcripts' cell of origin. We analyzed transcriptomes from 44,808 mouse retinal cells and identified 39 transcriptionally distinct cell populations, creating a molecular atlas of gene expression for known retinal cell classes and novel candidate cell subtypes. Drop-seq will accelerate biological discovery by enabling routine transcriptional profiling at single-cell resolution. VIDEO ABSTRACT.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Dimensionality reduction for visualizing single-cell data using UMAP

              Advances in single-cell technologies have enabled high-resolution dissection of tissue composition. Several tools for dimensionality reduction are available to analyze the large number of parameters generated in single-cell studies. Recently, a nonlinear dimensionality-reduction technique, uniform manifold approximation and projection (UMAP), was developed for the analysis of any type of high-dimensional data. Here we apply it to biological data, using three well-characterized mass cytometry and single-cell RNA sequencing datasets. Comparing the performance of UMAP with five other tools, we find that UMAP provides the fastest run times, highest reproducibility and the most meaningful organization of cell clusters. The work highlights the use of UMAP for improved visualization and interpretation of single-cell data.
                Bookmark

                Author and article information

                Journal
                Proc Natl Acad Sci U S A
                Proc Natl Acad Sci U S A
                pnas
                PNAS
                Proceedings of the National Academy of Sciences of the United States of America
                National Academy of Sciences
                0027-8424
                1091-6490
                20 July 2021
                16 July 2021
                16 July 2021
                : 118
                : 29
                : e2100473118
                Affiliations
                [1] aHarvard Graduate Program in Biophysics, Harvard University , Boston, MA 02115;
                [2] bDepartment of Data Sciences, Dana-Farber Cancer Institute , Boston, MA 02215;
                [3] cLaboratory of Systems Pharmacology, Harvard Medical School , Boston, MA 02115;
                [4] dDepartment of Systems Biology, Harvard Medical School , Boston, MA 02115;
                [5] eBroad Institute of MIT and Harvard , Cambridge, MA 02142
                Author notes
                2To whom correspondence may be addressed. Email: sahand_hormoz@ 123456hms.harvard.edu .

                Edited by Jonathan S. Weissman, University of California, San Francisco, CA, and approved June 3, 2021 (received for review January 13, 2021)

                Author contributions: D.S., S.W., and S.H. designed research; D.S. and S.W. performed research; D.S. and S.W. contributed new reagents/analytic tools; D.S. and S.W. analyzed data; D.S., S.W., and S.H. wrote the paper; and S.H. supervised research.

                1D.S. and S.W. contributed equally to this work.

                Author information
                http://orcid.org/0000-0002-7858-0979
                http://orcid.org/0000-0002-1178-1143
                http://orcid.org/0000-0002-4384-4428
                Article
                202100473
                10.1073/pnas.2100473118
                8307776
                34272279
                4ac143c7-554b-464e-be4f-04dce8c1b2b0
                Copyright © 2021 the Author(s). Published by PNAS.

                This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND).

                History
                Page count
                Pages: 11
                Funding
                Funded by: Gouvernement du Canada | Natural Sciences and Engineering Research Council of Canada (NSERC) 501100000038
                Award ID: PGSD2-517131-2018
                Award Recipient : Duluxan Sritharan
                Funded by: HHS | NIH | National Cancer Institute (NCI) 100000054
                Award ID: U54-CA225088
                Award Recipient : Shu Wang
                Funded by: HHS | NIH | National Institute of General Medical Sciences (NIGMS) 100000057
                Award ID: T32 GM008313
                Award Recipient : Duluxan Sritharan Award Recipient : Shu Wang Award Recipient : Sahand Hormoz
                Funded by: HHS | NIH | National Institute of General Medical Sciences (NIGMS) 100000057
                Award ID: R00GM118910
                Award Recipient : Duluxan Sritharan Award Recipient : Shu Wang Award Recipient : Sahand Hormoz
                Categories
                404
                408
                Biological Sciences
                Biophysics and Computational Biology
                Physical Sciences
                Applied Mathematics

                differential geometry,riemannian curvature,data manifold,laplace-beltrami,single-cell transcriptomics

                Comments

                Comment on this article