6
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Trajectories, bifurcations, and pseudo-time in large clinical datasets: applications to myocardial infarction and diabetes data

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Large observational clinical datasets are becoming increasingly available for mining associations between various disease traits and administered therapy. These datasets can be considered as representations of the landscape of all possible disease conditions, in which a concrete disease state develops through stereotypical routes, characterized by “points of no return" and “final states" (such as lethal or recovery states). Extracting this information directly from the data remains challenging, especially in the case of synchronic (with a short-term follow-up) observations.

          Results

          Here we suggest a semi-supervised methodology for the analysis of large clinical datasets, characterized by mixed data types and missing values, through modeling the geometrical data structure as a bouquet of bifurcating clinical trajectories. The methodology is based on application of elastic principal graphs, which can address simultaneously the tasks of dimensionality reduction, data visualization, clustering, feature selection, and quantifying the geodesic distances (pseudo-time) in partially ordered sequences of observations. The methodology allows a patient to be positioned on a particular clinical trajectory (pathological scenario) and the degree of progression along it to be characterized with a qualitative estimate of the uncertainty of the prognosis. We developed a tool ClinTrajan for clinical trajectory analysis implemented in the Python programming language. We test the methodology in 2 large publicly available datasets: myocardial infarction complications and readmission of diabetic patients data.

          Conclusions

          Our pseudo-time quantification-based approach makes it possible to apply the methods developed for dynamical disease phenotyping and illness trajectory analysis (diachronic data analysis) to synchronic observational data.

          Related collections

          Most cited references47

          • Record: found
          • Abstract: found
          • Article: not found

          Group-based trajectory modeling in clinical research.

          Group-based trajectory models are increasingly being applied in clinical research to map the developmental course of symptoms and assess heterogeneity in response to clinical interventions. In this review, we provide a nontechnical overview of group-based trajectory and growth mixture modeling alongside a sampling of how these models have been applied in clinical research. We discuss the challenges associated with the application of both types of group-based models and propose a set of preliminary guidelines for applied researchers to follow when reporting model results. Future directions in group-based modeling applications are discussed, including the use of trajectory models to facilitate causal inference when random assignment to treatment condition is not possible.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            A comparison of single-cell trajectory inference methods

            Trajectory inference approaches analyze genome-wide omics data from thousands of single cells and computationally infer the order of these cells along developmental trajectories. Although more than 70 trajectory inference tools have already been developed, it is challenging to compare their performance because the input they require and output models they produce vary substantially. Here, we benchmark 45 of these methods on 110 real and 229 synthetic datasets for cellular ordering, topology, scalability and usability. Our results highlight the complementarity of existing tools, and that the choice of method should depend mostly on the dataset dimensions and trajectory topology. Based on these results, we develop a set of guidelines to help users select the best method for their dataset. Our freely available data and evaluation pipeline ( https://benchmark.dynverse.org ) will aid in the development of improved tools designed to analyze increasingly large and complex single-cell datasets.
              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              An Introduction to Latent Class Growth Analysis and Growth Mixture Modeling

                Bookmark

                Author and article information

                Contributors
                Journal
                Gigascience
                Gigascience
                gigascience
                GigaScience
                Oxford University Press
                2047-217X
                25 November 2020
                November 2020
                25 November 2020
                : 9
                : 11
                : giaa128
                Affiliations
                Prof. V.F. Voino-Yasenetsky Krasnoyarsk State Medical University , 660022 Krasnoyarsk, Russia
                Institut Curie, PSL Research University , F-75005 Paris, France
                INSERM , U900, F-75005 Paris, France
                CBIO-Centre for Computational Biology, Mines ParisTech, PSL Research University , 75006 Paris, France
                Institut Curie, PSL Research University , F-75005 Paris, France
                INSERM , U900, F-75005 Paris, France
                CBIO-Centre for Computational Biology, Mines ParisTech, PSL Research University , 75006 Paris, France
                Centre for Artificial Intelligence, Data Analytics and Modelling, University of Leicester , LE1 7RH Leicester, UK
                Laboratory of advanced methods for high-dimensional data analysis, Lobachevsky University , 603000 Nizhny Novgorod, Russia
                Prof. V.F. Voino-Yasenetsky Krasnoyarsk State Medical University , 660022 Krasnoyarsk, Russia
                Institut Curie, PSL Research University , F-75005 Paris, France
                INSERM , U900, F-75005 Paris, France
                CBIO-Centre for Computational Biology, Mines ParisTech, PSL Research University , 75006 Paris, France
                Centre for Artificial Intelligence, Data Analytics and Modelling, University of Leicester , LE1 7RH Leicester, UK
                Laboratory of advanced methods for high-dimensional data analysis, Lobachevsky University , 603000 Nizhny Novgorod, Russia
                Institut Curie, PSL Research University , F-75005 Paris, France
                INSERM , U900, F-75005 Paris, France
                CBIO-Centre for Computational Biology, Mines ParisTech, PSL Research University , 75006 Paris, France
                Author notes
                Correspondence address. Andrei Zinovyev, Institut Curie, 26 rue d'Ulm, Paris, 75248, France, E-mail: andrei.zinovyev@ 123456curie.fr
                Author information
                http://orcid.org/0000-0002-9517-7284
                Article
                giaa128
                10.1093/gigascience/giaa128
                7688475
                33241287
                57036c8d-cff2-4f62-8f72-d262aaaeb4b0
                © The Author(s) 2020. Published by Oxford University Press GigaScience.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 25 June 2020
                : 30 September 2020
                : 22 October 2020
                Page count
                Pages: 20
                Funding
                Funded by: Council on grants of the President of the Russian Federation, DOI 10.13039/501100011051;
                Award ID: 14.Y26.31.0022
                Funded by: Horizon 2020 Framework Programme, DOI 10.13039/100010661;
                Award ID: 826121
                Categories
                Research
                AcademicSubjects/SCI00960
                AcademicSubjects/SCI02254

                clinical data,clinical trajectory,patient disease pathway,dynamical diseases phenotyping,data analysis,principal trees,dimensionality reduction,pseudo-time,myocardial infarction,diabetes

                Comments

                Comment on this article