12
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      SDA: a semi-parametric differential abundance analysis method for metabolomics and proteomics data

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Identifying differentially abundant features between different experimental groups is a common goal for many metabolomics and proteomics studies. However, analyzing data from mass spectrometry (MS) is difficult because the data may not be normally distributed and there is often a large fraction of zero values. Although several statistical methods have been proposed, they either require the data normality assumption or are inefficient.

          Results

          We propose a new semi-parametric differential abundance analysis (SDA) method for metabolomics and proteomics data from MS. The method considers a two-part model, a logistic regression for the zero proportion and a semi-parametric log-linear model for the possibly non-normally distributed non-zero values, to characterize data from each feature. A kernel-smoothed likelihood method is developed to estimate model coefficients and a likelihood ratio test is constructed for differential abundant analysis. The method has been implemented into an R package, SDAMS, which is available at https://www.bioconductor.org/packages/release/bioc/html/SDAMS.html.

          Conclusion

          By introducing the two-part semi-parametric model, SDA is able to handle both non-normally distributed data and large fraction of zero values in a MS dataset. It also allows for adjustment of covariates. Simulations and real data analyses demonstrate that SDA outperforms existing methods.

          Related collections

          Most cited references18

          • Record: found
          • Abstract: found
          • Article: not found

          Sphingolipid metabolism in cancer signalling and therapy

          Sphingolipids, including the two central bioactive lipids ceramide and sphingosine-1-phosphate (S1P), have opposing roles in regulating cancer cell death and survival, respectively, and there have been exciting developments in understanding how sphingolipid metabolism and signalling regulate these processes in response to anticancer therapy. Recent studies have provided mechanistic details of the roles of sphingolipids and their downstream targets in the regulation of tumour growth and response to chemotherapy, radiotherapy and/or immunotherapy using innovative molecular, genetic and pharmacological tools to target sphingolipid signalling nodes in cancer cells. For example, structure-function-based studies have provided innovative opportunities to develop mechanism-based anticancer therapeutic strategies to restore anti-proliferative ceramide signalling and/or inhibit pro-survival S1P-S1P receptor (S1PR) signalling. This Review summarizes how ceramide-induced cellular stress mediates cancer cell death through various mechanisms involving the induction of apoptosis, necroptosis and/or mitophagy. Moreover, the metabolism of ceramide for S1P biosynthesis, which is mediated by sphingosine kinase 1 and 2, and its role in influencing cancer cell growth, drug resistance and tumour metastasis through S1PR-dependent or receptor-independent signalling are highlighted. Finally, studies targeting enzymes involved in sphingolipid metabolism and/or signalling and their clinical implications for improving cancer therapeutics are also presented.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Accounting for the Multiple Natures of Missing Values in Label-Free Quantitative Proteomics Data Sets to Compare Imputation Strategies.

            Missing values are a genuine issue in label-free quantitative proteomics. Recent works have surveyed the different statistical methods to conduct imputation and have compared them on real or simulated data sets and recommended a list of missing value imputation methods for proteomics application. Although insightful, these comparisons do not account for two important facts: (i) depending on the proteomics data set, the missingness mechanism may be of different natures and (ii) each imputation method is devoted to a specific type of missingness mechanism. As a result, we believe that the question at stake is not to find the most accurate imputation method in general but instead the most appropriate one. We describe a series of comparisons that support our views: For instance, we show that a supposedly "under-performing" method (i.e., giving baseline average results), if applied at the "appropriate" time in the data-processing pipeline (before or after peptide aggregation) on a data set with the "appropriate" nature of missing values, can outperform a blindly applied, supposedly "better-performing" method (i.e., the reference method from the state-of-the-art). This leads us to formulate few practical guidelines regarding the choice and the application of an imputation method in a proteomics context.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              NOREVA: normalization and evaluation of MS-based metabolomics data

              Abstract Diverse forms of unwanted signal variations in mass spectrometry-based metabolomics data adversely affect the accuracies of metabolic profiling. A variety of normalization methods have been developed for addressing this problem. However, their performances vary greatly and depend heavily on the nature of the studied data. Moreover, given the complexity of the actual data, it is not feasible to assess the performance of methods by single criterion. We therefore developed NOREVA to enable performance evaluation of various normalization methods from multiple perspectives. NOREVA integrated five well-established criteria (each with a distinct underlying theory) to ensure more comprehensive evaluation than any single criterion. It provided the most complete set of the available normalization methods, with unique features of removing overall unwanted variations based on quality control metabolites and allowing quality control samples based correction sequentially followed by data normalization. The originality of NOREVA and the reliability of its algorithms were extensively validated by case studies on five benchmark datasets. In sum, NOREVA is distinguished for its capability of identifying the well performed normalization method by taking multiple criteria into consideration and can be an indispensable complement to other available tools. NOREVA can be freely accessed at http://server.idrb.cqu.edu.cn/noreva/.
                Bookmark

                Author and article information

                Contributors
                chi.wang@uky.edu
                lichenuky@uky.edu
                Journal
                BMC Bioinformatics
                BMC Bioinformatics
                BMC Bioinformatics
                BioMed Central (London )
                1471-2105
                17 October 2019
                17 October 2019
                2019
                : 20
                : 501
                Affiliations
                [1 ]ISNI 0000 0004 1936 8438, GRID grid.266539.d, Department of Statistics, University of Kentucky, ; Lexington, 40536 USA
                [2 ]ISNI 0000 0004 1936 8438, GRID grid.266539.d, Markey Cancer Center, University of Kentucky, ; Lexington, 40536 USA
                [3 ]ISNI 0000 0004 1936 8438, GRID grid.266539.d, Center for Environmental and Systems Biochemistry, University of Kentucky, ; Lexington, 40536 USA
                [4 ]ISNI 0000 0004 1936 8438, GRID grid.266539.d, Department of Toxicology and Cancer Biology, University of Kentucky, ; Lexington, 40536 USA
                [5 ]ISNI 0000 0004 1936 8438, GRID grid.266539.d, Department of Medicine, University of Kentucky, ; Lexington, 40536 USA
                [6 ]ISNI 0000 0004 1936 8438, GRID grid.266539.d, Department of Biostatistics, University of Kentucky, ; Lexington, 40536 USA
                Author information
                http://orcid.org/0000-0001-8984-4995
                Article
                3067
                10.1186/s12859-019-3067-z
                6798423
                31623550
                f2422d96-91b5-44e6-ad39-f78021833972
                © The Author(s) 2019

                Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

                History
                : 19 March 2019
                : 3 September 2019
                Funding
                Funded by: FundRef http://dx.doi.org/10.13039/100000002, National Institutes of Health;
                Award ID: 1R03CA211835
                Funded by: FundRef http://dx.doi.org/10.13039/100000002, National Institutes of Health;
                Award ID: 5P20GM103436-15
                Funded by: FundRef http://dx.doi.org/10.13039/100000002, National Institutes of Health;
                Award ID: 1P01CA163223-01A1
                Funded by: FundRef http://dx.doi.org/10.13039/100000002, National Institutes of Health;
                Award ID: P30CA177558
                Categories
                Methodology Article
                Custom metadata
                © The Author(s) 2019

                Bioinformatics & Computational biology
                differential abundance analysis,metabolomics,proteomics,semi-parametric log-linear model,kernel smoothing

                Comments

                Comment on this article