33
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      mapDIA: Preprocessing and Statistical Analysis of Quantitative Proteomics Data from Data Independent Acquisition Mass Spectrometry

      research-article

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Data independent acquisition (DIA) mass spectrometry is an emerging technique that offers more complete detection and quantification of peptides and proteins across multiple samples. DIA allows fragment-level quantification, which can be considered as repeated measurements of the abundance of the corresponding peptides and proteins in the downstream statistical analysis. However, few statistical approaches are available for aggregating these complex fragment-level data into peptide- or protein-level statistical summaries. In this work, we describe a software package, mapDIA, for statistical analysis of differential protein expression using DIA fragment-level intensities. The workflow consists of three major steps: intensity normalization, peptide/fragment selection, and statistical analysis. First, mapDIA offers normalization of fragment-level intensities by total intensity sums as well as a novel alternative normalization by local intensity sums in retention time space. Second, mapDIA removes outlier observations and selects peptides/fragments that preserve the major quantitative patterns across all samples for each protein. Last, using the selected fragments and peptides, mapDIA performs model-based statistical significance analysis of protein-level differential expression between specified groups of samples. Using a comprehensive set of simulation datasets, we show that mapDIA detects differentially expressed proteins with accurate control of the false discovery rates. We also describe the analysis procedure in detail using two recently published DIA datasets generated for 14-3-3 β dynamic interaction network and prostate cancer glycoproteome.

          Graphical abstract

          Related collections

          Author and article information

          Journal
          101475056
          34573
          J Proteomics
          J Proteomics
          Journal of proteomics
          1874-3919
          1876-7737
          11 October 2015
          15 September 2015
          3 November 2015
          03 November 2016
          : 129
          : 108-120
          Affiliations
          [1 ]Department of Applied Probability and Statistics, National University of Singapore, Singapore
          [2 ]Saw Swee Hock School of Public Health, National University of Singapore, Singapore
          [3 ]Department of Biostatistics, School of Public Health, Rutgers University, Piscataway, NJ, USA
          [4 ]Department of Pathology, University of Michigan, Ann Arbor, MI, USA
          [5 ]Department of Biology, Institute of Molecular Systems Biology, ETH Zürich, Zürich, Switzerland
          [6 ]Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Toronto, Ontario, Canada
          [7 ]Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
          [8 ]Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
          Author notes
          [* ]To whom all correspondence should be addressed. hyung won choi@ 123456nuhs.edu.sg
          Article
          PMC4630088 PMC4630088 4630088 nihpa728309
          10.1016/j.jprot.2015.09.013
          4630088
          26381204
          4d64f5e4-de0c-4f83-8d2a-1fe415ceaa7a
          History
          Categories
          Article

          Normalization,Differential expression,Data preprocessing,Data independent acquisition

          Comments

          Comment on this article