6
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Interoperable and scalable data analysis with microservices: applications in metabolomics

      research-article
      1 , 2 , 3 , 4 , 5 , 6 , 7 , 7 , 8 , 9 , 8 , 9 , 8 , 9 , 10 , 11 , 2 , 2 , 1 , 7 , 10 , 2 , 7 , 12 , 13 , 14 , 13 , 15 , 10 , 16 , 3 , 4 , 13 , 17 , 18 , 10 , 13 , 8 , 9 , 16 , 11 , 15 , 2 , 19 , 1 , 7
      Bioinformatics
      Oxford University Press

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Motivation

          Developing a robust and performant data analysis workflow that integrates all necessary components whilst still being able to scale over multiple compute nodes is a challenging task. We introduce a generic method based on the microservice architecture, where software tools are encapsulated as Docker containers that can be connected into scientific workflows and executed using the Kubernetes container orchestrator.

          Results

          We developed a Virtual Research Environment (VRE) which facilitates rapid integration of new tools and developing scalable and interoperable workflows for performing metabolomics data analysis. The environment can be launched on-demand on cloud resources and desktop computers. IT-expertise requirements on the user side are kept to a minimum, and workflows can be re-used effortlessly by any novice user. We validate our method in the field of metabolomics on two mass spectrometry, one nuclear magnetic resonance spectroscopy and one fluxomics study. We showed that the method scales dynamically with increasing availability of computational resources. We demonstrated that the method facilitates interoperability using integration of the major software suites resulting in a turn-key workflow encompassing all steps for mass-spectrometry-based metabolomics including preprocessing, statistics and identification. Microservices is a generic methodology that can serve any scientific discipline and opens up for new types of large-scale integrative science.

          Availability and implementation

          The PhenoMeNal consortium maintains a web portal ( https://portal.phenomenal-h2020.eu) providing a GUI for launching the Virtual Research Environment. The GitHub repository https://github.com/phnmnl/ hosts the source code of all projects.

          Supplementary information

          Supplementary data are available at Bioinformatics online.

          Related collections

          Most cited references36

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences

          Increased reliance on computational approaches in the life sciences has revealed grave concerns about how accessible and reproducible computation-reliant results truly are. Galaxy http://usegalaxy.org, an open web-based platform for genomic research, addresses these problems. Galaxy automatically tracks and manages data provenance and provides support for capturing the context and intent of computational methods. Galaxy Pages are interactive, web-based documents that provide users with a medium to communicate a complete computational analysis.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            MetaboAnalyst 2.0—a comprehensive server for metabolomic data analysis

            First released in 2009, MetaboAnalyst (www.metaboanalyst.ca) was a relatively simple web server designed to facilitate metabolomic data processing and statistical analysis. With continuing advances in metabolomics along with constant user feedback, it became clear that a substantial upgrade to the original server was necessary. MetaboAnalyst 2.0, which is the successor to MetaboAnalyst, represents just such an upgrade. MetaboAnalyst 2.0 now contains dozens of new features and functions including new procedures for data filtering, data editing and data normalization. It also supports multi-group data analysis, two-factor analysis as well as time-series data analysis. These new functions have also been supplemented with: (i) a quality-control module that allows users to evaluate their data quality before conducting any analysis, (ii) a functional enrichment analysis module that allows users to identify biologically meaningful patterns using metabolite set enrichment analysis and (iii) a metabolic pathway analysis module that allows users to perform pathway analysis and visualization for 15 different model organisms. In developing MetaboAnalyst 2.0 we have also substantially improved its graphical presentation tools. All images are now generated using anti-aliasing and are available over a range of resolutions, sizes and formats (PNG, TIFF, PDF, PostScript, or SVG). To improve its performance, MetaboAnalyst 2.0 is now hosted on a much more powerful server with substantially modified code to take advantage the server’s multi-core CPUs for computationally intensive tasks. MetaboAnalyst 2.0 also maintains a collection of 50 or more FAQs and more than a dozen tutorials compiled from user queries and requests. A downloadable version of MetaboAnalyst 2.0, along detailed instructions for local installation is now available as well.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              CAMERA: an integrated strategy for compound spectra extraction and annotation of liquid chromatography/mass spectrometry data sets.

              Liquid chromatography coupled to mass spectrometry is routinely used for metabolomics experiments. In contrast to the fairly routine and automated data acquisition steps, subsequent compound annotation and identification require extensive manual analysis and thus form a major bottleneck in data interpretation. Here we present CAMERA, a Bioconductor package integrating algorithms to extract compound spectra, annotate isotope and adduct peaks, and propose the accurate compound mass even in highly complex data. To evaluate the algorithms, we compared the annotation of CAMERA against a manually defined annotation for a mixture of known compounds spiked into a complex matrix at different concentrations. CAMERA successfully extracted accurate masses for 89.7% and 90.3% of the annotatable compounds in positive and negative ion modes, respectively. Furthermore, we present a novel annotation approach that combines spectral information of data acquired in opposite ion modes to further improve the annotation rate. We demonstrate the utility of CAMERA in two different, easily adoptable plant metabolomics experiments, where the application of CAMERA drastically reduced the amount of manual analysis. © 2011 American Chemical Society
                Bookmark

                Author and article information

                Contributors
                Role: Associate Editor
                Journal
                Bioinformatics
                Bioinformatics
                bioinformatics
                Bioinformatics
                Oxford University Press
                1367-4803
                1367-4811
                01 October 2019
                09 March 2019
                09 March 2019
                : 35
                : 19
                : 3752-3760
                Affiliations
                [1 ] Department of Medical Sciences, Clinical Chemistry, Uppsala University , Uppsala, Sweden
                [2 ] European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
                [3 ] Department of Computational Biology, University of Lausanne , Lausanne, Switzerland
                [4 ] Swiss Institute of Bioinformatics , Lausanne, Switzerland
                [5 ] Department of Neuroscience, Uppsala University , Uppsala, Sweden
                [6 ] Department of Information Technology, Uppsala University , Uppsala, Sweden
                [7 ] Department of Pharmaceutical Biosciences, Uppsala University , Uppsala, Sweden
                [8 ] Department of Biochemistry and Molecular Biomedicine, and Institute of Biomedicine (IBUB), Faculty of Biology, Universitat de Barcelona (IBUB) , Barcelona, Spain
                [9 ] Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBEREHD) and Metabolomics Node at INB-Bioinfarmatics Platform, Instituto de Salud Carlos III (ISCIII) , Madrid, Spain
                [10 ] Oxford e-Research Centre, Department of Engineering Science, University of Oxford , Oxford, UK
                [11 ] Division of Analytical Biosciences, Leiden Academic Centre for Drug Research, Leiden University, Leiden , The Netherlands
                [12 ] National Bioinformatics Infrastructure Sweden, Uppsala University , Uppsala, Sweden
                [13 ] Department of Stress and Developmental Biology, Leibniz Institute of Plant Biochemistry , Halle, Germany
                [14 ] German Centre for Integrative Biodiversity Research (iDiv), Halle-Jena-Leipzig , Germany
                [15 ] CRS4: Center for Advanced Studies, Research and Development in Sardinia, Distributed Computing Group , Pula, Italy
                [16 ] CEA, LIST, Laboratory for Data Analysis and Systems' Intelligence, MetaboHUB, Gif-sur-Yvette , France
                [17 ] Faculty of Medicine, Department of Surgery & Cancer, Imperial College London , London, UK
                [18 ] International Agency for Research on Cancer , 69372 Lyon CEDEX 08, France
                [19 ] Institute for Inorganic and Analytical Chemistry, Friedrich-Schiller-University , Jena, Germany
                Author notes
                To whom correspondence should be addressed. E-mail: ola.spjuth@ 123456farmbio.uu.se
                Author information
                http://orcid.org/0000-0002-9050-8697
                Article
                btz160
                10.1093/bioinformatics/btz160
                6761976
                30851093
                b5d7c32d-5e0b-492a-bbcb-bdaf0656e4cd
                © The Author(s) 2019. Published by Oxford University Press.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 22 August 2018
                : 25 February 2019
                : 08 March 2019
                Page count
                Pages: 9
                Funding
                Funded by: European Commission’s Horizon 2020 programme
                Award ID: 654241
                Funded by: PhenoMeNal
                Funded by: The Swedish Research Council FORMAS
                Funded by: Uppsala Berzelii Technology Centre for Neurodiagnostics
                Funded by: Åke Wiberg Foundation
                Funded by: Nordic e-Infrastructure Collaboration
                Categories
                Original Papers
                Systems Biology

                Bioinformatics & Computational biology
                Bioinformatics & Computational biology

                Comments

                Comment on this article