23
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      In silico Pathway Activation Network Decomposition Analysis (iPANDA) as a method for biomarker development

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Signalling pathway activation analysis is a powerful approach for extracting biologically relevant features from large-scale transcriptomic and proteomic data. However, modern pathway-based methods often fail to provide stable pathway signatures of a specific phenotype or reliable disease biomarkers. In the present study, we introduce the in silico Pathway Activation Network Decomposition Analysis (iPANDA) as a scalable robust method for biomarker identification using gene expression data. The iPANDA method combines precalculated gene coexpression data with gene importance factors based on the degree of differential gene expression and pathway topology decomposition for obtaining pathway activation scores. Using Microarray Analysis Quality Control (MAQC) data sets and pretreatment data on Taxol-based neoadjuvant breast cancer therapy from multiple sources, we demonstrate that iPANDA provides significant noise reduction in transcriptomic data and identifies highly robust sets of biologically relevant pathway signatures. We successfully apply iPANDA for stratifying breast cancer patients according to their sensitivity to neoadjuvant therapy.

          Abstract

          Pathway analysis aids interpretation of large-scale gene expression data, but existing algorithms fall short of providing robust pathway identification. The method introduced here includes coexpression analysis and gene importance estimation to robustly identify relevant pathways and biomarkers for patient stratification.

          Related collections

          Most cited references25

          • Record: found
          • Abstract: found
          • Article: not found

          Exploiting indirect neighbours and topological weight to predict protein function from protein-protein interactions.

          Most approaches in predicting protein function from protein-protein interaction data utilize the observation that a protein often share functions with proteins that interacts with it (its level-1 neighbours). However, proteins that interact with the same proteins (i.e. level-2 neighbours) may also have a greater likelihood of sharing similar physical or biochemical characteristics. We speculate that functional similarity between a protein and its neighbours from the two different levels arise from two distinct forms of functional association, and a protein is likely to share functions with its level-1 and/or level-2 neighbours. We are interested in finding out how significant is functional association between level-2 neighbours and how they can be exploited for protein function prediction. We made a statistical study on recent interaction data and observed that functional association between level-2 neighbours is clearly observable. A substantial number of proteins are observed to share functions with level-2 neighbours but not with level-1 neighbours. We develop an algorithm that predicts the functions of a protein in two steps: (1) assign a weight to each of its level-1 and level-2 neighbours by estimating its functional similarity with the protein using the local topology of the interaction network as well as the reliability of experimental sources and (2) scoring each function based on its weighted frequency in these neighbours. Using leave-one-out cross validation, we compare the performance of our method against that of several other existing approaches and show that our method performs relatively well.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            A Review of Feature Selection and Feature Extraction Methods Applied on Microarray Data

            We summarise various ways of performing dimensionality reduction on high-dimensional microarray data. Many different feature selection and feature extraction methods exist and they are being widely used. All these methods aim to remove redundant and irrelevant features so that classification of new instances will be more accurate. A popular source of data is microarrays, a biological platform for gathering gene expressions. Analysing microarrays can be difficult due to the size of the data they provide. In addition the complicated relations among the different genes make analysis more difficult and removing excess features can improve the quality of the results. We present some of the most popular methods for selecting significant features and provide a comparison between them. Their advantages and disadvantages are outlined in order to provide a clearer idea of when to use each one of them for saving computational time and resources.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              Pathway level analysis of gene expression using singular value decomposition

              Background A promising direction in the analysis of gene expression focuses on the changes in expression of specific predefined sets of genes that are known in advance to be related (e.g., genes coding for proteins involved in cellular pathways or complexes). Such an analysis can reveal features that are not easily visible from the variations in the individual genes and can lead to a picture of expression that is more biologically transparent and accessible to interpretation. In this article, we present a new method of this kind that operates by quantifying the level of 'activity' of each pathway in different samples. The activity levels, which are derived from singular value decompositions, form the basis for statistical comparisons and other applications. Results We demonstrate our approach using expression data from a study of type 2 diabetes and another of the influence of cigarette smoke on gene expression in airway epithelia. A number of interesting pathways are identified in comparisons between smokers and non-smokers including ones related to nicotine metabolism, mucus production, and glutathione metabolism. A comparison with results from the related approach, 'gene-set enrichment analysis', is also provided. Conclusion Our method offers a flexible basis for identifying differentially expressed pathways from gene expression data. The results of a pathway-based analysis can be complementary to those obtained from one more focused on individual genes. A web program PLAGE (Pathway Level Analysis of Gene Expression) for performing the kinds of analyses described here is accessible at .
                Bookmark

                Author and article information

                Journal
                Nat Commun
                Nat Commun
                Nature Communications
                Nature Publishing Group
                2041-1723
                16 November 2016
                2016
                : 7
                : 13427
                Affiliations
                [1 ]Pharmaceutical Artificial Intelligence Department, Insilico Medicine, Inc., Emerging Technology Centers, Johns Hopkins University at Eastern , B301, 1101 33rd Street, Baltimore, Maryland 21218, USA
                [2 ]The Johns Hopkins University, School of Medicine, Department of Otolaryngology, Head and Neck Cancer Research , 1550 Orleans Street, Baltimore, Maryland 21231, USA
                [3 ]Laboratory of Bioinformatics, D. Rogachev Federal Research and Clinical Center for Pediatric Hematology, Oncology and Immunology , Samory Mashela 1, Moscow 117997, Russia
                [4 ]Department of Genetics, Albert Einstein College of Medicine , 1300 Morris Park Avenue, Bronx, New York 10461, USA
                [5 ]BioTime, Inc. , 1010 Atlantic Avenue, Alameda, California 94501, USA
                [6 ]National Research Centre ‘Kurchatov Institute', Centre for Convergence of Nano-, Bio-, Information and Cognitive Sciences and Technologies , 1, Akademika Kurchatova square, Moscow 123182, Russia
                [7 ]Boston University, Department of Biomedical Engineering , 44 Cummington Street, Boston, Massachusetts 02215, USA
                [8 ]Skolkovo Foundation , 5 Nobelya street, Skolkovo Innovation Centre, Mozhajskij region, Moscow 143026, Russia
                [9 ]Nutrition and Metabolic Health group, Nestlé Institute of Health Sciences , CH-1015 Lausanne, Switzerland
                [10 ]Novartis Institutes for BioMedical Research , 250 Massachusetts Avenue, Cambridge, Massachusetts 02139, USA
                [11 ]The Biogerontology Research Foundation , 2354 Chynoweth House, Trevissome Park, Truro TR4 8UN, UK
                Author notes
                Author information
                http://orcid.org/0000-0001-7067-8966
                Article
                ncomms13427
                10.1038/ncomms13427
                5116087
                27848968
                b793fe2a-5101-416f-a3d9-a70eae9584e5
                Copyright © 2016, The Author(s)

                This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

                History
                : 07 March 2016
                : 03 October 2016
                Categories
                Article

                Uncategorized
                Uncategorized

                Comments

                Comment on this article