100
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      A Review of Feature Selection and Feature Extraction Methods Applied on Microarray Data

      review-article
      * ,
      Advances in Bioinformatics
      Hindawi Publishing Corporation

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          We summarise various ways of performing dimensionality reduction on high-dimensional microarray data. Many different feature selection and feature extraction methods exist and they are being widely used. All these methods aim to remove redundant and irrelevant features so that classification of new instances will be more accurate. A popular source of data is microarrays, a biological platform for gathering gene expressions. Analysing microarrays can be difficult due to the size of the data they provide. In addition the complicated relations among the different genes make analysis more difficult and removing excess features can improve the quality of the results. We present some of the most popular methods for selecting significant features and provide a comparison between them. Their advantages and disadvantages are outlined in order to provide a clearer idea of when to use each one of them for saving computational time and resources.

          Related collections

          Most cited references105

          • Record: found
          • Abstract: found
          • Article: not found

          Cluster analysis and display of genome-wide expression patterns.

          A system of cluster analysis for genome-wide expression data from DNA microarray hybridization is described that uses standard statistical algorithms to arrange genes according to similarity in pattern of gene expression. The output is displayed graphically, conveying the clustering and the underlying expression data simultaneously in a form intuitive for biologists. We have found in the budding yeast Saccharomyces cerevisiae that clustering gene expression data groups together efficiently genes of known similar function, and we find a similar tendency in human data. Thus patterns seen in genome-wide expression experiments can be interpreted as indications of the status of cellular processes. Also, coexpression of genes of known function with poorly characterized or novel genes may provide a simple means of gaining leads to the functions of many genes for which information is not available currently.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            Network-based classification of breast cancer metastasis

            Mapping the pathways that give rise to metastasis is one of the key challenges of breast cancer research. Recently, several large-scale studies have shed light on this problem through analysis of gene expression profiles to identify markers correlated with metastasis. Here, we apply a protein-network-based approach that identifies markers not as individual genes but as subnetworks extracted from protein interaction databases. The resulting subnetworks provide novel hypotheses for pathways involved in tumor progression. Although genes with known breast cancer mutations are typically not detected through analysis of differential expression, they play a central role in the protein network by interconnecting many differentially expressed genes. We find that the subnetwork markers are more reproducible than individual marker genes selected without network information, and that they achieve higher accuracy in the classification of metastatic versus non-metastatic tumors.
              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters

              J. C. Dunn (1973)
                Bookmark

                Author and article information

                Journal
                Adv Bioinformatics
                Adv Bioinformatics
                ABI
                Advances in Bioinformatics
                Hindawi Publishing Corporation
                1687-8027
                1687-8035
                2015
                11 June 2015
                : 2015
                : 198363
                Affiliations
                Department of Computing, Imperial College London, London SW7 2AZ, UK
                Author notes

                Academic Editor: Huixiao Hong

                Article
                10.1155/2015/198363
                4480804
                26170834
                bcf22f7e-d7c8-4a37-95ad-9ba28156acb0
                Copyright © 2015 Z. M. Hira and D. F. Gillies.

                This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 25 March 2015
                : 18 May 2015
                Categories
                Review Article

                Bioinformatics & Computational biology
                Bioinformatics & Computational biology

                Comments

                Comment on this article