38
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Compensation of feature selection biases accompanied with improved predictive performance for binary classification by using a novel ensemble feature selection approach

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Motivation

          Biomarker discovery methods are essential to identify a minimal subset of features (e.g., serum markers in predictive medicine) that are relevant to develop prediction models with high accuracy. By now, there exist diverse feature selection methods, which either are embedded, combined, or independent of predictive learning algorithms. Many preceding studies showed the defectiveness of single feature selection results, which cause difficulties for professionals in a variety of fields (e.g., medical practitioners) to analyze and interpret the obtained feature subsets. Whereas each of these methods is highly biased, an ensemble feature selection has the advantage to alleviate and compensate for such biases. Concerning the reliability, validity, and reproducibility of these methods, we examined eight different feature selection methods for binary classification datasets and developed an ensemble feature selection system.

          Results

          By using an ensemble of feature selection methods, a quantification of the importance of the features could be obtained. The prediction models that have been trained on the selected features showed improved prediction performance.

          Electronic supplementary material

          The online version of this article (doi:10.1186/s13040-016-0114-4) contains supplementary material, which is available to authorized users.

          Related collections

          Most cited references27

          • Record: found
          • Abstract: not found
          • Article: not found

          Wrappers for feature subset selection

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Selection of relevant features and examples in machine learning

              Bookmark
              • Record: found
              • Abstract: not found
              • Book: not found

              Combining Pattern Classifiers

                Bookmark

                Author and article information

                Contributors
                u.neumann@wz-straubing.de
                m.riemenschneider@wz-straubing.de
                jan.sowa@uni-due.de
                theodor.baars@uk-essen.de
                julia.kaelsch@uk-essen.de
                ali.canbac@uni-due.de
                d.heider@wz-straubing.de
                Journal
                BioData Min
                BioData Min
                BioData Mining
                BioMed Central (London )
                1756-0381
                18 November 2016
                18 November 2016
                2016
                : 9
                : 36
                Affiliations
                [1 ]Department of Bioinformatics, Straubing, 94315 Germany
                [2 ]University of Applied Science, Weihenstephan-Triesdorf, Freising, 85354 Germany
                [3 ]Wissenschaftszentrum Weihenstephan, Technische Universität München, Freising, 85354 Germany
                [4 ]Department of Gastroenterology and Hepatology, University Hospital, University Duisburg-Essen, Essen, 45122 Germany
                [5 ]Clinic for Cardiology, West German Heart and Vascular Centre Essen, University Hospital, University Duisburg-Essen, Essen, 45122 Germany
                Article
                114
                10.1186/s13040-016-0114-4
                5116216
                27891179
                271ff378-27f1-451f-9a48-ad378261f5dc
                © The Author(s) 2016

                Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

                History
                : 23 June 2016
                : 27 October 2016
                Funding
                Funded by: Deichmann Foundation
                Categories
                Research
                Custom metadata
                © The Author(s) 2016

                Bioinformatics & Computational biology
                machine learning,feature selection,ensemble learning,biomarker discovery,random forest

                Comments

                Comment on this article