1,036
views
0
recommends
+1 Recommend
1 collections
    24
    shares

      Celebrating 65 years of The Computer Journal - free-to-read perspectives - bcs.org/tcj65

      scite_
       
      • Record: found
      • Abstract: found
      • Conference Proceedings: found
      Is Open Access

      Feature selection algorithm for high dimensional biomedical data classification based on redundant removal

      proceedings-article
      1 , 2 , 3 , 1 , 1 , 1 , 1 , 2 , 4 , 1 , 1
      Proceedings of the 32nd International BCS Human Computer Interaction Conference (HCI)
      Human Computer Interaction Conference
      4 - 6 July 2018
      Feature selection, high dimensional data, KNN, classification accuracy
      Bookmark

            Abstract

            High dimensional biomedical data contain thousands of features, and accurate identification of the main features in these data can be used to classification related data. However, it is usually a large number of irrelevant or redundant features seriously influence classification accuracy. To solve this problem, a new feature selection algorithm based on redundant removal is proposed in this study. Firstly, two redundant criteria are determined by vertical relevance and horizontal relevance. Secondly, an approximate redundancy feature framework based on mutual information (MI) is defined to remove redundant and irrelevant features. Finally, to evaluate the effectiveness of our proposed method, contrast experiments based on the classic feature selection algorithm are conducted using (K-nearest neighbour) KNN classifiers, and the results show that our algorithm can effectively improve the classification accuracy.

            Content

            Author and article information

            Contributors
            Conference
            July 2018
            July 2018
            : 1-5
            Affiliations
            [0001]School of Information Science and Engineering, Lanzhou University, Lanzhou, China
            [0002]School of Electronic and Information Engineering, Lanzhou Jiaotong University, Lanzhou, China
            [0003]School of Architecture and Urban Planning, Lanzhou Jiaotong University, Lanzhou, China
            [0004]College of Electronical and Information Engineering, Shaanxi University of Science and Technology, 710021, Xi’an, China
            Article
            10.14236/ewic/HCI2018.230
            fef12549-a000-4e69-8230-47e3cf1cf964
            © Zhang et al. Published byBCS Learning and Development Ltd.Proceedings of British HCI 2018. Belfast, UK.

            This work is licensed under a Creative Commons Attribution 4.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

            Proceedings of the 32nd International BCS Human Computer Interaction Conference
            HCI
            32
            Belfast, UK
            4 - 6 July 2018
            Electronic Workshops in Computing (eWiC)
            Human Computer Interaction Conference
            History
            Product

            1477-9358 BCS Learning & Development

            Self URI (article page): https://www.scienceopen.com/hosted-document?doi=10.14236/ewic/HCI2018.230
            Self URI (journal page): https://ewic.bcs.org/
            Categories
            Electronic Workshops in Computing

            Applied computer science,Computer science,Security & Cryptology,Graphics & Multimedia design,General computer science,Human-computer-interaction
            Feature selection,high dimensional data,KNN,classification accuracy

            REFERENCES

            1. 2007 UCI machine learning repository Online available: http://archive.ics.uci.edu/ml/

            2. 2017 Combining multiple classifiers for wrapper feature selection International Journal of Data Mining Modelling & Management 1 91 102

            3. 2008 Improved binary PSO for feature selection using gene expression data Computational Biology and Chemistry 32 29 38

            4. 2009 Normalized Mutual Information Feature Selection IEEE Transactions on Neural Networks 20 189 201

            5. 2018 Differential evolution for filter feature selection based on information theory and feature ranking Knowledge-Based Systems 140 103 119

            6. 2016 Feature Selection for Optimized High-dimensional Biomedical Data using the Improved Shuffed Frog Leaping Algorithm IEEE/ACM Transactions on Computational Biology & Bioinformatics 99 1 10

            7. 2017 Investigation of different speech types and emotions for detecting depression using different classifiers Speech Communication 90 39 46

            8. 1994 Irrelevant Features and the Subset Selection Problem Machine Learning Proceedings 1994 121 129

            9. 1994 Estimating attributes: analysis and extensions of RELIEF European Conference on Machine Learning on Machine Learning 23 171 182

            10. 2013 Classification of bioinformatics dataset using finite impulse response extreme learning machine for cancer diagnosis Neural Computing and Applications 22 457 468

            11. 2018 Significantly Fast and Robust Fuzzy C-Means Clustering Algorithm Based on Morphological Reconstruction and Membership Filtering IEEE Transactions on Fuzzy Systems 27 1 15

            12. 2004 Kent Ridge Biomedical Data Set Repository School of Computer Engineering, Nanyang Technological University, Singapore Online available: http://datam.i2r.astar.edu.sg/datasets/krbd/index.html

            13. 2017 A Resting-State Brain Functional Network Study in MDD Based on Minimum Spanning Tree Analysis and the Hierarchical Clustering Complexity 22 1 11

            14. 2016 EEG-based mild depressive detection using feature selection methods and classifiers Computer Methods and Programs in Biomedicine 36 151 161

            15. 2018 Whale Optimization Approaches for Wrapper Feature Selection Applied Soft Computing 62 441 453

            16. 2005 Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy IEEE Trans Pattern Anal Mach Intell 27 1226 1238

            17. 2007 A review of feature selection techniques in bioinformatics Bioinformatics 23 2507 2517

            18. 2014 Molecular classification of endometriosis and disease stage using high-dimensional genomic data Endocrinology 155 4986 4999

            19. 2018 EEG-based automatic sleep staging using ontology and weighting feature analysis Computational and Mathematical Methods in Medicine 2018 1 28

            20. 2018 Intrusion Detection Method for MANET Based on Graph Theory Journal of Electronics and Information Technology 40 1446 1452

            Comments

            Comment on this article