974
views
0
recommends
+1 Recommend
1 collections
    0
    shares

      Celebrating 65 years of The Computer Journal - free-to-read perspectives - bcs.org/tcj65

      scite_
       
      • Record: found
      • Abstract: found
      • Conference Proceedings: found
      Is Open Access

      Feature Selection: a Useful Preprocessing Step

      proceedings-article
      Proceedings of the 19th Annual BCS-IRSG Colloquium on IR Research (IR)
      IR Research
      8-9 April 1997
      Bookmark

            Abstract

            Statistical classification techniques and machine learning methods have been applied to some Information Retrieval (IR) problems: routing, filtering and categorization. Most of these methods are usually awkward and sometimes intractable in highly dimensional feature spaces. In order to reduce dimensionality, feature selection has been introduced as a pre-processing step. In this paper, we assess to what extent feature selection can be used without causing a loss in effectiveness. This problem can be tackled since a couple of recent learners do not require a preprocessing step. On a text categorization task, using the Reuters-22,173 collection, we give empirical evidence that feature selection is useful: first, the size of the collection index can be drastically reduced without causing a significant loss in categorization effectiveness. Then, we show that feature selection speeds up the time required to automatically build the categorization system.

            Content

            Author and article information

            Contributors
            Conference
            April 1997
            April 1997
            : 1-11
            Affiliations
            [0001]LIP6, Université P. et M. Curie

            Paris, France
            Article
            10.14236/ewic/IR1997.6
            10ad688f-abd7-45ce-94ca-f8b7e6d4cdda
            © Isabelle Moulinier. Published by BCS Learning and Development Ltd. Proceedings of the 19th Annual BCS-IRSG Colloquium on IR Research, Aberdeen, Scotland

            This work is licensed under a Creative Commons Attribution 4.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

            Proceedings of the 19th Annual BCS-IRSG Colloquium on IR Research
            IR
            19
            Aberdeen, Scotland
            8-9 April 1997
            Electronic Workshops in Computing (eWiC)
            IR Research
            History
            Product

            1477-9358 BCS Learning & Development

            Self URI (article page): https://www.scienceopen.com/hosted-document?doi=10.14236/ewic/IR1997.6
            Self URI (journal page): https://ewic.bcs.org/
            Categories
            Electronic Workshops in Computing

            Applied computer science,Computer science,Security & Cryptology,Graphics & Multimedia design,General computer science,Human-computer-interaction

            Comments

            Comment on this article