939
views
0
recommends
+1 Recommend
1 collections
    4
    shares

      Celebrating 65 years of The Computer Journal - free-to-read perspectives - bcs.org/tcj65

      scite_
       
      • Record: found
      • Abstract: found
      • Conference Proceedings: found
      Is Open Access

      Automatic Phrase Recognition and Extraction from Text

      proceedings-article
      ,
      Proceedings of the 19th Annual BCS-IRSG Colloquium on IR Research (IR)
      IR Research
      8-9 April 1997
      Bookmark

            Abstract

            One of the problems facing researchers in the field of Information Retrieval (IR) is that the search criteria used during retrieval (the query) contains terms which are very ambiguous and common. By this we mean that terms can have multiple meanings and occur in a large percentage of the documents in a text collection. Many approaches to addressing this problem have been tried with varying degrees of success. One approach to this problem is to attempt to make the vocabulary used by the IR system less ambiguous by using terms which occur only infrequently. In our case this is achieved through an automatic process of phrase recognition and the incorporation of these phrases into the lexicon of the indexing mechanism used. Unlike previous phrase recognition approaches based on NLP, our work requires no linguistic processing of the text in order to extract phrases but is comparable to what is called ‘statistical phrases’. In this paper we describe experiments where we evaluate our phrase recognition on the TREC-4 and TREC-5 collections.

            Content

            Author and article information

            Contributors
            Conference
            April 1997
            April 1997
            : 1-9
            Affiliations
            [0001]School of Computer Applications,

            Dublin City University,

            Glasnevin, Dublin 9, Ireland.
            Article
            10.14236/ewic/IR1997.3
            da77505a-3d71-4673-8104-677ce46d4a28
            © Fergus Kelledy et al. Published by BCS Learning and Development Ltd. Proceedings of the 19th Annual BCS-IRSG Colloquium on IR Research, Aberdeen, Scotland

            This work is licensed under a Creative Commons Attribution 4.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

            Proceedings of the 19th Annual BCS-IRSG Colloquium on IR Research
            IR
            19
            Aberdeen, Scotland
            8-9 April 1997
            Electronic Workshops in Computing (eWiC)
            IR Research
            History
            Product

            1477-9358 BCS Learning & Development

            Self URI (article page): https://www.scienceopen.com/hosted-document?doi=10.14236/ewic/IR1997.3
            Self URI (journal page): https://ewic.bcs.org/
            Categories
            Electronic Workshops in Computing

            Applied computer science,Computer science,Security & Cryptology,Graphics & Multimedia design,General computer science,Human-computer-interaction

            Comments

            Comment on this article