1,514
views
0
recommends
+1 Recommend
1 collections
    0
    shares

      Celebrating 65 years of The Computer Journal - free-to-read perspectives - bcs.org/tcj65

      scite_
       
      • Record: found
      • Abstract: found
      • Conference Proceedings: found
      Is Open Access

      Challenges in Urdu Stemming (A Progress Report)

      proceedings-article
      BCS IRSG Symposium: Future Directions in Information Access 2007 (FDIA)
      Future Directions in Information Access
      28-29 August 2007
      Urdu, stemming, multilingual IR
      Bookmark

            Abstract

            This paper explains the challenges pertaining to Urdu stemming and presents a rule-based prototype with a few rules implemented for Urdu to motivate the intricacies. It shows that Urdu stemming is quite challenging because of Urdu’s diverse nature and because Arabic and Farsi stemmers cannot be used for Urdu. Dictionary-based errorcorrecting schemes used by other stemmers cannot be applied to Urdu because of the lack of machine-readable resources. There has not been any work published regarding Urdu stemming or morphological analysis in the IR community even though interest in Urdu is growing. The goal of this paper is to show the challenges in writing an Urdu stemmer, not to present a stemmer.

            Content

            Author and article information

            Contributors
            Conference
            August 2007
            August 2007
            : 1-6
            Affiliations
            [0001]University of Minnesota

            4-192 EE/CS Building

            200 Union Street SE

            Minneapolis, MN 55455
            Article
            10.14236/ewic/FDIA2007.4
            a580d6f1-a703-49d1-9635-37cb93ae7f15
            © Kashif Riaz. Published by BCS Learning and Development Ltd. BCS IRSG Symposium: Future Directions in Information Access 2007, Glasgow

            This work is licensed under a Creative Commons Attribution 4.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

            BCS IRSG Symposium: Future Directions in Information Access 2007
            FDIA
            Glasgow
            28-29 August 2007
            Electronic Workshops in Computing (eWiC)
            Future Directions in Information Access
            History
            Product

            1477-9358 BCS Learning & Development

            Self URI (article page): https://www.scienceopen.com/hosted-document?doi=10.14236/ewic/FDIA2007.4
            Self URI (journal page): https://ewic.bcs.org/
            Categories
            Electronic Workshops in Computing

            Applied computer science,Computer science,Security & Cryptology,Graphics & Multimedia design,General computer science,Human-computer-interaction
            Urdu,stemming,multilingual IR

            Comments

            Comment on this article