Blog
About

121
views
0
recommends
+1 Recommend
1 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Conference Proceedings: found
      Is Open Access

      Automatic Genre Identification: Towards a Flexible Classification Scheme

      BCS IRSG Symposium: Future Directions in Information Access 2007 (FDIA)

      Future Directions in Information Access

      28-29 August 2007

      genre, classification, inference, web pages

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          This paper presents an automatic genre classification model that implements a flexible classification scheme, i.e. a scheme capable of performing zero-, one- or multi-genre assignment. I suggest that this scheme is more appropriate for genres on the web, because many web pages have often more than one genre or none at all. The model that I propose relies on the distinction between the concepts of ‘text types’ and ‘genre’, which are both ‘inferred’ and not ‘learned’ from pre-labelled examples. The main drawback of this approach is that it cannot be fully evaluated given the limitations of current genre research. However, I present a partial evaluation that shows that the model performs competitively, and remains stable when re-scaled.

          Related collections

          Most cited references 3

          • Record: found
          • Abstract: not found
          • Conference Proceedings: not found

          A non-projective dependency parser

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            The SPIRIT collection: an overview of a large web collection

              Bookmark
              • Record: found
              • Abstract: not found
              • Conference Proceedings: not found

              Improving retrieval accuracy by weighting document types with clickthrough data

                Bookmark

                Author and article information

                Contributors
                Conference
                August 2007
                August 2007
                : 1-6
                Affiliations
                University of Brighton, Lewes Road, Brighton (UK)
                Article
                10.14236/ewic/FDIA2007.1
                © Marina Santini. Published by BCS Learning and Development Ltd. BCS IRSG Symposium: Future Directions in Information Access 2007, Glasgow

                This work is licensed under a Creative Commons Attribution 4.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

                BCS IRSG Symposium: Future Directions in Information Access 2007
                FDIA
                Glasgow
                28-29 August 2007
                Electronic Workshops in Computing (eWiC)
                Future Directions in Information Access
                Product
                Product Information: 1477-9358BCS Learning & Development
                Self URI (journal page): https://ewic.bcs.org/
                Categories
                Electronic Workshops in Computing

                Comments

                Comment on this article