4
views
0
recommends
+1 Recommend
1 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Author Verification Using a Semantic Space Model

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Abstract: In this work we propose to solve the author verification problem using a semantic space model through Latent Dirichlet Allocation (LDA). We experiment with the corpus used in the author identification tasks at PAN 2014 and PAN 2015. These datasets consist of subsets in the following languages: English, Spanish, Dutch and Greek. Each problem contained in these corpora is formed by one to five known documents which were written by one author and one unknown document. The task is to predict whether the unknown document was written by the author who wrote the known documents. We processed the documents in the dataset and captured the fingerprint of authors by generating a probabilistic distribution of words in the documents. In PAN 2015 classification, we achieved 81.6%, 75.4%, 74.1%, 67.1% accuracy for each English, Spanish, Dutch and Greek subset respectively. In particular for the English subset, we outreached the best result reported in both competitions.

          Related collections

          Most cited references18

          • Record: found
          • Abstract: not found
          • Article: not found

          Speaker Verification Using Adapted Gaussian Mixture Models

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            A survey of modern authorship attribution methods

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Latent semantic analysis

                Bookmark

                Author and article information

                Contributors
                Role: ND
                Role: ND
                Journal
                cys
                Computación y Sistemas
                Comp. y Sist.
                Centro de Investigación en computación, IPN (México, DF, Mexico )
                1405-5546
                June 2017
                : 21
                : 2
                : 167-179
                Affiliations
                [1] orgnameInstituto Politécnico Nacional Mexico
                [2] Estado de México orgnameTecnológico Nacional de México Mexico
                Article
                S1405-55462017000200167
                10.13053/cys-21-2-2732
                56c631aa-1b41-49e4-97b4-f3d1dff099c0

                This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

                History
                : 14 November 2016
                : 17 March 2017
                Page count
                Figures: 0, Tables: 0, Equations: 0, References: 25, Pages: 13
                Product

                SciELO Mexico


                latent Dirichlet allocation,cross-topic,cross-genre,semantic space model,Author verification

                Comments

                Comment on this article