35
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Plagiarism Detection using ROUGE and WordNet

      Preprint
      , ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          With the arrival of digital era and Internet, the lack of information control provides an incentive for people to freely use any content available to them. Plagiarism occurs when users fail to credit the original owner for the content referred to, and such behavior leads to violation of intellectual property. Two main approaches to plagiarism detection are fingerprinting and term occurrence; however, one common weakness shared by both approaches, especially fingerprinting, is the incapability to detect modified text plagiarism. This study proposes adoption of ROUGE and WordNet to plagiarism detection. The former includes ngram co-occurrence statistics, skip-bigram, and longest common subsequence (LCS), while the latter acts as a thesaurus and provides semantic information. N-gram co-occurrence statistics can detect verbatim copy and certain sentence modification, skip-bigram and LCS are immune from text modification such as simple addition or deletion of words, and WordNet may handle the problem of word substitution.

          Related collections

          Most cited references2

          • Record: found
          • Abstract: not found
          • Article: not found

          Methods for identifying versioned and plagiarized documents

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Copy detection mechanisms for digital documents

              Bookmark

              Author and article information

              Journal
              22 March 2010
              Article
              1003.4065
              77b0cf59-9902-4ee6-b4da-5c201daa5dd9

              http://arxiv.org/licenses/nonexclusive-distrib/1.0/

              History
              Custom metadata
              Journal of Computing, Volume 2, Issue 3, March 2010
              cs.OH cs.CL

              Comments

              Comment on this article