4
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      NUBES: A Corpus of Negation and Uncertainty in Spanish Clinical Texts

      Preprint
      , , ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          This paper introduces the first version of the NUBes corpus (Negation and Uncertainty annotations in Biomedical texts in Spanish). The corpus is part of an on-going research and currently consists of 29,682 sentences obtained from anonymised health records annotated with negation and uncertainty. The article includes an exhaustive comparison with similar corpora in Spanish, and presents the main annotation and design decisions. Additionally, we perform preliminary experiments using deep learning algorithms to validate the annotated dataset. As far as we know, NUBes is the largest publicly available corpus for negation in Spanish and the first that also incorporates the annotation of speculation cues, scopes, and events.

          Related collections

          Author and article information

          Journal
          02 April 2020
          Article
          2004.01092
          cb659ae5-7719-4477-af09-3949e2f87d5b

          http://creativecommons.org/licenses/by-nc-sa/4.0/

          History
          Custom metadata
          Accepted at the Twelfth International Conference on Language Resources and Evaluation (LREC 2020)
          cs.CL

          Theoretical computer science
          Theoretical computer science

          Comments

          Comment on this article