28
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Portuguese Named Entity Recognition using BERT-CRF

      Preprint
      , ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Recent advances in language representation using neural networks have made it viable to transfer the learned internal states of a trained model to downstream natural language processing tasks, such as named entity recognition (NER) and question answering. It has been shown that the leverage of pre-trained language models improves the overall performance on many tasks and is highly beneficial when labeled data is scarce. In this work, we employ a pre-trained BERT with Conditional Random Fields (CRF) architecture to the NER task on the Portuguese language, combining the transfer capabilities of BERT with the structured predictions of CRF. We explore feature-based and fine-tuning training strategies for the BERT model. Our fine-tuning approach obtains new state-of-the-art results on the HAREM I dataset, improving the F1-score by 3.2 points on the selective scenario (5 NE classes) and by 3.8 points on the total scenario (10 NE classes).

          Related collections

          Most cited references4

          • Record: found
          • Abstract: not found
          • Article: not found

          Named Entity Recognition: Fallacies, challenges and opportunities

            Bookmark
            • Record: found
            • Abstract: not found
            • Book Chapter: not found

            LeNER-Br: A Dataset for Named Entity Recognition in Brazilian Legal Text

              Bookmark
              • Record: found
              • Abstract: not found
              • Conference Proceedings: not found

              Applying Deep Neural Networks to Named Entity Recognition in Portuguese Texts

                Bookmark

                Author and article information

                Journal
                23 September 2019
                Article
                1909.10649
                f619c167-ee94-4a05-a444-995b7c951d8d

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                cs.CL cs.IR cs.LG

                Theoretical computer science,Information & Library science,Artificial intelligence

                Comments

                Comment on this article