ScienceOpen: research and publishing network

For Researchers

Search
Advanced search

28

views

    

0

recommends

0

shares

Record: found
Abstract: found
Article: found

Is Open Access

Portuguese Named Entity Recognition using BERT-CRF

Preprint

Author(s): Fábio Souza , Rodrigo Nogueira , Roberto Lotufo

Publication date Created: 23 September 2019

Read this article at

ScienceOpen ArXiv

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Recent advances in language representation using neural networks have made it viable to transfer the learned internal states of a trained model to downstream natural language processing tasks, such as named entity recognition (NER) and question answering. It has been shown that the leverage of pre-trained language models improves the overall performance on many tasks and is highly beneficial when labeled data is scarce. In this work, we employ a pre-trained BERT with Conditional Random Fields (CRF) architecture to the NER task on the Portuguese language, combining the transfer capabilities of BERT with the structured predictions of CRF. We explore feature-based and fine-tuning training strategies for the BERT model. Our fine-tuning approach obtains new state-of-the-art results on the HAREM I dataset, improving the F1-score by 3.2 points on the selective scenario (5 NE classes) and by 3.8 points on the total scenario (10 NE classes).

Related collections

Most cited references 4

Record: found
Abstract: not found
Article: not found

Named Entity Recognition: Fallacies, challenges and opportunities

Jorge Morato, Julián Urbano, Mónica Marrero … (2013)

0 comments Cited 20 times – based on 0 reviews      Review now

Record: found
Abstract: not found
Book Chapter: not found

LeNER-Br: A Dataset for Named Entity Recognition in Brazilian Legal Text

Pedro Luz de Araujo, Teófilo de Campos, Renato de Oliveira … (2018)

0 comments Cited 5 times – based on 0 reviews

Record: found
Abstract: not found
Conference Proceedings: not found

Applying Deep Neural Networks to Named Entity Recognition in Portuguese Texts

Ivo Fernandes, Henrique Lopes Cardoso, Eugenio Oliveira (2018)

0 comments Cited 2 times – based on 0 reviews

Author and article information

Journal

Publication date Created: 23 September 2019

Article

ArXiV ID: 1909.10649

SO-VID: f619c167-ee94-4a05-a444-995b7c951d8d

License:

http://arxiv.org/licenses/nonexclusive-distrib/1.0/

History

Custom metadata

Categories cs.CL cs.IR cs.LG

ScienceOpen disciplines: Theoretical computer science,Information & Library science,Artificial intelligence

Data availability:

ScienceOpen disciplines: Theoretical computer science, Information & Library science, Artificial intelligence

Comments

Comment on this article