ScienceOpen: research and publishing network

For Researchers

Search
Advanced search

33

views

    

0

recommends

0

shares

Record: found
Abstract: found
Article: found

Is Open Access

Character-Aware Neural Language Models

Preprint

Author(s): Yoon Kim , Yacine Jernite , David Sontag , Alexander M. Rush

Publication date Created: 2015-08-26

Read this article at

ScienceOpen ArXiv

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

We describe a simple neural language model that relies only on character-level inputs. Predictions are still made at the word-level. Our model employs a convolutional neural network (CNN) and a highway network over characters, whose output is given to a long short-term memory (LSTM) recurrent neural network language model (RNN-LM). On the English Penn Treebank the model is on par with the existing state-of-the-art despite having 60% fewer parameters. On languages with rich morphology (Arabic, Czech, French, German, Spanish, Russian), the model outperforms word-level/morpheme-level LSTM baselines, again with fewer parameters. The results suggest that on many languages, character inputs are sufficient for language modeling. Analysis of word representations obtained from the character composition part of the model reveals that the model is able to encode, from characters only, both semantic and orthographic information.

Related collections

Most cited references 5

Record: found
Abstract: not found
Article: not found

Backpropagation through time: what it does and how to do it

P.J. Werbos (1990)

0 comments Cited 644 times – based on 0 reviews      Review now

Record: found
Abstract: not found
Conference Proceedings: not found

Three new graphical models for statistical language modelling

Geoffrey E Hinton, Andriy Mnih (2007)

0 comments Cited 55 times – based on 0 reviews

Record: found
Abstract: not found
Conference Proceedings: not found

Boosting Named Entity Recognition with Neural Character Embeddings

Cícero dos Santos, Victor Guimarães (2015)

0 comments Cited 12 times – based on 0 reviews

Author and article information

Journal

Publication date Created: 2015-08-26

Publication date Updated: 2015-12-01

Article

ArXiV ID: 1508.06615

SO-VID: ee4f774e-79e7-426b-8684-868a01f8be08

License:

http://arxiv.org/licenses/nonexclusive-distrib/1.0/

History

Custom metadata

Comments AAAI 2016

Categories cs.CL cs.NE stat.ML

ScienceOpen disciplines: Theoretical computer science,Machine learning,Neural & Evolutionary computing

Data availability:

ScienceOpen disciplines: Theoretical computer science, Machine learning, Neural & Evolutionary computing

Comments

Comment on this article

Similar content 431

See all similar

Cited by 2

See all cited by

Most referenced authors 147

See all reference authors

- Version 1