Teaching Syntax by Adversarial Distraction

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Existing entailment datasets mainly pose problems which can be answered without attention to grammar or word order. Learning syntax requires comparing examples where different grammar and word order change the desired classification. We introduce several datasets based on synthetic transformations of natural entailment examples in SNLI or FEVER, to teach aspects of grammar and word order. We show that without retraining, popular entailment models are unaware that these syntactic differences change meaning. With retraining, some but not all popular entailment models can learn to compare the syntax properly.

Related collections

Most cited references 11

Record: found
Abstract: not found
Conference Proceedings: not found

Neural Machine Translation of Rare Words with Subword Units

Rico Sennrich, Barry Haddow, Alexandra Birch (2016)

0 comments Cited 399 times – based on 0 reviews

Bookmark

Record: found
Abstract: not found
Conference Proceedings: not found

Supervised Learning of Universal Sentence Representations from Natural Language Inference Data

Alexis Conneau, Antoine Bordes, Loïc Barrault … (2017)

0 comments Cited 184 times – based on 0 reviews

Bookmark

Record: found
Abstract: found
Article: found

Is Open Access

A large annotated corpus for learning natural language inference

Gabor Angeli, Christopher Potts, Christopher D Manning … (2015)

Understanding entailment and contradiction is fundamental to understanding natural language, and inference about entailment and contradiction is a valuable testing ground for the development of semantic representations. However, machine learning research in this area has been dramatically limited by the lack of large-scale resources. To address this, we introduce the Stanford Natural Language Inference corpus, a new, freely available collection of labeled sentence pairs, written by humans doing a novel grounded task based on image captioning. At 570K pairs, it is two orders of magnitude larger than all other resources of its type. This increase in scale allows lexicalized classifiers to outperform some sophisticated existing entailment models, and it allows a neural network-based model to perform competitively on natural language inference benchmarks for the first time.

0 comments Cited 124 times – based on 0 reviews

Preprint

     Review now

Bookmark

All references

Author and article information

Journal

Publication date Created: 25 October 2018

Article

ArXiV ID: 1810.11067

SO-VID: eb760df1-f07a-4c31-bf38-44875cb09c73

License:

http://arxiv.org/licenses/nonexclusive-distrib/1.0/

History

Custom metadata

Journal reference Juho Kim, Christopher Malon, and Asim Kadav. 2018. "Teaching Syntax by Adversarial Distraction." Proceedings of the EMNLP First Workshop on Fact Extraction and Verification

Comments To appear at the EMNLP 2018 First Workshop on Fact Extraction and Verification (FEVER)

Categories cs.CL

ScienceOpen disciplines: Theoretical computer science

Data availability:

ScienceOpen disciplines: Theoretical computer science

Teaching Syntax by Adversarial Distraction

Read this article at

Abstract

Related collections

Blockchain in Healthcare Today

Most cited references 11

Neural Machine Translation of Rare Words with Subword Units

Supervised Learning of Universal Sentence Representations from Natural Language Inference Data

A large annotated corpus for learning natural language inference

Author and article information

Journal

Article

History

Custom metadata

Comments

Comment on this article

Similar content 101

Most referenced authors 152