4
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Rethinking Attribute Representation and Injection for Sentiment Classification

      Preprint

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Text attributes, such as user and product information in product reviews, have been used to improve the performance of sentiment classification models. The de facto standard method is to incorporate them as additional biases in the attention mechanism, and more performance gains are achieved by extending the model architecture. In this paper, we show that the above method is the least effective way to represent and inject attributes. To demonstrate this hypothesis, unlike previous models with complicated architectures, we limit our base model to a simple BiLSTM with attention classifier, and instead focus on how and where the attributes should be incorporated in the model. We propose to represent attributes as chunk-wise importance weight matrices and consider four locations in the model (i.e., embedding, encoding, attention, classifier) to inject attributes. Experiments show that our proposed method achieves significant improvements over the standard approach and that attention mechanism is the worst location to inject attributes, contradicting prior work. We also outperform the state-of-the-art despite our use of a simple base model. Finally, we show that these representations transfer well to other tasks. Model implementation and datasets are released here: https://github.com/rktamplayo/CHIM.

          Related collections

          Most cited references23

          • Record: found
          • Abstract: not found
          • Conference Proceedings: not found

          The Stanford CoreNLP Natural Language Processing Toolkit

            Bookmark
            • Record: found
            • Abstract: not found
            • Conference Proceedings: not found

            Feature selection, L1 vs. L2 regularization, and rotational invariance

              Bookmark
              • Record: found
              • Abstract: not found
              • Conference Proceedings: not found

              Joint sentiment/topic model for sentiment analysis

                Bookmark

                Author and article information

                Journal
                26 August 2019
                Article
                1908.09590
                41473599-4f20-497f-9e93-75e48e2d49cf

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                EMNLP 2019
                cs.CL

                Theoretical computer science
                Theoretical computer science

                Comments

                Comment on this article