13
views
0
recommends
+1 Recommend
1 collections
    0
    shares

      To submit to the journal, click here

      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Word Embedding Enrichment for Dictionary Construction: An Example of Incivility in Cantonese

      research-article

      Read this article at

      ScienceOpenPublisher
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Dictionary-based methods remain valuable to measure concepts based on texts, though supervised machine learning has been widely used in much recent communication research. The present study proposes a semi-automatic and easily implemented method to build and enrich dictionaries based on word embeddings. As an example, we create a dictionary of political incivility that contains vulgarity and name-calling words in Cantonese. The study shows that dictionary-based classification outperforms supervised machine learning methods, including deep neural network models. Furthermore, a small number of random seed words can generate a highly accurate dictionary. However, the uncivil content detected is only weakly correlated with uncivil perceptions, as we demonstrate in a population-based survey experiment. The strengths and limitations of dictionary-based methods are discussed.

          Related collections

          Author and article information

          Contributors
          Journal
          CCR
          Computational Communication Research
          Amsterdam University Press (Amsterdam )
          2665-9085
          2665-9085
          2023
          : 5
          : 1
          : 1
          Affiliations
          The Chinese University of Hong Kong
          Department of Journalism, University of Illinois Urbana-Champaign
          Department of Sociology, The University of Southern California
          Article
          10.5117/CCR2023.1.10.LIAN
          10.5117/CCR2023.1.10.LIAN
          435db08c-c164-4478-acf3-b21d42d3fad5
          © The author(s)

          This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

          History
          Categories
          Article

          dictionary construction,swearing,Cantonese,machine learning,political incivility

          Comments

          Comment on this article