6
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Chinese Personal Name Disambiguation Based on Clustering

      1 , 2 , 1 , 2
      Wireless Communications and Mobile Computing
      Hindawi Limited

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Personal name disambiguation is a significant issue in natural language processing, which is the basis for many tasks in automatic information processing. This research explores the Chinese personal name disambiguation based on clustering technique. Preprocessing is applied to transform raw corpus into standardized format at the beginning. And then, Chinese word segmentation, part-of-speech tagging, and named entity recognition are accomplished by lexical analysis. Furthermore, we make an effort to extract features that can better disambiguate Chinese personal names. Some rules for identifying target personal names are created to improve the experimental effect. Additionally, many calculation methods of feature weights are implemented such as bool weight, absolute frequency weight, tf-idf weight, and entropy weight. As for clustering algorithm, an agglomerative hierarchical clustering is selected by comparison with other clustering methods. Finally, a labeling approach is employed to bring forward feature words that can represent each cluster. The experiment achieves a good result for five groups of Chinese personal names.

          Related collections

          Most cited references3

          • Record: found
          • Abstract: not found
          • Article: not found

          Finding community structure in social network of Renren

          C Fan (2013)
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Chinese named entity recognition and disambiguation based on multi-stage clustering

            G. Li (2013)
              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Aauthor name disambiguation using BP neural networks under missing data

              H. Ke (2018)
                Bookmark

                Author and article information

                Contributors
                Journal
                Wireless Communications and Mobile Computing
                Wireless Communications and Mobile Computing
                Hindawi Limited
                1530-8677
                1530-8669
                May 14 2021
                May 14 2021
                : 2021
                : 1-7
                Affiliations
                [1 ]The School of Artificial Intelligence and Computer Science, Jiangnan University, Wuxi 214122, China
                [2 ]Jiangsu Key Laboratory of Media Design and Software Technology, Jiangnan University, Wuxi 214122, China
                Article
                10.1155/2021/3790176
                5f273b87-bf35-45c4-8176-ebec3585fc0d
                © 2021

                https://creativecommons.org/licenses/by/4.0/

                History

                Comments

                Comment on this article