12
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Name Disambiguation Based on Graph Convolutional Network

      1 , 2 , 3 , 2
      Scientific Programming
      Hindawi Limited

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Recently, massive online academic resources have provided convenience for scientific study and research. However, the author name ambiguity degrades the user experience in retrieving the literature bases. Extracting the features of papers and calculating the similarity for clustering constitute the mainstream of present name disambiguation approaches, which can be divided into two branches: clustering based on attribute features and clustering based on linkage information. They cannot however get high performance. In order to improve the efficiency of literature retrieval and provide technical support for the accurate construction of literature bases, a name disambiguation method based on Graph Convolutional Network (GCN) is proposed. The disambiguation model based on GCN designed in this paper combines both attribute features and linkage information. We first build paper-to-paper graphs, coauthor graphs, and paper-to-author graphs for each reference item of a name. The nodes in the graphs contain attribute features and the edges contain linkage features. The graphs are then fed to a specialized GCN and output a hybrid representation. Finally, we use the hierarchical clustering algorithm to divide the papers into disjoint clusters. Finally, we cluster the papers using a hierarchical algorithm. The experimental results show that the proposed model achieves average F1 value of 77.10% on three name disambiguation datasets. In order to let the model automatically select the appropriate number of convolution layers and adapt to the structure of different local graphs, we improve upon the prior GCN model by utilizing attention mechanism. Compared with the original GCN model, it increases the average precision and F1 value by 2.05% and 0.63%, respectively. What is more, we build a bilingual dataset, BAT, which contains various forms of academic achievements and will be an alternative in future research of name disambiguation.

          Related collections

          Most cited references4

          • Record: found
          • Abstract: not found
          • Article: not found

          Accuracy of simple, initials-based methods for author name disambiguation

            Bookmark
            • Record: found
            • Abstract: not found
            • Conference Proceedings: not found

            Name Disambiguation in AMiner: Clustering, Maintenance, and Human in the Loop.

              Bookmark
              • Record: found
              • Abstract: not found
              • Book: not found

              Research on Disambiguation of Authors with the Same Name in Literature Database

              W. Zhang (2019)
                Bookmark

                Author and article information

                Contributors
                Journal
                Scientific Programming
                Scientific Programming
                Hindawi Limited
                1875-919X
                1058-9244
                May 8 2021
                May 8 2021
                : 2021
                : 1-11
                Affiliations
                [1 ]University of Science and Technology Beijing, Beijing 100083, China
                [2 ]Beihang University, Beijing 100191, China
                [3 ]China Association for Science and Technology, Beijing 100081, China
                Article
                10.1155/2021/5577692
                18372e1d-018d-4189-a807-213ad2fad1a8
                © 2021

                https://creativecommons.org/licenses/by/4.0/

                History

                Comments

                Comment on this article