39
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Distortive Effects of Initial-Based Name Disambiguation on Measurements of Large-Scale Coauthorship Networks

      Preprint
      ,

      Read this article at

      ScienceOpenPublisherArXiv
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Scholars have often relied on name initials to resolve name ambiguities in large-scale coauthorship network research. This approach bears the risk of incorrectly merging or splitting author identities. The use of initial-based disambiguation has been justified by the assumption that such errors would not affect research findings too much. This paper tests this assumption by analyzing coauthorship networks from five academic fields - biology, computer science, nanoscience, neuroscience, and physics - and an interdisciplinary journal, PNAS. Name instances in datasets of this study were disambiguated based on heuristics gained from previous algorithmic disambiguation solutions. We use disambiguated data as a proxy of ground-truth to test the performance of three types of initial-based disambiguation. Our results show that initial-based disambiguation can misrepresent statistical properties of coauthorship networks: it deflates the number of unique authors, number of component, average shortest paths, clustering coefficient, and assortativity, while it inflates average productivity, density, average coauthor number per author, and largest component size. Also, on average, more than half of top 10 productive or collaborative authors drop off the lists. Asian names were found to account for the majority of misidentification by initial-based disambiguation due to their common surname and given name initials.

          Related collections

          Most cited references6

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          Evolution of the social network of scientific collaborations

          The co-authorship network of scientists represents a prototype of complex evolving networks. By mapping the electronic database containing all relevant journals in mathematics and neuro-science for an eight-year period (1991-98), we infer the dynamic and the structural mechanisms that govern the evolution and topology of this complex system. First, empirical measurements allow us to uncover the topological measures that characterize the network at a given moment, as well as the time evolution of these quantities. The results indicate that the network is scale-free, and that the network evolution is governed by preferential attachment, affecting both internal and external links. However, in contrast with most model predictions the average degree increases in time, and the node separation decreases. Second, we propose a simple model that captures the network's time evolution. Third, numerical simulations are used to uncover the behavior of quantities that could not be predicted analytically.
            • Record: found
            • Abstract: not found
            • Article: not found

            Co-Authorship in Management and Organizational Studies: An Empirical and Network Analysis*

              • Record: found
              • Abstract: not found
              • Article: not found

              On the robustness of centrality measures under conditions of imperfect data

                Author and article information

                Journal
                2015-02-22
                Article
                10.1002/asi.23489
                1502.06306
                6d7109f7-1e2f-405a-ac74-a7987bd0a857

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                This is a preprint of an article accepted for publication in Journal of the Association for Information Science and Technology
                cs.DL cs.SI physics.soc-ph

                Social & Information networks,General physics,Information & Library science

                Comments

                Comment on this article

                Related Documents Log