28
views
1
recommends
+1 Recommend
1 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Quantifying and suppressing ranking bias in a large citation network

      Preprint

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          It is widely recognized that citation counts for papers from different fields cannot be directly compared because different scientific fields adopt different citation practices. Citation counts are also strongly biased by paper age since older papers had more time to attract citations. Various procedures aim at suppressing these biases and give rise to new normalized indicators, such as the relative citation count. We use a large citation dataset from Microsoft Academic Graph and a new statistical framework based on the Mahalanobis distance to show that the rankings by well known indicators, including the relative citation count and Google's PageRank score, are significantly biased by paper field and age. We propose a general normalization procedure motivated by the \(z\)-score which produces much less biased rankings when applied to citation count and PageRank score.

          Related collections

          Most cited references23

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          Universality of citation distributions: towards an objective measure of scientific impact

          , , (2008)
          We study the distributions of citations received by a single publication within several disciplines, spanning broad areas of science. We show that the probability that an article is cited \(c\) times has large variations between different disciplines, but all distributions are rescaled on a universal curve when the relative indicator \(c_f=c/c_0\) is considered, where \(c_0\) is the average number of citations per article for the discipline. In addition we show that the same universal behavior occurs when citation distributions of articles published in the same field, but in different years, are compared. These findings provide a strong validation of \(c_f\) as an unbiased indicator for citation performance across disciplines and years. Based on this indicator, we introduce a generalization of the h-index suitable for comparing scientists working in different fields.
            Bookmark
            • Record: found
            • Abstract: not found
            • Conference Proceedings: not found

            An Overview of Microsoft Academic Service (MAS) and Applications

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Relative indicators and relational charts for comparative assessment of publication output and citation impact

                Bookmark

                Author and article information

                Journal
                2017-03-23
                Article
                1703.08071
                ebee9f55-28fe-4280-8fc3-8735adda82c4

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                Main text (pp. 1-12) and Appendices (pp. 13-17)
                physics.soc-ph cs.DL cs.IR physics.data-an stat.AP

                General physics,Applications,Information & Library science,Mathematical & Computational physics

                Comments

                Comment on this article