32
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      A Comparative Study of Efficient Initialization Methods for the K-Means Clustering Algorithm

      Preprint
      , ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          K-means is undoubtedly the most widely used partitional clustering algorithm. Unfortunately, due to its gradient descent nature, this algorithm is highly sensitive to the initial placement of the cluster centers. Numerous initialization methods have been proposed to address this problem. In this paper, we first present an overview of these methods with an emphasis on their computational efficiency. We then compare eight commonly used linear time complexity initialization methods on a large and diverse collection of data sets using various performance criteria. Finally, we analyze the experimental results using non-parametric statistical tests and provide recommendations for practitioners. We demonstrate that popular initialization methods often perform poorly and that there are in fact strong alternatives to these methods.

          Related collections

          Most cited references35

          • Record: found
          • Abstract: not found
          • Article: not found

          An Algorithm for Vector Quantizer Design

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            An efficient k-means clustering algorithm: analysis and implementation

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              LOF

                Bookmark

                Author and article information

                Journal
                10 September 2012
                Article
                10.1016/j.eswa.2012.07.021
                1209.1960
                5da494a8-5e34-40bf-a8b6-d9e631afc714

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                Expert Systems with Applications 40 (2013) 200-210
                17 pages, 1 figure, 7 tables
                cs.LG cs.CV

                Comments

                Comment on this article