83
views
0
recommends
+1 Recommend
1 collections
    1
    shares
      • Record: found
      • Abstract: found
      • Article: found

      Greedy Optimization for K-Means-Based Consensus Clustering

      research-article

      Read this article at

      ScienceOpenPublisher
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Consensus clustering aims to fuse several existing basic partitions into an integrated one; this has been widely recognized as a promising tool for multi-source and heterogeneous data clustering. Owing to robust and high-quality performance over traditional clustering methods, consensus clustering attracts much attention, and much efforts have been devoted to develop this field. In the literature, the K-means-based Consensus Clustering (KCC) transforms the consensus clustering problem into a classical K-means clustering with theoretical supports and shows the advantages over the state-of-the-art methods. Although KCC inherits the merits from K-means, it suffers from the initialization sensitivity. Moreover, the current consensus clustering framework separates the basic partition generation and fusion into two disconnected parts. To solve the above two challenges, a novel clustering algorithm, named Greedy optimization of K-means-based Consensus Clustering (GKCC) is proposed. Inspired by the well-known greedy K-means that aims to solve the sensitivity of K-means initialization, GKCC seamlessly combines greedy K-means and KCC together, achieves the merits inherited by GKCC and overcomes the drawbacks of the precursors. Moreover, a 59-sampling strategy is conducted to provide high-quality basic partitions and accelerate the algorithmic speed. Extensive experiments on 36 benchmark datasets demonstrate the significant advantages of GKCC over KCC and KCC++ in terms of the objective function values and standard deviations and external cluster validity.

          Author and article information

          Journal
          TST
          Tsinghua Science and Technology
          Tsinghua University Press (Xueyan Building, Tsinghua University, Beijing 100084, China )
          1007-0214
          05 April 2018
          : 23
          : 2
          : 184-194 (pp. )
          Affiliations
          [1]∙ Xue Li is with the School of Economics and Management, Tsinghua University, Beijing 100084, China.
          [2]∙ Hongfu Liu is with the Department of Electrical and Computer Engineering, Northeastern University, Boston MI 02115, USA. E-mail: liu.hongf@ 123456husky.neu.edu .
          Author notes
          * To whom correspondence should be addressed. E-mail: lix2.11@ 123456sem.tsinghua.edu.cn .

          Xue Li�received the B.E. degree in 2011 from Beihang University, and earned the PhD degree in 2018 from Tsinghua University.�Her research interests generally focus on optimization and operations research, and supply chain management.

          Hongfu Liu�received the bachelor and master degrees in management information systems from Bei-hang University, in 2011 and 2014, respectively. He is currently pursuing the PhD degree in Northeastern University, Boston. His research interests generally focus on data mining and machine learning, with special interests in ensemble learning.

          Article
          1007-0214-23-2-184
          10.26599/TST.2018.9010063
          611dbc2e-0f33-41b0-a963-2c5004782cb2
          Copyright @ 2018
          History
          : 06 February 2017
          : 02 May 2017
          : 25 April 2017
          Categories
          Regular Article

          Software engineering,Data structures & Algorithms,Applied computer science,Computer science,Artificial intelligence,Hardware architecture
          consensus clustering,K-means,greedy optimization,initialization

          Comments

          Comment on this article