Association Journal of CSIAM
Supervised by Ministry of Education of PRC
Sponsored by Xi'an Jiaotong University
ISSN 1005-3085  CN 61-1269/O1

Chinese Journal of Engineering Mathematics ›› 2024, Vol. 41 ›› Issue (6): 1006-1020.doi: 10.3969/j.issn.1005-3085.2024.06.002

Previous Articles     Next Articles

A Robust Evolutionary Clustering Algorithm Based on Local Structure Self-expression

LI Chunzhong1,  JU Wenliang1,  JING Kaili2,  GUI Yang3   

  1. 1. School of Statistics and Applied Mathematics, Anhui University of Finance and Economics, Bengbu 233000
    2. School of Mathematics and Statistics, Xi'an Jiaotong University, Xi'an 710049
    3. School of Mathematics and Physics, University of Science and Technology Beijing, Beijing 100083
  • Received:2024-01-25 Accepted:2024-04-14 Online:2024-12-15 Published:2024-12-15
  • Supported by:
    The Natural Science Foundation of Colleges and Universities in Anhui Province (KJ2021A0481; KJ2021A0473).

Abstract:

Clustering is an unsupervised learning method that measures the similarity and difference between data by analyzing sample features. It utilizes the characteristics of high intra cluster similarity and large inter cluster differences to automate the process of grouping data. It is widely used in fields such as computer vision, text mining, biological information and so on. There is still improvement room in clustering algorithms in terms of robustness, universality, and class number selection, and the effectiveness of the algorithms is largely influenced by the density and manifold of the dataset. This paper proposes a robust evolutionary clustering algorithm based on local structure self-expression. This algorithm uses radial basis functions and adds prior information to obtain local density difference features of the data, constructing a new similarity measure. In this process, the extraction mechanism of local structural features of data and the recognition mechanism of stable classes are integrated, making clustering more robust and universal. Dynamic evolutionary clustering has natural advantages in these two aspects, which can continuously optimize clustering results during the dynamic clustering process, resulting in significant improvements in clustering performance. The new algorithm integrates local and global features through self-expression of the structure information in the dataset, while monitoring the stability of the class during dynamic evolution, in order to obtain better final clustering results. The experimental results on both synthetic and real datasets demonstrate that the clustering performance of the new algorithm is superior.

Key words: clustering, similarity measurement, relative local density, nearest neighbor, self-expression

CLC Number: