A robust EM clustering approach: ROBEM


Öner Y., Bulut H.

COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, cilt.50, sa.19, ss.4587-4605, 2021 (SCI-Expanded) identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 50 Sayı: 19
  • Basım Tarihi: 2021
  • Doi Numarası: 10.1080/03610926.2020.1722840
  • Dergi Adı: COMMUNICATIONS IN STATISTICS-THEORY AND METHODS
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier, Business Source Elite, Business Source Premier, CAB Abstracts, Compendex, Veterinary Science Database, zbMATH, Civil Engineering Abstracts
  • Sayfa Sayıları: ss.4587-4605
  • Anahtar Kelimeler: Clustering analysis, robust cluster algorithms, EM, spatial EM, TCLUST, trimmed k-means, ROBEM, TRIMMING APPROACH
  • Ondokuz Mayıs Üniversitesi Adresli: Evet

Özet

Cluster analysis is defined as a group of multivariate statistical methods that are used to classify identical, or similar units. As is the case with all other classical statistical methods, classical clustering analysis gives misleading results when there is an outlier in the multivariate data set. To solve this problem many approaches have been proposed. This study focuses on developing a new approach, aiming to make the expectation maximization (EM) clustering algorithm resistant to outliers. We proposed a new robust hybrid clustering algorithm called robust EM (ROBEM) to reach our aim. This algorithm combines the EM clustering algorithm with robust principal component analysis (ROBPCA) algorithm. Spatial EM algorithm was proposed as a robust EM algorithm in the literature, but our simulation results and sample data applications showed that the ROBEM algorithm was more successful than the spatial EM algorithm in terms of outlier detection rate and faulty classification rate. Moreover, the proposed algorithm ROBEM provides similar results to the other well known robust clustering algorithms, such as TCLUST and Trimmed k-Means.