SPARSE k-MEANS WITH ℓ∞/ℓ0 PENALTY FOR HIGH-DIMENSIONAL DATA CLUSTERING
One of the existing sparse clustering approaches, ℓ1-k-means, maximizes the weighted between-cluster sum of squares subject to the ℓ1 penalty. In this paper, we propose a sparse clustering method based on an ℓ∞/ℓ0 penalty, which we call ℓ0-k-means. We design an efficient iterative algorithm for solv...
Saved in:
| Published in | Statistica Sinica Vol. 28; no. 3; pp. 1265 - 1284 |
|---|---|
| Main Authors | , , , |
| Format | Journal Article |
| Language | English |
| Published |
Institute of Statistical Science, Academia Sinica and International Chinese Statistical Association
01.07.2018
|
| Online Access | Get full text |
| ISSN | 1017-0405 1996-8507 |
Cover
| Summary: | One of the existing sparse clustering approaches, ℓ1-k-means, maximizes the weighted between-cluster sum of squares subject to the ℓ1 penalty. In this paper, we propose a sparse clustering method based on an ℓ∞/ℓ0 penalty, which we call ℓ0-k-means. We design an efficient iterative algorithm for solving it. To compare the theoretical properties of ℓ1 and ℓ0-k-means, we show that they can be explained explicitly from a thresholding perspective based on different thresholding functions. Moreover, ℓ1 and ℓ0-k-means are proven to have a screening consistent property under Gaussian mixture models. Experiments on synthetic as well as real data justify the outperforming results of ℓ0 with respect to ℓ1-k-means. |
|---|---|
| ISSN: | 1017-0405 1996-8507 |