Spectral clustering using PCKID – A probabilistic cluster kernel for incomplete data
Permanent lenke
https://hdl.handle.net/10037/13697Dato
2017-05-19Type
Journal articleTidsskriftartikkel
Manuskript
Peer reviewed
Preprint
Sammendrag
In this paper, we propose PCKID, a novel, robust, kernel function for spectral clustering, specifically designed to handle incomplete data. By combining posterior distributions of Gaussian Mixture Models for incomplete data on different scales, we are able to learn a kernel for incomplete data that does not depend on any critical hyperparameters, unlike the commonly used RBF kernel. To evaluate our method, we perform experiments on two real datasets. PCKIDoutperforms the baseline methods for all fractions of missing values and in some cases outperforms the baseline methods with up to 25% points.
Beskrivelse
Manuscript version. The final publication is available at Springer via https://doi.org/10.1007/978-3-319-59126-1_36.