Machine Learning using Principal Manifolds and Mode Seeking
Permanent lenke
https://hdl.handle.net/10037/9921Dato
2016-10-14Type
Doctoral thesisDoktorgradsavhandling
Forfatter
Myhre, Jonas NordhaugSammendrag
A wide range of machine learning methods have taken advantage of density
estimates and their derivatives, including methodology related to principal manifolds and mode seeking, finding use in a number of real applications.
However, research concerned with improving density derivative
estimation and its practical use have received relatively limited attention.
Also, the fact that the derivatives of a distribution over a point set can provide a statistical framework for manifold learning has not yet been used to its full potential.
The aim of this thesis is to help fill these gaps, and to provide novel machine
learning algorithms and tools based on principal manifolds using density derivatives.
We present three different lines of works aiming towards this goal.
The first work presents a fast and exact kernel density derivative estimator.
The method takes advantage of the fact that the derivatives of a multivariate product kernel can be decomposed into a product of univariate differentiations.
By cutting redundant multiplications we obtain significant speedup while retaining an exact estimator.
Next, we present a novel algorithm for manifold unwrapping based on tracing the gradient flow along a manifold estimated using density derivatives.
This allows a direct and geometrically intuitive approach consistent with theory from differential geometry.
Promising results are shown on both real and synthetic data sets.
Finally, we provide a novel framework for robust mode seeking.
It is based on ensemble clustering and resampling techniques.
This allows a clustering algorithm that is both robust with respect to parameter choices as well as being capable of handling data sets of very high dimension.
Concretely, we build the ensemble by running multiple instances of a k nearest neighbor mode seeking algorithm.
We show good results on benchmark tests, as well as a case study involving medical health records.
Beskrivelse
The papers of this thesis are not available in Munin.
Paper I: Shaker, M. Myhre, J. N., Erdogmus, D.: “Computationally Efficient Exact Calculation of Kernel Density Derivatives". Available in Journal of Signal Processing Systems 2015, 81(3):321-332.
Paper 2: Myhre, J. N., Shaker, M., Kaba, M. D., Erdogmus, D.: “Manifold unwrapping using density ridges". (Manuscript).
Paper 3: Shaker, M., Myhre, J. N., Kaba, M. D., Erdogmus, D.: “Invertible nonlinear cluster unwrapping". Available in 2014 IEEE International Workshop on Machine Learning for Signal Processing (MLSP). ISBN: 978-1-4799-3694-6.
Paper 4: Myhre, J. N., Mikalsen, K., Løkse, S., Jenssen, R.: «A robust clustering using a kNN mode seeking ensemble". (Manuscript).
Paper I: Shaker, M. Myhre, J. N., Erdogmus, D.: “Computationally Efficient Exact Calculation of Kernel Density Derivatives". Available in Journal of Signal Processing Systems 2015, 81(3):321-332.
Paper 2: Myhre, J. N., Shaker, M., Kaba, M. D., Erdogmus, D.: “Manifold unwrapping using density ridges". (Manuscript).
Paper 3: Shaker, M., Myhre, J. N., Kaba, M. D., Erdogmus, D.: “Invertible nonlinear cluster unwrapping". Available in 2014 IEEE International Workshop on Machine Learning for Signal Processing (MLSP). ISBN: 978-1-4799-3694-6.
Paper 4: Myhre, J. N., Mikalsen, K., Løkse, S., Jenssen, R.: «A robust clustering using a kNN mode seeking ensemble". (Manuscript).
Forlag
UiT Norges arktiske universitetUiT The Arctic University of Norway
Metadata
Vis full innførselSamlinger
Copyright 2016 The Author(s)
Følgende lisensfil er knyttet til denne innførselen: