Leveraging Kernels for Unsupervised Learning

Løkse, Sigurd

dc.contributor.advisor	Jenssen, Robert
dc.contributor.author	Løkse, Sigurd
dc.date.accessioned	2020-11-24T10:46:19Z
dc.date.available	2020-11-24T10:46:19Z
dc.date.issued	2020-12-11
dc.description.abstract	<p>Kernel methods have been a central part of the machine learning arsenal for several decades. Within this framework, unsupervised learning has been a particularly challenging area. This is due to the inherent nature of unsupervised learning tasks, where important information about the structure of the data is unknown to the user, and as such it is difficult to design a kernel or system to solve the problem at hand. This thesis aims to bridge this knowledge gap on a multitude of challenges within the field. <p>Firstly, we address an important challenge within kernel methods for unsupervised learning, namely that of kernel parameter sensitivity. The process of finding the best parameter for the problem at hand usually depends on information which is unavailable for unsupervised tasks.Inspired by ideas from ensemble learning, we design a kernel for vectorial data with missing elements that automatically adapts to the inherent structures of the data, in order to decouple the parameter choice from the problem at hand. We perform experiments on spectral clustering and unsupervised ranking tasks with promising results. <p>Next, we develop a kernelized approximation method for unsupervised ranking using the Personalized PageRank (PPR). This method is based on novel insights on the PPR, which through a PPR-specific low-rank representation akin to Kernel PCA naturally leads to an out-of-sample approximation. The method is based on the spectrum of a specific matrix. We provide error bounds for the approximation which is used to order the eigenvectors/eigenvalues to minimize the approximation error. We perform a range of experiments to support our novel insights and show that our method may even outperform the PPR. <p>The final part of this thesis synergetically combines kernel methods and neural networks in various unsupervised tasks. Firstly, we design a kernelized autoencoder that incorporates similarities between datapoints through a kernel function in order to learn meaningful representations in code space. Secondly, we propose a novel deep learning approach to clustering, utilizing kernel-based information theoretic losses, with promising experimental results when compared to state--of--the art methods on challenging problems. Finally, we incorporate an unsupervised dimensionality reduction method (e.g. Kernel PCA) in-between the reservoir and readout layer of an Echo State Network in order to capture the dynamics of the time series while reducing the dimensionality of the trainable parameters and improving accuracy.	en_US
dc.description.doctoraltype	ph.d.	en_US
dc.description.popularabstract	Within the field of machine learning, unsupervised learning refers to learning with no (or minimal) support from labels or prior knowledge. Due to large amounts of unlabeled data being available, and manually labelling data being resource intensive, unsupervised learning will be increasingly important in the future for machine learning, with applications in e.g. medical data, marketing, logistics and identifying fraudulent activity. To this end, we address several challenges within unsupervised learning, and develop novel methodology by leveraging the so-called kernels and kernel methods. The contributions of this thesis are two-fold. Firstly, we focus on pure kernel methods and develop a kernel for data with missing elements which automatically adapts to the inherent structures in the data. In addition we develop new kernel-based ranking methodology. Secondly, we propose several novel deep learning methods which are fused with kernels in order to solve unsupervised learning tasks.	en_US
dc.identifier.isbn	978-82-8236-414-0 (trykt), 978-82-8236-415-7 (PDF)
dc.identifier.uri	https://hdl.handle.net/10037/19911
dc.language.iso	eng	en_US
dc.publisher	UiT Norges arktiske universitet	en_US
dc.publisher	UiT The Arctic University of Norway	en_US
dc.relation.haspart	<p>Paper I: Løkse, S., Bianchi, F.M., Salberg, A.-B. & Jenssen, R. Unsupervised learning using <i>PCKID</i> - A Probabilistic Cluster Kernel for Incomplete Data. (Submitted manuscript). <p>Paper II: Løkse, S. & Jenssen, R. Kernel Personalized PageRank. (Submitted manuscript). <p>Paper III: Kampffmeyer, M., Løkse, S., Bianchi, F.M., Jenssen, R. & Livi, R. (2018). The deep kernelized autoencoder. <i>Applied Soft Computing, 71</i>, 816-825. Also available at <a href=https://doi.org/10.1016/j.asoc.2018.07.029>https://doi.org/10.1016/j.asoc.2018.07.029</a>. Accepted manuscript version available in Munin at <a href=https://hdl.handle.net/10037/16437>https://hdl.handle.net/10037/16437</a>. <p>Paper IV: Kampffmeyer, M., Løkse, S., Bianchi, F.M., Livi, L., Salberg, A.-B. & Jenssen, R. (2019). Deep divergence-based approach to clustering. <i>Neural Networks, 113</i>, 91-101. Also available in Munin at <a href=https://hdl.handle.net/10037/17759>https://hdl.handle.net/10037/17759</a>. <p>Paper V: Løkse, S., Bianchi, F.M. & Jenssen, R. (2017). Training Echo State Networks with Regularization through Dimensionality Reduction. <i>Cognitive Computation, 9</i>, 364–378. Also available at <a href=https://doi.org/10.1007/s12559-017-9450-z>https://doi.org/10.1007/s12559-017-9450-z</a>. Submitted manuscript version available in Munin at <a href=https://hdl.handle.net/10037/13086>https://hdl.handle.net/10037/13086</a>.	en_US
dc.rights.accessRights	openAccess	en_US
dc.rights.holder	Copyright 2020 The Author(s)
dc.subject.courseID	DOKTOR-004
dc.subject	Machine Learning	en_US
dc.subject	VDP::Technology: 500::Information and communication technology: 550	en_US
dc.subject	VDP::Teknologi: 500::Informasjons- og kommunikasjonsteknologi: 550	en_US
dc.title	Leveraging Kernels for Unsupervised Learning	en_US
dc.type	Doctoral thesis	en_US
dc.type	Doktorgradsavhandling	en_US

Tilhørende fil(er)

Navn:: thesis.pdf
Størrelse:: 17.94Mb
Format:: PDF

Åpne

Navn:: license.txt
Størrelse:: 1.093Kb
Format:: Tekstfil

Åpne

Denne innførselen finnes i følgende samling(er)

Doktorgradsavhandlinger (NT-fak) [291]

Vis enkel innførsel