dc.contributor.advisor | Jenssen, Robert | |
dc.contributor.author | Vikjord, Vidar Vangen | |
dc.date.accessioned | 2012-11-02T10:00:39Z | |
dc.date.available | 2012-11-02T10:00:39Z | |
dc.date.issued | 2012-06-01 | |
dc.description.abstract | The machine learning field based on information theory has received a lot of attention in recent years. Through kernel estimation of the probability density functions, methods developed with information theoretic measures are able to use all the statistical information available in the data, not just a finite number of moments. However, by using kernel estimation, the methods are dependent on choosing a suitable bandwidth parameter and have trouble dealing with data which vary on different scales.
In this thesis, the field of information theoretic learning has been explored using k-nearest neighbor estimates for the probability density functions instead. The developed estimators of the information theoretic measures was used in a clustering routine and compared with the traditional kernel estimators.Performing clustering on a range of datasets and comparing the performance, the new method proved to provide superior results without the need of tuning any parameters. The performance difference was found to be especially large when clustering datasets where groups were on different scales. | en |
dc.identifier.uri | https://hdl.handle.net/10037/4608 | |
dc.identifier.urn | URN:NBN:no-uit_munin_4324 | |
dc.language.iso | eng | en |
dc.publisher | Universitetet i Tromsø | en |
dc.publisher | University of Tromsø | en |
dc.rights.accessRights | openAccess | |
dc.rights.holder | Copyright 2012 The Author(s) | |
dc.rights.uri | https://creativecommons.org/licenses/by-nc-sa/3.0 | en_US |
dc.rights | Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0) | en_US |
dc.subject.courseID | FYS-3921 | en |
dc.subject | VDP::Matematikk og Naturvitenskap: 400::Informasjons- og kommunikasjonsvitenskap: 420::Kunnskapsbaserte systemer: 425 | en |
dc.subject | VDP::Mathematics and natural science: 400::Information and communication science: 420::Knowledge based systems: 425 | en |
dc.title | Information theoretic learning with K nearest neighbors :
a new clustering algorithm | en |
dc.type | Master thesis | en |
dc.type | Mastergradsoppgave | en |