Vis enkel innførsel

dc.contributor.authorMikalsen, Karl Øyvind
dc.contributor.authorRuiz, Cristina Soguero
dc.contributor.authorBianchi, Filippo Maria
dc.contributor.authorRevhaug, Arthur
dc.contributor.authorJenssen, Robert
dc.date.accessioned2021-11-10T09:37:07Z
dc.date.available2021-11-10T09:37:07Z
dc.date.issued2021-02-20
dc.description.abstractThe time series cluster kernel (TCK) provides a powerful tool for analysing multivariate time series subject to missing data. TCK is designed using an ensemble learning approach in which Bayesian mixture models form the base models. Because of the Bayesian approach, TCK can naturally deal with missing values without resorting to imputation and the ensemble strategy ensures robustness to hyperparameters, making it particularly well suited for unsupervised learning. However, TCK assumes missing at random and that the underlying missingness mechanism is ignorable, i.e. uninformative, an assumption that does not hold in many real-world applications, such as e.g. medicine. To overcome this limitation, we present a kernel capable of exploiting the potentially rich information in the missing values and patterns, as well as the information from the observed data. In our approach, we create a representation of the missing pattern, which is incorporated into mixed mode mixture models in such a way that the information provided by the missing patterns is effectively exploited. Moreover, we also propose a semi-supervised kernel, capable of taking advantage of incomplete label information to learn more accurate similarities. Experiments on benchmark data, as well as a real-world case study of patients described by longitudinal electronic health record data who potentially suffer from hospital-acquired infections, demonstrate the effectiveness of the proposed methodsen_US
dc.identifier.citationMikalsen KØ, Ruiz CS, Bianchi FM, Revhaug A, Jenssen R. Time series cluster kernels to exploit informative missingness and incomplete label information. Pattern Recognition. 2021en_US
dc.identifier.cristinIDFRIDAID 1892147
dc.identifier.doi10.1016/j.patcog.2021.107896
dc.identifier.issn0031-3203
dc.identifier.issn1873-5142
dc.identifier.urihttps://hdl.handle.net/10037/22966
dc.language.isoengen_US
dc.publisherElsevieren_US
dc.relation.journalPattern Recognition
dc.relation.projectIDinfo:eu-repo/grantAgreement/RCN/IKTPLUSS-IKT/239844/Norway/Next Generation Kernel-Based Machine Learning for Big Missing Data Applied to Earth Observation//en_US
dc.rights.accessRightsopenAccessen_US
dc.rights.holderCopyright 2021 The Author(s)en_US
dc.subjectVDP::Mathematics and natural science: 400::Physics: 430en_US
dc.subjectVDP::Matematikk og Naturvitenskap: 400::Fysikk: 430en_US
dc.titleTime series cluster kernels to exploit informative missingness and incomplete label informationen_US
dc.type.versionpublishedVersionen_US
dc.typeJournal articleen_US
dc.typeTidsskriftartikkelen_US
dc.typePeer revieweden_US


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel