dc.contributor.author | Gautam, Srishti | |
dc.contributor.author | Hohne, Marina Marie-Claire | |
dc.contributor.author | Hansen, Stine | |
dc.contributor.author | Jenssen, Robert | |
dc.contributor.author | Kampffmeyer, Michael | |
dc.date.accessioned | 2022-11-30T09:32:24Z | |
dc.date.available | 2022-11-30T09:32:24Z | |
dc.date.issued | 2022-11-12 | |
dc.description.abstract | Current machine learning models have shown high efficiency in solving a wide variety of real-world problems. However, their black box character poses a major challenge for the comprehensibility and traceability of the underlying decision-making strategies. As a remedy, numerous post-hoc and self-explanation methods have been developed to interpret the models’ behavior. Those methods, in addition, enable the identification of artifacts that, inherent in the training data, can be erroneously learned by the model as class-relevant features. In this work, we provide a detailed case study of a representative for the state-of-the-art self-explaining network, ProtoPNet, in the presence of a spectrum of artifacts. Accordingly, we identify the main drawbacks of ProtoPNet, especially its coarse and spatially imprecise explanations. We address these limitations by introducing Prototypical Relevance Propagation (PRP), a novel method for generating more precise model-aware explanations. Furthermore, in order to obtain a clean, artifact-free dataset, we propose to use multi-view clustering strategies for segregating the artifact images using the PRP explanations, thereby suppressing the potential artifact learning in the models. | en_US |
dc.identifier.citation | Gautam S, Hohne MM, Hansen S, Jenssen R, Kampffmeyer MC. This looks more like that: Enhancing Self-Explaining Models by Prototypical Relevance Propagation. Pattern Recognition. 2022 | en_US |
dc.identifier.cristinID | FRIDAID 2084854 | |
dc.identifier.doi | https://doi.org/10.1016/j.patcog.2022.109172 | |
dc.identifier.issn | 0031-3203 | |
dc.identifier.issn | 1873-5142 | |
dc.identifier.uri | https://hdl.handle.net/10037/27611 | |
dc.language.iso | eng | en_US |
dc.publisher | Elsevier | en_US |
dc.relation.ispartof | Gautam, S. (2024). Towards Interpretable, Trustworthy and Reliable AI. (Doctoral thesis). <a href=https://hdl.handle.net/10037/33143>https://hdl.handle.net/10037/33143</a>. | |
dc.relation.journal | Pattern Recognition | |
dc.rights.accessRights | openAccess | en_US |
dc.rights.holder | Copyright 2022 The Author(s) | en_US |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0 | en_US |
dc.rights | Attribution 4.0 International (CC BY 4.0) | en_US |
dc.title | This looks more like that: Enhancing Self-Explaining Models by Prototypical Relevance Propagation | en_US |
dc.type.version | publishedVersion | en_US |
dc.type | Journal article | en_US |
dc.type | Tidsskriftartikkel | en_US |
dc.type | Peer reviewed | en_US |