dc.contributor.author | Jha, Debesh | |
dc.contributor.author | Riegler, Michael Alexander | |
dc.contributor.author | Johansen, Dag | |
dc.contributor.author | Halvorsen, Pål | |
dc.contributor.author | Johansen, Håvard D. | |
dc.date.accessioned | 2023-03-27T11:04:16Z | |
dc.date.available | 2023-03-27T11:04:16Z | |
dc.date.issued | 2020-09-01 | |
dc.description.abstract | Semantic image segmentation is the process of labeling each pixel of an image with its corresponding class. An encoder-decoder based approach, like U-Net and its variants, is a popular strategy for solving medical image segmentation tasks. To improve the performance of U-Net on various segmentation tasks, we propose a novel architecture called DoubleU-Net, which is a combination of two U-Net architectures stacked on top of each other. The first U-Net uses a pre-trained VGG-19 as the encoder, which has already learned features from ImageNet and can be transferred to another task easily. To capture more semantic information efficiently, we added another U-Net at the bottom. We also adopt Atrous Spatial Pyramid Pooling (ASPP) to capture contextual information within the network. We have evaluated DoubleU-Net using four medical segmentation datasets, covering various imaging modalities such as colonoscopy, dermoscopy, and microscopy. Experiments on the MICCAI 2015 segmentation challenge, the CVC-ClinicDB, the 2018 Data Science Bowl challenge, and the Lesion boundary segmentation datasets demonstrate that the DoubleU-Net outperforms U-Net and the baseline models. Moreover, DoubleU-Net produces more accurate segmentation masks, especially in the case of the CVC-ClinicDB and MICCAI 2015 segmentation challenge datasets, which have challenging images such as smaller and flat polyps. These results show the improvement over the existing U-Net model. The encouraging results, produced on various medical image segmentation datasets, show that DoubleU-Net can be used as a strong baseline for both medical image segmentation and cross-dataset evaluation testing to measure the generalizability of Deep Learning (DL) models. | en_US |
dc.identifier.citation | Jha, Riegler, Johansen, Halvorsen, Johansen. DoubleU-Net: A Deep Convolutional Neural Network for Medical Image Segmentation. IEEE International Symposium on Computer-Based Medical Systems. 2020 | en_US |
dc.identifier.cristinID | FRIDAID 1835631 | |
dc.identifier.doi | 10.1109/CBMS49503.2020.00111 | |
dc.identifier.issn | 2372-9198 | |
dc.identifier.uri | https://hdl.handle.net/10037/28864 | |
dc.language.iso | eng | en_US |
dc.publisher | IEEE | en_US |
dc.relation.journal | IEEE International Symposium on Computer-Based Medical Systems | |
dc.relation.projectID | Norges forskningsråd: 263248 | en_US |
dc.rights.accessRights | openAccess | en_US |
dc.rights.holder | Copyright 2020 The Author(s) | en_US |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0 | en_US |
dc.rights | Attribution 4.0 International (CC BY 4.0) | en_US |
dc.subject | VDP::Matematikk og naturvitenskap: 400::Informasjons- og kommunikasjonsvitenskap: 420 | en_US |
dc.subject | VDP::Mathematics and natural scienses: 400::Information and communication science: 420 | en_US |
dc.subject | Fordøyelseskanalen / Gastrointestinal Tract | en_US |
dc.subject | Mage-tarmsykdommer / gastrointestinale sykdommer / Gastrointestinal Diseases | en_US |
dc.subject | Maskinlæring / Machine learning | en_US |
dc.title | DoubleU-Net: A Deep Convolutional Neural Network for Medical Image Segmentation | en_US |
dc.type.version | acceptedVersion | en_US |
dc.type | Journal article | en_US |
dc.type | Tidsskriftartikkel | en_US |
dc.type | Peer reviewed | en_US |