A multi-centre polyp detection and segmentation dataset for generalisability assessment
Permanent link
https://hdl.handle.net/10037/28669Date
2023-02-06Type
Journal articleTidsskriftartikkel
Peer reviewed
Author
Ali, Sharib; Jha, Debesh; Ghatwary, Noha; Realdon, Stefano; Cannizzaro, Renato; Salem, Osama E.; Lamarque, Dominique; Daul, Christian; Riegler, Michael Alexander; Ånonsen, Kim Vidar; Petlund, Andreas; Halvorsen, Pål; Rittscher, Jens; de Lange, Thomas; East, James EAbstract
Polyps in the colon are widely known cancer precursors identifed by colonoscopy. Whilst most polyps
are benign, the polyp’s number, size and surface structure are linked to the risk of colon cancer.
Several methods have been developed to automate polyp detection and segmentation. However,
the main issue is that they are not tested rigorously on a large multicentre purpose-built dataset, one
reason being the lack of a comprehensive public dataset. As a result, the developed methods may not
generalise to diferent population datasets. To this extent, we have curated a dataset from six unique
centres incorporating more than 300 patients. The dataset includes both single frame and sequence
data with 3762 annotated polyp labels with precise delineation of polyp boundaries verifed by six
senior gastroenterologists. To our knowledge, this is the most comprehensive detection and pixellevel segmentation dataset (referred to as PolypGen) curated by a team of computational scientists
and expert gastroenterologists. The paper provides insight into data construction and annotation
strategies, quality assurance, and technical validation.
Publisher
Springer NatureCitation
Ali, Jha, Ghatwary, Realdon, Cannizzaro, Salem, Lamarque, Daul, Riegler, Ånonsen, Petlund, Halvorsen, Rittscher, de Lange, East. A multi-centre polyp detection and segmentation dataset for generalisability assessment. Scientific Data. 2023;10Metadata
Show full item recordCollections
Copyright 2023 The Author(s)