Show simple item record

dc.contributor.authorTafavvoghi, Masoud
dc.contributor.authorBongo, Lars Ailo Aslaksen
dc.contributor.authorShvetsov, Nikita
dc.contributor.authorBusund, Lill-Tove Rasmussen
dc.contributor.authorMøllersen, Kajsa
dc.date.accessioned2024-08-23T11:56:08Z
dc.date.available2024-08-23T11:56:08Z
dc.date.issued2024-02-01
dc.description.abstractAdvancements in digital pathology and computing resources have made a significant impact in the field of computational pathology for breast cancer diagnosis and treatment. However, access to high-quality labeled histopathological images of breast cancer is a big challenge that limits the development of accurate and robust deep learning models. In this scoping review, we identified the publicly available datasets of breast H&E-stained whole-slide images (WSIs) that can be used to develop deep learning algorithms. We systematically searched 9 scientific literature databases and 9 research data repositories and found 17 publicly available datasets containing 10 385 H&E WSIs of breast cancer. Moreover, we reported image metadata and characteristics for each dataset to assist researchers in selecting proper datasets for specific tasks in breast cancer computational pathology. In addition, we compiled 2 lists of breast H&E patches and private datasets as supplementary resources for researchers. Notably, only 28% of the included articles utilized multiple datasets, and only 14% used an external validation set, suggesting that the performance of other developed models may be susceptible to overestimation. The TCGA-BRCA was used in 52% of the selected studies. This dataset has a considerable selection bias that can impact the robustness and generalizability of the trained algorithms. There is also a lack of consistent metadata reporting of breast WSI datasets that can be an issue in developing accurate deep learning models, indicating the necessity of establishing explicit guidelines for documenting breast WSI dataset characteristics and metadata.en_US
dc.identifier.citationTafavvoghi, Bongo, Shvetsov, Busund, Møllersen. Publicly available datasets of breast histopathology H&E whole-slide images: A scoping review. Journal of Pathology Informatics. 2024;15en_US
dc.identifier.cristinIDFRIDAID 2252303
dc.identifier.doi10.1016/j.jpi.2024.100363
dc.identifier.issn2229-5089
dc.identifier.issn2153-3539
dc.identifier.urihttps://hdl.handle.net/10037/34411
dc.language.isoengen_US
dc.publisherElsevieren_US
dc.relation.journalJournal of Pathology Informatics
dc.rights.accessRightsopenAccessen_US
dc.rights.holderCopyright 2024 The Author(s)en_US
dc.rights.urihttps://creativecommons.org/licenses/by/4.0en_US
dc.rightsAttribution 4.0 International (CC BY 4.0)en_US
dc.titlePublicly available datasets of breast histopathology H&E whole-slide images: A scoping reviewen_US
dc.type.versionpublishedVersionen_US
dc.typeJournal articleen_US
dc.typeTidsskriftartikkelen_US
dc.typePeer revieweden_US


File(s) in this item

Thumbnail

This item appears in the following collection(s)

Show simple item record

Attribution 4.0 International (CC BY 4.0)
Except where otherwise noted, this item's license is described as Attribution 4.0 International (CC BY 4.0)