Vis enkel innførsel

dc.contributor.authorNgo, Phuong Dinh
dc.contributor.authorTejedor Hernandez, Miguel Angel
dc.contributor.authorOlsen Svenning, Therese
dc.contributor.authorChomutare, Taridzo Fred
dc.contributor.authorBudrionis, Andrius
dc.contributor.authorDalianis, Hercules
dc.date.accessioned2024-04-17T10:18:13Z
dc.date.available2024-04-17T10:18:13Z
dc.date.issued2024
dc.description.abstractThis study discusses the methods and challenges of deidentifying and pseudonymizing Norwegian clinical text for research purposes. The results of the NorDeid tool for deidentification and pseudonymization on different types of protected health information were evaluated and discussed, as well as the extension of its functionality with regular expressions to identify specific types of sensitive information. This research used a clinical corpus of adult patients treated in a gastro-surgical department in Norway, which contains approximately nine million clinical notes. The study also highlights the challenges posed by the unique language and clinical terminology of Norway and emphasizes the importance of protecting privacy and the need for customized approaches to meet legal and research requirements.en_US
dc.descriptionSource at <a href=https://aclanthology.org/2024.caldpseudo-1.0>https://aclanthology.org/2024.caldpseudo-1.0</a>.en_US
dc.identifier.citationNgo, Tejedor Hernandez, Olsen Svenning, Chomutare, Budrionis, Dalianis. Deidentifying a Norwegian clinical corpus - An effort to create a privacy-preserving Norwegian large clinical language model. Proceedings of the Workshop on Computational Approaches to Language Data Pseudonymization (CALD-pseudo 2024). 2024en_US
dc.identifier.cristinIDFRIDAID 2262189
dc.identifier.urihttps://hdl.handle.net/10037/33415
dc.language.isoengen_US
dc.publisherACLen_US
dc.relation.journalProceedings of the Workshop on Computational Approaches to Language Data Pseudonymization (CALD-pseudo 2024)
dc.rights.accessRightsopenAccessen_US
dc.rights.holderCopyright 2024 The Author(s)en_US
dc.titleDeidentifying a Norwegian clinical corpus - An effort to create a privacy-preserving Norwegian large clinical language modelen_US
dc.type.versionacceptedVersionen_US
dc.typeJournal articleen_US
dc.typeTidsskriftartikkelen_US
dc.typePeer revieweden_US


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel