ub.xmlui.mirage2.page-structure.muninLogoub.xmlui.mirage2.page-structure.openResearchArchiveLogo
    • EnglishEnglish
    • norsknorsk
  • Velg spraakEnglish 
    • EnglishEnglish
    • norsknorsk
  • Administration/UB
View Item 
  •   Home
  • Fakultet for naturvitenskap og teknologi
  • Institutt for informatikk
  • Artikler, rapporter og annet (informatikk)
  • View Item
  •   Home
  • Fakultet for naturvitenskap og teknologi
  • Institutt for informatikk
  • Artikler, rapporter og annet (informatikk)
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

De-identifying Norwegian Clinical Text using Resources from Swedish and Danish

Permanent link
https://hdl.handle.net/10037/32958
Thumbnail
View/Open
article.pdf (194.7Kb)
Submitted manuscript version (PDF)
Date
2023
Type
Journal article
Tidsskriftartikkel

Author
Lamproudis, Anastasios; Mora, Sara; Olsen Svenning, Therese; Torsvik, Torbjørn; Chomutare, Taridzo Fred; Ngo, Phuong Dinh; Dalianis, Hercules
Abstract
The lack of relevant annotated datasets represents one key limitation in the application of Natural Language Processing techniques in a broad number of tasks, among them Protected Health Information (PHI) identification in Norwegian clinical text. In this work, the possibility of exploiting resources from Swedish, a very closely related language, to Norwegian is explored. The Swedish dataset is annotated with PHI information. Different processing and text augmentation techniques are evaluated, along with their impact in the final performance of the model. The augmentation techniques, such as injection and generation of both Norwegian and Scandinavian Named Entities into the Swedish training corpus, showed to increase the performance in the de-identification task for both Danish and Norwegian text.This trend was also confirmed by the evaluation of model performance on a sample Norwegian gastro surgical clinical text.
Description
Source at https://knowledge.amia.org/event-data.
Citation
Lamproudis, Mora, Olsen Svenning, Torsvik, Chomutare, Ngo, Dalianis. De-identifying Norwegian Clinical Text using Resources from Swedish and Danish. AMIA Annual Symposium Proceedings. 2023;2023:456-464
Metadata
Show full item record
Collections
  • Artikler, rapporter og annet (informatikk) [481]
Copyright 2023 The Author(s)

Browse

Browse all of MuninCommunities & CollectionsAuthor listTitlesBy Issue DateBrowse this CollectionAuthor listTitlesBy Issue Date
Login

Statistics

View Usage Statistics
UiT

Munin is powered by DSpace

UiT The Arctic University of Norway
The University Library
uit.no/ub - munin@ub.uit.no

Accessibility statement (Norwegian only)