Vis enkel innførsel

dc.contributor.advisorSvendsen, Kristian
dc.contributor.authorCheang, Chi Kei
dc.date.accessioned2021-05-14T05:45:38Z
dc.date.available2021-05-14T05:45:38Z
dc.date.issued2019-05-14en
dc.description.abstractBackground: The pressure to share more data and being more transparency of clinical study reports has grown and becomes an important topic in recent years. Before clinical data and clinical results can be shared they must undergo anonymization. How anonymization of clinical data affects the utility is poorly-studied, especially in pharmacoepidemiology. Objective: The aim of the study is to describe and evaluate how anonymization of simulated clinical data will affect the data utility of pharmacoepidemiological analyses of these data. Method: We have simulated five clinical datasets with different characteristics, associations, types of outcome and study populations. Suppression, generalization, randomization and k-anonymity were used as our anonymization approaches. These methods will be evaluated by the change in the data and statistical results before and after anonymization. Result: K-anonymity and suppression were the methods that affected the simulated clinical data the most, while generalization and randomization affected the data least. With k-anonymity and suppression there is a risk to overestimating the clinical results due to the elimination of unique records. On the other hand, generalization and randomization preserved the most data utility but they were less effective in anonymizing the data. Conclusion: Our study revealed that different anonymization approaches can affect the clinical results differently. The more we anonymize a record or attribute, the less utility is provided. It is therefore important to construct a balance of data utility and effectiveness of anonymization before the clinical data are published. More investigations about how anonymization of clinical data affects data utility are needed in order to maximize the benefit of using anonymized clinical data to improve public health.en_US
dc.identifier.urihttps://hdl.handle.net/10037/21177
dc.language.isoengen_US
dc.publisherUiT Norges arktiske universitetno
dc.publisherUiT The Arctic University of Norwayen
dc.rights.holderCopyright 2019 The Author(s)
dc.rights.urihttps://creativecommons.org/licenses/by-nc-sa/4.0en_US
dc.rightsAttribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)en_US
dc.subject.courseIDFAR-3911
dc.subjectVDP::Medisinske Fag: 700::Helsefag: 800::Samfunnsfarmasi: 812en_US
dc.subjectVDP::Medical disciplines: 700::Health sciences: 800::Community pharmacy: 812en_US
dc.subjectVDP::Medisinske Fag: 700::Helsefag: 800::Andre helsefag: 829en_US
dc.subjectVDP::Medical disciplines: 700::Health sciences: 800::Other health science disciplines: 829en_US
dc.titleHow will anonymization of simulated clinical data affect the data utility of pharmacoepidemiological studies?en_US
dc.typeMastergradsoppgaveno
dc.typeMaster thesisen


Tilhørende fil(er)

Thumbnail
Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel

Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
Med mindre det står noe annet, er denne innførselens lisens beskrevet som Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)