dc.contributor.author | Yigzaw, Kassaye Yitbarek | |
dc.contributor.author | Michalas, Antonis | |
dc.contributor.author | Bellika, Johan Gustav | |
dc.date.accessioned | 2017-03-01T11:21:21Z | |
dc.date.available | 2017-03-01T11:21:21Z | |
dc.date.issued | 2016-08-12 | |
dc.description.abstract | Collecting data via a questionnaire and analyzing them while preserving respondents' privacy may increase the number of respondents and the truthfulness of their responses. It may also reduce the systematic differences between respondents and non-respondents. In this paper, we propose a privacy- preserving method for collecting and analyzing survey responses using secure multi-party computation. The method is secure under the semi-honest adversarial model. The proposed method computes a wide variety of statistics. Total and stratified statistical counts are computed using the secure protocols developed in this paper. Then, additional statistics, such as a contingency table, a chi-square test, an odds ratio, and logistic regression, are computed within the R statistical environment using the statistical counts as building blocks. The method was evaluated on a questionnaire data set of 3158 respondents sampled for a medical study and simulated questionnaire data sets of up to 50 000 respondents. The computation time for the statistical analyses linearly scales as the number of respondents increases. The results show that the method is efficient and scalable for practical use. It can also be used for other applications in which categorical data are collected. | en_US |
dc.description | Source: <a href=http://dx.doi.org/10.1109/ACCESS.2016.2599851>doi: 10.1109/ACCESS.2016.2599851</a> | en_US |
dc.identifier.citation | Yigzaw KY, Michalas A, Bellika JG. Secure and scalable statistical computation of questionnaire data in R. IEEE Access. 2016;4:4635-4645 | en_US |
dc.identifier.cristinID | FRIDAID 1397197 | |
dc.identifier.doi | 10.1109/ACCESS.2016.2599851 | |
dc.identifier.issn | 2169-3536 | |
dc.identifier.uri | https://hdl.handle.net/10037/10398 | |
dc.language.iso | eng | en_US |
dc.publisher | Institute of Electrical and Electronics Engineers | en_US |
dc.relation.journal | IEEE Access | |
dc.relation.projectID | info:eu-repo/grantAgreement/RCN/IKTPLUSS/ 248150/Norway/ Assessing the feasibility of the Learning Healthcare System toolbox | en_US |
dc.relation.uri | http://ieeexplore.ieee.org/document/7542506/ | |
dc.rights.accessRights | openAccess | en_US |
dc.subject | bloom filter | en_US |
dc.subject | privacy | en_US |
dc.subject | questionnaire | en_US |
dc.subject | statistical analysis | en_US |
dc.subject | secure multi-party computation | en_US |
dc.subject | secret sharing | en_US |
dc.subject | VDP::Matematikk og Naturvitenskap: 400::Informasjons- og kommunikasjonsvitenskap: 420::Sikkerhet og sårbarhet: 424 | en_US |
dc.subject | VDP::Mathematics and natural science: 400::Information and communication science: 420::Security and vulnerability: 424 | en_US |
dc.subject | VDP::Teknologi: 500::Informasjons- og kommunikasjonsteknologi: 550 | en_US |
dc.subject | VDP::Technology: 500::Information and communication technology: 550 | en_US |
dc.title | Secure and scalable statistical computation of questionnaire data in R | en_US |
dc.type | Journal article | en_US |
dc.type | Tidsskriftartikkel | en_US |
dc.type | Peer reviewed | en_US |