dc.contributor.advisor | Pedersen, Edvard | |
dc.contributor.advisor | Olav Melberg, Hans | |
dc.contributor.author | Søreide, Anders | |
dc.date.accessioned | 2024-08-09T05:34:36Z | |
dc.date.available | 2024-08-09T05:34:36Z | |
dc.date.issued | 2024-05-15 | en |
dc.description.abstract | Querying and exploring health data can lead to the discovery of new rela-
tions between conditions, medications, hospital events, etc. For this purpose,
temporal health queries are useful since the order in which events happen is
important.
Many of the querying tools available do not address the unique needs of
temporal health queries, making these queries difficult and time-consuming
to perform. One tool made for this purpose, Snotra, enables temporal health
queries with a syntax that is human-readable and easy to understand and
write. Problems in the technical implementation and underlying architecture
of Snotra currently prevent it from being used to query large datasets from
health registers.
By implementing a subset of Snotra operations we can compare to design a
new underlying engine for Snotra to handle larger datasets. This thesis ex-
plores possible avenues to fix the underlying architecture of Snotra, comparing
a selection of approaches including SQL, Dataframes, and custom low-level
querying functions. The most promising approach is further developed into a
prototype supporting a small subset of Snotra operations.
This work shows how Polars extended with custom Rust query functions
is a viable path for implementing performant and scalable temporal health
queries. | en_US |
dc.identifier.uri | https://hdl.handle.net/10037/34239 | |
dc.language.iso | eng | en_US |
dc.publisher | UiT Norges arktiske universitet | no |
dc.publisher | UiT The Arctic University of Norway | en |
dc.rights.holder | Copyright 2024 The Author(s) | |
dc.rights.uri | https://creativecommons.org/licenses/by-nc-sa/4.0 | en_US |
dc.rights | Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) | en_US |
dc.subject.courseID | INF-3990 | |
dc.subject | Temporal health queries | en_US |
dc.subject | Dataframes | en_US |
dc.subject | Health registers | en_US |
dc.title | A before B: Investigations into how best to perform Temporal Health Queries | en_US |
dc.type | Mastergradsoppgave | no |
dc.type | Master thesis | en |