Vis enkel innførsel

dc.contributor.advisorAilo Bongo, Lars
dc.contributor.advisorJuselius, Jonas
dc.contributor.authorLau, Ka Hin
dc.date.accessioned2022-08-03T05:34:57Z
dc.date.available2022-08-03T05:34:57Z
dc.date.issued2022-05-15en
dc.description.abstractIn large simulations, like predicting the movement of ocean particles, it is common that simulation executions are related when they share one or more inputs. When the number of simulations increases, it becomes harder for users who run the simulations to keep track of all the simulations. Also, more storage spaces are wasted if there are multiple copies of the same input files. This thesis describes a system that collects data from previous simulations, allowing users to search for the data they need to run the next simulation. Also, the system identifies the same files that were used in previous simulations, which allows users to re-use these files instead of copying the files to a new simulation folder to use them. Among the simulations that were executed in our current environment, the system identifies around 11\% of input files that are shared by the simulations. Users can refer to the same file to use it instead of copying the file to new simulation folders. The conclusion is that the system helps users who run simulations to reduce their efforts and time to find input files that are used in previous simulations when they set up for a new simulation. Also, it saves storage space on the computing cluster where the simulations run on by identifying the duplicated data.en_US
dc.identifier.urihttps://hdl.handle.net/10037/25914
dc.language.isoengen_US
dc.publisherUiT Norges arktiske universitetno
dc.publisherUiT The Arctic University of Norwayen
dc.rights.holderCopyright 2022 The Author(s)
dc.rights.urihttps://creativecommons.org/licenses/by-nc-sa/4.0en_US
dc.rightsAttribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)en_US
dc.subject.courseIDINF-3990
dc.subjectVDP::Technology: 500::Information and communication technology: 550::Computer technology: 551en_US
dc.subjectVDP::Teknologi: 500::Informasjons- og kommunikasjonsteknologi: 550::Datateknologi: 551en_US
dc.titleManagement of large geospatial datasetsen_US
dc.typeMastergradsoppgaveno
dc.typeMaster thesisen


Tilhørende fil(er)

Thumbnail
Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel

Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
Med mindre det står noe annet, er denne innførselens lisens beskrevet som Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)