Introducing Soft Option-Critic for Blood Glucose Control in Type 1 Diabetes : Exploiting Abstraction of Actions for Automated Insulin Administration

Jenssen, Christian

dc.contributor.advisor	Myhre, Jonas
dc.contributor.author	Jenssen, Christian
dc.date.accessioned	2020-10-07T19:31:20Z
dc.date.available	2020-10-07T19:31:20Z
dc.date.issued	2020-07-15
dc.description.abstract	Type 1 Diabetes (T1D) is an autoimmune disease where the insulin-producing cells are damaged and unable to produce sufficient amounts of insulin, causing an inability to regulate the body's blood sugar levels. Administrating insulin is necessary for blood glucose regulation, requiring diligent and continuous care from the patient to avoid critical health risks. The dynamics governing insulin-glucose are complex, where aspects such as diet, exercise and sleep have a substantial effect, making it a difficult burden for the patient. Reinforcement learning (RL) has been proposed as a solution for automated insulin administration, with the potential to learn personalized solutions for insulin control adapted to the patient. In this thesis policy-based RL-methods for T1D management are investigated and a new method is developed; Soft option-critic (SOC) is designed to better account for differing situations affecting the blood glucose, using temporally extended actions called options. Further extensions of the method are implemented, using key elements from deep Q-learning algorithms. The experiments are twofold; Several experiments are conducted to thoroughly assess the performance of SOC and its extensions on T1D in-silico patients: The first part of the experiments are done on the already solved environment lunar lander (LL) to analyze the merits of using options in the SOC-formulation. The second part consists of the diabetes experiments using a insulin-glucose simulator including scenarios with varying meals and bolus. The results show that SOC and its extension outperforms the benchmark algorithms on LL, learning options for improved sample-efficiency. On the diabetes experiments they performed comparable to the best benchmark model, beating the optimal baseline control method. The resulting policy was able to predict and account for meals, improving time-in-range (TIR) substantially.	en_US
dc.identifier.uri	https://hdl.handle.net/10037/19549
dc.language.iso	eng	en_US
dc.publisher	UiT Norges arktiske universitet	en_US
dc.publisher	UiT The Arctic University of Norway	en_US
dc.rights.accessRights	openAccess	en_US
dc.rights.holder	Copyright 2020 The Author(s)
dc.rights.uri	https://creativecommons.org/licenses/by-nc-sa/4.0	en_US
dc.rights	Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)	en_US
dc.subject.courseID	FYS-3941
dc.subject	VDP::Technology: 500::Medical technology: 620	en_US
dc.subject	VDP::Teknologi: 500::Medisinsk teknologi: 620	en_US
dc.title	Introducing Soft Option-Critic for Blood Glucose Control in Type 1 Diabetes : Exploiting Abstraction of Actions for Automated Insulin Administration	en_US
dc.type	Master thesis	en_US
dc.type	Mastergradsoppgave	en_US

File(s) in this item

Name:: thesis.pdf
Size:: 1.678Mb
Format:: PDF

View/Open

Name:: license.txt
Size:: 1.093Kb
Format:: Text file

View/Open

This item appears in the following collection(s)

Mastergradsoppgaver i teknologi - anvendt fysikk [67]

Show simple item record

Except where otherwise noted, this item's license is described as Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)