ub.xmlui.mirage2.page-structure.muninLogoub.xmlui.mirage2.page-structure.openResearchArchiveLogo
    • EnglishEnglish
    • norsknorsk
  • Velg spraakEnglish 
    • EnglishEnglish
    • norsknorsk
  • Administration/UB
View Item 
  •   Home
  • Fakultet for naturvitenskap og teknologi
  • Institutt for teknologi og sikkerhet
  • Artikler, rapporter og annet (teknologi og sikkerhet)
  • View Item
  •   Home
  • Fakultet for naturvitenskap og teknologi
  • Institutt for teknologi og sikkerhet
  • Artikler, rapporter og annet (teknologi og sikkerhet)
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Data driven control based on Deep Q-Network algorithm for heading control and path following of a ship in calm water and waves

Permanent link
https://hdl.handle.net/10037/26286
DOI
https://doi.org/10.1016/j.oceaneng.2022.111802
Thumbnail
View/Open
article.pdf (22.39Mb)
Accepted manuscript version (PDF)
Date
2022-07-06
Type
Journal article
Tidsskriftartikkel
Peer reviewed

Author
Sivaraj, Sivaraman; Rajendran, Suresh; Perera, Lokukaluge Prasad Channa
Abstract
A reinforcement learning algorithm based on Deep Q-Networks (DQN) is used for the path following and heading control of a ship in calm water and waves. The rudder action of the ship is selected based on the developed DQN model. The spatial positions, linear velocities, yaw rate, Heading Error (HE) and Cross Track Error (CTE) represent the state-space, and a set of rudder angles represents the action space of the DQN model. The state space variables are in continuous space and action spaces are in discrete space. The decaying -greedy method is used for the exploration. Reward functions are modeled such that the agent will try to reduce the Cross Track Error and the Heading Error. Based on the literature available, the L7 model of a KVLCC2 tanker is used for testing the algorithm. The vessel dynamics are represented using a 3DoF maneuvering model that includes hydrodynamic, propeller, rudder and wave forces. The wave disturbances are calculated from the second-order mean drift forces . The environment is assumed to have the Markov property. The CTE and HE are calculated based on the Line of Sight (LOS) Algorithm. The effect of Pre-trained weights on different heading actions is investigated based on the exploration threshold. The DQN is trained and tested for heading control and path-following in calm water and different wave headings.
Publisher
Elsevier
Citation
Sivaraj, Rajendran, Perera. Data driven control based on Deep Q-Network algorithm for heading control and path following of a ship in calm water and waves. Ocean Engineering. 2022
Metadata
Show full item record
Collections
  • Artikler, rapporter og annet (teknologi og sikkerhet) [361]
Copyright 2022 The Author(s)

Browse

Browse all of MuninCommunities & CollectionsAuthor listTitlesBy Issue DateBrowse this CollectionAuthor listTitlesBy Issue Date
Login

Statistics

View Usage Statistics
UiT

Munin is powered by DSpace

UiT The Arctic University of Norway
The University Library
uit.no/ub - munin@ub.uit.no

Accessibility statement (Norwegian only)