RMIT University
Browse

Deep reinforcement learning approach to optimize the driving performance of shield tunnelling machines

journal contribution
posted on 2024-11-03, 09:06 authored by Khalid Elbaz, Annan ZhouAnnan Zhou, Shui-Long Shen
This paper proposes a deep reinforcement learning (DRL)-based model as a valuable tool to improve the performance of the driving system (i.e. thrust force and cutterhead torque) of a shield tunnelling machine. The proposed model integrates deep-Q learning algorithm (DQL) and particle swarm optimization (PSO) based on an extreme learning machine (ELM). Specifically, the DQL–PSO model initialized the biases and weights in the ELM to achieve the optimal convergence rate and avoid instability. The DQL–PSO model evaluates the reward of action at each step and thus guides the particles to perform the appropriate action in real time. The DRL process data included shield operational parameters, geometry, and geological conditions. Field data collected from the Shenzhen railway tunnelling case study were used to validate the superiority and effectiveness of the presented DQL–PSO model. The algorithm was also evaluated using four numerical benchmark problems and compared with a theoretical model. The results revealed that the promising potential of DRL as a decision tool efficiently supports the formulation of target strategy and demonstrated its potential for engineering applications.

History

Related Materials

  1. 1.
    DOI - Is published in 10.1016/j.tust.2023.105104
  2. 2.
    ISSN - Is published in 08867798

Journal

Tunnelling and Underground Space Technology

Volume

136

Number

105104

Start page

1

End page

17

Total pages

17

Publisher

Elsevier

Place published

United Kingdom

Language

English

Copyright

© 2023 Elsevier Ltd. All rights reserved.

Former Identifier

2006122787

Esploro creation date

2023-06-23

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC