Traffic Navigation for Urban Air Mobility with Reinforcement Learning

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 88
  • Download : 0
Assuring stability of the guidance law for quadrotor-type Urban Air Mobility (UAM) is important since it is assumed to operate in urban areas. Model free reinforcement learning was intensively applied for this purpose in recent studies. In reinforcement learning, the environment is an important part of training. Usually, a Proximal Policy Optimization (PPO) algorithm is used widely for reinforcement learning of quadrotors. However, PPO algorithms for quadrotors tend to fail to guarantee the stability of the guidance law in the environment as the search space increases. In this work, we show the improvements of stability in a multi-agent quadrotor-type UAM environment by applying the Soft Actor-Critic (SAC) reinforcement learning algorithm. The simulations were performed in Unity. Our results achieved three times better reward in the Urban Air Mobility environment than when trained with the PPO algorithm and our approach also shows faster training time than the PPO algorithm.
Publisher
SPRINGER-VERLAG SINGAPORE PTE LTD
Issue Date
2021-11
Language
English
Citation

Asia-Pacific International Symposium on Aerospace Technology (APISAT), pp.31 - 42

ISSN
1876-1100
DOI
10.1007/978-981-19-2635-8_3
URI
http://hdl.handle.net/10203/305058
Appears in Collection
AE-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0