TY - GEN
T1 - Hierarchical Clustering-Based State Grouping Reinforcement Learning for Switching Decision of Autonomous Vehicles
AU - Ouyang, Wenjie
AU - Jiao, Yiwen
AU - Liu, Yang
AU - Li, Yang
AU - Hu, Manjiang
AU - Qin, Hongmao
PY - 2023
Y1 - 2023
N2 - Reinforcement learning (RL) has gained wide attention, but its implementation in autonomous vehicles is still limited by insufficient sample efficiency and heavy training costs. The training efficiency of RL agents is influenced by the dimension of the state space, which can be partitioned to reduce the complexity of sampling and computation. This study proposes a hierarchical clustering-based state grouping reinforcement learning (HCSG-RL) method for the switching decision of autonomous vehicles. First, we partition the base state space into groups and generate a hierarchical tree of state space groups. Then, we train multiple sub-agents for each node in the hierarchical tree. Finally, we add these trained-well sub-model into master policy. This method allows us to fully explore all state spaces and improve the training efficiency of individual agents, which handles the 'long-tail' issue and the curse of dimensionality issue. We conduct experiments in a simulation environment and results show that the proposed method has 16-72% reward improvement compared to the tree model in different road length.
AB - Reinforcement learning (RL) has gained wide attention, but its implementation in autonomous vehicles is still limited by insufficient sample efficiency and heavy training costs. The training efficiency of RL agents is influenced by the dimension of the state space, which can be partitioned to reduce the complexity of sampling and computation. This study proposes a hierarchical clustering-based state grouping reinforcement learning (HCSG-RL) method for the switching decision of autonomous vehicles. First, we partition the base state space into groups and generate a hierarchical tree of state space groups. Then, we train multiple sub-agents for each node in the hierarchical tree. Finally, we add these trained-well sub-model into master policy. This method allows us to fully explore all state spaces and improve the training efficiency of individual agents, which handles the 'long-tail' issue and the curse of dimensionality issue. We conduct experiments in a simulation environment and results show that the proposed method has 16-72% reward improvement compared to the tree model in different road length.
KW - autonomous switching
KW - hierarchical clustering
KW - reinforcement learning
KW - state grouping
UR - http://www.scopus.com/inward/record.url?scp=85180129997&partnerID=8YFLogxK
U2 - 10.1109/ICUS58632.2023.10318413
DO - 10.1109/ICUS58632.2023.10318413
M3 - Conference contribution
AN - SCOPUS:85180129997
T3 - Proceedings of 2023 IEEE International Conference on Unmanned Systems, ICUS 2023
SP - 1375
EP - 1380
BT - Proceedings of 2023 IEEE International Conference on Unmanned Systems, ICUS 2023
A2 - Song, Rong
PB - IEEE
T2 - 2023 IEEE International Conference on Unmanned Systems, ICUS 2023
Y2 - 13 October 2023 through 15 October 2023
ER -