A new reinforcement learning-based variable speed limit control approach to improve traffic efficiency against freeway jam waves

Yu Han; Andreas Hegyi; Le Zhang; Zhengbing He; Edward Chung; Pan Liu

doi:10.1016/j.trc.2022.103900

A new reinforcement learning-based variable speed limit control approach to improve traffic efficiency against freeway jam waves

Yu Han^*, Andreas Hegyi, Le Zhang, Zhengbing He, Edward Chung, Pan Liu

^*Corresponding author for this work

Transport and Planning

Research output: Contribution to journal › Article › Scientific › peer-review

17 Citations (Scopus)

31 Downloads (Pure)

Abstract

Conventional reinforcement learning (RL) models of variable speed limit (VSL) control systems (and traffic control systems in general) cannot be trained in real traffic process because new control actions are usually explored randomly, which may result in high costs (delays) due to exploration and learning. For this reason, existing RL-based VSL control approaches need a traffic simulator for training. However, the performance of those approaches are dependent on the accuracy of the simulators. This paper proposes a new RL-based VSL control approach to overcome the aforementioned problems. The proposed VSL control approach is designed to improve traffic efficiency by using VSLs against freeway jam waves. It applies an iterative training framework, where the optimal control policy is updated by exploring new control actions both online and offline in each iteration. The explored control actions are evaluated in real traffic process, thus it avoids that the RL model learns only from a traffic simulator. The proposed VSL control approach is tested using a macroscopic traffic simulation model to represent real world traffic flow dynamics. By comparing with existing VSL control approaches, the proposed approach is demonstrated to have advantages in the following two aspects: (i) it alleviates the impact of model mismatch, which occurs in both model-based VSL control approaches and existing RL-based VSL control approaches, via replacing knowledge from the models by knowledge from the real process, and (ii) it significantly reduces the exploration and learning costs compared to existing RL-based VSL control approaches.

Original language	English
Article number	103900
Number of pages	22
Journal	Transportation Research Part C: Emerging Technologies
Volume	144
DOIs	https://doi.org/10.1016/j.trc.2022.103900
Publication status	Published - 2022

Bibliographical note

Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care
Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Keywords

Data-driven approach
Freeway traffic control
Reinforcement learning
Variable speed limits

Access to Document

10.1016/j.trc.2022.103900

1-s2.0-S0968090X22003138-mainFinal published version, 4.39 MB

Cite this

@article{b76f96649e4948e4b56e4df2b9f0b67b,

title = "A new reinforcement learning-based variable speed limit control approach to improve traffic efficiency against freeway jam waves",

abstract = "Conventional reinforcement learning (RL) models of variable speed limit (VSL) control systems (and traffic control systems in general) cannot be trained in real traffic process because new control actions are usually explored randomly, which may result in high costs (delays) due to exploration and learning. For this reason, existing RL-based VSL control approaches need a traffic simulator for training. However, the performance of those approaches are dependent on the accuracy of the simulators. This paper proposes a new RL-based VSL control approach to overcome the aforementioned problems. The proposed VSL control approach is designed to improve traffic efficiency by using VSLs against freeway jam waves. It applies an iterative training framework, where the optimal control policy is updated by exploring new control actions both online and offline in each iteration. The explored control actions are evaluated in real traffic process, thus it avoids that the RL model learns only from a traffic simulator. The proposed VSL control approach is tested using a macroscopic traffic simulation model to represent real world traffic flow dynamics. By comparing with existing VSL control approaches, the proposed approach is demonstrated to have advantages in the following two aspects: (i) it alleviates the impact of model mismatch, which occurs in both model-based VSL control approaches and existing RL-based VSL control approaches, via replacing knowledge from the models by knowledge from the real process, and (ii) it significantly reduces the exploration and learning costs compared to existing RL-based VSL control approaches.",

keywords = "Data-driven approach, Freeway traffic control, Reinforcement learning, Variable speed limits",

author = "Yu Han and Andreas Hegyi and Le Zhang and Zhengbing He and Edward Chung and Pan Liu",

note = "Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.",

year = "2022",

doi = "10.1016/j.trc.2022.103900",

language = "English",

volume = "144",

journal = "Transportation Research Part C: Emerging Technologies",

issn = "0968-090X",

publisher = "Elsevier",

}

TY - JOUR

T1 - A new reinforcement learning-based variable speed limit control approach to improve traffic efficiency against freeway jam waves

AU - Han, Yu

AU - Hegyi, Andreas

AU - Zhang, Le

AU - He, Zhengbing

AU - Chung, Edward

AU - Liu, Pan

N1 - Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

PY - 2022

Y1 - 2022

N2 - Conventional reinforcement learning (RL) models of variable speed limit (VSL) control systems (and traffic control systems in general) cannot be trained in real traffic process because new control actions are usually explored randomly, which may result in high costs (delays) due to exploration and learning. For this reason, existing RL-based VSL control approaches need a traffic simulator for training. However, the performance of those approaches are dependent on the accuracy of the simulators. This paper proposes a new RL-based VSL control approach to overcome the aforementioned problems. The proposed VSL control approach is designed to improve traffic efficiency by using VSLs against freeway jam waves. It applies an iterative training framework, where the optimal control policy is updated by exploring new control actions both online and offline in each iteration. The explored control actions are evaluated in real traffic process, thus it avoids that the RL model learns only from a traffic simulator. The proposed VSL control approach is tested using a macroscopic traffic simulation model to represent real world traffic flow dynamics. By comparing with existing VSL control approaches, the proposed approach is demonstrated to have advantages in the following two aspects: (i) it alleviates the impact of model mismatch, which occurs in both model-based VSL control approaches and existing RL-based VSL control approaches, via replacing knowledge from the models by knowledge from the real process, and (ii) it significantly reduces the exploration and learning costs compared to existing RL-based VSL control approaches.

AB - Conventional reinforcement learning (RL) models of variable speed limit (VSL) control systems (and traffic control systems in general) cannot be trained in real traffic process because new control actions are usually explored randomly, which may result in high costs (delays) due to exploration and learning. For this reason, existing RL-based VSL control approaches need a traffic simulator for training. However, the performance of those approaches are dependent on the accuracy of the simulators. This paper proposes a new RL-based VSL control approach to overcome the aforementioned problems. The proposed VSL control approach is designed to improve traffic efficiency by using VSLs against freeway jam waves. It applies an iterative training framework, where the optimal control policy is updated by exploring new control actions both online and offline in each iteration. The explored control actions are evaluated in real traffic process, thus it avoids that the RL model learns only from a traffic simulator. The proposed VSL control approach is tested using a macroscopic traffic simulation model to represent real world traffic flow dynamics. By comparing with existing VSL control approaches, the proposed approach is demonstrated to have advantages in the following two aspects: (i) it alleviates the impact of model mismatch, which occurs in both model-based VSL control approaches and existing RL-based VSL control approaches, via replacing knowledge from the models by knowledge from the real process, and (ii) it significantly reduces the exploration and learning costs compared to existing RL-based VSL control approaches.

KW - Data-driven approach

KW - Freeway traffic control

KW - Reinforcement learning

KW - Variable speed limits

UR - http://www.scopus.com/inward/record.url?scp=85140007420&partnerID=8YFLogxK

U2 - 10.1016/j.trc.2022.103900

DO - 10.1016/j.trc.2022.103900

M3 - Article

AN - SCOPUS:85140007420

SN - 0968-090X

VL - 144

JO - Transportation Research Part C: Emerging Technologies

JF - Transportation Research Part C: Emerging Technologies

M1 - 103900

ER -

A new reinforcement learning-based variable speed limit control approach to improve traffic efficiency against freeway jam waves

Abstract

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this