Automatic learning of cyclist's compliance for speed advice at intersections - a reinforcement learning-based approach

Azita Dabiri; Andreas Hegyi; Serge Hoogendoorn

doi:10.1109/ITSC.2019.8916847

Automatic learning of cyclist's compliance for speed advice at intersections - a reinforcement learning-based approach

Azita Dabiri, Andreas Hegyi, Serge Hoogendoorn

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

2 Citations (Scopus)

Abstract

Although there exists algorithms that give speed advice for cyclists when approaching traffic lights with uncertainty in the timing, they all need to know, and thus assume, the cyclist's response to the advice in order to be able to optimize the advice. To relax this assumption, in this paper an algorithm is proposed that combines reinforcement learning and planning to learn the reaction of cyclist to the advice and deploys this information for planning the best next advice on-the-fly. Rather than a single search procedure, which is conventional in the existing architectures, two sample-based search procedures are suggested to be used in the algorithm. This makes it possible to obtain an accurate local approximation of the action-value function, in spite of the short computation time that is available in each decision epoch. The algorithm is tested in a simulation case study where the impact of a proper initialisation of action-value function as well as the importance of using two search procedures are affirmed.

Original language	English
Title of host publication	2019 IEEE Intelligent Transportation Systems Conference, ITSC 2019
Place of Publication	Piscataway, NJ, USA
Publisher	IEEE
Pages	2375-2380
ISBN (Electronic)	9781538670248
DOIs	https://doi.org/10.1109/ITSC.2019.8916847
Publication status	Published - 2019
Event	22nd IEEE International Conference on Intelligent Transportation Systems, ITSC 2019 - Auckland, New Zealand Duration: 27 Oct 2019 → 30 Oct 2019 https://www.itsc2019.org/

Conference

Conference	22nd IEEE International Conference on Intelligent Transportation Systems, ITSC 2019
Country/Territory	New Zealand
City	Auckland
Period	27/10/19 → 30/10/19
Internet address	https://www.itsc2019.org/

Access to Document

10.1109/ITSC.2019.8916847

Cite this

@inproceedings{d7739dcf4ea2489bb3862e064433b186,

title = "Automatic learning of cyclist's compliance for speed advice at intersections - a reinforcement learning-based approach",

abstract = "Although there exists algorithms that give speed advice for cyclists when approaching traffic lights with uncertainty in the timing, they all need to know, and thus assume, the cyclist's response to the advice in order to be able to optimize the advice. To relax this assumption, in this paper an algorithm is proposed that combines reinforcement learning and planning to learn the reaction of cyclist to the advice and deploys this information for planning the best next advice on-the-fly. Rather than a single search procedure, which is conventional in the existing architectures, two sample-based search procedures are suggested to be used in the algorithm. This makes it possible to obtain an accurate local approximation of the action-value function, in spite of the short computation time that is available in each decision epoch. The algorithm is tested in a simulation case study where the impact of a proper initialisation of action-value function as well as the importance of using two search procedures are affirmed.",

author = "Azita Dabiri and Andreas Hegyi and Serge Hoogendoorn",

year = "2019",

doi = "10.1109/ITSC.2019.8916847",

language = "English",

pages = "2375--2380",

booktitle = "2019 IEEE Intelligent Transportation Systems Conference, ITSC 2019",

publisher = "IEEE",

address = "United States",

note = "22nd IEEE International Conference on Intelligent Transportation Systems, ITSC 2019 ; Conference date: 27-10-2019 Through 30-10-2019",

url = "https://www.itsc2019.org/",

}

Dabiri, A , Hegyi, A & Hoogendoorn, S 2019, Automatic learning of cyclist's compliance for speed advice at intersections - a reinforcement learning-based approach. in 2019 IEEE Intelligent Transportation Systems Conference, ITSC 2019., 8916847, IEEE, Piscataway, NJ, USA, pp. 2375-2380, 22nd IEEE International Conference on Intelligent Transportation Systems, ITSC 2019, Auckland, New Zealand, 27/10/19. https://doi.org/10.1109/ITSC.2019.8916847

Automatic learning of cyclist's compliance for speed advice at intersections - a reinforcement learning-based approach. / Dabiri, Azita ; Hegyi, Andreas ; Hoogendoorn, Serge.
2019 IEEE Intelligent Transportation Systems Conference, ITSC 2019. Piscataway, NJ, USA: IEEE, 2019. p. 2375-2380 8916847.

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - Automatic learning of cyclist's compliance for speed advice at intersections - a reinforcement learning-based approach

AU - Dabiri, Azita

AU - Hegyi, Andreas

AU - Hoogendoorn, Serge

PY - 2019

Y1 - 2019

N2 - Although there exists algorithms that give speed advice for cyclists when approaching traffic lights with uncertainty in the timing, they all need to know, and thus assume, the cyclist's response to the advice in order to be able to optimize the advice. To relax this assumption, in this paper an algorithm is proposed that combines reinforcement learning and planning to learn the reaction of cyclist to the advice and deploys this information for planning the best next advice on-the-fly. Rather than a single search procedure, which is conventional in the existing architectures, two sample-based search procedures are suggested to be used in the algorithm. This makes it possible to obtain an accurate local approximation of the action-value function, in spite of the short computation time that is available in each decision epoch. The algorithm is tested in a simulation case study where the impact of a proper initialisation of action-value function as well as the importance of using two search procedures are affirmed.

AB - Although there exists algorithms that give speed advice for cyclists when approaching traffic lights with uncertainty in the timing, they all need to know, and thus assume, the cyclist's response to the advice in order to be able to optimize the advice. To relax this assumption, in this paper an algorithm is proposed that combines reinforcement learning and planning to learn the reaction of cyclist to the advice and deploys this information for planning the best next advice on-the-fly. Rather than a single search procedure, which is conventional in the existing architectures, two sample-based search procedures are suggested to be used in the algorithm. This makes it possible to obtain an accurate local approximation of the action-value function, in spite of the short computation time that is available in each decision epoch. The algorithm is tested in a simulation case study where the impact of a proper initialisation of action-value function as well as the importance of using two search procedures are affirmed.

UR - http://www.scopus.com/inward/record.url?scp=85076816920&partnerID=8YFLogxK

U2 - 10.1109/ITSC.2019.8916847

DO - 10.1109/ITSC.2019.8916847

M3 - Conference contribution

SP - 2375

EP - 2380

BT - 2019 IEEE Intelligent Transportation Systems Conference, ITSC 2019

PB - IEEE

CY - Piscataway, NJ, USA

T2 - 22nd IEEE International Conference on Intelligent Transportation Systems, ITSC 2019

Y2 - 27 October 2019 through 30 October 2019

ER -

Automatic learning of cyclist's compliance for speed advice at intersections - a reinforcement learning-based approach

Abstract

Conference

Access to Document

Other files and links

Fingerprint

Cite this