Abstract
In this paper, we analyze the regret incurred by a computationally efficient exploration strategy, known as naive exploration, for controlling unknown partially observable systems within the Linear Quadratic Gaussian (LQG) framework. We introduce a two-phase control algorithm called LQG-NAIVE, which involves an initial phase of injecting Gaussian input signals to obtain a system model, followed by a second phase of an interplay between naive exploration and control in an episodic fashion. We show that LQG-NAIVE achieves a regret growth rate of Õ(√T), i.e., O(√T) up to logarithmic factors after T time steps, and we validate its performance through numerical simulations. Additionally, we propose LQG-IF2E, which extends the exploration signal to a 'closed-loop' setting by incorporating the Fisher Information Matrix (FIM). We provide compelling numerical evidence of the competitive performance of LQG-IF2E compared to LQG-NAIVE.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the European Control Conference, ECC 2024 |
| Publisher | IEEE |
| Pages | 1795-1801 |
| Number of pages | 7 |
| ISBN (Electronic) | 978-3-9071-4410-7 |
| DOIs | |
| Publication status | Published - 2024 |
| Event | 2024 European Control Conference, ECC 2024 - Stockholm, Sweden Duration: 25 Jun 2024 → 28 Jun 2024 |
Conference
| Conference | 2024 European Control Conference, ECC 2024 |
|---|---|
| Country/Territory | Sweden |
| City | Stockholm |
| Period | 25/06/24 → 28/06/24 |
Bibliographical note
Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-careOtherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.