Regret Analysis of Learning-Based Linear Quadratic Gaussian Control with Additive Exploration

Archith Athrey, Othmane Mazhar, Meichen Guo, Bart De Schutter, Shengling Shi

Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

11 Downloads (Pure)

Abstract

In this paper, we analyze the regret incurred by a computationally efficient exploration strategy, known as naive exploration, for controlling unknown partially observable systems within the Linear Quadratic Gaussian (LQG) framework. We introduce a two-phase control algorithm called LQG-NAIVE, which involves an initial phase of injecting Gaussian input signals to obtain a system model, followed by a second phase of an interplay between naive exploration and control in an episodic fashion. We show that LQG-NAIVE achieves a regret growth rate of Õ(√T), i.e., O(√T) up to logarithmic factors after T time steps, and we validate its performance through numerical simulations. Additionally, we propose LQG-IF2E, which extends the exploration signal to a 'closed-loop' setting by incorporating the Fisher Information Matrix (FIM). We provide compelling numerical evidence of the competitive performance of LQG-IF2E compared to LQG-NAIVE.

Original languageEnglish
Title of host publicationProceedings of the European Control Conference, ECC 2024
PublisherIEEE
Pages1795-1801
Number of pages7
ISBN (Electronic)978-3-9071-4410-7
DOIs
Publication statusPublished - 2024
Event2024 European Control Conference, ECC 2024 - Stockholm, Sweden
Duration: 25 Jun 202428 Jun 2024

Conference

Conference2024 European Control Conference, ECC 2024
Country/TerritorySweden
CityStockholm
Period25/06/2428/06/24

Bibliographical note

Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care
Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Fingerprint

Dive into the research topics of 'Regret Analysis of Learning-Based Linear Quadratic Gaussian Control with Additive Exploration'. Together they form a unique fingerprint.

Cite this