TY - GEN
T1 - Adaptive Online Non-stochastic Control
AU - Mhaisen, Naram
AU - Iosifidis, George
PY - 2024
Y1 - 2024
N2 - We tackle the problem of Non-stochastic Control (NSC) with the aim of obtaining algorithms whose policy regret is proportional to the difficulty of the controlled environment. Namely, we tailor the Follow The Regularized Leader (FTRL) framework to dynamical systems by using regularizers that are proportional to the actual witnessed costs. The main challenge arises from using the proposed adaptive regularizers in the presence of a state, or equivalently, a memory, which couples the effect of the online decisions and requires new tools for bounding the regret. Via new analysis techniques for NSC and FTRL integration, we obtain novel disturbance action controllers (DAC) with sub-linear data adaptive policy regret bounds that shrink when the trajectory of costs has small gradients, while staying sub-linear even in the worst case.
AB - We tackle the problem of Non-stochastic Control (NSC) with the aim of obtaining algorithms whose policy regret is proportional to the difficulty of the controlled environment. Namely, we tailor the Follow The Regularized Leader (FTRL) framework to dynamical systems by using regularizers that are proportional to the actual witnessed costs. The main challenge arises from using the proposed adaptive regularizers in the presence of a state, or equivalently, a memory, which couples the effect of the online decisions and requires new tools for bounding the regret. Via new analysis techniques for NSC and FTRL integration, we obtain novel disturbance action controllers (DAC) with sub-linear data adaptive policy regret bounds that shrink when the trajectory of costs has small gradients, while staying sub-linear even in the worst case.
KW - Follow the Regularized Leader
KW - Non-stochastic control
KW - Online learning
UR - http://www.scopus.com/inward/record.url?scp=85203697175&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85203697175
VL - 242
T3 - Proceedings of Machine Learning Research
SP - 248
EP - 259
BT - Proceedings of Machine Learning Research
A2 - Abate, A.
A2 - Cannon, M.
A2 - Margellos, K.
A2 - Papachristodoulou, A.
T2 - 6th Annual Learning for Dynamics and Control Conference, L4DC 2024
Y2 - 15 July 2024 through 17 July 2024
ER -