Lazy Lagrangians for Optimistic Learning With Budget Constraints

Daron Anderson; George Iosifidis; Douglas J. Leith

doi:10.1109/TNET.2022.3222404

Lazy Lagrangians for Optimistic Learning With Budget Constraints

Daron Anderson, George Iosifidis, Douglas J. Leith

Embedded Systems

Research output: Contribution to journal › Article › Scientific › peer-review

2 Citations (Scopus)

8 Downloads (Pure)

Abstract

We consider the general problem of online convex optimization with time-varying budget constraints in the presence of predictions for the next cost and constraint functions, that arises in a plethora of network resource management problems. A novel saddle-point algorithm is designed by combining a Follow-The-Regularized-Leader iteration with prediction-adaptive dynamic steps. The algorithm achieves O(T(3β/4) regret and O(T(1+β)/2) constraint violation bounds that are tunable via parameter β ∈ [1/2,1) and have constant factors that shrink with the predictions quality, achieving eventually O(1) regret for perfect predictions. Our work extends the seminal FTRL framework for this new OCO setting and outperforms the respective state-of-the-art greedy-based solutions which naturally cannot benefit from predictions, without imposing conditions on the (unknown) quality of predictions, the cost functions or the geometry of constraints, beyond convexity.

Original language	English
Pages (from-to)	1935 - 1949
Number of pages	15
Journal	IEEE/ACM Transactions on Networking
Volume	31
Issue number	5
DOIs	https://doi.org/10.1109/TNET.2022.3222404
Publication status	Published - 2023

Bibliographical note

Green Open Access added to TU Delft Institutional Repository ‘You share, we take care!’ – Taverne project https://www.openaccess.nl/en/you-share-we-take-care
Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Keywords

Network control
network management
online convex optimization (OCO)
online learning
resource allocation

Access to Document

10.1109/TNET.2022.3222404

Lazy_Lagrangians_for_Optimistic_Learning_With_Budget_ConstraintsFinal published version, 1.13 MB

Cite this

@article{e343b1be344f4de5b3c0a8f6140f02c7,

title = "Lazy Lagrangians for Optimistic Learning With Budget Constraints",

abstract = "We consider the general problem of online convex optimization with time-varying budget constraints in the presence of predictions for the next cost and constraint functions, that arises in a plethora of network resource management problems. A novel saddle-point algorithm is designed by combining a Follow-The-Regularized-Leader iteration with prediction-adaptive dynamic steps. The algorithm achieves O(T(3β/4) regret and O(T(1+β)/2) constraint violation bounds that are tunable via parameter β ∈ [1/2,1) and have constant factors that shrink with the predictions quality, achieving eventually O(1) regret for perfect predictions. Our work extends the seminal FTRL framework for this new OCO setting and outperforms the respective state-of-the-art greedy-based solutions which naturally cannot benefit from predictions, without imposing conditions on the (unknown) quality of predictions, the cost functions or the geometry of constraints, beyond convexity.",

keywords = "Network control, network management, online convex optimization (OCO), online learning, resource allocation",

author = "Daron Anderson and George Iosifidis and Leith, {Douglas J.}",

note = "Green Open Access added to TU Delft Institutional Repository {\textquoteleft}You share, we take care!{\textquoteright} – Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public. ",

year = "2023",

doi = "10.1109/TNET.2022.3222404",

language = "English",

volume = "31",

pages = "1935 -- 1949",

journal = "IEEE/ACM Transactions on Networking",

issn = "1063-6692",

publisher = "Institute of Electrical and Electronics Engineers (IEEE)",

number = "5",

}

TY - JOUR

T1 - Lazy Lagrangians for Optimistic Learning With Budget Constraints

AU - Anderson, Daron

AU - Iosifidis, George

AU - Leith, Douglas J.

N1 - Green Open Access added to TU Delft Institutional Repository ‘You share, we take care!’ – Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

PY - 2023

Y1 - 2023

N2 - We consider the general problem of online convex optimization with time-varying budget constraints in the presence of predictions for the next cost and constraint functions, that arises in a plethora of network resource management problems. A novel saddle-point algorithm is designed by combining a Follow-The-Regularized-Leader iteration with prediction-adaptive dynamic steps. The algorithm achieves O(T(3β/4) regret and O(T(1+β)/2) constraint violation bounds that are tunable via parameter β ∈ [1/2,1) and have constant factors that shrink with the predictions quality, achieving eventually O(1) regret for perfect predictions. Our work extends the seminal FTRL framework for this new OCO setting and outperforms the respective state-of-the-art greedy-based solutions which naturally cannot benefit from predictions, without imposing conditions on the (unknown) quality of predictions, the cost functions or the geometry of constraints, beyond convexity.

AB - We consider the general problem of online convex optimization with time-varying budget constraints in the presence of predictions for the next cost and constraint functions, that arises in a plethora of network resource management problems. A novel saddle-point algorithm is designed by combining a Follow-The-Regularized-Leader iteration with prediction-adaptive dynamic steps. The algorithm achieves O(T(3β/4) regret and O(T(1+β)/2) constraint violation bounds that are tunable via parameter β ∈ [1/2,1) and have constant factors that shrink with the predictions quality, achieving eventually O(1) regret for perfect predictions. Our work extends the seminal FTRL framework for this new OCO setting and outperforms the respective state-of-the-art greedy-based solutions which naturally cannot benefit from predictions, without imposing conditions on the (unknown) quality of predictions, the cost functions or the geometry of constraints, beyond convexity.

KW - Network control

KW - network management

KW - online convex optimization (OCO)

KW - online learning

KW - resource allocation

UR - http://www.scopus.com/inward/record.url?scp=85147300647&partnerID=8YFLogxK

U2 - 10.1109/TNET.2022.3222404

DO - 10.1109/TNET.2022.3222404

M3 - Article

AN - SCOPUS:85147300647

SN - 1063-6692

VL - 31

SP - 1935

EP - 1949

JO - IEEE/ACM Transactions on Networking

JF - IEEE/ACM Transactions on Networking

IS - 5

ER -

Lazy Lagrangians for Optimistic Learning With Budget Constraints

Abstract

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this