Residential demand response of thermostatically controlled loads using batch Reinforcement Learning

F Ruelens; BJ Claessens; S Vandael; Bart De Schutter; Robert Babuska; R Belmans

doi:10.1109/TSG.2016.2517211

Residential demand response of thermostatically controlled loads using batch Reinforcement Learning

F Ruelens, BJ Claessens, S Vandael, Bart De Schutter, Robert Babuska, R Belmans

Research output: Contribution to journal › Article › Scientific › peer-review

240 Citations (Scopus)

113 Downloads (Pure)

Abstract

Driven by recent advances in batch Reinforcement Learning (RL), this paper contributes to the application of batch RL to demand response. In contrast to conventional model-based approaches, batch RL techniques do not require a system identification step, making them more suitable for a large-scale implementation. This paper extends fitted Q-iteration, a standard batch RL technique, to the situation when a forecast of the exogenous data is provided. In general, batch RL techniques do not rely on expert knowledge about the system dynamics or the solution. However, if some expert knowledge is provided, it can be incorporated by using the proposed policy adjustment method. Finally, we tackle the challenge of finding an open-loop schedule required to participate in the day-ahead market. We propose a model-free Monte Carlo method that uses a metric based on the state-action value function or Q-function and we illustrate this method by finding the day-ahead schedule of a heat-pump thermostat. Our experiments show that batch RL techniques provide a valuable alternative to model-based controllers and that they can be used to construct both closed-loop and open-loop policies.

Original language	English
Pages (from-to)	2149-2159
Journal	IEEE Transactions on Smart Grid
Volume	8
Issue number	5
DOIs	https://doi.org/10.1109/TSG.2016.2517211
Publication status	Published - 2017

Bibliographical note

Accepted Author Manuscript

Keywords

Load management
Water heating
Resistance heating
Atmospheric modeling
Load modeling
Feature extraction
Learning (artificial intelligence)

Access to Document

10.1109/TSG.2016.2517211

07401112_1-3Accepted author manuscript, 1.26 MB

Cite this

@article{2a6f9e25d53845d5a8dd2c2f9b453bdb,

title = "Residential demand response of thermostatically controlled loads using batch Reinforcement Learning",

abstract = "Driven by recent advances in batch Reinforcement Learning (RL), this paper contributes to the application of batch RL to demand response. In contrast to conventional model-based approaches, batch RL techniques do not require a system identification step, making them more suitable for a large-scale implementation. This paper extends fitted Q-iteration, a standard batch RL technique, to the situation when a forecast of the exogenous data is provided. In general, batch RL techniques do not rely on expert knowledge about the system dynamics or the solution. However, if some expert knowledge is provided, it can be incorporated by using the proposed policy adjustment method. Finally, we tackle the challenge of finding an open-loop schedule required to participate in the day-ahead market. We propose a model-free Monte Carlo method that uses a metric based on the state-action value function or Q-function and we illustrate this method by finding the day-ahead schedule of a heat-pump thermostat. Our experiments show that batch RL techniques provide a valuable alternative to model-based controllers and that they can be used to construct both closed-loop and open-loop policies.",

keywords = "Load management, Water heating, Resistance heating, Atmospheric modeling, Load modeling, Feature extraction, Learning (artificial intelligence)",

author = "F Ruelens and BJ Claessens and S Vandael and {De Schutter}, Bart and Robert Babuska and R Belmans",

note = "Accepted Author Manuscript",

year = "2017",

doi = "10.1109/TSG.2016.2517211",

language = "English",

volume = "8",

pages = "2149--2159",

journal = "IEEE Transactions on Smart Grid",

issn = "1949-3053",

publisher = "Institute of Electrical and Electronics Engineers (IEEE)",

number = "5",

}

TY - JOUR

T1 - Residential demand response of thermostatically controlled loads using batch Reinforcement Learning

AU - Ruelens, F

AU - Claessens, BJ

AU - Vandael, S

AU - De Schutter, Bart

AU - Babuska, Robert

AU - Belmans, R

N1 - Accepted Author Manuscript

PY - 2017

Y1 - 2017

N2 - Driven by recent advances in batch Reinforcement Learning (RL), this paper contributes to the application of batch RL to demand response. In contrast to conventional model-based approaches, batch RL techniques do not require a system identification step, making them more suitable for a large-scale implementation. This paper extends fitted Q-iteration, a standard batch RL technique, to the situation when a forecast of the exogenous data is provided. In general, batch RL techniques do not rely on expert knowledge about the system dynamics or the solution. However, if some expert knowledge is provided, it can be incorporated by using the proposed policy adjustment method. Finally, we tackle the challenge of finding an open-loop schedule required to participate in the day-ahead market. We propose a model-free Monte Carlo method that uses a metric based on the state-action value function or Q-function and we illustrate this method by finding the day-ahead schedule of a heat-pump thermostat. Our experiments show that batch RL techniques provide a valuable alternative to model-based controllers and that they can be used to construct both closed-loop and open-loop policies.

AB - Driven by recent advances in batch Reinforcement Learning (RL), this paper contributes to the application of batch RL to demand response. In contrast to conventional model-based approaches, batch RL techniques do not require a system identification step, making them more suitable for a large-scale implementation. This paper extends fitted Q-iteration, a standard batch RL technique, to the situation when a forecast of the exogenous data is provided. In general, batch RL techniques do not rely on expert knowledge about the system dynamics or the solution. However, if some expert knowledge is provided, it can be incorporated by using the proposed policy adjustment method. Finally, we tackle the challenge of finding an open-loop schedule required to participate in the day-ahead market. We propose a model-free Monte Carlo method that uses a metric based on the state-action value function or Q-function and we illustrate this method by finding the day-ahead schedule of a heat-pump thermostat. Our experiments show that batch RL techniques provide a valuable alternative to model-based controllers and that they can be used to construct both closed-loop and open-loop policies.

KW - Load management

KW - Water heating

KW - Resistance heating

KW - Atmospheric modeling

KW - Load modeling

KW - Feature extraction

KW - Learning (artificial intelligence)

UR - http://resolver.tudelft.nl/uuid:2a6f9e25-d538-45d5-a8dd-2c2f9b453bdb

U2 - 10.1109/TSG.2016.2517211

DO - 10.1109/TSG.2016.2517211

M3 - Article

SN - 1949-3053

VL - 8

SP - 2149

EP - 2159

JO - IEEE Transactions on Smart Grid

JF - IEEE Transactions on Smart Grid

IS - 5

ER -

Residential demand response of thermostatically controlled loads using batch Reinforcement Learning

Abstract

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this