Residential demand response of thermostatically controlled loads using batch Reinforcement Learning

F Ruelens, BJ Claessens, S Vandael, Bart De Schutter, Robert Babuska, R Belmans

Research output: Contribution to journalArticleScientificpeer-review

240 Citations (Scopus)
113 Downloads (Pure)

Abstract

Driven by recent advances in batch Reinforcement Learning (RL), this paper contributes to the application of batch RL to demand response. In contrast to conventional model-based approaches, batch RL techniques do not require a system identification step, making them more suitable for a large-scale implementation. This paper extends fitted Q-iteration, a standard batch RL technique, to the situation when a forecast of the exogenous data is provided. In general, batch RL techniques do not rely on expert knowledge about the system dynamics or the solution. However, if some expert knowledge is provided, it can be incorporated by using the proposed policy adjustment method. Finally, we tackle the challenge of finding an open-loop schedule required to participate in the day-ahead market. We propose a model-free Monte Carlo method that uses a metric based on the state-action value function or Q-function and we illustrate this method by finding the day-ahead schedule of a heat-pump thermostat. Our experiments show that batch RL techniques provide a valuable alternative to model-based controllers and that they can be used to construct both closed-loop and open-loop policies.
Original languageEnglish
Pages (from-to)2149-2159
JournalIEEE Transactions on Smart Grid
Volume8
Issue number5
DOIs
Publication statusPublished - 2017

Bibliographical note

Accepted Author Manuscript

Keywords

  • Load management
  • Water heating
  • Resistance heating
  • Atmospheric modeling
  • Load modeling
  • Feature extraction
  • Learning (artificial intelligence)

Fingerprint

Dive into the research topics of 'Residential demand response of thermostatically controlled loads using batch Reinforcement Learning'. Together they form a unique fingerprint.

Cite this