Learning a reward function for user-preferred appliance scheduling

Nikolina Čović*, Jochen L. Cremer, Hrvoje Pandžić

*Corresponding author for this work

Research output: Contribution to journalArticleScientificpeer-review

Abstract

Accelerated development of demand response service provision by the residential sector is crucial for reducing carbon-emissions in the power sector. Along with the infrastructure advancement, encouraging the end users to participate is crucial. End users highly value their privacy and control, and want to be included in the service design and decision-making process when creating the daily appliance operation schedules. Furthermore, unless they are financially or environmentally motivated, they are generally not prepared to sacrifice their comfort to help balance the power system. In this paper, we present an inverse-reinforcement-learning-based model that helps create the end users’ daily appliance schedules without asking them to explicitly state their needs and wishes. By using their past consumption data, the end consumers will implicitly participate in the creation of those decisions and will thus be motivated to continue participating in the provision of demand response services.

Original languageEnglish
Article number110667
Number of pages7
JournalElectric Power Systems Research
Volume235
DOIs
Publication statusPublished - 2024

Bibliographical note

Green Open Access added to TU Delft Institutional Repository ‘You share, we take care!’ – Taverne project https://www.openaccess.nl/en/you-share-we-take-care
Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Keywords

  • Demand response
  • Inverse reinforcement learning
  • User comfort

Fingerprint

Dive into the research topics of 'Learning a reward function for user-preferred appliance scheduling'. Together they form a unique fingerprint.

Cite this