{ "cells": [ { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# Low Emission Building Control with Probabilistic Deep Reinforcement Learning\n", "\n", "## [Scott Jeen](https://enjeeneer.io/)\n", "\n", "### Refficiency Lunch, 2nd November 2021" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# 1. Motivation" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "## 1.1 Cool (and the gang)\n", "" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "## 1.2 Idealised Building Control\n", "\n", "\n", "" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# 2. Current Approaches to RL for Building Control" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "## 2.1 Model-free RL\n", "\n", "Rollouts | Network\n", ":-------------------------:|:-------------------------:\n", "|\n", "![](images/model_free.gif) | ![](images/model_free_network1.png)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "## 2.2 Model-based RL\n", "Rollouts | Network\n", ":-------------------------:|:-------------------------:\n", "|\n", "![](images/model_based.gif) | ![](images/model_based_network.png)\n" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "## 2.3 Qualitative Comparison\n", "\n", "| | **Pros** 💚 | **Cons** 💔 |\n", "|----------------|------------------|------------------------------|\n", "|**Model-free** | Performance | Data inefficiency |\n", "|**Model-based** | Data efficiency | Performance; Model bias |\n", "\n" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "## 2.4 Quantitative Comparison\n", "| | **SOA Energy Efficiency** | **SOA Data Efficiency** | **Authors** |\n", "|----------------|------------------|------------------------------|----------------------|\n", "|**Model-free** | **10-30%** | 1-2 years *simulated* data | [1,2] |\n", "|**Model-based** | 5-10% | **3-12 hours** *live* data | [3,4] |\n", "\n", "\n", "- To scale, agents must learn quickly *online*. Can we get model-free performance with model-based agents?" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# 3. Probabilistic, Model-based Reinforcement Learning\n", "" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "## 3.1 Mitigating Model Bias\n", "" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "## 3.1 Mitigating Model Bias\n", "" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "## 3.1 Mitigating Model Bias\n", "" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "## 3.1 Mitigating Model Bias\n", "" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "## 3.1 Mitigating Model Bias\n", "" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "## 3.2 Probabilistic Trajectory Sampling\n", "" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# 4. Experiments" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "## 4.1 Building Simulations [5]\n", "some *blue* text | some *blue* text \n", ":-------------------------:|:-------------------------:\n", "|\n", " | \n", "|\n", " | " ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "## 4.2 Annual Carbon Emissions\n", "" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "## 4.3 Temperature Control\n", "\n", "" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "## 4.4 Load Shifting\n", "" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "## 4.4 Load Shifting\n", "" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "## 4.4 Load Shifting\n", "" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "## 4.5 Mixed-Use Results\n", "\n", "| | Annual Energy Consumption (kWh) ⚡ | Average Temperature (oC) 🌡️ | Emissions (tCO2) |\n", "|---------------------------|--------------------------------|--------------------------|--------------------------|\n", "| Current | 132, 776 | 20.11 | 65.15 |\n", "| Model-free | 103, 363 | 21.99 | 50.91 |\n", "| Model-based | 137, 616 | 20.59 | 67.61 |\n", "| Probabilistic Model-based | **97, 149** | 20.41 | **47.73** | |" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# 5. Future Work\n", "\n", "- In-situ verification\n", "- What is optimal?\n", "- What is the necessary feature space?\n", "- Can we share learning between buildings?" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# Thanks!" ] } ], "metadata": { "celltoolbar": "Slideshow", "kernelspec": { "display_name": "Python 3", "language": "python", "name": "python3" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.7.9" } }, "nbformat": 4, "nbformat_minor": 4 }