Energy Load Forecasting

Energy Load Forecasting#

Accurate forecasting of expected energy load is identified as one of the highest leverage contribution areas of Machine/Deep Learning toward transitioning to a renewable-based electrical infrastructure. In this demo, we cover a use case on forecasting energy load using hourly energy demand generation data.

This dataset is originally from ENTSOE transparency platform, and contains 4 years of hourly electrical consumption, coal consumption, effluent gas generation, hourly forecasted and actual energy demand, and energy pricing generation. For this demo, we only use the actual energy demand column (total actual load). Though energy load forecast is affected by external factors like weather, and unit pricing, the load data exhibits a strong diurnal pattern and autoregressive structure. So, we treat the forecasting problem as univariate forecasting. The model predicts the next 12-hour energy load from the last three days’ data.

For this demo, we use a pre-trained Keras model weights which can be downloaded into your file system. We use aix360’s Time Series Individual Conditional Expectation (TSLimeExplainer) to explain the energy load forecaster’s sensitivity, i.e., how the model forecast is impacted by the input. Further, we analyze the impact of the input time series in terms of time series data statistics.

For more algorithmic details on TSICE, you can refer to Time Series Individual Conditional Expectation.

To start this hands on demo, skip to Instructions.

Time Series Individual Conditional Expectation#

TSICEExplainer is a model agnostic, black box forecaster explainer and provides local explanations for time series data. This algorithm adapts traditional Individual Conditional Expectation plot to time series modality. Timeseries data is highly correlated. Many algorithms often use features derived from the timeseries for forecasting. But these features cannot be changed independently. The TSICE relies on temporally focused sampling method (via timeseries perturbation) to explore this feature space and establishes the connection from this feature space to the model output.

Instructions#

Create a new notebook energy_load_forecasting.ipynb in Jupyter lab to run the demo on energy load forecasting use case. Refer to the instructions in prerequisites.
Follow all the below sections and execute the code by pasting into the newly created notebook energy_load_forecasting.ipynb.

Imports#

Paste the below code snippet into a cell in energy_load_forecasting.ipynb in Jupyter lab and run the cell.

import os
import warnings
from typing import Union
# to suppress unrelated tensorflow warnings while importing aix360.datasets
warnings.filterwarnings("ignore")
os.environ["TF_CPP_MIN_LOG_LEVEL"] = "3"

import numpy as np 
import pandas as pd
import tensorflow as tf
import matplotlib.pyplot as plt
from plotly.subplots import make_subplots
from plotly import graph_objects as go
from IPython.display import Image
from sklearn.linear_model import LinearRegression
from aix360.algorithms.tsice import TSICEExplainer

Load Dataset#

Download the dataset from Kaggle manually. Unzip the file and move energy_dataset.csv to the tutorial folder aix360_demo. For use-case and dataset description, refer to introduction.

Paste the below code snippet into a cell in energy_load_forecasting.ipynb in Jupyter lab and run the cell. The data has NaN values and so, imputing using interpolate to have uniform frequency.

energy_df = pd.read_csv('energy_dataset.csv', header=0, index_col=0)
energy_df.index = pd.to_datetime(energy_df.index, utc=True)
energy_df = energy_df.sort_index()
energy_df.index.freq = pd.infer_freq(energy_df.index)

total_energy_col = 'total load actual'
energy_df[total_energy_col] = energy_df[total_energy_col].interpolate(method='time', axis=0)

energy_df[total_energy_col].describe()

Plot the dataset#

Paste the below code snippet into a cell in energy_load_forecasting.ipynb in Jupyter lab and run the cell to plot energy load from Feb 2015.

_, ax = plt.subplots(1, 1, figsize=(20, 3))
_ = energy_df[total_energy_col]['2015-02-01':'2015-02-28'].plot(color='green', ax=ax)
_ = ax.set_xlabel('time', fontsize=16)
_ = ax.set_ylabel(total_energy_col, fontsize=16)

Data Preprocessing#

Prepare the data for the model by running a 72-hour sliding window on each partition. And extract the history input (72-hour) and forecast (12-hour) pair at the same time.

Paste the below code snippet into a cell in energy_load_forecasting.ipynb in Jupyter lab and run the cell.

history_length, lookahead = 72, 12
x = energy_df[total_energy_col].values
t_x = list(energy_df[total_energy_col].index)
data_len = x.shape[0]

X_all = list()
t_all = list()

for i in range(data_len - history_length - lookahead):
    x_seg = x[i:i+history_length + lookahead]
    if (np.max(x_seg) - np.min(x_seg)) > 2000:
        X_all.append(x_seg)
        t_all.append(t_x[(i+history_length):(i+history_length + lookahead)])
X_all = np.array(X_all)[..., np.newaxis]

X, y = X_all[:, :history_length], X_all[:, history_length:]

Train Test Split#

Train a Convolution Network based (Dilated Convolution Network) deep learning model, that forecast the next 12 hours’ energy demand by looking at the last 3 days (72 hours) energy consumption. For the model training purposes, we split the data into three parts, (1) training (80%), (2) validation (10%), and (3) test (10%).

Paste the below code snippet into a cell in energy_load_forecasting.ipynb in Jupyter lab and run the cell.

n_data = X.shape[0]
train_len = int(0.8 * n_data)
validation_len = int(0.9 * n_data)

X_train, y_train = X[:train_len], y[:train_len] 
X_val, y_val = X[train_len:validation_len], y[train_len:validation_len]
X_test, y_test = X[validation_len:], y[validation_len:]
t_val = t_all[validation_len:]

Robust Scaling of the data#

Apply robust quantile scaling i.e., scale the data using lower(1%) and upper(99%) quantile values. As a result, forecasted values are scaled as well.

class RobustQuantileScaler:
    def __init__(self):
        self.data_lo = None
        self.data_hi = None

    def transform(self, data, y=None):
        self.data_lo, self.data_hi = np.quantile(data, q=(0.01, 0.99), axis=1, keepdims=True)
        data_X = (data - self.data_lo) / np.maximum((self.data_hi - self.data_lo), 1)

        data_y = None
        if y is not None:
            data_y = (y - self.data_lo) / np.maximum((self.data_hi - self.data_lo), 1)

        return data_X, data_y
    
    def inverse_transform(self, data):
        return data * (self.data_hi - self.data_lo) + self.data_lo

X_train, y_train = RobustQuantileScaler().transform(X_train, y_train)
X_val, y_val = RobustQuantileScaler().transform(X_val, y_val)

X_input_scaler = RobustQuantileScaler()
X_in, _ = X_input_scaler.transform(X_test[:3000])

Load Pre-Trained Model#

Build a deep neural network model composed of dilated causal convolution layers followed by dense layers. And it uses residual connection between the convolution layers. Outputs from the convolution layers are concatenated, and then further passed to the dense FC (fully connected) layer to produce the forecast. The model is configured for a fixed length input of history_length (=72) size and produce a fixed length forecast of length lookahead (=12).

Download model weights from here to the demo folder aix360_demo created in prerequisites. If the weights cannot be downloaded, model can be trained using the training data.

Paste the below code snippet into a cell in energy_load_forecasting.ipynb in Jupyter lab and run the cell.

d_input = tf.keras.layers.Input((history_length, 1))
x = d_input

x1 = tf.keras.layers.Conv1D(16, kernel_size=5, padding='causal', activation='relu')(x)
x2 = tf.keras.layers.Conv1D(16, kernel_size=5, dilation_rate=2, padding='causal', activation='relu')(x1 + x)
x3 = tf.keras.layers.Conv1D(16, kernel_size=5, dilation_rate=4, padding='causal', activation='relu')(x2 + x1)
x4 = tf.keras.layers.Conv1D(16, kernel_size=5, dilation_rate=4, padding='causal', activation='relu')(x3 + x2)
x = tf.concat([x1, x2, x3, x4], axis=-1)
x = tf.keras.layers.TimeDistributed(tf.keras.layers.Dense(1, activation='gelu'))(x)

x = tf.keras.layers.Flatten()(x)
x = tf.keras.layers.Dense(16, activation='elu')(x)
x = tf.keras.layers.Dense(lookahead, activation='linear')(x)
x_out = tf.keras.layers.Reshape((lookahead, 1))(x)

model = tf.keras.Model(d_input, x_out)
model.compile(loss="mse", optimizer='adam')

if not os.path.exists('energy_load_forecast.h5'):
    callback = tf.keras.callbacks.EarlyStopping(monitor='val_loss', min_delta=0, patience=5)
    history = model.fit(X_train, y_train, 
                        validation_data=[X_val[:100], y_val[:100]], 
                        batch_size=256, 
                        epochs=500, 
                        verbose=0, 
                        callbacks=[callback])
    model.save_weights('energy_load_forecast.h5', save_format="h5")
else:
    model.load_weights('energy_load_forecast.h5')

class ForecastingPipeline:
    def __init__(self, model):
        self.model = model
    
    def __call__(self, x):
        if x.shape[0] == 1:
            x = x.T
        
        if len(x.shape) != 3:
            x = x[np.newaxis, ...]

        x_in_scaler = RobustQuantileScaler()
        x_in, _ = x_in_scaler.transform(x)
        y = self.model.predict(x_in, verbose=0)
        y = x_in_scaler.inverse_transform(y)
        return y[..., 0]

forecast = ForecastingPipeline(model)

y_out = forecast(X_test[:3000])
i_start, i_end = 0, 1500

y_true = np.concatenate([y_test[_] for _ in range(i_start, i_end, lookahead)], axis=0)
y_pred = np.concatenate([y_out[_] for _ in range(i_start, i_end, lookahead)], axis=0)
t_test =  np.concatenate([t_all[_] for _ in range(i_start, i_end, lookahead)], axis=0)

_ , ax = plt.subplots(1, 1, figsize=(20, 5))
ax.plot(t_test, y_pred, label='predicted', color='red')
ax.plot(t_test, y_true[:, 0], label='actual', color='green')
ax.legend()
plt.title("Model forecasts")
plt.show()

Initializing TSICE Explainer#

TSICEExplainer uses TSPerturber for generating perturbed/simulated data and TSFeatures (Latest/Range Feature) to derive time series structural features for further analysis. The TSICEExplainer explains the trend in the model forecast change by varying the selected part of the time series.

Available perturbers are “block-bootstrap”(default), “frequency”, “moving_average”, and “shift”. Available aggregation metrics (stats) are “median”, “mean”, “min”, “max”, “std”, “range”, “intercept”, “trend”, “rsquared”, “max_variation”. explanation_window_start and explanation_window_length are used to select observation window to be explained.

Paste the below code snippet into a cell in energy_load_forecasting.ipynb in Jupyter lab and run the cell.

tsice_explainer = TSICEExplainer(forecast, 
                             input_length=history_length, 
                             forecast_lookahead=lookahead,
                             n_variables=1,
                             perturbers=[
                                  dict(type="block-bootstrap", 
                                       window_length=5, 
                                       block_length=5, 
                                       block_swap=2),
                              ],
                             features_to_analyze=['std', 'mean', 'max_variation', 'trend'],
                             explanation_window_length=12)

Compute TSICE Explanation#

TSICEExplainer produces a local explanation, for the current instance. The local explanation can be produced via the explain_instance API call. For the explanation generation TSICEExplainer carries out time series sampling. For detailed analysis, we often required large number of samples. Specify number of samples to use for the explanation generation using n_perturbation.

The following cells demonstrate the usage of TSICE to explain impact of latest part of the input timeseries.

Paste the below code snippet into a cell in energy_load_forecasting.ipynb in Jupyter lab and run the cell.

tsice_explanation = tsice_explainer.explain_instance(energy_df[[total_energy_col]]['2017-05-11':'2017-05-13'], n_perturbations=100)

Plot TSICE Explanation#

Paste the below code snippet with plot utilities into a cell in energy_load_forecasting.ipynb in Jupyter lab and run the cell.

def add_timeseries(fig, ts, color="green", name="time series", showlegend=False):
    timestamps = ts.index

    trace = go.Scatter(
        name=name,
        x=timestamps,
        y=ts[ts.columns[0]],
        mode="lines",
        line=dict(color=color),
        showlegend=showlegend,
    )

    fig.add_trace(trace)


def plot_timeseries(
    ts,
    color: Union[str, dict] = "green",
    fig=None,
    name="time series",
):

    showlegend = True
    if type(ts) == dict:
        data = ts
        if type(color) == str:
            color = {k: color for k in data}
    elif type(ts) == list:
        data = {}
        for k, ts_data in enumerate(ts):
            data[k] = ts_data
        if type(color) == str:
            color = {k: color for k in data}
    else:
        data = {}
        data["default"] = ts
        color = {"default": color}

    if fig is None:
        fig = go.Figure()

    first = True
    for key, ts in data.items():
        if not first:
            showlegend = False

        add_timeseries(fig, ts, color=color[key], showlegend=showlegend, name=name)
        first = False

    return fig


def plot_tsice_explanation(explanation, forecast_horizon):
    original_ts = pd.DataFrame(explanation["data_x"])
    perturbations = explanation["perturbations"]
    forecasts_on_perturbations = explanation["forecasts_on_perturbations"]

    new_perturbations = []
    new_timestamps = []
    pred_ts = []

    original_ts.index.freq = pd.infer_freq(original_ts.index)
    for i in range(1, forecast_horizon + 1):
        new_timestamps.append(original_ts.index[-1] + (i * original_ts.index.freq))

    for perturbation in perturbations:
        new_perturbations.append(pd.DataFrame(perturbation))

    for forecast in forecasts_on_perturbations:
        pred_ts.append(pd.DataFrame(forecast, index=new_timestamps))

    pred_original_ts = pd.DataFrame(
        explanation["current_forecast"], index=new_timestamps
    )

    # plot perturbed timeseries
    fig = plot_timeseries(
        ts=new_perturbations, color="lightgreen", name="perturbed timeseries samples"
    )
    # plot original timeseries
    plot_timeseries(
        ts=original_ts, fig=fig, name="input/original timeseries", color="green"
    )

    # plot varying forecast range
    plot_timeseries(
        ts=pred_ts, color="lightblue", fig=fig, name="forecast on perturbed samples"
    )

    # plot original forecast
    fig = plot_timeseries(
        ts=pred_original_ts, fig=fig, color="blue", name="original forecast"
    )

    fig.update_layout(template="plotly_white")
    fig.update_layout(
        title_text="Time Series Individual Conditional Expectation (TSICE) Plot"
    )

    fig.update_xaxes(title_text="Datetime")

    fig.update_yaxes(title_text=f"{original_ts.columns[0]}")

    return fig

fig = plot_tsice_explanation(tsice_explanation, forecast_horizon=lookahead)
dataset_plot_bytes = fig.to_image(format="png", width=1400, height=800)
Image(dataset_plot_bytes)

Plot the TSICE with derived features to analyze the impact of the input time series on the forecast, in terms of time series data statistics.

Paste the below code snippet with plot utilities into a cell in energy_load_forecasting.ipynb in Jupyter lab and run the cell.

def plot_tsice_with_observed_features(explanation, feature_per_row=2):
    df = pd.DataFrame(explanation["data_x"])
    n_row = int(np.ceil(len(explanation["feature_names"]) / feature_per_row))
    feat_values = np.array(explanation["feature_values"])

    spec = [[{} for _ in range(feature_per_row)] for _ in range(n_row)]

    fig = make_subplots(n_row, feature_per_row, specs=spec)

    row_id = 1
    col_id = 1
    showlegend = True
    for i, feat in enumerate(explanation["feature_names"]):
        x_feat = feat_values[i, :, 0]
        trend_fit = LinearRegression()
        trend_line = trend_fit.fit(x_feat.reshape(-1, 1), explanation["signed_impact"])
        x_trend = np.linspace(min(x_feat), max(x_feat), 101)
        y_trend = trend_line.predict(x_trend[..., np.newaxis])

        fig.add_trace(
            go.Scatter(
                x=x_feat,
                y=explanation["signed_impact"],
                mode="markers",
                showlegend=False,
            ),
            row=row_id,
            col=col_id,
        )

        fig.add_trace(
            go.Scatter(
                x=x_trend,
                y=y_trend,
                line=dict(color="green"),
                mode="lines",
                name="correlation between forecast and observed feature",
                showlegend=showlegend,
            ),
            row=row_id,
            col=col_id,
        )
        current_value = explanation["current_feature_values"][i][0]
        reference_line = go.Scatter(
            x=[current_value, current_value],
            y=[
                np.min(explanation["signed_impact"]) - 1,
                np.max(explanation["signed_impact"]) + 1,
            ],
            mode="lines",
            line=go.scatter.Line(color="firebrick", dash="dash"),
            showlegend=showlegend,
            name="current value",
        )
        fig.add_trace(reference_line, row=row_id, col=col_id)

        fig.update_xaxes(title=f"<b>{feat}<b>", row=row_id, col=col_id)
        fig.update_yaxes(title=f"<b>&#916; forecast<b>", row=row_id, col=col_id)

        showlegend = False
        if col_id == feature_per_row:
            col_id = 1
            row_id += 1
        else:
            col_id += 1

    fig.update_layout(
        title="<b>Impact of Derived Variable On The Forecast</b>",
        plot_bgcolor="white",
        showlegend=True,
        legend=dict(yanchor="top", y=0.99, xanchor="left", x=0.01),
    )
    return fig


fig = plot_tsice_with_observed_features(tsice_explanation)
dataset_plot_bytes = fig.to_image(format="png", width=1400, height=800)
Image(dataset_plot_bytes)

Continue to the next demo on Concrete Comprehensive Strength.

Energy Load Forecasting

Contents

Energy Load Forecasting#

Time Series Individual Conditional Expectation#

Instructions#

Imports#

Load Dataset#

Plot the dataset#

Data Preprocessing#

Train Test Split#

Robust Scaling of the data#

Load Pre-Trained Model#

Initializing TSICE Explainer#

Compute TSICE Explanation#

Plot TSICE Explanation#