{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# The Keras Functional API\n", "> In this chapter, you'll become familiar with the basics of the Keras functional API. You'll build a simple functional network using functional building blocks, fit it to data, and make predictions. This is the Summary of lecture \"Advanced Deep Learning with Keras\", via datacamp.\n", "\n", "- toc: true \n", "- badges: true\n", "- comments: true\n", "- author: Chanseok Kang\n", "- categories: [Python, Datacamp, Tensorflow-Keras, Deep_Learning]\n", "- image: images/plot_model.png" ] }, { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [], "source": [ "import tensorflow as tf\n", "import numpy as np\n", "import pandas as pd\n", "import matplotlib.pyplot as plt\n", "\n", "plt.rcParams['figure.figsize'] = (8, 8)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Keras input and dense layers\n", "- Inputs and outputs\n", " - Input layer\n", " - Output layer" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### Input layers\n", "The first step in creating a neural network model is to define the Input layer. This layer takes in raw data, usually in the form of numpy arrays. The shape of the Input layer defines how many variables your neural network will use. For example, if the input data has 10 columns, you define an Input layer with a shape of `(10,)`.\n", "\n", "In this case, you are only using one input in your network." ] }, { "cell_type": "code", "execution_count": 2, "metadata": {}, "outputs": [], "source": [ "from tensorflow.keras.layers import Input\n", "\n", "# Create an input layer of shape 1\n", "input_tensor = Input(shape=(1, ))" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### Dense layers\n", "Once you have an Input layer, the next step is to add a Dense layer.\n", "\n", "Dense layers learn a weight matrix, where the first dimension of the matrix is the dimension of the input data, and the second dimension is the dimension of the output data. Recall that your Input layer has a shape of 1. In this case, your output layer will also have a shape of 1. This means that the Dense layer will learn a 1x1 weight matrix.\n", "\n", "In this exercise, you will add a dense layer to your model, after the input layer." ] }, { "cell_type": "code", "execution_count": 3, "metadata": {}, "outputs": [], "source": [ "from tensorflow.keras.layers import Dense\n", "\n", "# Input layer\n", "input_tensor = Input(shape=(1, ))\n", "\n", "# Dense layer\n", "output_layer = Dense(1)\n", "\n", "# Connect the dense layer to the input_tensor\n", "output_tensor = output_layer(input_tensor)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### Output layers\n", "Output layers are simply Dense layers! Output layers are used to reduce the dimension of the inputs to the dimension of the outputs. You'll learn more about output dimensions in chapter 4, but for now, you'll always use a single output in your neural networks, which is equivalent to `Dense(1)` or a dense layer with a single unit." ] }, { "cell_type": "code", "execution_count": 4, "metadata": {}, "outputs": [], "source": [ "# Input layer\n", "input_tensor = Input(shape=(1, ))\n", "\n", "# Create a dense layer and connect the dense layer to the input_tensor in one step\n", "# Note that we did this in 2 steps in the previous exercise, but are doing it in one step now\n", "output_tensor = Dense(1)(input_tensor)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Build and compile a model\n" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### Build a model\n", "Once you've defined an input layer and an output layer, you can build a Keras model. The model object is how you tell Keras where the model starts and stops: where data comes in and where predictions come out.\n", "\n" ] }, { "cell_type": "code", "execution_count": 5, "metadata": {}, "outputs": [], "source": [ "from tensorflow.keras.models import Model\n", "\n", "input_tensor = Input(shape=(1, ))\n", "output_tensor = Dense(1)(input_tensor)\n", "\n", "# Built the model\n", "model = Model(input_tensor, output_tensor)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### Compile a model\n", "The final step in creating a model is compiling it. Now that you've created a model, you have to compile it before you can fit it to data. This finalizes your model, freezes all its settings, and prepares it to meet some data!\n", "\n", "During compilation, you specify the optimizer to use for fitting the model to the data, and a loss function. `'adam'` is a good default optimizer to use, and will generally work well. Loss function depends on the problem at hand. Mean squared error is a common loss function and will optimize for predicting the mean, as is done in least squares regression.\n", "\n", "Mean absolute error optimizes for the median and is used in quantile regression. For this dataset, `'mean_absolute_error'` works pretty well, so use it as your loss function." ] }, { "cell_type": "code", "execution_count": 6, "metadata": {}, "outputs": [], "source": [ "# Compile the model\n", "model.compile(optimizer='adam', loss='mean_absolute_error')" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### Visualize a model\n", "Now that you've compiled the model, take a look a the result of your hard work! You can do this by looking at the model summary, as well as its plot.\n", "\n", "The summary will tell you the names of the layers, as well as how many units they have and how many parameters are in the model." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "> Note: Before using `plot_model`, you need to install pydot, pydotplus, and graphviz. After install them, restart the kernel.\n", "```\n", "sudo apt install graphviz\n", "pip install pydot pydotplus graphviz\n", "```" ] }, { "cell_type": "code", "execution_count": 14, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Model: \"model_1\"\n", "_________________________________________________________________\n", "Layer (type) Output Shape Param # \n", "=================================================================\n", "input_5 (InputLayer) [(None, 1)] 0 \n", "_________________________________________________________________\n", "dense_3 (Dense) (None, 1) 2 \n", "=================================================================\n", "Total params: 2\n", "Trainable params: 2\n", "Non-trainable params: 0\n", "_________________________________________________________________\n" ] }, { "data": { "image/png": "\n", "text/plain": [ "
" ] }, "metadata": { "needs_background": "light" }, "output_type": "display_data" } ], "source": [ "from tensorflow.keras.utils import plot_model\n", "\n", "# Summarize the model\n", "model.summary()\n", "\n", "# Plot the model\n", "plot_model(model, to_file='../images/plot_model.png')\n", "\n", "# Display the image\n", "data = plt.imread('../images/plot_model.png')\n", "plt.imshow(data);" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Fit and evaluate a model\n", "- Basketball Data\n", " - Goal: Predict tournament outcomes\n", " - Data Available: team ratings from the tournament organizers\n", "- Input\n", " - Seed difference (`seed_diff`)\n", "- Output\n", " - Score difference (`score_diff`)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### Fit the model to the tournament basketball data\n", "Now that the model is compiled, you are ready to fit it to some data!\n", "\n", "In this exercise, you'll use a dataset of scores from US College Basketball tournament games. Each row of the dataset has the team ids: `team_1` and `team_2`, as integers. It also has the seed difference between the teams (seeds are assigned by the tournament committee and represent a ranking of how strong the teams are) and the score difference of the game (e.g. if `team_1` wins by 5 points, the score difference is 5).\n", "\n", "To fit the model, you provide a matrix of X variables (in this case one column: the seed difference) and a matrix of Y variables (in this case one column: the score difference)." ] }, { "cell_type": "code", "execution_count": 9, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
seasonteam_1team_2homeseed_diffscore_diffscore_1score_2won
01985288730-3-941500
1198559297304661551
2198598847305-459630
319857328803950411
41985392041001-954630
\n", "
" ], "text/plain": [ " season team_1 team_2 home seed_diff score_diff score_1 score_2 won\n", "0 1985 288 73 0 -3 -9 41 50 0\n", "1 1985 5929 73 0 4 6 61 55 1\n", "2 1985 9884 73 0 5 -4 59 63 0\n", "3 1985 73 288 0 3 9 50 41 1\n", "4 1985 3920 410 0 1 -9 54 63 0" ] }, "execution_count": 9, "metadata": {}, "output_type": "execute_result" } ], "source": [ "games_tourney = pd.read_csv('./dataset/games_tourney.csv')\n", "games_tourney.head()" ] }, { "cell_type": "code", "execution_count": 10, "metadata": {}, "outputs": [], "source": [ "from sklearn.model_selection import train_test_split\n", "\n", "games_tourney_train, games_tourney_test = train_test_split(games_tourney, test_size=0.3)" ] }, { "cell_type": "code", "execution_count": 11, "metadata": {}, "outputs": [], "source": [ "input_tensor = Input(shape=(1, ))\n", "output_tensor = Dense(1)(input_tensor)\n", "\n", "model = Model(input_tensor, output_tensor)\n", "model.compile(optimizer='adam', loss='mean_absolute_error')" ] }, { "cell_type": "code", "execution_count": 12, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "21/21 [==============================] - 0s 8ms/step - loss: 9.5143 - val_loss: 9.5148\n" ] } ], "source": [ "# Now fit the model\n", "model.fit(games_tourney_train['seed_diff'], games_tourney_train['score_diff'],\n", " epochs=1,\n", " batch_size=128,\n", " validation_split=0.1,\n", " verbose=True);" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### Evaluate the model on a test set\n", "After fitting the model, you can evaluate it on new data. You will give the model a new `X` matrix (also called test data), allow it to make predictions, and then compare to the known `y` variable (also called target data).\n", "\n", "In this case, you'll use data from the post-season tournament to evaluate your model. The tournament games happen after the regular season games you used to train our model, and are therefore a good evaluation of how well your model performs out-of-sample." ] }, { "cell_type": "code", "execution_count": 13, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "9.419431686401367\n" ] } ], "source": [ "# Load the X variable from the test data\n", "X_test = games_tourney_test['seed_diff']\n", "\n", "# Load the y variable from the test data\n", "y_test = games_tourney_test['score_diff']\n", "\n", "# Evaluate the model on the test data\n", "print(model.evaluate(X_test, y_test, verbose=False))" ] } ], "metadata": { "kernelspec": { "display_name": "Python 3", "language": "python", "name": "python3" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.7.6" } }, "nbformat": 4, "nbformat_minor": 4 }