{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "## Approach\n", "**[Fashion-MNIST](https://github.com/zalandoresearch/fashion-mnist)** is a dataset of Zalando's article images—consisting of a training set of 60,000 examples and a test set of 10,000 examples. Each example is a 28x28 grayscale image, associated with a label from 10 classes. The dataset serves as a direct drop-in replacement for the original [MNIST dataset](http://yann.lecun.com/exdb/mnist/) for benchmarking machine learning algorithms. It shares the same image size and structure of training and testing splits.\n", "\n", "In this work, I will train a Convolutional Neural Network classifier with 3 convolution layers using the Keras deep learning library. The model is first trained for 10 epochs with batch size of 256, compiled with `categorical_crossentropy` loss function and `Adam` optimizer. Then, I added **data augmentation**, which generates new training samples by rotating, shifting and zooming on the training samples, and trained for another 50 epochs.\n", "\n", "I will first split the original training data (60,000 images) into 80% training (48,000 images) and 20% validation (12000 images) optimize the classifier, while keeping the test data (10,000 images) to finally evaluate the accuracy of the model on the data it has never seen. This helps to see whether I'm over-fitting on the training data and whether I should lower the learning rate and train for more epochs if validation accuracy is higher than training accuracy or stop over-training if training accuracy shift higher than the validation." ] }, { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [ { "name": "stderr", "output_type": "stream", "text": [ "/Library/Frameworks/Python.framework/Versions/3.6/lib/python3.6/site-packages/h5py/__init__.py:36: FutureWarning: Conversion of the second argument of issubdtype from `float` to `np.floating` is deprecated. In future, it will be treated as `np.float64 == np.dtype(float).type`.\n", " from ._conv import register_converters as _register_converters\n", "Using TensorFlow backend.\n" ] } ], "source": [ "import numpy as np\n", "import pandas as pd\n", "from keras.utils import to_categorical\n", "from sklearn.model_selection import train_test_split\n", "\n", "# Load training and test data into dataframes\n", "data_train = pd.read_csv('data/fashion-mnist_train.csv')\n", "data_test = pd.read_csv('data/fashion-mnist_test.csv')\n", "\n", "# X forms the training images, and y forms the training labels\n", "X = np.array(data_train.iloc[:, 1:])\n", "y = to_categorical(np.array(data_train.iloc[:, 0]))\n", "\n", "# Here I split original training data to sub-training (80%) and validation data (20%)\n", "X_train, X_val, y_train, y_val = train_test_split(X, y, test_size=0.2, random_state=13)\n", "\n", "# X_test forms the test images, and y_test forms the test labels\n", "X_test = np.array(data_test.iloc[:, 1:])\n", "y_test = to_categorical(np.array(data_test.iloc[:, 0]))" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Processing Data\n", "After loading and splitting the data, I preprocess them by reshaping them into the shape the network expects and scaling them so that all values are in the [0, 1] interval. Previously, for instance, the training data were stored in an array of shape (60000, 28, 28) of type uint8 with values in the [0, 255] interval. I transform it into a float32 array of shape (60000, 28 * 28) with values between 0 and 1." ] }, { "cell_type": "code", "execution_count": 2, "metadata": { "collapsed": true }, "outputs": [], "source": [ "# Each image's dimension is 28 x 28\n", "img_rows, img_cols = 28, 28\n", "input_shape = (img_rows, img_cols, 1)\n", "\n", "# Prepare the training images\n", "X_train = X_train.reshape(X_train.shape[0], img_rows, img_cols, 1)\n", "X_train = X_train.astype('float32')\n", "X_train /= 255\n", "\n", "# Prepare the test images\n", "X_test = X_test.reshape(X_test.shape[0], img_rows, img_cols, 1)\n", "X_test = X_test.astype('float32')\n", "X_test /= 255\n", "\n", "# Prepare the validation images\n", "X_val = X_val.reshape(X_val.shape[0], img_rows, img_cols, 1)\n", "X_val = X_val.astype('float32')\n", "X_val /= 255" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## CNN with 3 Convolutional Layers\n", "This CNN takes as input tensors of shape *(image_height, image_width, image_channels)*. In this case, I configure the CNN to process inputs of size *(28, 28, 1)*, which is the format of the FashionMNIST images. I do this by passing the argument *input_shape=(28, 28, 1)* to the first layer.\n", "\n", "* The 1st layer is a *Conv2D* layer for the **convolution** operation that extracts features from the input images by sliding a convolution filter over the input to produce a feature map. Here I choose feature map with size 3 x 3. \n", "* The 2nd layer is a *MaxPooling2D* layer for the **max-pooling** operation that reduces the dimensionality of each feature, which helps shorten training time and reduce number of parameters. Here I choose the pooling window with size 2 x 2.\n", "* To combat overfititng, I add a *Dropout* layer as the 3rd layer, a powerful regularization technique. **Dropout** is the method used to reduce overfitting. It forces the model to learn multiple independent representations of the same data by randomly disabling neurons in the learning phase. In this model, dropout will randomnly disable 20% of the outputs.\n", "* I repeat these steps to add more hidden layers: 2 *Conv2D* layers, 1 *MaxPooling2D* layers, and 2 *Dropout* layers.\n", "* The next step is to feed the last output tensor into a stack of *Dense* layers, otherwise known as **fully-connected** layers. These densely connected classifiers process vectors, which are 1D, whereas the current output is a 3D tensor. Thus, I need to **flatten** the 3D outputs to 1D, and then add 2 *Dense* layers on top.\n", "* I add another *Dropout* layer between these 2 *Dense* layers to disable 30% of the outputs.\n", "* I do a 10-way classification (as there are 10 classes of fashion images), using a final layer with 10 outputs and a softmax activation. **Softmax** activation enables me to calculate the output based on the probabilities. Each class is assigned a probability and the class with the maximum probability is the model’s output for the input." ] }, { "cell_type": "code", "execution_count": 3, "metadata": { "collapsed": true }, "outputs": [], "source": [ "import keras\n", "from keras.models import Sequential\n", "from keras.layers import Dense, Dropout, Flatten\n", "from keras.layers import Conv2D, MaxPooling2D\n", "\n", "cnn3 = Sequential()\n", "cnn3.add(Conv2D(32, kernel_size=(3, 3), activation='relu', input_shape=input_shape))\n", "cnn3.add(MaxPooling2D((2, 2)))\n", "cnn3.add(Dropout(0.25))\n", "\n", "cnn3.add(Conv2D(64, kernel_size=(3, 3), activation='relu'))\n", "cnn3.add(MaxPooling2D(pool_size=(2, 2)))\n", "cnn3.add(Dropout(0.25))\n", "\n", "cnn3.add(Conv2D(128, kernel_size=(3, 3), activation='relu'))\n", "cnn3.add(Dropout(0.4))\n", "\n", "cnn3.add(Flatten())\n", "\n", "cnn3.add(Dense(128, activation='relu'))\n", "cnn3.add(Dropout(0.3))\n", "cnn3.add(Dense(10, activation='softmax'))" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "When compiling the model, I choose **categorical_crossentropy** as the loss function (which is relevent for multiclass, single-label classification problem) and **Adam** optimizer.\n", "* The cross-entropy loss calculates the error rate between the predicted value and the original value. The formula for calculating cross-entropy loss is given [here](https://en.wikipedia.org/wiki/Cross_entropy). Categorical is used because there are 10 classes to predict from. If there were 2 classes, I would have used binary_crossentropy.\n", "* The Adam optimizer is an improvement over SGD(Stochastic Gradient Descent). The optimizer is responsible for updating the weights of the neurons via backpropagation. It calculates the derivative of the loss function with respect to each weight and subtracts it from the weight. That is how a neural network learns." ] }, { "cell_type": "code", "execution_count": 4, "metadata": { "collapsed": true }, "outputs": [], "source": [ "cnn3.compile(loss=keras.losses.categorical_crossentropy,\n", " optimizer=keras.optimizers.Adam(),\n", " metrics=['accuracy'])" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Let’s look at how the dimensions of the feature maps change with every successive layer:" ] }, { "cell_type": "code", "execution_count": 5, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "_________________________________________________________________\n", "Layer (type) Output Shape Param # \n", "=================================================================\n", "conv2d_1 (Conv2D) (None, 26, 26, 32) 320 \n", "_________________________________________________________________\n", "max_pooling2d_1 (MaxPooling2 (None, 13, 13, 32) 0 \n", "_________________________________________________________________\n", "dropout_1 (Dropout) (None, 13, 13, 32) 0 \n", "_________________________________________________________________\n", "conv2d_2 (Conv2D) (None, 11, 11, 64) 18496 \n", "_________________________________________________________________\n", "max_pooling2d_2 (MaxPooling2 (None, 5, 5, 64) 0 \n", "_________________________________________________________________\n", "dropout_2 (Dropout) (None, 5, 5, 64) 0 \n", "_________________________________________________________________\n", "conv2d_3 (Conv2D) (None, 3, 3, 128) 73856 \n", "_________________________________________________________________\n", "dropout_3 (Dropout) (None, 3, 3, 128) 0 \n", "_________________________________________________________________\n", "flatten_1 (Flatten) (None, 1152) 0 \n", "_________________________________________________________________\n", "dense_1 (Dense) (None, 128) 147584 \n", "_________________________________________________________________\n", "dropout_4 (Dropout) (None, 128) 0 \n", "_________________________________________________________________\n", "dense_2 (Dense) (None, 10) 1290 \n", "=================================================================\n", "Total params: 241,546\n", "Trainable params: 241,546\n", "Non-trainable params: 0\n", "_________________________________________________________________\n" ] } ], "source": [ "cnn3.summary()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "* 241,546 parameters are available to be trained.\n", "* The output of the *Conv2D* and *MaxPooling2D* layers are 3D tensors of shape *(height, width, channels)*.\n", "* The number of channels is controlled by the 1st argument passed to the *Conv2D* layer (32).\n", "* The (3, 3, 128) outputs from the 3rd *Dropout* layer are flattened into vectors of shape (1152,) before going through 2 *Dense* layers.\n", "\n", "## Training the Model\n", "As previously mentioned, I train the model with batch size of 256 and 10 epochs on both training and validation data." ] }, { "cell_type": "code", "execution_count": 6, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Train on 48000 samples, validate on 12000 samples\n", "Epoch 1/10\n", "48000/48000 [==============================] - 65s 1ms/step - loss: 0.8479 - acc: 0.6865 - val_loss: 0.5098 - val_acc: 0.8076\n", "Epoch 2/10\n", "48000/48000 [==============================] - 69s 1ms/step - loss: 0.5232 - acc: 0.8047 - val_loss: 0.4146 - val_acc: 0.8526\n", "Epoch 3/10\n", "48000/48000 [==============================] - 79s 2ms/step - loss: 0.4510 - acc: 0.8366 - val_loss: 0.3688 - val_acc: 0.8669\n", "Epoch 4/10\n", "48000/48000 [==============================] - 65s 1ms/step - loss: 0.4039 - acc: 0.8529 - val_loss: 0.3481 - val_acc: 0.8741\n", "Epoch 5/10\n", "48000/48000 [==============================] - 66s 1ms/step - loss: 0.3762 - acc: 0.8612 - val_loss: 0.3221 - val_acc: 0.8810\n", "Epoch 6/10\n", "48000/48000 [==============================] - 68s 1ms/step - loss: 0.3594 - acc: 0.8696 - val_loss: 0.3105 - val_acc: 0.8869\n", "Epoch 7/10\n", "48000/48000 [==============================] - 78s 2ms/step - loss: 0.3397 - acc: 0.8778 - val_loss: 0.2960 - val_acc: 0.8923\n", "Epoch 8/10\n", "48000/48000 [==============================] - 63s 1ms/step - loss: 0.3266 - acc: 0.8810 - val_loss: 0.2847 - val_acc: 0.8977\n", "Epoch 9/10\n", "48000/48000 [==============================] - 75s 2ms/step - loss: 0.3162 - acc: 0.8836 - val_loss: 0.2884 - val_acc: 0.8947\n", "Epoch 10/10\n", "48000/48000 [==============================] - 61s 1ms/step - loss: 0.3074 - acc: 0.8878 - val_loss: 0.2700 - val_acc: 0.9028\n" ] } ], "source": [ "history3 = cnn3.fit(X_train, y_train,\n", " batch_size=256,\n", " epochs=10,\n", " verbose=1,\n", " validation_data=(X_val, y_val))" ] }, { "cell_type": "code", "execution_count": 7, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Test loss: 0.24964626643657684\n", "Test accuracy: 0.9079\n" ] } ], "source": [ "score3 = cnn3.evaluate(X_test, y_test, verbose=0)\n", "print('Test loss:', score3[0])\n", "print('Test accuracy:', score3[1])" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "My accuracy is 90.79%, pretty powerful!\n", "\n", "## Data Augmentation\n", "Overfitting can be caused by having too few samples to learn from, making me unable to train a model that can generalize to new data. Given infinite data, my model would be exposed to every possible aspect of the data distribution at hand: I would never overfit. \n", "\n", "**Data augmentation** takes the approach of generating more training data from existing training samples, by augmenting the samples via a number of random transformations that yield believable-looking images. The goal is that at training time, my model will never see the exact same picture twice. This helps expose the model to more aspects of the data and generalize better.\n", "\n", "In Keras, this can be done by configuring a number of random transformations to be performed on the images read by the ImageDataGenerator instance.\n", "* *rotation_range* is a value in degrees (0–180), a range within which to randomly rotate pictures.\n", "* *width_shift* and *height_shift* are ranges (as a fraction of total width or height) within which to randomly translate pictures vertically or horizontally.\n", "* *shear_range* is for randomly applying shearing transformations.\n", "* *zoom_range* is for randomly zooming inside pictures." ] }, { "cell_type": "code", "execution_count": 8, "metadata": { "collapsed": true }, "outputs": [], "source": [ "from keras.preprocessing.image import ImageDataGenerator\n", "gen = ImageDataGenerator(rotation_range=8, width_shift_range=0.08, shear_range=0.3,\n", " height_shift_range=0.08, zoom_range=0.08)\n", "batches = gen.flow(X_train, y_train, batch_size=256)\n", "val_batches = gen.flow(X_val, y_val, batch_size=256)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Let's train the network using data augmentation." ] }, { "cell_type": "code", "execution_count": 9, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Epoch 1/50\n", "187/187 [==============================] - 66s 355ms/step - loss: 0.4831 - acc: 0.8195 - val_loss: 0.4110 - val_acc: 0.8404\n", "Epoch 2/50\n", "187/187 [==============================] - 71s 378ms/step - loss: 0.4413 - acc: 0.8350 - val_loss: 0.3684 - val_acc: 0.8633\n", "Epoch 3/50\n", "187/187 [==============================] - 78s 416ms/step - loss: 0.4205 - acc: 0.8437 - val_loss: 0.3511 - val_acc: 0.8684\n", "Epoch 4/50\n", "187/187 [==============================] - 69s 370ms/step - loss: 0.4098 - acc: 0.8478 - val_loss: 0.3550 - val_acc: 0.8614\n", "Epoch 5/50\n", "187/187 [==============================] - 65s 348ms/step - loss: 0.3997 - acc: 0.8510 - val_loss: 0.3362 - val_acc: 0.8744\n", "Epoch 6/50\n", "187/187 [==============================] - 67s 361ms/step - loss: 0.3943 - acc: 0.8524 - val_loss: 0.3537 - val_acc: 0.8675\n", "Epoch 7/50\n", "187/187 [==============================] - 71s 377ms/step - loss: 0.3892 - acc: 0.8560 - val_loss: 0.3249 - val_acc: 0.8750\n", "Epoch 8/50\n", "187/187 [==============================] - 72s 384ms/step - loss: 0.3793 - acc: 0.8593 - val_loss: 0.3259 - val_acc: 0.8770\n", "Epoch 9/50\n", "187/187 [==============================] - 71s 382ms/step - loss: 0.3739 - acc: 0.8601 - val_loss: 0.3197 - val_acc: 0.8802\n", "Epoch 10/50\n", "187/187 [==============================] - 75s 402ms/step - loss: 0.3700 - acc: 0.8618 - val_loss: 0.3248 - val_acc: 0.8796\n", "Epoch 11/50\n", "187/187 [==============================] - 73s 390ms/step - loss: 0.3657 - acc: 0.8648 - val_loss: 0.3177 - val_acc: 0.8790\n", "Epoch 12/50\n", "187/187 [==============================] - 63s 337ms/step - loss: 0.3607 - acc: 0.8649 - val_loss: 0.3151 - val_acc: 0.8823\n", "Epoch 13/50\n", "187/187 [==============================] - 64s 340ms/step - loss: 0.3581 - acc: 0.8665 - val_loss: 0.3046 - val_acc: 0.8869\n", "Epoch 14/50\n", "187/187 [==============================] - 66s 352ms/step - loss: 0.3577 - acc: 0.8649 - val_loss: 0.2992 - val_acc: 0.8876\n", "Epoch 15/50\n", "187/187 [==============================] - 64s 340ms/step - loss: 0.3500 - acc: 0.8686 - val_loss: 0.3014 - val_acc: 0.8867\n", "Epoch 16/50\n", "187/187 [==============================] - 63s 337ms/step - loss: 0.3497 - acc: 0.8711 - val_loss: 0.3065 - val_acc: 0.8849\n", "Epoch 17/50\n", "187/187 [==============================] - 66s 351ms/step - loss: 0.3547 - acc: 0.8696 - val_loss: 0.3068 - val_acc: 0.8861\n", "Epoch 18/50\n", "187/187 [==============================] - 62s 333ms/step - loss: 0.3439 - acc: 0.8707 - val_loss: 0.2992 - val_acc: 0.8887\n", "Epoch 19/50\n", "187/187 [==============================] - 65s 349ms/step - loss: 0.3445 - acc: 0.8727 - val_loss: 0.2916 - val_acc: 0.8931\n", "Epoch 20/50\n", "187/187 [==============================] - 75s 402ms/step - loss: 0.3384 - acc: 0.8734 - val_loss: 0.3072 - val_acc: 0.8845\n", "Epoch 21/50\n", "187/187 [==============================] - 63s 336ms/step - loss: 0.3402 - acc: 0.8731 - val_loss: 0.2955 - val_acc: 0.8875\n", "Epoch 22/50\n", "187/187 [==============================] - 61s 324ms/step - loss: 0.3391 - acc: 0.8756 - val_loss: 0.2951 - val_acc: 0.8912\n", "Epoch 23/50\n", "187/187 [==============================] - 64s 343ms/step - loss: 0.3352 - acc: 0.8755 - val_loss: 0.2813 - val_acc: 0.8937\n", "Epoch 24/50\n", "187/187 [==============================] - 61s 327ms/step - loss: 0.3328 - acc: 0.8750 - val_loss: 0.2912 - val_acc: 0.8902\n", "Epoch 25/50\n", "187/187 [==============================] - 64s 343ms/step - loss: 0.3273 - acc: 0.8774 - val_loss: 0.2873 - val_acc: 0.8952\n", "Epoch 26/50\n", "187/187 [==============================] - 66s 353ms/step - loss: 0.3306 - acc: 0.8775 - val_loss: 0.2816 - val_acc: 0.8913\n", "Epoch 27/50\n", "187/187 [==============================] - 64s 341ms/step - loss: 0.3221 - acc: 0.8790 - val_loss: 0.2978 - val_acc: 0.8876\n", "Epoch 28/50\n", "187/187 [==============================] - 64s 343ms/step - loss: 0.3290 - acc: 0.8784 - val_loss: 0.2906 - val_acc: 0.8890\n", "Epoch 29/50\n", "187/187 [==============================] - 77s 410ms/step - loss: 0.3232 - acc: 0.8812 - val_loss: 0.2892 - val_acc: 0.8894\n", "Epoch 30/50\n", "187/187 [==============================] - 69s 371ms/step - loss: 0.3232 - acc: 0.8794 - val_loss: 0.2750 - val_acc: 0.8971\n", "Epoch 31/50\n", "187/187 [==============================] - 64s 344ms/step - loss: 0.3213 - acc: 0.8821 - val_loss: 0.3017 - val_acc: 0.8852\n", "Epoch 32/50\n", "187/187 [==============================] - 63s 338ms/step - loss: 0.3244 - acc: 0.8792 - val_loss: 0.2788 - val_acc: 0.8952\n", "Epoch 33/50\n", "187/187 [==============================] - 67s 356ms/step - loss: 0.3173 - acc: 0.8820 - val_loss: 0.2817 - val_acc: 0.8931\n", "Epoch 34/50\n", "187/187 [==============================] - 67s 357ms/step - loss: 0.3161 - acc: 0.8842 - val_loss: 0.2762 - val_acc: 0.8952\n", "Epoch 35/50\n", "187/187 [==============================] - 68s 365ms/step - loss: 0.3169 - acc: 0.8837 - val_loss: 0.2794 - val_acc: 0.8969\n", "Epoch 36/50\n", "187/187 [==============================] - 78s 415ms/step - loss: 0.3197 - acc: 0.8800 - val_loss: 0.2869 - val_acc: 0.8882\n", "Epoch 37/50\n", "187/187 [==============================] - 76s 404ms/step - loss: 0.3168 - acc: 0.8822 - val_loss: 0.2767 - val_acc: 0.8928\n", "Epoch 38/50\n", "187/187 [==============================] - 79s 423ms/step - loss: 0.3066 - acc: 0.8846 - val_loss: 0.2743 - val_acc: 0.8975\n", "Epoch 39/50\n", "187/187 [==============================] - 66s 356ms/step - loss: 0.3132 - acc: 0.8825 - val_loss: 0.2677 - val_acc: 0.9027\n", "Epoch 40/50\n", "187/187 [==============================] - 64s 340ms/step - loss: 0.3093 - acc: 0.8850 - val_loss: 0.2735 - val_acc: 0.8946\n", "Epoch 41/50\n", "187/187 [==============================] - 66s 351ms/step - loss: 0.3074 - acc: 0.8862 - val_loss: 0.2695 - val_acc: 0.8980\n", "Epoch 42/50\n", "187/187 [==============================] - 64s 340ms/step - loss: 0.3089 - acc: 0.8860 - val_loss: 0.2713 - val_acc: 0.8992\n", "Epoch 43/50\n", "187/187 [==============================] - 68s 362ms/step - loss: 0.3082 - acc: 0.8840 - val_loss: 0.2751 - val_acc: 0.8970\n", "Epoch 44/50\n", "187/187 [==============================] - 66s 354ms/step - loss: 0.3063 - acc: 0.8849 - val_loss: 0.2619 - val_acc: 0.9019\n", "Epoch 45/50\n", "187/187 [==============================] - 64s 343ms/step - loss: 0.3051 - acc: 0.8873 - val_loss: 0.2639 - val_acc: 0.9031\n", "Epoch 46/50\n", "187/187 [==============================] - 67s 358ms/step - loss: 0.3063 - acc: 0.8856 - val_loss: 0.2689 - val_acc: 0.8987\n", "Epoch 47/50\n", "187/187 [==============================] - 79s 424ms/step - loss: 0.3087 - acc: 0.8849 - val_loss: 0.2760 - val_acc: 0.8954\n", "Epoch 48/50\n", "187/187 [==============================] - 66s 356ms/step - loss: 0.3028 - acc: 0.8869 - val_loss: 0.2690 - val_acc: 0.8967\n", "Epoch 49/50\n", "187/187 [==============================] - 65s 347ms/step - loss: 0.3028 - acc: 0.8858 - val_loss: 0.2712 - val_acc: 0.8959\n", "Epoch 50/50\n", "187/187 [==============================] - 67s 358ms/step - loss: 0.3042 - acc: 0.8880 - val_loss: 0.2675 - val_acc: 0.8990\n" ] } ], "source": [ "history3 = cnn3.fit_generator(batches, steps_per_epoch=48000//256, epochs=50,\n", " validation_data=val_batches, validation_steps=12000//256, use_multiprocessing=True)" ] }, { "cell_type": "code", "execution_count": 10, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Test loss: 0.22910109297037123\n", "Test accuracy: 0.9117\n" ] } ], "source": [ "score3 = cnn3.evaluate(X_test, y_test, verbose=0)\n", "print('Test loss:', score3[0])\n", "print('Test accuracy:', score3[1])" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Okay, I improved the accuracy to 91.17%!\n", "\n", "## Results\n", "Let's plot training and validation accuracy as well as training and validation loss." ] }, { "cell_type": "code", "execution_count": 11, "metadata": {}, "outputs": [ { "data": { "image/png": "\n", "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "image/png": "\n", "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "import matplotlib.pyplot as plt\n", "%matplotlib inline\n", "\n", "accuracy = history3.history['acc']\n", "val_accuracy = history3.history['val_acc']\n", "loss = history3.history['loss']\n", "val_loss = history3.history['val_loss']\n", "epochs = range(len(accuracy))\n", "\n", "plt.plot(epochs, accuracy, 'bo', label='Training accuracy')\n", "plt.plot(epochs, val_accuracy, 'b', label='Validation accuracy')\n", "plt.title('Training and validation accuracy')\n", "plt.legend()\n", "plt.figure()\n", "\n", "plt.plot(epochs, loss, 'bo', label='Training loss')\n", "plt.plot(epochs, val_loss, 'b', label='Validation loss')\n", "plt.title('Training and validation loss')\n", "plt.legend()\n", "plt.show()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "These plots look decent: The training curves are closely tracking the validation curves.\n", "\n", "## Classification Report\n", "I can summarize the performance of my classifier as follows:" ] }, { "cell_type": "code", "execution_count": 12, "metadata": { "collapsed": true }, "outputs": [], "source": [ "# get the predictions for the test data\n", "predicted_classes = cnn3.predict_classes(X_test)\n", "\n", "# get the indices to be plotted\n", "y_true = data_test.iloc[:, 0]\n", "correct = np.nonzero(predicted_classes==y_true)[0]\n", "incorrect = np.nonzero(predicted_classes!=y_true)[0]" ] }, { "cell_type": "code", "execution_count": 13, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " precision recall f1-score support\n", "\n", " Class 0 0.85 0.86 0.86 1000\n", " Class 1 0.99 0.99 0.99 1000\n", " Class 2 0.92 0.83 0.87 1000\n", " Class 3 0.93 0.94 0.93 1000\n", " Class 4 0.88 0.83 0.85 1000\n", " Class 5 0.98 0.98 0.98 1000\n", " Class 6 0.68 0.78 0.73 1000\n", " Class 7 0.95 0.96 0.96 1000\n", " Class 8 0.99 0.99 0.99 1000\n", " Class 9 0.98 0.96 0.97 1000\n", "\n", "avg / total 0.91 0.91 0.91 10000\n", "\n" ] } ], "source": [ "from sklearn.metrics import classification_report\n", "target_names = [\"Class {}\".format(i) for i in range(10)]\n", "print(classification_report(y_true, predicted_classes, target_names=target_names))" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "It's apparent that the classifier is underperforming for class 6 in terms of both precision and recall. For class 0, the classifier is slightly lacking precision; whereas for class 2 and 4, it is slightly lacking recall.\n", "\n", "Perhaps I would gain more insight after visualizing the correct and incorrect predictions.\n", "\n", "Here is a subset of correctly predicted classes." ] }, { "cell_type": "code", "execution_count": 14, "metadata": {}, "outputs": [ { "data": { "image/png": "\n", "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "for i, correct in enumerate(correct[:9]):\n", " plt.subplot(3,3,i+1)\n", " plt.imshow(X_test[correct].reshape(28,28), cmap='gray', interpolation='none')\n", " plt.title(\"Predicted {}, Class {}\".format(predicted_classes[correct], y_true[correct]))\n", " plt.tight_layout()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "And here is a subset of incorrectly predicted classes:" ] }, { "cell_type": "code", "execution_count": 15, "metadata": {}, "outputs": [ { "data": { "image/png": "\n", "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "for i, incorrect in enumerate(incorrect[0:9]):\n", " plt.subplot(3,3,i+1)\n", " plt.imshow(X_test[incorrect].reshape(28,28), cmap='gray', interpolation='none')\n", " plt.title(\"Predicted {}, Class {}\".format(predicted_classes[incorrect], y_true[incorrect]))\n", " plt.tight_layout()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Visualizing What My Model Learns\n", "It’s often said that deep-learning models are “black boxes”: learning representations that are difficult to extract and present in a human-readable form. Although this is partially true for certain types of deep-learning models, it’s definitely not true for convnets. The representations learned by convnets are highly amenable to visualization, in large part because they’re representations of visual concepts.\n", "\n", "Here I attempt to visualize the intermediate CNN outputs (intermediate activations). Visualizing intermediate activations consists of displaying the feature maps that are output by various convolution and pooling layers in a network, given a certain input (the output of a layer is often called its *activation*, the output of the activation function). This gives a view into how an input is decomposed into the different filters learned by the network. \n", "\n", "I want to visualize feature maps with three dimensions: width, height, and depth (channels). Each channel encodes relatively independent features, so the proper way to visualize these feature maps is by independently plotting the contents of every channel as a 2D image.\n", "\n", "I first get an input test image (#1994)." ] }, { "cell_type": "code", "execution_count": 16, "metadata": {}, "outputs": [ { "data": { "image/png": "iVBORw0KGgoAAAANSUhEUgAAAP8AAAD8CAYAAAC4nHJkAAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADl0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uIDIuMS4yLCBodHRwOi8vbWF0cGxvdGxpYi5vcmcvNQv5yAAAD91JREFUeJzt3WuMXeV1xvFnzczx+A62IbZjrJoQq4kDihNGTtSgipYQAUE1URWEI0VGQTWVgtRU+RBEPxSpX1BaSKOoQnKCgxOlkKiEglraQKwmJC2lDIhrCNca2YMvGBtsbOy5nNUPs4kGM3vtw7m76/+TRnNmr7PPWd4zj8/lPft9zd0FIJ+BXjcAoDcIP5AU4QeSIvxAUoQfSIrwA0kRfiApwg8kRfiBpIa6eWdzbNjnakE37xJI5biOatxPWCPXbSn8ZnaJpG9LGpT0PXe/Kbr+XC3Qp+yiVu4SQOBh39HwdZt+2m9mg5L+QdKlktZJ2mRm65q9PQDd1cpr/g2SXnT3l919XNKdkja2py0AndZK+FdJ2jXj593Ftncxsy1mNmpmoxM60cLdAWinjr/b7+5b3X3E3UdqGu703QFoUCvhH5O0esbPZxXbAJwCWgn/I5LWmtnZZjZH0lWS7m1PWwA6remhPnefNLPrJP1M00N929z9mbZ1BqCjWhrnd/f7JN3Xpl4AdBEf7wWSIvxAUoQfSIrwA0kRfiApwg8kRfiBpAg/kBThB5Ii/EBShB9IivADSRF+ICnCDyRF+IGkCD+QFOEHkiL8QFKEH0iK8ANJEX4gKcIPJEX4gaQIP5AU4QeSIvxAUoQfSIrwA0kRfiCpllbpNbOdko5ImpI06e4j7WgKQOe1FP7CH7n7gTbcDoAu4mk/kFSr4XdJ95vZo2a2pR0NAeiOVp/2X+DuY2b2AUkPmNlv3f3BmVco/lPYIklzNb/FuwPQLi098rv7WPF9v6S7JW2Y5Tpb3X3E3UdqGm7l7gC0UdPhN7MFZrboncuSPifp6XY1BqCzWnnav1zS3Wb2zu38o7v/e1u6AtBxTYff3V+W9PE29gKgixjqA5Ii/EBShB9IivADSRF+ICnCDyRF+IGkCD+QFOEHkiL8QFKEH0iK8ANJEX4gKcIPJNWO2XvRYzZU/mv0ycnWbvv8j4X1g+ctDutLbn+o+TsfGIzr9anO3bbXK+oe16fnuSgvD9XKb3piPL7tNuGRH0iK8ANJEX4gKcIPJEX4gaQIP5AU4QeSYpz/VFAxZhyN5Q+t+mC878J4CbXnv7QorP/ii38b1rf88+dLa1NvvBnuWzmO38JYvQ1W7dv8MZ++Qvw5gG6N5Ud45AeSIvxAUoQfSIrwA0kRfiApwg8kRfiBpCrH+c1sm6TLJe1393OLbUsl/VjSGkk7JV3p7oc612ZyVeeOB+PdY3+6Jtx13qX7wvrQI/F49w1jl4X1fT9cXlpb8ecLwn0nx14N662cz9/pcfbBxfE8By99o3yehHl742O+/Dv/1VRPJ2vkkf92SZectO16STvcfa2kHcXPAE4hleF39wclHTxp80ZJ24vL2yVd0ea+AHRYs6/5l7v7nuLyXknlz+0A9KWW3/Bzd5dU+qLUzLaY2aiZjU7oRKt3B6BNmg3/PjNbKUnF9/1lV3T3re4+4u4jNQ03eXcA2q3Z8N8raXNxebOke9rTDoBuqQy/md0h6SFJv29mu83sGkk3SbrYzF6Q9NniZwCnkMpxfnffVFK6qM299FYn54hv0eQfnx/WX/5SMC48EZ93/oG/WRjWP3LL02H9nPmvhfWNyx4vrQ3+Ip4b/y9/dVVYX/Wv8e9s4b+U37fq8Wcn6hvWhfUXvhJH53sXfj+s75r4bWnt6sWlr6IlSRc9c01pzf+n8XUS+IQfkBThB5Ii/EBShB9IivADSRF+IKn+mrq7Yorqlm66YqrmVpeyjgzMj6fH3vuV9WH9tD+JT209b/h4ae2lA8vCfVfcvCusjyzeGdYfeuOcsP7m5LzS2vBAfMx/+dm/D+vzL47/Xp74ZvlptXNtItz33Dm/DOuvTsZDhbcf+oOwPqjyYc67Bsp/n5L0xofnlNamnmg8QzzyA0kRfiApwg8kRfiBpAg/kBThB5Ii/EBS/TXOXzVFdSs33cFx/OOXbwjrY5viMeVrP/6zsP7zfR8N6/VgOemL1zwX7ltlx4GPhPVlw8fC+sHx8um5zxh+K9z3+4fi43pG7UhYXzOn/HTj0WMfCve9de+qsD5ejz83clotHquPrDotngX/8IVvl9am7o9Pk56JR34gKcIPJEX4gaQIP5AU4QeSIvxAUoQfSKq/xvlbYOeXL3ksSa98/rSwfvUXHwjrd75cPn322yfKx10l6bqP/SqsP/j62rC+funusL7/xKLS2qtvx//u6DMCknT2gtfD+oTH492TQT36DIAkPXc8XgJyYS1e/u3fxs8trQ1ZPB4+dyj+bMaAVUz9XXFchwbKp4L/77fjORIWLSz/exscZJwfQAXCDyRF+IGkCD+QFOEHkiL8QFKEH0iqcpzfzLZJulzSfnc/t9h2o6Q/k/TOCdM3uPt9lbc1d1iDaz5cWj/2nfic+90HTi+tnbkkPrf7rDmHw/r9FefML10QnLceD1dXjuOvmBf3fuBEvIz2q0fLx/KPTpTP8S5JH12yL6xXjWevqMXHtRbMzb908Gi479zT47H20yv2r6l8LH3FUHzMT69YU2CRxY+bj42Xf/ZCko57rbS2YjA+pv8055Oltarf17uu28B1bpd0ySzbv+Xu64uvyuAD6C+V4Xf3ByUd7EIvALqoldf815nZk2a2zcyWtK0jAF3RbPhvlXSOpPWS9ki6ueyKZrbFzEbNbHR8Mp7vDUD3NBV+d9/n7lPuXpf0XUmlMy26+1Z3H3H3kTlD8YKVALqnqfCb2coZP35B0tPtaQdAtzQy1HeHpAslnWFmuyX9taQLzWy9JJe0U9K1HewRQAdUht/dN82y+bam7m18Qr6rfK35qVvPC3cfWl9+bvjeVfE/ZfXq+Lz0T525M6yvnVc+Hr5sMJ5//pxa+fzxkrR0MB7PPlqPn6BNBE/gqs4rr/J6PX6ptmtiWVg/Xi8fzz44FX9A4vnDK8L6kcnhsD4+Fc81EDk+Vd63JB04Fvc+ORX/zt44VL7/QC0+J/+DPwk+u/Fa41N08Ak/ICnCDyRF+IGkCD+QFOEHkiL8QFLmHVwW+2SnDSzzT8+9rLReH4+HvFQvP0Wznw0uWxpfYUk8vbZNVvy7J4LTT+vxsFH9zfj00foxPpJ9KnnYd+iwH2xofJdHfiApwg8kRfiBpAg/kBThB5Ii/EBShB9IqrtLdJtJA+X/39j568LdJxfGp1lGaoeOh3U7Fi/3bMfK9/ej8Vh41Vi6DsenBNtgxf/RwTGtYvPnhfWhxfEU1Jobn1ZbXxicElzRttcqTsm1eDjbJss/4+AV+2qw+duWJK/YP/qdTQ3H/+6hw+V/i/b8f8b3O7OFhq8J4P8Vwg8kRfiBpAg/kBThB5Ii/EBShB9Iqqvj/F6vx+eHP/JUuH80+jkwP55iemBJ+fLeklRfsjiuLy4fD6/Xzgj3rRrztal4TgWrmHPBTpSf728T8VwAVb15RW+V4+Fvj5fXquZvqOC1Fv58h+Oly4PVvSVJdryi97nxZ1Jssnx/O1HxmPy/Y+W1E+XH+2Q88gNJEX4gKcIPJEX4gaQIP5AU4QeSIvxAUpUDpWa2WtIPJC2X5JK2uvu3zWyppB9LWiNpp6Qr3f1Q51qNVc0vXzn//Fj50uFVqiZJb22R7GrRSHz3VmVAP3CP5xmYqZFH/klJX3f3dZI+LemrZrZO0vWSdrj7Wkk7ip8BnCIqw+/ue9z9seLyEUnPSlolaaOk7cXVtku6olNNAmi/9/Wa38zWSPqEpIclLXf3PUVpr6ZfFgA4RTQcfjNbKOkuSV9z93dNSufTC/7N+vLSzLaY2aiZjU4onicPQPc0FH4zq2k6+D9y958Wm/eZ2cqivlLS/tn2dfet7j7i7iM1xZM9AuieyvCbmUm6TdKz7n7LjNK9kjYXlzdLuqf97QHolEbOifyMpC9LesrMHi+23SDpJkk/MbNrJL0i6crOtAigEyrD7+6/VvlQ9UXtbQdAt/AJPyApwg8kRfiBpAg/kBThB5Ii/EBShB9IivADSRF+ICnCDyRF+IGkCD+QFOEHkiL8QFKEH0iK8ANJEX4gKcIPJEX4gaQIP5AU4QeSIvxAUoQfSIrwA0kRfiApwg8kRfiBpAg/kBThB5Ii/EBSleE3s9Vm9h9m9hsze8bM/qLYfqOZjZnZ48XXZZ1vF0C7DDVwnUlJX3f3x8xskaRHzeyBovYtd/+7zrUHoFMqw+/ueyTtKS4fMbNnJa3qdGMAOut9veY3szWSPiHp4WLTdWb2pJltM7MlJftsMbNRMxud0ImWmgXQPg2H38wWSrpL0tfc/bCkWyWdI2m9pp8Z3Dzbfu6+1d1H3H2kpuE2tAygHRoKv5nVNB38H7n7TyXJ3fe5+5S71yV9V9KGzrUJoN0aebffJN0m6Vl3v2XG9pUzrvYFSU+3vz0AndLIu/2fkfRlSU+Z2ePFthskbTKz9ZJc0k5J13akQwAd0ci7/b+WZLOU7mt/OwC6hU/4AUkRfiApwg8kRfiBpAg/kBThB5Ii/EBShB9IivADSRF+ICnCDyRF+IGkCD+QFOEHkjJ3796dmb0m6ZUZm86QdKBrDbw//dpbv/Yl0Vuz2tnb77n7mY1csavhf8+dm426+0jPGgj0a2/92pdEb83qVW887QeSIvxAUr0O/9Ye33+kX3vr174kemtWT3rr6Wt+AL3T60d+AD3Sk/Cb2SVm9pyZvWhm1/eihzJmttPMnipWHh7tcS/bzGy/mT09Y9tSM3vAzF4ovs+6TFqPeuuLlZuDlaV7euz6bcXrrj/tN7NBSc9LuljSbkmPSNrk7r/paiMlzGynpBF37/mYsJn9oaS3JP3A3c8ttn1T0kF3v6n4j3OJu3+jT3q7UdJbvV65uVhQZuXMlaUlXSHpavXw2AV9XakeHLdePPJvkPSiu7/s7uOS7pS0sQd99D13f1DSwZM2b5S0vbi8XdN/PF1X0ltfcPc97v5YcfmIpHdWlu7psQv66olehH+VpF0zft6t/lry2yXdb2aPmtmWXjczi+XFsumStFfS8l42M4vKlZu76aSVpfvm2DWz4nW78Ybfe13g7p+UdKmkrxZPb/uST79m66fhmoZWbu6WWVaW/p1eHrtmV7xut16Ef0zS6hk/n1Vs6wvuPlZ83y/pbvXf6sP73lkktfi+v8f9/E4/rdw828rS6oNj108rXvci/I9IWmtmZ5vZHElXSbq3B328h5ktKN6IkZktkPQ59d/qw/dK2lxc3izpnh728i79snJz2crS6vGx67sVr92961+SLtP0O/4vSfqrXvRQ0teHJD1RfD3T694k3aHpp4ETmn5v5BpJyyTtkPSCpJ9LWtpHvf1Q0lOSntR00Fb2qLcLNP2U/klJjxdfl/X62AV99eS48Qk/ICne8AOSIvxAUoQfSIrwA0kRfiApwg8kRfiBpAg/kNT/AfTO5GiwjYv/AAAAAElFTkSuQmCC\n", "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "test_im = X_train[1994]\n", "plt.imshow(test_im.reshape(28,28), cmap='viridis', interpolation='none')\n", "plt.show()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "In order to extract the feature maps I want to look at, I create a Keras model that takes batches of images as input, and outputs the activations of all convolution and pooling layers. To do this, I use the Keras class Model. A model is instantiated using two arguments: an input tensor (or list of input tensors) and an output tensor (or list of output tensors). The resulting class is a Keras model, mapping the specified inputs to the specified outputs. When fed an image input, this model returns the values of the layer activations in the original model." ] }, { "cell_type": "code", "execution_count": 17, "metadata": {}, "outputs": [ { "name": "stderr", "output_type": "stream", "text": [ "/Library/Frameworks/Python.framework/Versions/3.6/lib/python3.6/site-packages/ipykernel_launcher.py:6: UserWarning: Update your `Model` call to the Keras 2 API: `Model(inputs=Tensor(\"co..., outputs=[" ] }, "execution_count": 17, "metadata": {}, "output_type": "execute_result" }, { "data": { "image/png": "iVBORw0KGgoAAAANSUhEUgAAAQQAAAECCAYAAAAYUakXAAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADl0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uIDIuMS4yLCBodHRwOi8vbWF0cGxvdGxpYi5vcmcvNQv5yAAADdxJREFUeJzt3V+MHeV9xvHnsb1rk7Ud2yK4juvECUKpojS1o40VFVQRpUEUqQJuaIkUOVIkIwUkI+WiiIuGm0pWGqCp1KCaYsWViNtUQOEChbgWEkFBhDWxbGOnEBG72FnbuC7YcTH779eLHf9y6nhnhvNvzsL3I63OOfO+Z97fGdbPzsx5mXFECAAkaUHTBQAYHAQCgEQgAEgEAoBEIABIBAKA1Fgg2L7R9n/a/qXte5qqow7bR2wfsL3P9ljT9bSyvcP2KdsHW5atsr3b9mvF48oma7xojlrvs3282Lb7bN/UZI1FTetsP2v7kO1XbG8tlg/cdi2pta3t6ibmIdheKOlVSV+WdEzSS5Juj4hDfS+mBttHJI1GxOmma7mU7T+R9BtJ/xwRnymWfVvSmYjYVoTtyoj4qybrLOq6XK33SfpNRHynydpa2V4jaU1EvGx7maS9km6R9DUN2HYtqfU2tbFdm9pD2CTplxHxekRMSPoXSTc3VMu8FhHPSTpzyeKbJe0snu/U7C9I4+aodeBExHhEvFw8PyfpsKS1GsDtWlJrW5oKhLWS3mh5fUwdfIg+CEk/tr3X9pami6lhdUSMF89PSFrdZDE13GV7f3FI0fhueCvb6yVtlPSiBny7XlKr1MZ25aRiPddFxOck/ZmkO4td33khZo8JB3l++kOSrpa0QdK4pPubLee3bC+V9JikuyPibGvboG3Xy9Ta1nZtKhCOS1rX8vr3i2UDKSKOF4+nJD2h2UOeQXayOLa8eIx5quF65hQRJyNiOiJmJD2sAdm2toc0+w/s0Yh4vFg8kNv1crW2u12bCoSXJF1j+xO2hyX9paSnGqqllO2R4mSNbI9IukHSwfJ3Ne4pSZuL55slPdlgLaUu/gMr3KoB2La2LekRSYcj4oGWpoHbrnPV2u52beRbBkkqvgb5O0kLJe2IiL9ppJAKtj+p2b0CSVok6QeDVKvtXZKul3SlpJOSviXp3yX9UNLHJB2VdFtENH4yb45ar9fsbm1IOiLpjpbj9EbYvk7STyQdkDRTLL5Xs8fmA7VdS2q9XW1s18YCAcDg4aQigEQgAEgEAoBEIABIBAKA1GggzJNpwJKotVeotTfarbXpPYR5s4FFrb1Crb0xLwMBwADpaGKS7RslfVezsw3/KSK2lfUfHhqJJUtW5OuJyfMaHhppe/x+otbeoNbeuLTWCxfe0sTkeVe9b1G7AxYXOfkHtVzkxPZTZRc5WbJkhT6/8RvtDgmgTS/9/Hu1+nVyyMBFToD3mU4CYb5d5ARAhbYPGeoqvv7YIkmLF3+418MB6EAnewi1LnISEdsjYjQiRufLCRngg6qTQJg3FzkBUE/bhwwRMWX7LknP6LcXOXmla5UB6LuOziFExNOSnu5SLQAaxkxFAIlAAJAIBACJQACQCAQAiUAAkAgEAIlAAJAIBACJQACQCAQAiUAAkAgEAIlAAJAIBACJQACQCAQAiUAAkAgEAIlAAJAIBACJQACQCAQAiUAAkAgEAIlAAJAIBACJQACQCAQAiUAAkAgEAIlAAJAIBACJQACQFnXyZttHJJ2TNC1pKiJGu1EUgGZ0FAiFL0bE6S6sB0DDOGQAkDoNhJD0Y9t7bW/pRkEAmtPpIcN1EXHc9lWSdtv+RUQ819qhCIotkrR48Yc7HA5AL3W0hxARx4vHU5KekLTpMn22R8RoRIwOD410MhyAHms7EGyP2F528bmkGyQd7FZhAPqvk0OG1ZKesH1xPT+IiB91pSoAjWg7ECLidUl/1MVa0K7ZUO5MRGWXyeVDlX3OrSv/lVr8dvU4I8culHfoxueVqj9zt8bphj59H8jXjgASgQAgEQgAEoEAIBEIABKBACARCABSN/73Z/SYZ6q+L69ex9RI+X/qKw6NV65jwbHjlX3G//Wzpe1np6v/Bo38Y8UHmq6ey6CF1RvFU+XtoepxYlH153FFvQsmp6vHWVg+TizqzpwJ9hAAJAIBQCIQACQCAUAiEAAkAgFAIhAAJAIBQGJiUsNcY47NorfeKV/H+fJ2SVrwkfIL3E7VmHRUx/q/2F/a/syv91Wu47N7v1Havnqs+vN6qnrDVm3XOpOOppYtruyzYGqmtH3h6XOV6zj1xd8rbT/3sfL3T7xa728/ewgAEoEAIBEIABKBACARCAASgQAgEQgAEoEAIH0wJyb1684/NSyYqL5azv+uL59UtPjpV6sHOvpG3ZJ66rbXv1TZ55mt3y5tv2Hvlsp1zLy0orLPR39S/nswdPJs5TomP7q0ss///EH5Ha/WPF/9e7T8VxOl7V/Z+mxp+3d3VX8WiT0EAC0IBACJQACQCAQAiUAAkAgEAIlAAJAcXfguva7ly9bG5z93Z+8HqvpM/ZqHUGOc4Tf+u7LP1JH/qlvR+8KiNeUXA/nF366pXMfff2FXZZ/jk6tK27f97MbKdXzqwQuVfWb2Hars06mFq68qbX/h9L/p7clTlb+QlXsItnfYPmX7YMuyVbZ3236teFxZq2oAA63OIcP3JV0alfdI2hMR10jaU7wGMM9VBkJEPCfpzCWLb5a0s3i+U9ItXa4LQAPaPam4OiIu3i74hKTVXaoHQIM6/pYhZs9Kznl2zfYW22O2xyYmz3c6HIAeajcQTtpeI0nF46m5OkbE9ogYjYjR4aGRNocD0A/tBsJTkjYXzzdLerI75QBoUp2vHXdJekHSp2wfs/11Sdskfdn2a5L+tHgNYJ7r+8SkTRvK78pTaaa63oXvTJa2T48Md1ZDYWJ5+YUvPjR2pHId02++2ZVagDIvxh6djTOdT0wC8MFBIABIBAKARCAASAQCgEQgAEgEAoDU3xu1zNS4MUnFvIhYUH3RkRN/XH6TjnPXvlO5jiUHrqjss3bbT0vbq2/BAgwW9hAAJAIBQCIQACQCAUAiEAAkAgFAIhAAJAIBQOrrxKSJFQt05M+XlvbxVPnEo0/uOFo5zlXfO1DRXrkK4AOJPQQAiUAAkAgEAIlAAJAIBACJQACQCAQAiUAAkPo6MWn41+f18b9+oaN1THWpFgC/iz0EAIlAAJAIBACJQACQCAQAiUAAkAgEAIlAAJAqA8H2DtunbB9sWXaf7eO29xU/N/W2TAD9UGcP4fuSbrzM8gcjYkPx83R3ywLQhMpAiIjnJJ3pQy0AGtbJOYS7bO8vDilWdq0iAI1pNxAeknS1pA2SxiXdP1dH21tsj9kem9S7bQ4HoB/aCoSIOBkR0xExI+lhSZtK+m6PiNGIGB3S4nbrBNAHbQWC7TUtL2+VdHCuvgDmj8rrIdjeJel6SVfaPibpW5Kut71BUkg6IumOHtYIoE8qAyEibr/M4kd6UAuAhjFTEUAiEAAkAgFAIhAAJAIBQCIQACQCAUDq641aumHh8uXVndauLm2eWjVSuYp3Vw5X9vnQf+wvbZ+5cKFyHQNj0x9WdolF1X8/Fr31TnmHyepb7XhisrxDVbskLakxTd4ub4+oXEUMD1WPMzNTXkaNbaJ3J8rrWLGsfIzXn68eQ+whAGhBIABIBAKARCAASAQCgEQgAEgEAoBEIABI825i0vTZs9WdKvpUTEeRJC2p0ad8usk887MDlV3qbLfpzitBO8ZPlDZH1LvAMXsIABKBACARCAASgQAgEQgAEoEAIBEIABKBACD1d2LS0is0s3FDaZehg78qbZ9+6+1uVgSgBXsIABKBACARCAASgQAgEQgAEoEAIBEIANLAXSBl8jOfaLoE4P3n5z+t1a1yD8H2OtvP2j5k+xXbW4vlq2zvtv1a8biyw5IBNKzOIcOUpG9GxKclfUHSnbY/LekeSXsi4hpJe4rXAOaxykCIiPGIeLl4fk7SYUlrJd0saWfRbaekW3pVJID+eE8nFW2vl7RR0ouSVkfEeNF0QlL5LZcBDLzagWB7qaTHJN0dEf/vssYREZIue+9s21tsj9kem5g831GxAHqrViDYHtJsGDwaEY8Xi0/aXlO0r5F06nLvjYjtETEaEaPDQyPdqBlAj9T5lsGSHpF0OCIeaGl6StLm4vlmSU92vzwA/VRnHsK1kr4q6YDtfcWyeyVtk/RD21+XdFTSbb0pEUC/VAZCRDyvuW/a86XulgOgSUxdBpAIBACJQACQCAQAiUAAkAgEAIlAAJAIBACJQACQCAQAiUAAkAgEAIlAAJAIBACJQACQCAQAiUAAkAgEAIlAAJAIBACJQACQCAQAiUAAkAgEAIlAAJAIBACJQACQCAQAiUAAkAgEAIlAAJAIBACJQACQCAQAqTIQbK+z/aztQ7Zfsb21WH6f7eO29xU/N/W+XAC9tKhGnylJ34yIl20vk7TX9u6i7cGI+E7vygPQT5WBEBHjksaL5+dsH5a0tteFAei/93QOwfZ6SRslvVgsusv2fts7bK/scm0A+qx2INheKukxSXdHxFlJD0m6WtIGze5B3D/H+7bYHrM9NjF5vgslA+iVWoFge0izYfBoRDwuSRFxMiKmI2JG0sOSNl3uvRGxPSJGI2J0eGikW3UD6IE63zJY0iOSDkfEAy3L17R0u1XSwe6XB6Cf6nzLcK2kr0o6YHtfsexeSbfb3iApJB2RdEdPKgTQN46I/g1mvynpaMuiKyWd7lsBnaHW3qDW3ri01o9HxEeq3tTXQPidwe2xiBhtrID3gFp7g1p7o91amboMIBEIAFLTgbC94fHfC2rtDWrtjbZqbfQcAoDB0vQeAoABQiAASAQCgEQgAEgEAoD0fyBR7361nYJCAAAAAElFTkSuQmCC\n", "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "from keras import models\n", "# extracts the outputs of the top 8 layers\n", "layer_outputs = [layer.output for layer in cnn3.layers[:8]]\n", "\n", "# creates a model that will return these outputs, given the model input\n", "activation_model = models.Model(input=cnn3.input, output=layer_outputs)\n", "\n", "# returns a list of Numpy arrays: one array per layer activation\n", "activations = activation_model.predict(test_im.reshape(1,28,28,1))\n", "\n", "# activation of the 1st convolution layer\n", "first_layer_activation = activations[0]\n", "\n", "# display the 3rd channel of the activation of the 1st layer of the original model\n", "plt.matshow(first_layer_activation[0, :, :, 3], cmap='viridis')" ] }, { "cell_type": "code", "execution_count": 18, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "" ] }, "execution_count": 18, "metadata": {}, "output_type": "execute_result" }, { "data": { "image/png": "iVBORw0KGgoAAAANSUhEUgAAAQQAAAECCAYAAAAYUakXAAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADl0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uIDIuMS4yLCBodHRwOi8vbWF0cGxvdGxpYi5vcmcvNQv5yAAADuRJREFUeJzt3W2MHeV5xvHr2vXaxm/EDmAccHixaKQ0FYZuTFtQ5YqEEtIKUCVUVEWOimQ+BAmkfCjiS1CVqKgKJP2EZAqNWxGqqEChFbShLi2pVNwY6mKDG0ypCTh+iXHALxh713v3ww53Vo53nuGcPWfO0v9PWu058zxn5t7x2cszZ559xhEhAJCkobYLADA4CAQAiUAAkAgEAIlAAJAIBACptUCwfa3tH9l+zfadbdXRhO1dtrfZ3mp7S9v1TGX7Idv7bW+fsmyZ7Wds76y+L22zxg9MU+vdtndX+3ar7evarLGqaaXtZ22/Yvtl27dXywduv9bU2tF+dRvjEGwPS3pV0uclvSXph5JujohX+l5MA7Z3SRqNiANt13Iq278p6Yikv4yIz1TL/lTSwYi4pwrbpRHxR23WWdV1ulrvlnQkIr7ZZm1T2V4haUVEvGh7saQXJN0g6csasP1aU+tN6mC/tnWEsEbSaxHxekSckPTXkq5vqZZZLSKek3TwlMXXS9pYPd6oyTdI66apdeBExJ6IeLF6fFjSDknnaQD3a02tHWkrEM6T9OaU52+pix+iD0LS922/YHt928U0sDwi9lSP90pa3mYxDdxm+6XqlKL1w/CpbF8o6TJJmzXg+/WUWqUO9isfKjZzVURcLukLkr5SHfrOCjF5TjjI49Pvl7RK0mpJeyTd2245P2d7kaRHJd0REYemtg3afj1NrR3t17YCYbeklVOen18tG0gRsbv6vl/S45o85Rlk+6pzyw/OMfe3XM+0ImJfRJyMiAlJD2hA9q3tEU3+gj0cEY9Viwdyv56u1k73a1uB8ENJl9i+yPZcSb8v6cmWaqlle2H1YY1sL5R0jaTt9a9q3ZOS1lWP10l6osVaan3wC1a5UQOwb21b0oOSdkTEfVOaBm6/Tldrp/u1lasMklRdBvm2pGFJD0XEN1oppMD2xZo8KpCkOZK+O0i12n5E0lpJZ0naJ+lrkv5W0vckfVLSG5JuiojWP8ybpta1mjysDUm7JN065Ty9FbavkvQDSdskTVSL79LkuflA7deaWm9WB/u1tUAAMHj4UBFAIhAAJAIBQCIQACQCAUBqNRBmyTBgSdTaK9TaG53W2vYRwqzZwaLWXqHW3piVgQBggHQ1MMn2tZL+TJOjDf88Iu6p6z/X82K+FubzMR3XiOZ1vP1+otbeoNbeOLXW93VUJ+K4S6/rOBA6meRkiZfFFb66o+0B6Nzm2KRDcbAYCN2cMjDJCfAR000gzLZJTgAUzOn1BqrLH+slab4W9HpzALrQzRFCo0lOImJDRIxGxOhs+UAG+P+qm0CYNZOcAGim41OGiBi3fZukf9TPJzl5ecYqA9B3XX2GEBFPSXpqhmoB0DJGKgJIBAKARCAASAQCgEQgAEgEAoBEIABIBAKARCAASAQCgEQgAEgEAoBEIABIBAKARCAASAQCgEQgAEgEAoBEIABIBAKARCAASAQCgEQgAEgEAoBEIABIBAKARCAASAQCgEQgAEgEAoBEIABIBAKARCAASAQCgDSnmxfb3iXpsKSTksYjYnQmigLQjq4CofJbEXFgBtYDoGWcMgBI3QZCSPq+7Rdsr5+JggC0p9tThqsiYrftcyQ9Y/u/I+K5qR2qoFgvSfO1oMvNAeilro4QImJ39X2/pMclrTlNnw0RMRoRoyOa183mAPRYx4Fge6HtxR88lnSNpO0zVRiA/uvmlGG5pMdtf7Ce70bEP8xIVQBa0XEgRMTrki6dwVrQouGlS4t94oIVxT4eO1nfYWy8uI6Tr/5PsQ96g8uOABKBACARCAASgQAgEQgAEoEAIBEIANJM/PkzWuZ55SHhQ4sX1ba/t+bi4jp2ry2/XUYOu7Y96pslSZ/8+q76DhOFsQ7oGEcIABKBACARCAASgQAgEQgAEoEAIBEIABKBACAxMOkjYGjVBcU+B9Z8vLb96CfKI4aGTkS5z1h9+7FzJ4rrOPJ79ff7Wfzk1uI64vjxYp++cWHffvYzxVUcW3FGbXsM1W9j4p+fL25D4ggBwBQEAoBEIABIBAKARCAASAQCgEQgAEgEAoDEwKSWDS8/p9jn6JoLa9vfWVX+Z5x/sH5Q0ZL/LQ8Y+ti2d4p9jvzSmbXtEyPDxXX84R8/Vtv+jS9+sbiOc58eKfZZuvknte1x5GhxHe9dUZ5p6se/Xf//7ujlrxXX8Qdn/Vdt+1+8eWVtu7cXRoxVOEIAkAgEAIlAAJAIBACJQACQCAQAiUAAkBiH0EPDv/ypYp99Vy0r9jl4af2ditzgTkZje+uv/y/ZVZ78JH70erHPvLPrJ/s4+z/L2/n6Rb9T2/6Dz327uI7zr6m/U5Uk7Rk/Utv+xnj9pCSS9H6Uxzu8fPy82va/2X15cR1/su0Lte3z/2VxbfvJn5XrlBocIdh+yPZ+29unLFtm+xnbO6vvSxttDcBAa3LK8B1J156y7E5JmyLiEkmbqucAZrliIETEc5IOnrL4ekkbq8cbJd0ww3UBaEGnHyouj4g91eO9kpbPUD0AWtT1VYaICEnTflJke73tLba3jGmAZsIF8As6DYR9tldIUvV9/3QdI2JDRIxGxOiI5nW4OQD90GkgPClpXfV4naQnZqYcAG1qctnxEUn/LulTtt+yfYukeyR93vZOSZ+rngOY5Tz5EUB/LPGyuMJX92173Ziz4txin3evrL9j0k/WNtjQkvLEFX57bm37ma82uOtSYTMLfloe3OTyHCqa/3f/Ue7Upfj1S4t9dn65fp9J0u/+av0doD4+Up4g5a+2X1Hsc9bT9afKSx97qbiOiffeq+9QuDvU5ol/0qE4WHyjMHQZQCIQACQCAUAiEAAkAgFAIhAAJAIBQBq4cQjDS5bUto//SoMbY1yzoLZ9bNWx4jpWnvOzYp/Dx+uvdR/cV/+zSNLIgfLEFZ/41/Ha9gUvvlFcx8Tbp/7B6ilc/r8hxk4U+6Adnlc/1uH540/r0MTbjEMA0ByBACARCAASgQAgEQgAEoEAIBEIABKBACD19c5NJ1Ys1Ju3/EZtn5HP1g8IOjFWnshjwby3a9vnjdQP9JGkkeHydt7ZWX/XpYv/vjz5yfxXyndDGt+7r7a9XGn/DC8t3LMnGsyyUhAny+uYOFqYUETS0Bnza9u9aGF5O+efXe4zt/7XbOhE+f04vLf+92Li0OHadp8oT6IjcYQAYAoCAUAiEAAkAgFAIhAAJAIBQCIQACQCAUDq68CkuYcmtHLTkdo+b6p+YMtQg3EtC3bWD9VZ+OPyoJU5B8rDfVadWz/zUgw3mIXozMXlPhfV30WqycCWGBmuX8d75dmQ/H6Du0wdqv/3jbHyOuJEfZ840WDmpgYDoCaOFu7MVGqXpH3T3uc4lYYENZmzrPwvXNhGwwFhHCEASAQCgEQgAEgEAoBEIABIBAKARCAASH0dh6Cjx6TnX6rtsvL5PtVS0OS6rwtzmzSZkqLJ5CYzcR27pPtpS/BRUDxCsP2Q7f22t09Zdrft3ba3Vl/X9bZMAP3Q5JThO5KuPc3yb0XE6urrqZktC0AbioEQEc9JKtwpFMBHQTcfKt5m+6XqlKIwsyaA2aDTQLhf0ipJqyXtkXTvdB1tr7e9xfaWMR3vcHMA+qGjQIiIfRFxMib/hOoBSWtq+m6IiNGIGB1R/T3sAbSro0CwvWLK0xslbZ+uL4DZozgOwfYjktZKOsv2W5K+Jmmt7dWavAS+S9KtPawRQJ8UAyEibj7N4gd7UAuAljF0GUAiEAAkAgFAIhAAJAIBQCIQACQCAUDq7wQpTQzV31BkeMmi4ir8sTNr22PB/OI6JhbMLfYpGTrW4MYm79bf2ESS4lj9DWGa8MhIfYc55bdCLDyjvKGhwnQu4+UpYXyyMF1LlKeEaXKTHI0UfmY3mOKmQZ8Yru/j8fL0NDGn/ucZOlz/HvFbzd7PHCEASAQCgEQgAEgEAoBEIABIBAKARCAASAQCgDR4A5Mm6geunHzn3fI6mvTpgyZ3ZQJmQmloU8SJRuvhCAFAIhAAJAIBQCIQACQCAUAiEAAkAgFAIhAAJAIBQCIQACQCAUAiEAAkAgFAIhAAJAIBQCIQACQCAUAqBoLtlbaftf2K7Zdt314tX2b7Gds7q+9Le18ugF5qcoQwLumrEfFpSb8m6Su2Py3pTkmbIuISSZuq5wBmsWIgRMSeiHixenxY0g5J50m6XtLGqttGSTf0qkgA/fGhPkOwfaGkyyRtlrQ8IvZUTXslLZ/RygD0XeNAsL1I0qOS7oiIQ1PbIiIknfYe3bbX295ie8uYjndVLIDeahQItkc0GQYPR8Rj1eJ9tldU7Ssk7T/dayNiQ0SMRsToiObNRM0AeqTJVQZLelDSjoi4b0rTk5LWVY/XSXpi5ssD0E9NbtRypaQvSdpme2u17C5J90j6nu1bJL0h6abelAigX4qBEBH/JsnTNF89s+UAaBMjFQEkAgFAIhAAJAIBQCIQACQCAUAiEAAkAgFAIhAAJAIBQCIQACQCAUAiEAAkAgFAIhAAJAIBQCIQACQCAUAiEAAkAgFAIhAAJAIBQCIQACQCAUAiEAAkAgFAIhAAJAIBQCIQACQCAUAiEAAkAgFAIhAAJAIBQCoGgu2Vtp+1/Yrtl23fXi2/2/Zu21urr+t6Xy6AXprToM+4pK9GxIu2F0t6wfYzVdu3IuKbvSsPQD8VAyEi9kjaUz0+bHuHpPN6XRiA/vtQnyHYvlDSZZI2V4tus/2S7YdsL53h2gD0WeNAsL1I0qOS7oiIQ5Lul7RK0mpNHkHcO83r1tveYnvLmI7PQMkAeqVRINge0WQYPBwRj0lSROyLiJMRMSHpAUlrTvfaiNgQEaMRMTqieTNVN4AeaHKVwZIelLQjIu6bsnzFlG43Sto+8+UB6KcmVxmulPQlSdtsb62W3SXpZturJYWkXZJu7UmFAPrGEdG/jdk/lfTGlEVnSTrQtwK6Q629Qa29cWqtF0TE2aUX9TUQfmHj9paIGG2tgA+BWnuDWnuj01oZugwgEQgAUtuBsKHl7X8Y1Nob1NobHdXa6mcIAAZL20cIAAYIgQAgEQgAEoEAIBEIANL/AfTPNGXYoXjUAAAAAElFTkSuQmCC\n", "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "# display the 6th channel of the activation of the 1st layer of the original model\n", "plt.matshow(first_layer_activation[0, :, :, 6], cmap='viridis')" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Let's plot a complete visualization of all the activations in the network. I extract and plot every channel in each of the eight activation maps, and then stack the results in one big image tensor, with channels stacked side by side." ] }, { "cell_type": "code", "execution_count": 19, "metadata": {}, "outputs": [ { "name": "stderr", "output_type": "stream", "text": [ "/Library/Frameworks/Python.framework/Versions/3.6/lib/python3.6/site-packages/ipykernel_launcher.py:15: RuntimeWarning: invalid value encountered in true_divide\n", " from ipykernel import kernelapp as app\n" ] }, { "data": { "image/png": "\n", "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "image/png": "\n", "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "image/png": "\n", "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "layer_names = []\n", "for layer in cnn3.layers[:-1]:\n", " layer_names.append(layer.name) \n", "images_per_row = 16\n", "for layer_name, layer_activation in zip(layer_names, activations):\n", " if layer_name.startswith('conv'):\n", " n_features = layer_activation.shape[-1]\n", " size = layer_activation.shape[1]\n", " n_cols = n_features // images_per_row\n", " display_grid = np.zeros((size * n_cols, images_per_row * size))\n", " for col in range(n_cols):\n", " for row in range(images_per_row):\n", " channel_image = layer_activation[0,:, :, col * images_per_row + row]\n", " channel_image -= channel_image.mean()\n", " channel_image /= channel_image.std()\n", " channel_image *= 64\n", " channel_image += 128\n", " channel_image = np.clip(channel_image, 0, 255).astype('uint8')\n", " display_grid[col * size : (col + 1) * size,\n", " row * size : (row + 1) * size] = channel_image\n", " scale = 1. / size\n", " plt.figure(figsize=(scale * display_grid.shape[1],\n", " scale * display_grid.shape[0]))\n", " plt.title(layer_name)\n", " plt.grid(False)\n", " plt.imshow(display_grid, aspect='auto', cmap='viridis')" ] } ], "metadata": { "kernelspec": { "display_name": "Python 3", "language": "python", "name": "python3" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.6.4" } }, "nbformat": 4, "nbformat_minor": 2 }