{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# EqualWidthDiscretiser + OrdinalEncoder\n", "\n", "\n", "This is very useful for linear models, because by using discretisation + a monotonic encoding, we create monotonic variables with the target, from those that before were not originally. And this tends to help improve the performance of the linear model. " ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## EqualWidthDiscretiser\n", "\n", "The EqualWidthDiscretiser() divides continuous numerical variables into\n", "intervals of the same width, that is, equidistant intervals. Note that the\n", "proportion of observations per interval may vary.\n", "\n", "The number of intervals\n", "in which the variable should be divided must be indicated by the user.\n", "\n", "Note: Check out the EqualWidthDiscretiser notebook to learn more about this transformer." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## OrdinalEncoder\n", "The OrdinalEncoder() will replace the variable labels by digits, from 1 to the number of different labels. \n", "\n", "If we select \"arbitrary\", then the encoder will assign numbers as the labels appear in the variable (first come first served).\n", "\n", "If we select \"ordered\", the encoder will assign numbers following the mean of the target value for that label. So labels for which the mean of the target is higher will get the number 1, and those where the mean of the target is smallest will get the number n.\n", "\n", "Note: Check out the OrdinalEncoder notebook to know more about this transformer." ] }, { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [], "source": [ "import pandas as pd\n", "import numpy as np\n", "import matplotlib.pyplot as plt\n", "\n", "from sklearn.model_selection import train_test_split\n", "from sklearn.pipeline import Pipeline\n", "\n", "\n", "from feature_engine.discretisation import EqualWidthDiscretiser\n", "from feature_engine.encoding import OrdinalEncoder\n", "\n", "plt.rcParams[\"figure.figsize\"] = [15,5]" ] }, { "cell_type": "code", "execution_count": 3, "metadata": {}, "outputs": [], "source": [ "# Load titanic dataset from OpenML\n", "\n", "def load_titanic():\n", " data = pd.read_csv('https://www.openml.org/data/get_csv/16826755/phpMYEkMl')\n", " data = data.replace('?', np.nan)\n", " data['cabin'] = data['cabin'].astype(str).str[0]\n", " data['pclass'] = data['pclass'].astype('O')\n", " data['age'] = data['age'].astype('float').fillna(data.age.median())\n", " data['fare'] = data['fare'].astype('float').fillna(data.fare.median())\n", " data['embarked'].fillna('C', inplace=True)\n", " data.drop(labels=['boat', 'body', 'home.dest', 'name', 'ticket'], axis=1, inplace=True)\n", " return data" ] }, { "cell_type": "code", "execution_count": 4, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
pclasssurvivedsexagesibspparchfarecabinembarked
011female29.000000211.3375BS
111male0.916712151.5500CS
210female2.000012151.5500CS
310male30.000012151.5500CS
410female25.000012151.5500CS
\n", "
" ], "text/plain": [ " pclass survived sex age sibsp parch fare cabin embarked\n", "0 1 1 female 29.0000 0 0 211.3375 B S\n", "1 1 1 male 0.9167 1 2 151.5500 C S\n", "2 1 0 female 2.0000 1 2 151.5500 C S\n", "3 1 0 male 30.0000 1 2 151.5500 C S\n", "4 1 0 female 25.0000 1 2 151.5500 C S" ] }, "execution_count": 4, "metadata": {}, "output_type": "execute_result" } ], "source": [ "data = load_titanic()\n", "data.head()" ] }, { "cell_type": "code", "execution_count": 5, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "X_train : (916, 8)\n", "X_test : (393, 8)\n" ] } ], "source": [ "# let's separate into training and testing set\n", "X = data.drop(['survived'], axis=1)\n", "y = data.survived\n", "\n", "X_train, X_test, y_train, y_test = train_test_split(\n", " X, y, test_size=0.3, random_state=0)\n", "\n", "print(\"X_train :\", X_train.shape)\n", "print(\"X_test :\", X_test.shape)" ] }, { "cell_type": "code", "execution_count": 6, "metadata": {}, "outputs": [ { "data": { "image/png": "iVBORw0KGgoAAAANSUhEUgAAA20AAAE/CAYAAADVKysfAAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADh0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uMy4yLjEsIGh0dHA6Ly9tYXRwbG90bGliLm9yZy+j8jraAAAgAElEQVR4nO3df7Bc913e8fcTORjjBGLj+I4qaSKXakLsuLHpHZFOWriJ01iJGWQ6445SQ+Riqv5witPRDMgwQ6CtZtw/TGECBkSSRjMxMWrAtSZOIR4ltymdEMdOHGzZca3EwlakWpAfODelpjKf/nGPYFGurPtzz3d3368Zze5+7zm7z/n6Smcfn7NnU1VIkiRJktr0kr4DSJIkSZLOztImSZIkSQ2ztEmSJElSwyxtkiRJktQwS5skSZIkNczSJkmSJEkNs7RJkiSpCUleneRzSb6R5Cf7ziO14ry+A0iSJEmdnwJmq+rqvoNILfFImyRJklrxKuDwUldK4oEIjTVLm7QMSfYk+WJ3+sZjSX6kG1+X5I4kf5rkqSTvTFKndyZJvivJ+5KcSPLlJP8hybp+t0aSpP4l+TjwRuBXkswlubU7VfK5JM8k+fmBZTd3+9ebkzwNfLwb//Ekjyf5WpLfT/KqfrZGWl2WNml5vgj8Q+C7gF8APphkPfDPgbcCVwHfB1x/xnr7gVPA3wGuBt4C/MSQMkuS1KyqehPwP4B3VtXLgM8D7wBeAVwH/KskZ+5XfxB4DXBt97OfAf4x8MruuT40pPjSmkpV9Z1BGnlJHgbeDdwK/HZV/UY3/mbgfuClwHcDTwOvqKo/737+dmBXVb2xl+CSJDUkySzwwap67wI/+yWgqurfJtkMPAV8T1V9qfv5fwM+XFXv6x6/BJgDXlNVfzycLZDWhkfapGVI8o4kDyf5epKvA68FLgH+FvDMwKKD91/FfHk7MbDebwCXDiu3JEmjIsn3J/lEkj9J8mfAv2R+XzvozP3sLw/sY78KBNgwnMTS2vFDm9ISdefH/yZwDfCpqnqhO9IW4ASwcWDxTQP3nwGeBy6pqlPDyitJ0oj6LeBXgLdW1f/tjrSdWdoGTxl7BthbVXcNK6A0LB5pk5buQuZ3En8CkOSfMX+kDeAAcGuSDUleAfz06ZWq6gTwMeCOJN+Z5CVJvifJDw43viRJI+HlwFe7wrYV+KfnWP7XgduSXAF/dfGvG9Y6pDQMljZpiarqMeAO4FPAs8CVwP/sfvybzBezPwI+B3yU+QuPvND9/B3AtwGPAV8DPgysH1Z2SZJGyL8G/l2SbwA/x/z/GD2rqroH+I/A3UmeAx5l/uJg0sjzQiTSGkryVuDXq8pLDkuSJGlZPNImraIkFyR5W5Lzkmxg/oqS9/SdS5IkSaPLI23SKkryHcB/B74X+HPgPuDWqnqu12CSJEkaWZY2SZIkSWqYp0dKkiRJUsMsbZIkSZLUsCa+XPuSSy6pzZs3L3v9b37zm1x44YWrF2gIzDwco5Z51PKCmYellcwPPfTQn1bVK/vOocVb6T4W2vn969Okz8Gkbz84B5O+/bD2c/Bi+9gmStvmzZt58MEHl73+7OwsMzMzqxdoCMw8HKOWedTygpmHpZXMSf647wxampXuY6Gd378+TfocTPr2g3Mw6dsPaz8HL7aP9fRISZIkSWqYpU2SJEmSGmZpkyRJkqSGWdokSZIkqWGWNkmSJElqmKVNkiRJkhpmaZMkSZKkhlnaJEmSJKlhljZJkiRJapilTZIkSZIaZmmTJEmSpIad13cAabE277lv0csevf26NUwiSZPlkS//GTct8t9g//2VpNXnkTZJkiRJapilTZIkSZIaZmmTJEmSpIZZ2iRJ6lGSo0keSfJwkge7sYuT3J/kye72ooHlb0tyJMkTSa7tL7kkaVgsbZIk9e+NVXVVVU13j/cAh6pqC3Coe0ySy4EdwBXANuDOJOv6CCxJGh5LmyRJ7dkO7O/u7weuHxi/u6qer6qngCPA1h7ySZKGyEv+S5LUrwI+lqSA36iqfcBUVZ0AqKoTSS7tlt0A/OHAuse6sb8hyS5gF8DU1BSzs7MrCjh1Aey+8tSill3pa7Vqbm5ubLdtMSZ9+8E5mPTth37nwNImSVK/3lBVx7tidn+SL7zIsllgrL5lYL747QOYnp6umZmZFQV8z133cscji3vLcPTGlb1Wq2ZnZ1npPI6ySd9+cA4mffuh3znw9EhJknpUVce725PAPcyf7vhskvUA3e3JbvFjwKaB1TcCx4eXVpLUB0ubJEk9SXJhkpefvg+8BXgUOAjs7BbbCdzb3T8I7EhyfpLLgC3AA8NNLUkaNk+PlCSpP1PAPUlgfp/8W1X1e0k+AxxIcjPwNHADQFUdTnIAeAw4BdxSVS/0E12SNCyWNkmSelJVXwJet8D4V4BrzrLOXmDvGkeTJDXE0yMlSZIkqWGWNkmSJElqmKVNkiRJkhpmaZMkSZKkhlnaJEmSJKlhljZJkiRJapilTZIkSZIaZmmTJEmSpIZZ2iRJkiSpYZY2SZIkSWqYpU2SJEmSGnbO0pZkU5JPJHk8yeEkt3bjFye5P8mT3e1FA+vcluRIkieSXLuWGyBJkiRJ42wxR9pOAbur6jXA64FbklwO7AEOVdUW4FD3mO5nO4ArgG3AnUnWrUV4SZIkSRp35yxtVXWiqj7b3f8G8DiwAdgO7O8W2w9c393fDtxdVc9X1VPAEWDrageXJEmSpEmwpM+0JdkMXA18GpiqqhMwX+yAS7vFNgDPDKx2rBuTJEmSJC3ReYtdMMnLgN8B3lVVzyU566ILjNUCz7cL2AUwNTXF7OzsYqN8i7m5uRWt3wczL93uK08tetnTOfvOvFSjlhfMPCyjmFmSJK2ORZW2JC9lvrDdVVW/2w0/m2R9VZ1Ish442Y0fAzYNrL4ROH7mc1bVPmAfwPT0dM3MzCxvC5h/g76S9ftg5qW7ac99i1726I0zQP+Zl2rU8oKZh2UUM0uSpNWxmKtHBngf8HhV/eLAjw4CO7v7O4F7B8Z3JDk/yWXAFuCB1YssSZIkSZNjMUfa3gD8GPBIkoe7sZ8BbgcOJLkZeBq4AaCqDic5ADzG/JUnb6mqF1Y9uSRJkiRNgHOWtqr6Axb+nBrANWdZZy+wdwW5JEmSJEks8eqRkiRJkqThsrRJkiRJUsMsbZIkSZLUMEubJEmSJDXM0iZJkiRJDbO0SZIkSVLDLG2SJEmS1DBLmyRJkiQ1zNImSZIkSQ2ztEmSJElSwyxtkiRJktQwS5skSZIkNczSJkmSJEkNs7RJkiRJUsMsbZIkSZLUMEubJEmSJDXM0iZJkiRJDbO0SZIkSVLDLG2SJEmS1DBLmyRJkiQ1zNImSZIkSQ2ztEmS1LMk65J8LslHuscXJ7k/yZPd7UUDy96W5EiSJ5Jc219qSdKwWNokSerfrcDjA4/3AIeqagtwqHtMksuBHcAVwDbgziTrhpxVkjRkljZJknqUZCNwHfDegeHtwP7u/n7g+oHxu6vq+ap6CjgCbB1WVklSPyxtkiT165eAnwL+cmBsqqpOAHS3l3bjG4BnBpY71o1JksbYeX0HkCRpUiX5IeBkVT2UZGYxqywwVgs87y5gF8DU1BSzs7MricnUBbD7ylOLWnalr9Wqubm5sd22xZj07QfnYNK3H/qdA0ubJEn9eQPww0neBnw78J1JPgg8m2R9VZ1Ish442S1/DNg0sP5G4PiZT1pV+4B9ANPT0zUzM7OikO+5617ueGRxbxmO3riy12rV7OwsK53HUTbp2w/OwaRvP/Q7B54eKUlST6rqtqraWFWbmb/AyMer6keBg8DObrGdwL3d/YPAjiTnJ7kM2AI8MOTYkqQh80ibJEntuR04kORm4GngBoCqOpzkAPAYcAq4pape6C+mJGkYLG2SJDWgqmaB2e7+V4BrzrLcXmDv0IJJknrn6ZGSJEmS1DBLmyRJkiQ1zNImSZIkSQ2ztEmSJElSwyxtkiRJktQwS5skSZIkNczSJkmSJEkNs7RJkiRJUsMsbZIkSZLUMEubJEmSJDXM0iZJkiRJDbO0SZIkSVLDLG2SJEmS1DBLmyRJkiQ1zNImSZIkSQ2ztEmSJElSwyxtkiRJktQwS5skSZIkNczSJkmSJEkNO2dpS/L+JCeTPDow9vNJvpzk4e7P2wZ+dluSI0meSHLtWgWXJEmSpEmwmCNtHwC2LTD+n6rqqu7PRwGSXA7sAK7o1rkzybrVCitJkiRJk+acpa2qPgl8dZHPtx24u6qer6qngCPA1hXkkyRJkqSJdt4K1n1nkncADwK7q+prwAbgDweWOdaNfYsku4BdAFNTU8zOzi47yNzc3IrW74OZl273lacWvezpnH1nXqpRywtmHpZRzCxJklbHckvbrwH/Hqju9g7gx4EssGwt9ARVtQ/YBzA9PV0zMzPLjDL/Bn0l6/fBzEt30577Fr3s0RtngP4zL9Wo5QUzD8soZpYkSatjWVePrKpnq+qFqvpL4Df561MgjwGbBhbdCBxfWURJkiRJmlzLKm1J1g88/BHg9JUlDwI7kpyf5DJgC/DAyiJKkiRJ0uQ65+mRST4EzACXJDkGvBuYSXIV86c+HgX+BUBVHU5yAHgMOAXcUlUvrE10SZIkSRp/5yxtVfX2BYbf9yLL7wX2riSUJEmSJGnesk6PlCRJkiQNh6VNkiRJkhpmaZMkSZKkhlnaJEmSJKlhljZJkiRJapilTZIkSZIaZmmTJEmSpIZZ2iRJkiSpYZY2SZIkSWqYpU2SJEmSGmZpkyRJkqSGWdokSZIkqWGWNkmSJElqmKVNkqSeJPn2JA8k+XySw0l+oRu/OMn9SZ7sbi8aWOe2JEeSPJHk2v7SS5KGxdImSVJ/ngfeVFWvA64CtiV5PbAHOFRVW4BD3WOSXA7sAK4AtgF3JlnXS3JJ0tBY2iRJ6knNm+sevrT7U8B2YH83vh+4vru/Hbi7qp6vqqeAI8DWIUaWJPXA0iZJUo+SrEvyMHASuL+qPg1MVdUJgO720m7xDcAzA6sf68YkSWPsvL4DSJI0yarqBeCqJK8A7kny2hdZPAs9xbcslOwCdgFMTU0xOzu7ooxTF8DuK08tatmVvlar5ubmxnbbFmPStx+cg0nffuh3DixtkiQ1oKq+nmSW+c+qPZtkfVWdSLKe+aNwMH9kbdPAahuB4ws81z5gH8D09HTNzMysKNt77rqXOx5Z3FuGozeu7LVaNTs7y0rncZRN+vaDczDp2w/9zoGnR0qS1JMkr+yOsJHkAuDNwBeAg8DObrGdwL3d/YPAjiTnJ7kM2AI8MNzUkqRh80ibJEn9WQ/s764A+RLgQFV9JMmngANJbgaeBm4AqKrDSQ4AjwGngFu60yslSWPM0iZJUk+q6o+AqxcY/wpwzVnW2QvsXeNokqSGeHqkJEmSJDXM0iZJkiRJDbO0SZIkSVLDLG2SJEmS1DBLmyRJkiQ1zNImSZIkSQ2ztEmSJElSwyxtkiRJktQwS5skSZIkNczSJkmSJEkNs7RJkiRJUsMsbZIkSZLUMEubJEmSJDXM0iZJkiRJDbO0SZIkSVLDLG2SJEmS1DBLmyRJkiQ1zNImSZIkSQ2ztEmSJElSwyxtkiRJktQwS5skSZIkNczSJkmSJEkNs7RJkiRJUsMsbZIkSZLUMEubJEmSJDXM0iZJkiRJDTtnaUvy/iQnkzw6MHZxkvuTPNndXjTws9uSHEnyRJJr1yq4JEmSJE2CxRxp+wCw7YyxPcChqtoCHOoek+RyYAdwRbfOnUnWrVpaSZIkSZow5yxtVfVJ4KtnDG8H9nf39wPXD4zfXVXPV9VTwBFg6ypllSRJkqSJs9zPtE1V1QmA7vbSbnwD8MzAcse6MUmSJEnSMpy3ys+XBcZqwQWTXcAugKmpKWZnZ5f9onNzcytavw9mXrrdV55a9LKnc/adealGLS+YeVhGMbMkSVodyy1tzyZZX1UnkqwHTnbjx4BNA8ttBI4v9ARVtQ/YBzA9PV0zMzPLjDL/Bn0l6/fBzEt30577Fr3s0RtngP4zL9Wo5QUzD8soZpYkSatjuadHHgR2dvd3AvcOjO9Icn6Sy4AtwAMriyhJkiRJk+ucR9qSfAiYAS5Jcgx4N3A7cCDJzcDTwA0AVXU4yQHgMeAUcEtVvbBG2SVJkiRp7J2ztFXV28/yo2vOsvxeYO9KQkmSJEmS5i339EhJkiRJ0hBY2iRJkiSpYZY2SZIkSWqYpU2SJEmSGmZpkyRJkqSGWdokSZIkqWGWNkmSepJkU5JPJHk8yeEkt3bjFye5P8mT3e1FA+vcluRIkieSXNtfeknSsFjaJEnqzylgd1W9Bng9cEuSy4E9wKGq2gIc6h7T/WwHcAWwDbgzybpekkuShsbSJklST6rqRFV9trv/DeBxYAOwHdjfLbYfuL67vx24u6qer6qngCPA1uGmliQNm6VNkqQGJNkMXA18GpiqqhMwX+yAS7vFNgDPDKx2rBuTJI2x8/oOIEnSpEvyMuB3gHdV1XNJzrroAmO1wPPtAnYBTE1NMTs7u6J8UxfA7itPLWrZlb5Wq+bm5sZ22xZj0rcfnINJ337odw4sbZIk9SjJS5kvbHdV1e92w88mWV9VJ5KsB05248eATQOrbwSOn/mcVbUP2AcwPT1dMzMzK8r4nrvu5Y5HFveW4eiNK3utVs3OzrLSeRxlk7794BxM+vZDv3Pg6ZGSJPUk84fU3gc8XlW/OPCjg8DO7v5O4N6B8R1Jzk9yGbAFeGBYeSVJ/fBImyRJ/XkD8GPAI0ke7sZ+BrgdOJDkZuBp4AaAqjqc5ADwGPNXnrylql4YfmxJ0jBZ2iRJ6klV/QELf04N4JqzrLMX2LtmoSRJzfH0SEmSJElqmKVNkiRJkhrm6ZGSJGnVbN5z36KXPXr7dWuYRJLGh0faJEmSJKlhljZJkiRJapilTZIkSZIaZmmTJEmSpIZZ2iRJkiSpYZY2SZIkSWqYl/zXomzecx+7rzzFTYu4lPNSLuG8lEtDS5IkSZPII22SJEmS1DBLmyRJkiQ1zNImSZIkSQ2ztEmSJElSw7wQicbS6QucLObiKUu5cIokSZI0bB5pkyRJkqSGWdokSZIkqWGWNkmSJElqmKVNkiRJkhpmaZMkSZKkhlnaJEmSJKlhljZJkiRJapilTZIkSZIaZmmTJEmSpIZZ2iRJkiSpYZY2SZIkSWqYpU2SJEmSGmZpkyRJkqSGWdokSZIkqWGWNkmSJElqmKVNkiRJkhpmaZMkSZKkhlnaJEmSJKlhljZJkiRJath5K1k5yVHgG8ALwKmqmk5yMfDbwGbgKPBPquprK4spSZIkSZNpNY60vbGqrqqq6e7xHuBQVW0BDnWPJUmSJEnLsKIjbWexHZjp7u8HZoGfXoPXUaM277mv7wiSJEnS2FjpkbYCPpbkoSS7urGpqjoB0N1eusLXkCRJkqSJtdIjbW+oquNJLgXuT/KFxa7YlbxdAFNTU8zOzi47xNzc3IrW78OoZd595SmmLpi/HSWLydzSf4dR+70AMw/LKGaWJEmrY0WlraqOd7cnk9wDbAWeTbK+qk4kWQ+cPMu6+4B9ANPT0zUzM7PsHLOzs6xk/T60kHlppzGex+4rT3HHI2txRu3aWUzmozfODCfMIrTwe7FUZh6OUcwsSZJWx7JPj0xyYZKXn74PvAV4FDgI7OwW2wncu9KQkiRJkjSpVnLYZAq4J8np5/mtqvq9JJ8BDiS5GXgauGHlMSVJkiRpMi27tFXVl4DXLTD+FeCalYSSJGkSJHk/8EPAyap6bTd21u87TXIbcDPz34/6k1X1+z3EliQN2Wp8T5skSVqeDwDbzhhb8PtOk1wO7ACu6Na5M8m64UWVJPVltK4qIfVsKRdvOXr7dWuYRNI4qKpPJtl8xvDZvu90O3B3VT0PPJXkCPMXAPvUMLJKkvrjkTZJktpytu873QA8M7DcsW5MkjTmPNImSdJoyAJjteCCq/hdqLC477xcjlH67sFJ/67ESd9+cA4mffuh3zmwtEmS1Jazfd/pMWDTwHIbgeMLPcFqfhcqwHvuundNvqezpe/JPJdJ/67ESd9+cA4mffuh3znw9EhJktpytu87PQjsSHJ+ksuALcADPeSTJA2ZR9okSepJkg8xf9GRS5IcA94N3M4C33daVYeTHAAeA04Bt1TVC70ElyQNlaVNkqSeVNXbz/KjBb/vtKr2AnvXLpEkqUWeHilJkiRJDbO0SZIkSVLDLG2SJEmS1DBLmyRJkiQ1zAuRSA3YvOc+YP7La2/q7p/N0duvG0YkSZIkNcIjbZIkSZLUMEubJEmSJDXM0iZJkiRJDfMzbZp4m8/xGTJJkiSpTx5pkyRJkqSGWdokSZIkqWGWNkmSJElqmJ9pGzN+PkuSJEkaLx5pkyRJkqSGWdokSZIkqWGeHimtEU9VlSRJ0mrwSJskSZIkNcwjbZKApR0ZPHr7dWuYRJIkSYM80iZJkiRJDbO0SZIkSVLDPD1SkiSNFU/3ljRuxqa0+Q+0JEnjyyvySppkY1PaJEnSaLGISdLiWNqkMeYbIkmSpNFnaZO0ZINlcPeVp7jpRcqhpyNLkiStjFePlCRJkqSGeaRNGjGe8ihJkjRZLG2S1pRXdpUkSVoZT4+UJEmSpIZZ2iRJkiSpYRN5eqSna0mSJEkaFRNZ2tbKUsrgB7ZduIZJJEmSJI0LS5skncGj8ZIkqSWWNkkTYSlfCC5JktQSL0QiSZIkSQ2ztEmSJElSwzw9siePfPnPFn16lp+Z0aRYymfJJEmSJoWlbQT4RlaSJEmaXJa2c7AwSZI0vha7n9995Slm1jaKJJ2VpU2SJEmL5teiSMNnaZOkFfDNizQ5Wjj7xn9H5vlvrybNml09Msm2JE8kOZJkz1q9jiRJk8Z9rCRNljU50pZkHfCrwD8CjgGfSXKwqh5bi9eTpHFz5v9FfrEvBPf/Ik8W97GSNHnW6vTIrcCRqvoSQJK7ge2AOxRJklbGfaxGhqcxSqtjrUrbBuCZgcfHgO9fo9eSpJGwVp+H8U3RxHEfO8HG+e97C58ZlM40+Hv5Yme9wNr+nUtVrf6TJjcA11bVT3SPfwzYWlX/ZmCZXcCu7uGrgSdW8JKXAH+6gvX7YObhGLXMo5YXzDwsrWR+VVW9su8Qk6yHfSy08/vXp0mfg0nffnAOJn37Ye3n4Kz72LU60nYM2DTweCNwfHCBqtoH7FuNF0vyYFVNr8ZzDYuZh2PUMo9aXjDzsIxiZq2Zoe5jwd8/cA4mffvBOZj07Yd+52Ctrh75GWBLksuSfBuwAzi4Rq8lSdIkcR8rSRNmTY60VdWpJO8Efh9YB7y/qg6vxWtJkjRJ3MdK0uRZsy/XrqqPAh9dq+c/w6qdAjJEZh6OUcs8annBzMMyipm1Roa8jwV//8A5mPTtB+dg0rcfepyDNbkQiSRJkiRpdazVZ9okSZIkSatg5Etbkm1JnkhyJMmevvMsJMn7k5xM8ujA2MVJ7k/yZHd7UZ8ZByXZlOQTSR5PcjjJrd14y5m/PckDST7fZf6FbrzZzABJ1iX5XJKPdI+bzguQ5GiSR5I8nOTBbqzZ3ElekeTDSb7Q/U7//cbzvrqb29N/nkvyrpYza3yNwj52NSx1P53ktm5OnkhybT+pV89y9vtjOAdLfh8xbnMAS3tfMqbbv6T3OMOcg5EubUnWAb8KvBW4HHh7ksv7TbWgDwDbzhjbAxyqqi3Aoe5xK04Bu6vqNcDrgVu6eW058/PAm6rqdcBVwLYkr6ftzAC3Ao8PPG4972lvrKqrBi5723LuXwZ+r6q+F3gd8/PdbN6qeqKb26uAvwf8H+AeGs6s8TRC+9jV8AEWuZ/u5mAHcEW3zp3dXI2yJe33x3QOlvQ+YkznABb5vmSMtx8W+R5n2HMw0qUN2AocqaovVdVfAHcD23vO9C2q6pPAV88Y3g7s7+7vB64faqgXUVUnquqz3f1vMP+XdwNtZ66qmusevrT7UzScOclG4DrgvQPDzeY9hyZzJ/lO4AeA9wFU1V9U1ddpNO8CrgG+WFV/zOhk1vgYiX3saljifno7cHdVPV9VTwFHmJ+rkbWM/f44zsFS30eM3Rws8X3J2G3/i2hiDka9tG0Anhl4fKwbGwVTVXUC5v+xBC7tOc+CkmwGrgY+TeOZu0P6DwMngfurqvXMvwT8FPCXA2Mt5z2tgI8leSjJrm6s1dx/G/gT4D93p3u8N8mFtJv3TDuAD3X3RyWzxsco72NXw9n+zo31vCxyvz+Wc7DE9xHjOAdLeV8yjtsPS3uPM9Q5GPXSlgXGvBzmKknyMuB3gHdV1XN95zmXqnqhO6VsI7A1yWv7znQ2SX4IOFlVD/WdZRneUFXfx/wpU7ck+YG+A72I84DvA36tqq4GvsmInFaY+S9N/mHgv/SdRRPLfezCxnZelrDfH8s5WOL7iLGag2W8Lxmr7R+wlPc4Q52DUS9tx4BNA483Asd7yrJUzyZZD9Ddnuw5z9+Q5KXM/8N9V1X9bjfcdObTutPfZpk/v7jVzG8AfjjJUeZPOXpTkg/Sbt6/UlXHu9uTzH/Waivt5j4GHOv+bynAh5kvca3mHfRW4LNV9Wz3eBQya7yM8j52NZzt79xYzssS9/tjOQenLfJ9xLjNwVLfl4zb9gNLfo8z1DkY9dL2GWBLksu6/yu9AzjYc6bFOgjs7O7vBO7tMcvfkCTMfwbo8ar6xYEftZz5lUle0d2/AHgz8AUazVxVt1XVxqrazPzv7cer6kdpNO9pSS5M8vLT94G3AI/SaO6q+t/AM0le3Q1dAzxGo3nP8Hb++tRIGI3MGi+jvI9dDWf7O3cQ2JHk/CSXAVuAB3rIt2qWsd8fxzlY6vuIsZqDZbwvGavth2W9xxnuHFTVSP8B3gb8L+CLwM/2necsGT8EnAD+H/Ot/Gbgu5m/As2T3e3FfeccyPsPmD+8+0fAw92ftzWe+e8Cn+syPwr8XDfebOaB7DPAR0YhL/OfEft89+fw6b9zLedm/ipgD3a/G/8VuKjlvF3m7wC+AnzXwFjTmf0znn9GYR+7Stu5pP008LPdnDwBvLXv/Kuw/Uve7/y3fO4AAABoSURBVI/hHCz5fcS4zcHAdi3qfcm4bf9y3uMMcw7SvaAkSZIkqUGjfnqkJEmSJI01S5skSZIkNczSJkmSJEkNs7RJkiRJUsMsbZIkSZLUMEubJEmSJDXM0iZJkiRJDbO0SZIkSVLD/j+UgSC64ypSUwAAAABJRU5ErkJggg==\n", "text/plain": [ "
" ] }, "metadata": { "needs_background": "light" }, "output_type": "display_data" } ], "source": [ "# we will use two continuous variables for the transformations\n", "\n", "X_train[[\"age\", 'fare']].hist(bins=30)\n", "plt.show()" ] }, { "cell_type": "code", "execution_count": 7, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "Pipeline(memory=None,\n", " steps=[('EqualWidthDiscretiser',\n", " EqualWidthDiscretiser(bins=5, return_boundaries=False,\n", " return_object=True,\n", " variables=['age', 'fare'])),\n", " ('OrdinalEncoder',\n", " OrdinalEncoder(encoding_method='ordered',\n", " variables=['age', 'fare']))],\n", " verbose=False)" ] }, "execution_count": 7, "metadata": {}, "output_type": "execute_result" } ], "source": [ "# set up the discretiser\n", "ewd = EqualWidthDiscretiser(\n", " bins=5,\n", " variables=['age', 'fare'],\n", " # important: return values as categorical\n", " return_object=True)\n", "\n", "# set up the encoder\n", "oe = OrdinalEncoder(variables=['age', 'fare'])\n", "\n", "# pipeline\n", "transformer = Pipeline(steps=[('EqualWidthDiscretiser', ewd),\n", " ('OrdinalEncoder', oe),\n", " ])\n", "\n", "transformer.fit(X_train, y_train)" ] }, { "cell_type": "code", "execution_count": 8, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "{'age': [-inf,\n", " 14.933359999999999,\n", " 29.700019999999995,\n", " 44.46667999999999,\n", " 59.23333999999999,\n", " inf],\n", " 'fare': [-inf, 102.46584, 204.93168, 307.39752, 409.86336, inf]}" ] }, "execution_count": 8, "metadata": {}, "output_type": "execute_result" } ], "source": [ "transformer.named_steps['EqualWidthDiscretiser'].binner_dict_" ] }, { "cell_type": "code", "execution_count": 9, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "{'age': {4: 0, 1: 1, 2: 2, 3: 3, 0: 4}, 'fare': {0: 0, 2: 1, 1: 2, 4: 3}}" ] }, "execution_count": 9, "metadata": {}, "output_type": "execute_result" } ], "source": [ "transformer.named_steps['OrdinalEncoder'].encoder_dict_" ] }, { "cell_type": "code", "execution_count": 10, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
pclasssexagesibspparchfarecabinembarked
11393male2000nS
5332female1010nS
4592male2100nS
11503male1000nS
3932male1000nS
\n", "
" ], "text/plain": [ " pclass sex age sibsp parch fare cabin embarked\n", "1139 3 male 2 0 0 0 n S\n", "533 2 female 1 0 1 0 n S\n", "459 2 male 2 1 0 0 n S\n", "1150 3 male 1 0 0 0 n S\n", "393 2 male 1 0 0 0 n S" ] }, "execution_count": 10, "metadata": {}, "output_type": "execute_result" } ], "source": [ "train_t = transformer.transform(X_train)\n", "test_t = transformer.transform(X_test)\n", "\n", "test_t.head()" ] }, { "cell_type": "code", "execution_count": 12, "metadata": {}, "outputs": [ { "data": { "image/png": "iVBORw0KGgoAAAANSUhEUgAAAbkAAAFNCAYAAACdVxEnAAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADh0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uMy4yLjEsIGh0dHA6Ly9tYXRwbG90bGliLm9yZy+j8jraAAAgAElEQVR4nO3dd5hU9dn/8ffN0rv0uvTecQXBEmtEDKKiBiyJlegTLEHRYBLF8og91oQYY3yMCtJUVBQr2JW2u3RZ+tJ7L1vu3x8z5DeuW4YyO7NnP6/r2uuaM+d7ztxnzux85nvmzPeYuyMiIhJEZeJdgIiISKwo5EREJLAUciIiElgKORERCSyFnIiIBJZCTkREAkshJ0fNzKab2Q1HuWyyme0xs6TjXVfEY4wys9cKmb/AzM44ynW7mbU+6uISnJldbGZrwvuoR7zrORJB3zdyZBRypZyZrTSz/eE3sw1m9oqZVY3R45xzeNrdV7t7VXfPOd6PFS137+Tu04v7cUvIm/ATwLDwPpob72KOl2P5YBaExy+NFHICMMDdqwLdgR7AyDjXI/HXDFhwNAuaWdnjXEvCCPK2BZVCTv7L3TcA0wiFHQBmdrKZfWNmO8wsraDDe2bWysw+M7OtZrbFzF43s5rhef8BkoF3wz3Gu8ysebhHUzbcppGZTTGzbWaWYWY3Rqx7lJmNN7NXzWx3+DBjSsT8u81sbXjeEjM7O6K08oUs99/eZfgxJprZm+G2c8ysWxFPWX8zWx7e3sfN7L//T2Z2nZktMrPtZjbNzJqF7/8i3CQt/Fz82sxmmNmg8PxTw89L//D0OWaWWtR6w/Pam9nH4edwiZldHjHvFTN7wczeD2/f92bWKp/9WMHM9gBJ4RqXhe//o5ktCy+70MwujljmGjP72sz+ambbgFHh9TxhZqvNbKOZjTGzSvk9iYW9diL2051mlm5mO8P7qGLE/BFmtt7M1pnZdQXtLDP7X+A04Pnwc/98+P5nLHRodpeZzTaz0yKWOfy6eM3MdgHXmFkLM/si/Fx8En5eX4tYJt//mYIeX2LM3fVXiv+AlcA54dtNgHnAM+HpxsBWoD+hD0TnhqfrhudPB24I324dnl8BqAt8ATyd3+OEp5sDDpQNT88A/gZUJBSym4Gzw/NGAQfCdSQBo4HvwvPaAWuARhHrbVXUcvls+yggC7gUKAfcCawAyhXwvDnwOVCLUID/GPFcXARkAB2AssCfgW/yLNs6YvoB4Lnw7XuAZcCjEfOeKWq9QJXw83BteF5PYAvQKTz/FWAb0Cs8/3VgXCGvi7w1XgY0Cr8Ofg3sBRqG510DZAO3hNddCXgamBJ+fqoB7wKjC3isaF47P4QfvxawCLgpPK8fsBHoHH4O3shbe57Hmn54P0XcdxVQO1z7HcAGoGKe18VF4W2vBHxL6HBueeBUYBfw2pH+z+ivmN7j4l2A/uL8Agi9gewBdoffHD4Faobn3Q38J0/7acBvw7cL/IcNvynMzfM4+YYc0BTIAapFzB8NvBK+PQr4JGJeR2B/+HZrYBNwDnkCqbDl8tYUbhsZgGWA9cBpBWyfA/0ipv8H+DR8+wPg+jzr2gc0i1g2MkDOBtLDtz8EbuD/h/gM4JKi1ksoeL7MU+M/gPvCt18BXoqY1x9YXMjrosCgCM9PBQaGb18DrI6YZ4RCsFXEfX2AFVG+JvN77VwVMf0YMCZ8+2XgkYh5bQurnShCBtgOdIt4XXwRMS+ZUKBXjrjvNf5/yB31/4z+YvOnw5UCcJG7VwPOANoDdcL3NwMuCx922WFmOwh9cm2YdwVmVs/MxoUPG+4i9I9fJ2+7AjQCtrn77oj7VhH6VHzYhojb+4CKZlbW3TOA2wm9GW0K19CoqOUKqGPN4RvungtkhmsryJqI26si2jYDnol4zrYReuNvTP6+BdqaWX1CvdhXgaZmVodQz+vwIc7C1tsM6J1nX10JNIh4nLzPRdQnGJnZb8wsNWLdnfnp/o18LuoClYHZEe0/DN+f37qjee0UVHsjfr4fjoiZ3RE+BLwzXGuNQrbt8Gt1XwHzo/6fkeKhkJP/cvcZhD7xPxG+aw2hT6U1I/6quPsj+Sw+mtAn6K7uXp3QISCLXH0hD70OqGVm1SLuSwbWRln3G+5+KqE3GAcejWa5fDQ9fMNC3681CddWZHtC9R5uuwb4XZ7nrZK7f1NA/fuA2cBtwHx3PwR8AwwHlrn7lijWuwaYkWdeVXe/+YifhTzC3/v9ExgG1Hb3msB8Ct6/W4D9hA6VHq6lhodObspPUa+dwqzn5/uhMD95HYa/f7sbuBw4IbxtOyl429YTeq1Wjrgv8vGL+p/RZV+KmUJO8noaONfMuhP6RD3AzM4zsyQzq2hmZ5hZk3yWq0bosOcOM2sMjMgzfyPQMr8HdPc1hN7UR4cfoytwPaHvjQplZu3M7Cwzq0Do+7f9hA59Ho0TzeyScE/vduAg8F0h7UeY2Qlm1pRQQL0Zvn8MMNLMOoVrrGFml0Usl99zMYNQiMwIT0/PM13Uet8j1Bu82szKhf9OMrMOUW99waoQenPeHH7cawn15PIV7gX/E/irmdULL9PYzM4rYJGiXjuFGU/oZJCO4eC5r4j2eZ/7aoQOP24GyprZvUD1ghZ291XALEIn15Q3sz7AgIgmRf3PFPh/ILGhkJOfcPfNhA6X/SUcPgMJnQyxmdCn1BHk/7q5n9DJDjuB94HJeeaPBv4cPoRzZz7LDyH0Pd064C1C3yV9HEXJFYBHCPUeNgD1wvUejXcIfbe1Hbia0HdhWUW0n03o+6n3gX8BuPtbhHqT48KH3+YD50csNwr4v/BzcfgMyBmE3nC/KGC60PWGD/X+EhhM6DncEG5b4UifhLzcfSHwJKHDqhuBLsDXRSx2N6GTZL4L1/oJoZOE8lPUa6ew2j4g9MHss/DjfVbEIs8Al1ro7NRnCX1f9gGhE4dWEfqgtKaQ5SF0GLgPoRNKHiL04eZguJ6i/mfyPr7EmLmr9yxiZqMInaxwVbxrkZLFzN4kdBJPUb1IiQP15EREjkD4MHArMytjZv0I9dzejnddkj/9el9E5Mg0IHRItTahM3Bv9gANfRY0OlwpIiKBpcOVIiISWAo5EREJrBL3nVydOnW8efPm8S5DREQSyOzZs7e4+89G1SlxIde8eXNmzZoV7zJERCSBmFm+Q7rpcKWIiASWQk5ERAJLISciIoGlkBMRkcBSyImISGAp5EREJLAUciIiElgxCzkze9nMNpnZ/ALmm5k9a2YZZpZuZj1jVYuIiJROsezJvQL0K2T++UCb8N9Q4O8xrEVEREqhmIWcu38BbCukyUDgVQ/5DqhpZg1jVY+IiCSW9MwdfLRgQ0wfI57DejXmp5eZzwzftz5vQzMbSqi3R3JycrEUJyIisZGVk8sLn2fw3GcZtKpbhbM71CepjMXkseIZcvltUb4Xt3P3F4EXAVJSUnQBPBGREipj026Gj08jPXMnF/dozKgBnWIWcBDfkMsEmkZMNwHWxakWERGJodxc59/frOSxDxdTuXwSf7+yJ+d3if03VPEMuSnAMDMbB/QGdrr7zw5ViohIyZa5fR93Tkjju+XbOLt9PUYP6kK9ahWL5bFjFnJmNhY4A6hjZpnAfUA5AHcfA0wF+gMZwD7g2ljVIiIixc/dmTg7k/vfXYi789igrlyW0gSz2B2ezCtmIefuQ4qY78DvY/X4IiISP1v2HGTk5Hl8vHAjvVrU4snLutG0VuVir6PEXTRVREQS24fzN3DPW/PYczCbP1/QgetOaUGZGJ5cUhiFnIiIHBe7DmQxasoCJs9ZS+fG1Xnq8u60rV8trjUp5ERE5Jh9nbGFERPS2Lj7ILee3YZbzmpNuaT4D4+skBMRkaO2/1AOj364mFe+WUnLulWYdHNfujetGe+y/kshJyIiRyV1zQ6Gj09l+ea9XNO3OXf3a0+l8knxLusnFHIiInJEsnJyee7TpbwwfRn1q1Xg9Rt6c0rrOvEuK18KORERidqPG3czfHwq89fuYlDPJtx3YUeqVywX77IKpJATEZEi5eY6L3+9gsemLaFqhbKMuepE+nVuEO+yiqSQExGRQq3ZFhqW6/sV2zi3Y30evrgLdatViHdZUVHIiYhIvtyd8bPW8MC7CzEzHr+0K5eeWLzDch0rhZyIiPzMpt0HGDlpHp8u3kSflrV5/LKuNDmh+IflOlYKORER+YkP5q3nnrfmse9QDn/5VUeu7ds8bsNyHSuFnIiIALBzf2hYrrfmrqVrkxo8dXk3WteL77Bcx0ohJyIifLl0MyMmpLN5z0FuP6cNvz8zMYblOlYKORGRUmzfoWwe+WAxr367ilZ1q/Dib/rStUniDMt1rBRyIiKl1JzV27ljfBortuzl+lNbMOK8dlQsl1jDch0rhZyISClzKDuXZz9dyt+mZ9CwRiXeuLE3fVsl5rBcx0ohJyJSiizZsJs/vJnKwvW7uOzEJvxlQGIPy3WsFHIiIqVATq7z0pfLefKjH6leqSwvXn0iv+yU+MNyHSuFnIhIwK3euo87JqQyc+V2zusUGpardtWSMSzXsVLIiYgElLszbuYaHnxvIUlmPHV5Ny7u0bhEDct1rBRyIiIBtGnXAe6elM7nSzZzSuvaPHZpNxrXrBTvsoqdQk5EJGDeS1/Hn9+ez4GsHEYN6Mhv+pTcYbmOlUJORCQgduw7xL3vLGBK2jq6Na3JU5d3o1XdqvEuK64UciIiATDjx83cNTGNrXsOMfzctvzPGa0oG4BhuY6VQk5EpATbdyibh6cu4rXvVtOmXlX+9duT6Ny4RrzLShgKORGREmr2qm0MH5/G6m37uPG0Ftzxy+ANy3WsFHIiIiXMwewcnv5kKf+YsYxGNSsx9saTObll7XiXlZBiGnJm1g94BkgCXnL3R/LMPwF4GWgFHACuc/f5saxJRKQkW7R+F394M5XFG3Yz+KSm/PlXHalaQf2VgsTsmTGzJOAF4FwgE5hpZlPcfWFEs3uAVHe/2Mzah9ufHauaRERKqpxc58UvlvPUx0uoUak8//ptCmd3qB/vshJeLOO/F5Dh7ssBzGwcMBCIDLmOwGgAd19sZs3NrL67b4xhXSIiJcqqrXsZPj6N2au2c37nBvzvxV2oVaV8vMsqEWIZco2BNRHTmUDvPG3SgEuAr8ysF9AMaAIo5ESk1HN3Xv9+NQ9PXURSGePpX3dnYPdGpWpYrmMVy5DLby94nulHgGfMLBWYB8wFsn+2IrOhwFCA5OTk41ymiEji2bAzNCzXjB83c2rrOjx+WVca1ih9w3Idq1iGXCbQNGK6CbAusoG77wKuBbDQR5MV4T/ytHsReBEgJSUlb1CKiATKlLR1/OXt+RzMzuGBgZ24qnezUjss17GKZcjNBNqYWQtgLTAYuCKygZnVBPa5+yHgBuCLcPCJiJQ62/ce4i/vzOe99PX0SK7Jk5d1o2UpH5brWMUs5Nw928yGAdMI/YTgZXdfYGY3heePAToAr5pZDqETUq6PVT0iIons8yWbuHtiOtv3HWLEee343ektNSzXcRDTH1e4+1Rgap77xkTc/hZoE8saREQS2d6D2Tz0/iLG/rCadvWr8e9rT6JTIw3LdbzoF4QiInEyc+U27hifxprt+/jd6S35w7ltNSzXcaaQExEpZgezc3jq4x958YvlNDmhEm8O7UOvFrXiXVYgKeRERIrRgnU7Gf5mGks27mZIr2T+dEEHDcsVQ3pmRUSKQXZOLv/4YjlPf/IjNSuX59/XnMSZ7evFu6zAU8iJiMTYii17GT4+lbmrd3BB14Y8NLAzJ2hYrmKhkBMRiRF357XvVvHw1MWUSzKeHdKDC7s1indZpYpCTkQkBtbv3M9dE9P5cukWTm9bl8cGdaVBjYrxLqvUUciJiBxH7s47qeu49535ZOU4D13UmSt7J2tQ5ThRyImIHCfb9h7iz2/PY+q8DfRMrslTl3eneZ0q8S6rVFPIiYgcB58u2sjdk+axc/8h7urXjt+d3ookDaocdwo5EZFjsOdgNg+9t5BxM9fQvkE1Xr2uFx0bVY93WRKmkBMROUrfL9/KHRPSWLdjPzef0Yrbz2lDhbIaliuRKORERI7QgawcnvxoCS99tYLkWpUZ/7s+pDTXsFyJSCEnInIE5q/dyR/eTGXppj1c2TuZe/p3oIqG5UpY2jMiIlHIzsnl79OX8cynS6ldtTyvXHsSZ7TTsFyJTiEnIlKEZZv3MHx8GmlrdnBht0Y8MLATNStrWK6SQCEnIlKA3Fzn1W9X8siHi6lYLonnhvRggIblKlEUciIi+Vi3Yz8jJqbxdcZWzmhXl0cHdaV+dQ3LVdIo5EREIrg7b81dy31TFpCT64y+pAuDT2qqYblKKIWciEjY1j0HueeteUxbsJGTmp/AE5d1o1ltDctVkinkRESAjxduZOTkdHbtz2bk+e254bSWGpYrABRyIlKq7T6QxQPvLmTC7Ew6NKzOazd0o30DDcsVFAo5ESm1vl22lTsnpLF+535+f2Yrbju7LeXLlol3WXIcKeREpNQ5kJXD49OW8K+vVtCiThUm3NSXE5udEO+yJAYUciJSqqRn7mD4+DQyNu3hN32a8cfz21O5vN4Kg0p7VkRKhaycXF74PIPnPsugbtUKvHpdL05vWzfeZUmMKeREJPAyNu1h+PhU0jN3clH3Rtx/YWdqVC4X77KkGCjkRCSwcnOdV75ZyaMfLqZy+ST+dmVP+ndpGO+ypBgp5EQkkDK372PEhHS+Xb6Vs9vXY/SgLtSrpmG5SpuYhpyZ9QOeAZKAl9z9kTzzawCvAcnhWp5w93/HsiYRCTZ3Z+LsTO5/dyHuzqODunB5ioblKq1iFnJmlgS8AJwLZAIzzWyKuy+MaPZ7YKG7DzCzusASM3vd3Q/Fqi4RCa4tew4ycvI8Pl64kV4tavHkZd1oWqtyvMuSOIplT64XkOHuywHMbBwwEIgMOQeqWegjVlVgG5Adw5pEJKA+nL+BP701j90HsvlT/w5cf2oLymhYrlIvliHXGFgTMZ0J9M7T5nlgCrAOqAb82t1zY1iTiATMrgNZjJqygMlz1tKpUXXGDu1O2/rV4l2WJIhYhlx+H6E8z/R5QCpwFtAK+NjMvnT3XT9ZkdlQYChAcnJyDEoVkZLo64wtjJiQxsbdB7n1rNYMO6uNhuWSn4jlqyETaBox3YRQjy3StcBkD8kAVgDt867I3V909xR3T6lbVz/eFCnt9h/KYdSUBVz50vdULJfExJv6MPyX7RRw8jOx7MnNBNqYWQtgLTAYuCJPm9XA2cCXZlYfaAcsj2FNIlLCpa7ZwfDxqSzfvJdr+jbn7n7tqVQ+Kd5lSYKKWci5e7aZDQOmEfoJwcvuvsDMbgrPHwM8CLxiZvMIHd682923xKomESm5snJyee6zDF74PIN61Srw+g29OaV1nXiXJQkupr+Tc/epwNQ8942JuL0O+GUsaxCRkm/pxt38YXwq89fu4pKejblvQCdqVNKwXFI0jXgiIgnL3Xn9+9U8+N5CqlQoy5iretKvs4blkugp5EQkIe3cn8XIyelMnbeB09vW5cnLulG3WoV4lyUljEJORBLO3NXbuWXsXDbsPMAfz2/P0NNa6ofdclQUciKSMHJznX9+uZzHpy2hfvWKjL+pDz2TdcVuOXoKORFJCFv3HOSOCWlMX7KZfp0a8OigrrrmmxwzhZyIxN03y7Zw+7hUduzP4sGLOnNV72RdNUCOC4WciMRNdk4uz36WwXOfLaVFnSq8cm0vOjaqHu+yJECKDDkzO8Xdvy7qPhGRI7F+535uG5fKDyu2MahnEx4Y2IkqFfS5W46vaF5RzwE9o7hPRCQqny7ayJ0T0jiYnctTl3fjkp5N4l2SBFSBIWdmfYC+QF0zGx4xqzqhYbpERI7IoexcHv1wMf/6agUdGlbn+St60Kpu1XiXJQFWWE+uPKELmZYldK23w3YBl8ayKBEJnlVb93LL2LmkZ+7kt32aMbJ/ByqW0+dlia0CQ87dZwAzzOwVd19lZlXcfW8x1iYiAfFu2jpGTp5HGYMxV51Iv84N4l2SlBLRfCfXyMw+INSrSzazbsDv3P1/YluaiJR0+w/l8MB7Cxj7wxp6Jtfk2SE9aHJC5XiXJaVINCH3NKEreE8BcPc0Mzs9plWJSIn348bdDHtjDj9u3MPNZ7Ri+LltKZeki5pK8YrqfF13X5Pnh5k5sSlHREo6d+fNmWsY9e4CqlYoy6vX9eL0tnXjXZaUUtGE3Boz6wu4mZUHbgUWxbYsESmJdh/I4p635vNu2jpOaV2bv/66O/WqVYx3WVKKRRNyNwHPAI2BTOAj4PexLEpESp70zB3cMnYumdv3M+K8dtz0i1Yk6coBEmdFhpy7bwGuLIZaRKQEcnde/nolj3ywiLpVKzBu6Mmc1LxWvMsSAaIb1uvZfO7eCcxy93eOf0kiUlJs33uIOyek8eniTZzToT5PXNaVmpXLx7sskf+K5nBlRaA9MCE8PQhYAFxvZme6++2xKk5EEtcPK7Zx69i5bNt7iPsGdOSavs115QBJONGEXGvgLHfPBjCzvxP6Xu5cYF4MaxORBJST67zweQZPf/IjybUqM+nmvnRpUiPeZYnkK5qQawxUIXSIkvDtRu6eY2YHY1aZiCScTbsOcPubqXyzbCsDuzfioYs6U62iLmwqiSuakHsMSDWz6YABpwMPm1kV4JMY1iYiCWT6kk3cMT6NvYeyeWxQVy5LaaLDk5LwCg05MytD6DdxfYFehELuHndfF24yIrbliUi8ZeXk8sRHS/jHjOW0q1+NcVecTJv61YpeUCQBFBpy7p5rZk+6ex9AZ1KKlDJrtu3jlrFzSV2zgyG9krlvQEddOUBKlGgOV35kZoOAye7usS5IRBLDB/PWc9ekdHB4/ooe/Kpro3iXJHLEogm54YRONsk2swOEDlm6u1ePaWUiEhcHsnJ46P2FvPbdaro1qcFzQ3qSXFtXDpCSKZoRT3TwXaSUyNi0h2FvzGHxht3ceFoLRpzXnvJldeUAKbmiugqBmZ0AtCH0w3AA3P2LWBUlIsVv4uxM/vL2fCqWK8PL16RwVvv68S5J5JhFM6zXDcBtQBMgFTgZ+BY4K4pl+xEa3DkJeMndH8kzfwT/f1zMskAHoK67bzuCbRCRY7DnYDb3vj2fyXPX0rtFLZ4Z3IMGNXTlAAmGaI5D3AacBKxy9zOBHsDmohYysyTgBeB8oCMwxMw6RrZx98fdvbu7dwdGAjMUcCLFZ8G6nQx47iveTl3LbWe34Y0bT1bASaBEc7jygLsfMDPMrIK7LzazdlEs1wvIcPflAGY2DhgILCyg/RBgbFRVi8gxcXde/XYV//v+Ik6oUo7XbziZPq1qx7sskeMumpDLNLOawNvAx2a2HVhXxDIQGg5sTeR6gN75NTSzykA/YFgU6xWRY7BzXxZ3TUpj2oKNnNmuLk9c1o3aVSvEuyyRmIjm7MqLwzdHmdnnQA3ggyjWnd94PwX9zm4A8HVBhyrNbCgwFCA5OTmKhxaR/MxetY1bx6aycdcB/tS/A9ef2oIyurCpBFiR38mZ2X8O33b3Ge4+BXg5inVnAk0jpptQcA9wMIUcqnT3F909xd1T6tatG8VDi0ik3Fznb9MzuPwf31GmDEy8uS83nt5SASeBF83hyk6RE+ETSk6MYrmZQBszawGsJRRkV+RtZGY1gF8AV0WxThE5Qpt3H2T4+FS+XLqFC7o0ZPSgLlTXlQOklCgw5MxsJHAPUMnMdh2+GzgEvFjUit0928yGAdMI/YTgZXdfYGY3heePCTe9GPjI3fce/WaISH6+WrqF299MZfeBLB6+uAtDejXVlQOkVLGihqM0s9HuPrKY6ilSSkqKz5o1K95liCS07Jxc/vrJj/xt+jJa1a3K81f0oH0DjcQnwWVms909Je/90Zx4kjABJyJFW7tjP7eNncusVdu5PKUJoy7sROXyUQ1uJBI4euWLBMhHCzYwYmI62Tm5PDO4OwO7N453SSJxVdh3ci3cfUVxFiMiR+dgdg6jpy7mlW9W0rlxdZ4b0pMWdarEuyyRuCusJzcRONHMPnX3s4urIBE5Miu27GXYG3NYsG4X1/Rtzsj+7alQVhc2FYHCQ66Mmd0HtDWz4XlnuvtTsStLRKLxTupa7pk8j7JJZfjnb1I4t6OuHCASqbCQGwxcFG6ja8qJJJB9h7IZNWUB42dlktLsBJ4d0oNGNSvFuyyRhFNgyLn7EuBRM0t392iG8RKRYrB4wy6GvTGXZZv3MOzM1tx+ThvKJunCpiL5iebsym/M7Cng9PD0DOABd98Zu7JEJC93540fVvPAuwupVrEc/7muN6e2qRPvskQSWjQh9zIwH7g8PH018G/gklgVJSI/tetAFiMnzeP9ees5rU0dnrq8O3Wr6coBIkWJJuRaufugiOn7zSw1VgWJyE+lrtnBLWPnsG7HAe7u157faWBlkahFE3L7zexUd/8KwMxOAfbHtiwRyc11/vXVCh79cDH1q1dk/O/6cGKzE+JdlkiJEk3I3QS8Gr5aAMB24LexK0lEtu45yJ0T0vh8yWbO61SfxwZ1o0ZlXTlA5EhFM3ZlGtDNzKqHp3cVsYiIHINvl23l9jfnsn1vFg8M7MTVJzfTlQNEjlLUY1cq3ERiKyfXefbTpTz32VKa167Cy9ecRKdGNYpeUEQKpAGaRRLAhp0HuG3cXL5fsY1LejTmwYs6U6WC/j1FjpX+i0Ti7PPFm7hjQhoHsnJ48rJuDDqxSbxLEgmMqELOzPoCzSPbu/urMapJpFQ4lJ3L49MW888vV9C+QTVeuLInrepWjXdZIoFSZMiZ2X+AVkAqkBO+2wGFnMhRWr11H7eMnUNa5k6uPrkZf7qgAxXL6coBIsdbND25FKCju3usixEpDd5LX8fISfPA4O9X9uT8Lg3jXZJIYEUTcvOBBsD6GNciEmgHsnK4/92FjP1hNT2Sa/Ls4B40rVU53mWJBFo0IVcHWGhmPwAHD9/p7hfGrCqRgFm6cTfD3pjLko27uekXrbjjl20ppysHiMRcNCE3KtZFiASVuzNhVib3TplPlfJleeXak839tdUAABXKSURBVDijXb14lyVSakQz4smM4ihEJGh2H8jiz2/P553UdfRpWZunB3enfvWK8S5LpFSJ5uzKk4HngA5AeSAJ2Ovu1WNcm0iJNS9zJ7eMncPqbfsYfm5bfn9ma5J05QCRYhfN4crngcHABEJnWv4GaBPLokRKKnfn31+vZPQHi6hdpQLjhvahV4ta8S5LpNSK6sfg7p5hZknungP828y+iXFdIiXO9r2HGDExnU8WbeTs9vV44rJunFClfLzLEinVogm5fWZWHkg1s8cI/ZSgSmzLEilZZq7cxq1j57Jlz0H+8quOXHdKc105QCQBRHMO89XhdsOAvUBTYFChS4iUEjm5zvOfLWXwi99RvmwZJt3cl+tPbaGAE0kQ0ZxducrMKgEN3f3+YqhJpETYtOsAfxifytcZWxnQrREPX9yZahV1YVORRFJkT87MBhAat/LD8HR3M5sSzcrNrJ+ZLTGzDDP7YwFtzjCzVDNbYGb6uYKUCDN+3Ez/Z79k9qrtPHJJF54d3F0BJ5KAov0xeC9gOoC7p5pZ86IWMrMk4AXgXCATmGlmU9x9YUSbmsDfgH7uvtrM9CtZSWhZObk8+dGPjJmxjLb1q/LGjSfTtn61eJclIgWIJuSy3X3nUXzH0AvIcPflAGY2DhgILIxocwUw2d1XA7j7piN9EJHikrl9H7eOncuc1TsY0qsp9/6qE5XK68oBIoksqgGazewKIMnM2gC3AtH8hKAxsCZiOhPonadNW6CcmU0HqgHP6Dp1kog+nL+euyamk+vw3JAeDOjWKN4liUgUogm5W4A/ERqceSwwDXgwiuXy6/rlvVxPWeBE4GygEvCtmX3n7j/+ZEVmQ4GhAMnJyVE8tMjxcSArh4enLuLVb1fRtUkNnhvSg2a19QsakZIimrMr9xEKuT8d4bozCf3c4LAmwLp82mxx973AXjP7AugG/CTk3P1F4EWAlJQUXddOisWyzXsY9sZcFq3fxfWntuDufu0pX1ZXDhApSQoMuaLOoIziUjszgTZm1gJYS2hosCvytHkHeN7MyhIaF7M38NeiihaJtUmzM/nLO/OpULYM//ptCmd3qB/vkkTkKBTWk+tD6Du1scD35H/4sUDunm1mwwgd3kwCXnb3BWZ2U3j+GHdfZGYfAulALvCSu88/iu0QOS72HszmL+/MZ/KctfRqUYtnBnenYY1K8S5LRI6Sued/9C/8E4BzgSFAV+B9YKy7Lyi+8n4uJSXFZ82aFc8SJKAWrtvFsLFzWLFlL7ec1YZbz2pNWV3YVKREMLPZ7p6S9/4Ce3LhwZg/BD40swqEwm66mT3g7s/FrlSR4uXuvPbdKh58fxE1K5Xj9Rt607dVnXiXJSLHQaEnnoTD7QJCAdcceBaYHPuyRIrHzn1Z3D0pnQ8XbOAXbevy5OXdqFO1QrzLEpHjpLATT/4P6Ax8ANyv78okaGav2s6tY+eycdcB7unfnhtObUkZXdhUJFAK68ldTeiqA22BWyNGPDHAdWVwKalyc51/fLGcJz5aQsMaFZlwUx96JJ8Q77JEJAYK+05O37hL4GzZc5Dh49P44sfN9O/SgNGXdKVGJQ2sLBJUUV0ZXCQIvs7Ywu1vprJzfxYPXdSZK3sn67pvIgGnkJPAy87J5elPlvLC9Axa1qnCq9f1okNDHW0XKQ0UchJo63bs57Zxc5m5cjuXntiEBwZ2onJ5vexFSgv9t0tgfbxwIyMmppGVnctff92Ni3s0iXdJIlLMFHISOAezc3jkg8X8++uVdGpUneeG9KBl3arxLktE4kAhJ4Gycsteho2dw/y1u7imb3NG9m9PhbK6sKlIaaWQk0BwdybOzuT+dxeSVMb4x9Uncl6nBvEuS0TiTCEnJd66Hfu55615TF+ymV7Na/HXwd1pXFNXDhARhZyUYO7O+FlreOi9RWTnOvcN6Mhv+zTX0Fwi8l8KOSmR1u7Yzx8npfPl0i30blGLxy7tSrPaVeJdlogkGIWclCjuztgf1vDw1EXkuvPgwE5c2buZem8iki+FnJQYa7btY+TkeXyVsYU+LWvz2KVdaVqrcrzLEpEEppCThJeb67z+w2oemboIgIcu6swVvZLVexORIinkJKGt3rqPuyal8d3ybZzaug6PDOpCkxPUexOR6CjkJCHl5jr/+W4Vj3ywmKQyxiOXdOHXJzXVVQNE5Igo5CThrNq6lxET0/lhxTZOb1uX0Zd00e/eROSoKOQkYeTmOq98s5LHpi2mXJkyPDaoK5elNFHvTUSOmkJOEsKKLXu5a2IaM1du58x2dXn4ki40rKHem4gcG4WcxFVOrvPvr1fw+LQlVChbhicv68YlPRur9yYix4VCTuJm2eY9jJiQxpzVOzi7fT0evqQL9atXjHdZIhIgCjkpdjm5zktfLufJj3+kUrkk/vrrblzUXb03ETn+FHJSrDI27ebOCemkrtnBuR3r878Xdaaeem8iEiMKOSkW2Tm5vPjlcp7+ZClVyifxzODuXNitkXpvIhJTCjmJuSUbdjNiYhrpmTvp16kBD17UmbrVKsS7LBEpBWIacmbWD3gGSAJecvdH8sw/A3gHWBG+a7K7PxDLmqT4ZOXk8o8Zy3j20wyqVizL81f04IIuDdV7E5FiE7OQM7Mk4AXgXCATmGlmU9x9YZ6mX7r7r2JVh8THovW7GDExjflrd3FB14Y8cGEnaldV701Eilcse3K9gAx3Xw5gZuOAgUDekJMAycrJ5e/Tl/HcZ0upXrEcf7uyJ/27NIx3WSJSSsUy5BoDayKmM4He+bTrY2ZpwDrgTndfEMOaJIYWrNvJiAnpLFy/iwu7NWLUhZ2oVaV8vMsSkVIsliGX3xcvnmd6DtDM3feYWX/gbaDNz1ZkNhQYCpCcnHy865RjdCg7l+c/z+Bvn2dQs3J5xlx1Iv06N4h3WSIiMQ25TKBpxHQTQr21/3L3XRG3p5rZ38ysjrtvydPuReBFgJSUlLxBKXE0f+1O7pyQxuINu7m4R2Pu/VVHTlDvTUQSRCxDbibQxsxaAGuBwcAVkQ3MrAGw0d3dzHoBZYCtMaxJjpOD2Tk8/1kGf5u+jFpVyvPP36Rwbsf68S5LROQnYhZy7p5tZsOAaYR+QvCyuy8ws5vC88cAlwI3m1k2sB8Y7O7qqSW49MwdjJiQzpKNu7mkZ6j3VrOyem8iknispGVKSkqKz5o1K95llEoHsnJ49tOl/OOL5dSpWp7Rl3ThrPbqvYlI/JnZbHdPyXu/RjyRqMxdvZ0RE9PJ2LSHy1Oa8KcLOlKjUrl4lyUiUiiFnBTqQFYOf/3kR/75xXLqV6/IK9eexBnt6sW7LBGRqCjkpECzV23nrolpLNu8l8EnNeWeCzpQvaJ6byJScijk5GcOZOXw5EdLeOmrFTSqUYlXr+vF6W3rxrssEZEjppCTn5i1cht3TUxn+Za9XNk7mT+e355q6r2JSAmlkBMA9h/K4fFpS/j3N6He2+s39OaU1nXiXZaIyDFRyAnfL9/KXZPSWbV1H1ef3Iy7z29P1Qp6aYhIyad3slJs36FsHvtwCa98s5KmtSrxxo296dtKvTcRCQ6FXCn17bKt3D0pndXb9nFN3+bc1a8dlcvr5SAiwaJ3tVJm78FsHvlgMf/5bhXNalfmzaEn07tl7XiXJSISEwq5UuTrjC3cPSmdtTv2c90pLRhxXjsqlU+Kd1kiIjGjkCsFdh/IYvQHi3nj+9W0qFOFCb/rQ0rzWvEuS0Qk5hRyAffl0s38cdI81u3cz42ntWD4ueq9iUjpoZALqF0Hshg9dRFjf1hDy7pVmHhTX05sdkK8yxIRKVYKuQCavmQTIyfPY+OuA/zuFy35wzltqVhOvTcRKX0UcgGyc38WD723kAmzM2ldryqTbu5Lj2T13kSk9FLIBcRnizcycvI8tuw5xP+c0Ypbz26j3puIlHoKuRJu574sHnhvIZPmZNKufjX++ZsUujapGe+yREQSgkKuBPtk4UbueWseW/ce4pazWjPsrNZUKKvem4jIYQq5Emj73kPc/+4C3k5dR/sG1Xj5mpPo3LhGvMsSEUk4CrkSZtqCDfzprfns2HeI285uw+/PbE35smXiXZaISEJSyJUQ2/YeYtSUBUxJW0fHhtX5v+tOolMj9d5ERAqjkCsBPpi3nr+8M5+d+7MYfm5bbj6jFeWS1HsTESmKQi6Bbd1zkHunLOD99PV0blyd/1zfmw4Nq8e7LBGREkMhl4Dcnffnrefedxaw+0AWI85rx9DTW6r3JiJyhBRyCWbz7oPc+858Ppi/ga5NavD4pSfTrkG1eJclIlIiKeQShLszJW0do6YsYO/BHO7u154bT2tBWfXeRESOmkIuAWzafYA/vzWfjxZupHvTmjx+aVfa1FfvTUTkWCnk4sjdeSd1HfdNWcD+rBzu6d+e609tSVIZi3dpIiKBENNjYWbWz8yWmFmGmf2xkHYnmVmOmV0ay3oSycZdB7jx1Vnc/mYqrepWYeqtpzH09FYKOBGR4yhmPTkzSwJeAM4FMoGZZjbF3Rfm0+5RYFqsakkk7s6kOWt54N0FHMzO5c8XdODaU1oo3EREYiCWhyt7ARnuvhzAzMYBA4GFedrdAkwCTophLQlhw84DjJyczudLNnNS8xN47NJutKhTJd5liYgEVixDrjGwJmI6E+gd2cDMGgMXA2cR4JBzdybMzuTB9xaSlZPLfQM68ts+zSmj3puISEzFMuTyewf3PNNPA3e7e45ZwW/4ZjYUGAqQnJx83AosDut27OePk+fxxY+b6dWiFo8N6kpz9d5ERIpFLEMuE2gaMd0EWJenTQowLhxwdYD+Zpbt7m9HNnL3F4EXAVJSUvIGZUJyd8bNXMP/vr+InFzn/gs7cfXJzdR7ExEpRrEMuZlAGzNrAawFBgNXRDZw9xaHb5vZK8B7eQOuJMrcvo+Rk+fx5dIt9GlZm0cHdSW5duV4lyUiUurELOTcPdvMhhE6azIJeNndF5jZTeH5Y2L12PHi7rzxw2oefn8RAA9d1JkreiWr9yYiEicx/TG4u08Fpua5L99wc/drYllLrK3Zto+7J6XzzbKtnNK6No9c0pWmtdR7ExGJJ414coxyc53Xv1/F6A8WU8aMhy/uwpBeTSnsRBoRESkeCrljsGrrXu6elM53y7dxWps6PDKoK41rVop3WSIiEqaQOwq5uc6r367k0Q+XULaM8digrlyW0kS9NxGRBKOQO0Irt+zlronp/LByG2e0q8voS7rQsIZ6byIiiUghF6WcXOeVb1by+LTFlEsqw+OXduXSE9V7ExFJZAq5KCzfvIe7JqYza9V2zmpfj4cv7kKDGhXjXZaIiBRBIVeInFzn5a9W8MRHS6hYLomnLu/GxT0aq/cmIlJCKOQKkLFpDyMmpjF39Q7O6VCfhy/uTL3q6r2JiJQkCrk8snNyeemrFTz18Y9ULp/EM4O7c2G3Ruq9iYiUQAq5CEs37ubOiemkrdnBeZ3q8+BFnalXTb03EZGSSiFHqPf2jy+W88wnS6lasSzPDenBr7o2VO9NRKSEK/Uht2TDbkZMTCM9cycXdGnI/QM7UadqhXiXJSIix0GpDbmsnFzGTF/Gs58tpXrFcrxwRU8u6Now3mWJiMhxVCpDbtH6Xdw5IY0F63YxoFsjRg3oSG313kREAqfUhVxOrvP7N+awa38WY67qSb/O6r2JiARVqQu5pDLG80N60rBGRU6oUj7e5YiISAyVupAD6NioerxLEBGRYlAm3gWIiIjEikJOREQCSyEnIiKBpZATEZHAUsiJiEhgKeRERCSwFHIiIhJYCjkREQkshZyIiASWQk5ERALL3D3eNRwRM9sMrDoOq6oDbDkO6ykJtK3BVZq2V9saTMdrW5u5e928d5a4kDtezGyWu6fEu47ioG0NrtK0vdrWYIr1tupwpYiIBJZCTkREAqs0h9yL8S6gGGlbg6s0ba+2NZhiuq2l9js5EREJvtLckxMRkYALfMiZWT8zW2JmGWb2x3zmm5k9G56fbmY941Hn8RDFtp5hZjvNLDX8d2886jxWZvaymW0ys/kFzA/MPoWotjco+7WpmX1uZovMbIGZ3ZZPm8Ds2yi3Nyj7tqKZ/WBmaeFtvT+fNrHZt+4e2D8gCVgGtATKA2lAxzxt+gMfAAacDHwf77pjuK1nAO/Fu9bjsK2nAz2B+QXMD8Q+PYLtDcp+bQj0DN+uBvwY1P/XI9jeoOxbA6qGb5cDvgdOLo59G/SeXC8gw92Xu/shYBwwME+bgcCrHvIdUNPMGhZ3ocdBNNsaCO7+BbCtkCZB2adAVNsbCO6+3t3nhG/vBhYBjfM0C8y+jXJ7AyG8v/aEJ8uF//KeEBKTfRv0kGsMrImYzuTnL6Jo2pQE0W5Hn/Ahgw/MrFPxlFbsgrJPj0Sg9quZNQd6EPrEHymQ+7aQ7YWA7FszSzKzVGAT8LG7F8u+LXusK0hwls99eT89RNOmJIhmO+YQGvpmj5n1B94G2sS8suIXlH0arUDtVzOrCkwCbnf3XXln57NIid63RWxvYPatu+cA3c2sJvCWmXV298jvmWOyb4Pek8sEmkZMNwHWHUWbkqDI7XD3XYcPGbj7VKCcmdUpvhKLTVD2aVSCtF/NrByhN/zX3X1yPk0CtW+L2t4g7dvD3H0HMB3ol2dWTPZt0ENuJtDGzFqYWXlgMDAlT5spwG/CZ/acDOx09/XFXehxUOS2mlkDM7Pw7V6E9v/WYq809oKyT6MSlP0a3oZ/AYvc/akCmgVm30azvQHat3XDPTjMrBJwDrA4T7OY7NtAH65092wzGwZMI3T24cvuvsDMbgrPHwNMJXRWTwawD7g2XvUeiyi39VLgZjPLBvYDgz18WlNJYmZjCZ11VsfMMoH7CH2RHah9elgU2xuI/QqcAlwNzAt/dwNwD5AMgdy30WxvUPZtQ+D/zCyJUFCPd/f3iuO9WCOeiIhIYAX9cKWIiJRiCjkREQkshZyIiASWQk5ERAJLISciIoGlkBNJUGZ2a3iE+tfjXYtISaWfEIgkKDNbDJzv7iuiaFvW3bOLoSyREiXQPwYXKanMbAyhyyZNMbPXCI3QXonQD4KvdfclZnYNcAFQEahiZgOA54AuhP63R7n7O/GoXyRRqCcnkqDMbCWQAhwC9oVHtTkHuNndB4VD7iGgq7tvM7OHgYXu/lp4CKUfgB7uvjdOmyASd+rJiSS+GoSGRGpDaFT2chHzPnb3w9ea+yVwoZndGZ6uSGiIqEXFVqlIglHIiSS+B4HP3f3i8HXHpkfMi+ylGTDI3ZcUX2kiiU1nV4okvhrA2vDtawppNw24JWLU+h4xrksk4SnkRBLfY8BoM/ua0BUmCvIgoUOZ6WY2PzwtUqrpxBMREQks9eRERCSwFHIiIhJYCjkREQkshZyIiASWQk5ERAJLISciIoGlkBMRkcBSyImISGD9P43TWvojONvaAAAAAElFTkSuQmCC\n", "text/plain": [ "
" ] }, "metadata": { "needs_background": "light" }, "output_type": "display_data" } ], "source": [ "# let's explore the monotonic relationship\n", "plt.figure(figsize=(7,5))\n", "pd.concat([test_t,y_test], axis=1).groupby(\"fare\")[\"survived\"].mean().plot()\n", "plt.title(\"Relationship between fare and target\")\n", "plt.xlabel(\"fare\")\n", "plt.ylabel(\"Mean of target\")\n", "plt.show()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Note how the bins are monotonically ordered with the target." ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [] } ], "metadata": { "kernelspec": { "display_name": "fengine", "language": "python", "name": "fengine" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.8.2" }, "toc": { "base_numbering": 1, "nav_menu": {}, "number_sections": true, "sideBar": true, "skip_h1_title": false, "title_cell": "Table of Contents", "title_sidebar": "Contents", "toc_cell": false, "toc_position": {}, "toc_section_display": true, "toc_window_display": false } }, "nbformat": 4, "nbformat_minor": 4 }