{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# EqualFrequencyDiscretiser + WoEEncoder\n", "\n", "This is very useful for linear models, because by using discretisation + a monotonic encoding, we create monotonic variables with the target, from those that before were not originally. And this tends to help improve the performance of the linear model. " ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## EqualFrequencyDiscretiser\n", "\n", "The EqualFrequencyDiscretiser() divides continuous numerical variables\n", "into contiguous equal frequency intervals, that is, intervals that contain\n", "approximately the same proportion of observations.\n", "\n", "The interval limits are determined by the quantiles. The number of intervals,\n", "i.e., the number of quantiles in which the variable should be divided is\n", "determined by the user.\n", "\n", "Note: Check out the EqualFrequencyDiscretiser notebook to larn more about this transformer." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## WoEEncoder\n", "\n", "This encoder replaces the labels by the weight of evidence.\n", "\n", "**It only works for binary classification.**\n", "\n", "Note: Check out the WoEEncoder notebook to learn more about this transformer." ] }, { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [], "source": [ "import pandas as pd\n", "import numpy as np\n", "import matplotlib.pyplot as plt\n", "\n", "from sklearn.model_selection import train_test_split\n", "from sklearn.pipeline import Pipeline\n", "\n", "from feature_engine.discretisation import EqualFrequencyDiscretiser\n", "from feature_engine.encoding import WoEEncoder\n", "\n", "plt.rcParams[\"figure.figsize\"] = [15,5]" ] }, { "cell_type": "code", "execution_count": 2, "metadata": {}, "outputs": [], "source": [ "# Load titanic dataset from OpenML\n", "\n", "def load_titanic():\n", " data = pd.read_csv('https://www.openml.org/data/get_csv/16826755/phpMYEkMl')\n", " data = data.replace('?', np.nan)\n", " data['cabin'] = data['cabin'].astype(str).str[0]\n", " data['pclass'] = data['pclass'].astype('O')\n", " data['age'] = data['age'].astype('float').fillna(data.age.median())\n", " data['fare'] = data['fare'].astype('float').fillna(data.fare.median())\n", " data['embarked'].fillna('C', inplace=True)\n", " data.drop(labels=['boat', 'body', 'home.dest', 'name', 'ticket'], axis=1, inplace=True)\n", " return data" ] }, { "cell_type": "code", "execution_count": 3, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
pclasssurvivedsexagesibspparchfarecabinembarked
011female29.000000211.3375BS
111male0.916712151.5500CS
210female2.000012151.5500CS
310male30.000012151.5500CS
410female25.000012151.5500CS
\n", "
" ], "text/plain": [ " pclass survived sex age sibsp parch fare cabin embarked\n", "0 1 1 female 29.0000 0 0 211.3375 B S\n", "1 1 1 male 0.9167 1 2 151.5500 C S\n", "2 1 0 female 2.0000 1 2 151.5500 C S\n", "3 1 0 male 30.0000 1 2 151.5500 C S\n", "4 1 0 female 25.0000 1 2 151.5500 C S" ] }, "execution_count": 3, "metadata": {}, "output_type": "execute_result" } ], "source": [ "data = load_titanic()\n", "data.head()" ] }, { "cell_type": "code", "execution_count": 4, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "X_train : (916, 8)\n", "X_test : (393, 8)\n" ] } ], "source": [ "# let's separate into training and testing set\n", "X = data.drop(['survived'], axis=1)\n", "y = data.survived\n", "\n", "X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3, random_state=0)\n", "\n", "print(\"X_train :\" ,X_train.shape)\n", "print(\"X_test :\" ,X_test.shape)" ] }, { "cell_type": "code", "execution_count": 5, "metadata": {}, "outputs": [ { "data": { "image/png": "iVBORw0KGgoAAAANSUhEUgAAA20AAAE/CAYAAADVKysfAAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADh0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uMy4yLjEsIGh0dHA6Ly9tYXRwbG90bGliLm9yZy+j8jraAAAgAElEQVR4nO3df7Bc913e8fcTORjjBGLj+I4qaSKXakLsuLHpHZFOWriJ01iJGWQ6445SQ+Riqv5witPRDMgwQ6CtZtw/TGECBkSSRjMxMWrAtSZOIR4ltymdEMdOHGzZca3EwlakWpAfODelpjKf/nGPYFGurPtzz3d3368Zze5+7zm7z/n6Smcfn7NnU1VIkiRJktr0kr4DSJIkSZLOztImSZIkSQ2ztEmSJElSwyxtkiRJktQwS5skSZIkNczSJkmSJEkNs7RJkiSpCUleneRzSb6R5Cf7ziO14ry+A0iSJEmdnwJmq+rqvoNILfFImyRJklrxKuDwUldK4oEIjTVLm7QMSfYk+WJ3+sZjSX6kG1+X5I4kf5rkqSTvTFKndyZJvivJ+5KcSPLlJP8hybp+t0aSpP4l+TjwRuBXkswlubU7VfK5JM8k+fmBZTd3+9ebkzwNfLwb//Ekjyf5WpLfT/KqfrZGWl2WNml5vgj8Q+C7gF8APphkPfDPgbcCVwHfB1x/xnr7gVPA3wGuBt4C/MSQMkuS1KyqehPwP4B3VtXLgM8D7wBeAVwH/KskZ+5XfxB4DXBt97OfAf4x8MruuT40pPjSmkpV9Z1BGnlJHgbeDdwK/HZV/UY3/mbgfuClwHcDTwOvqKo/737+dmBXVb2xl+CSJDUkySzwwap67wI/+yWgqurfJtkMPAV8T1V9qfv5fwM+XFXv6x6/BJgDXlNVfzycLZDWhkfapGVI8o4kDyf5epKvA68FLgH+FvDMwKKD91/FfHk7MbDebwCXDiu3JEmjIsn3J/lEkj9J8mfAv2R+XzvozP3sLw/sY78KBNgwnMTS2vFDm9ISdefH/yZwDfCpqnqhO9IW4ASwcWDxTQP3nwGeBy6pqlPDyitJ0oj6LeBXgLdW1f/tjrSdWdoGTxl7BthbVXcNK6A0LB5pk5buQuZ3En8CkOSfMX+kDeAAcGuSDUleAfz06ZWq6gTwMeCOJN+Z5CVJvifJDw43viRJI+HlwFe7wrYV+KfnWP7XgduSXAF/dfGvG9Y6pDQMljZpiarqMeAO4FPAs8CVwP/sfvybzBezPwI+B3yU+QuPvND9/B3AtwGPAV8DPgysH1Z2SZJGyL8G/l2SbwA/x/z/GD2rqroH+I/A3UmeAx5l/uJg0sjzQiTSGkryVuDXq8pLDkuSJGlZPNImraIkFyR5W5Lzkmxg/oqS9/SdS5IkSaPLI23SKkryHcB/B74X+HPgPuDWqnqu12CSJEkaWZY2SZIkSWqYp0dKkiRJUsMsbZIkSZLUsCa+XPuSSy6pzZs3L3v9b37zm1x44YWrF2gIzDwco5Z51PKCmYellcwPPfTQn1bVK/vOocVb6T4W2vn969Okz8Gkbz84B5O+/bD2c/Bi+9gmStvmzZt58MEHl73+7OwsMzMzqxdoCMw8HKOWedTygpmHpZXMSf647wxampXuY6Gd378+TfocTPr2g3Mw6dsPaz8HL7aP9fRISZIkSWqYpU2SJEmSGmZpkyRJkqSGWdokSZIkqWGWNkmSJElqmKVNkiRJkhpmaZMkSZKkhlnaJEmSJKlhljZJkiRJapilTZIkSZIaZmmTJEmSpIad13cAabE277lv0csevf26NUwiSZPlkS//GTct8t9g//2VpNXnkTZJkiRJapilTZIkSZIaZmmTJEmSpIZZ2iRJ6lGSo0keSfJwkge7sYuT3J/kye72ooHlb0tyJMkTSa7tL7kkaVgsbZIk9e+NVXVVVU13j/cAh6pqC3Coe0ySy4EdwBXANuDOJOv6CCxJGh5LmyRJ7dkO7O/u7weuHxi/u6qer6qngCPA1h7ySZKGyEv+S5LUrwI+lqSA36iqfcBUVZ0AqKoTSS7tlt0A/OHAuse6sb8hyS5gF8DU1BSzs7MrCjh1Aey+8tSill3pa7Vqbm5ubLdtMSZ9+8E5mPTth37nwNImSVK/3lBVx7tidn+SL7zIsllgrL5lYL747QOYnp6umZmZFQV8z133cscji3vLcPTGlb1Wq2ZnZ1npPI6ySd9+cA4mffuh3znw9EhJknpUVce725PAPcyf7vhskvUA3e3JbvFjwKaB1TcCx4eXVpLUB0ubJEk9SXJhkpefvg+8BXgUOAjs7BbbCdzb3T8I7EhyfpLLgC3AA8NNLUkaNk+PlCSpP1PAPUlgfp/8W1X1e0k+AxxIcjPwNHADQFUdTnIAeAw4BdxSVS/0E12SNCyWNkmSelJVXwJet8D4V4BrzrLOXmDvGkeTJDXE0yMlSZIkqWGWNkmSJElqmKVNkiRJkhpmaZMkSZKkhlnaJEmSJKlhljZJkiRJapilTZIkSZIaZmmTJEmSpIZZ2iRJkiSpYZY2SZIkSWqYpU2SJEmSGnbO0pZkU5JPJHk8yeEkt3bjFye5P8mT3e1FA+vcluRIkieSXLuWGyBJkiRJ42wxR9pOAbur6jXA64FbklwO7AEOVdUW4FD3mO5nO4ArgG3AnUnWrUV4SZIkSRp35yxtVXWiqj7b3f8G8DiwAdgO7O8W2w9c393fDtxdVc9X1VPAEWDrageXJEmSpEmwpM+0JdkMXA18GpiqqhMwX+yAS7vFNgDPDKx2rBuTJEmSJC3ReYtdMMnLgN8B3lVVzyU566ILjNUCz7cL2AUwNTXF7OzsYqN8i7m5uRWt3wczL93uK08tetnTOfvOvFSjlhfMPCyjmFmSJK2ORZW2JC9lvrDdVVW/2w0/m2R9VZ1Ish442Y0fAzYNrL4ROH7mc1bVPmAfwPT0dM3MzCxvC5h/g76S9ftg5qW7ac99i1726I0zQP+Zl2rU8oKZh2UUM0uSpNWxmKtHBngf8HhV/eLAjw4CO7v7O4F7B8Z3JDk/yWXAFuCB1YssSZIkSZNjMUfa3gD8GPBIkoe7sZ8BbgcOJLkZeBq4AaCqDic5ADzG/JUnb6mqF1Y9uSRJkiRNgHOWtqr6Axb+nBrANWdZZy+wdwW5JEmSJEks8eqRkiRJkqThsrRJkiRJUsMsbZIkSZLUMEubJEmSJDXM0iZJkiRJDbO0SZIkSVLDLG2SJEmS1DBLmyRJkiQ1zNImSZIkSQ2ztEmSJElSwyxtkiRJktQwS5skSZIkNczSJkmSJEkNs7RJkiRJUsMsbZIkSZLUMEubJEmSJDXM0iZJkiRJDbO0SZIkSVLDLG2SJEmS1DBLmyRJkiQ1zNImSZIkSQ2ztEmS1LMk65J8LslHuscXJ7k/yZPd7UUDy96W5EiSJ5Jc219qSdKwWNokSerfrcDjA4/3AIeqagtwqHtMksuBHcAVwDbgziTrhpxVkjRkljZJknqUZCNwHfDegeHtwP7u/n7g+oHxu6vq+ap6CjgCbB1WVklSPyxtkiT165eAnwL+cmBsqqpOAHS3l3bjG4BnBpY71o1JksbYeX0HkCRpUiX5IeBkVT2UZGYxqywwVgs87y5gF8DU1BSzs7MricnUBbD7ylOLWnalr9Wqubm5sd22xZj07QfnYNK3H/qdA0ubJEn9eQPww0neBnw78J1JPgg8m2R9VZ1Ish442S1/DNg0sP5G4PiZT1pV+4B9ANPT0zUzM7OikO+5617ueGRxbxmO3riy12rV7OwsK53HUTbp2w/OwaRvP/Q7B54eKUlST6rqtqraWFWbmb/AyMer6keBg8DObrGdwL3d/YPAjiTnJ7kM2AI8MOTYkqQh80ibJEntuR04kORm4GngBoCqOpzkAPAYcAq4pape6C+mJGkYLG2SJDWgqmaB2e7+V4BrzrLcXmDv0IJJknrn6ZGSJEmS1DBLmyRJkiQ1zNImSZIkSQ2ztEmSJElSwyxtkiRJktQwS5skSZIkNczSJkmSJEkNs7RJkiRJUsMsbZIkSZLUMEubJEmSJDXM0iZJkiRJDbO0SZIkSVLDLG2SJEmS1DBLmyRJkiQ1zNImSZIkSQ2ztEmSJElSwyxtkiRJktQwS5skSZIkNczSJkmSJEkNO2dpS/L+JCeTPDow9vNJvpzk4e7P2wZ+dluSI0meSHLtWgWXJEmSpEmwmCNtHwC2LTD+n6rqqu7PRwGSXA7sAK7o1rkzybrVCitJkiRJk+acpa2qPgl8dZHPtx24u6qer6qngCPA1hXkkyRJkqSJdt4K1n1nkncADwK7q+prwAbgDweWOdaNfYsku4BdAFNTU8zOzi47yNzc3IrW74OZl273lacWvezpnH1nXqpRywtmHpZRzCxJklbHckvbrwH/Hqju9g7gx4EssGwt9ARVtQ/YBzA9PV0zMzPLjDL/Bn0l6/fBzEt30577Fr3s0RtngP4zL9Wo5QUzD8soZpYkSatjWVePrKpnq+qFqvpL4Df561MgjwGbBhbdCBxfWURJkiRJmlzLKm1J1g88/BHg9JUlDwI7kpyf5DJgC/DAyiJKkiRJ0uQ65+mRST4EzACXJDkGvBuYSXIV86c+HgX+BUBVHU5yAHgMOAXcUlUvrE10SZIkSRp/5yxtVfX2BYbf9yLL7wX2riSUJEmSJGnesk6PlCRJkiQNh6VNkiRJkhpmaZMkSZKkhlnaJEmSJKlhljZJkiRJapilTZIkSZIaZmmTJEmSpIZZ2iRJkiSpYZY2SZIkSWqYpU2SJEmSGmZpkyRJkqSGWdokSZIkqWGWNkmSJElqmKVNkqSeJPn2JA8k+XySw0l+oRu/OMn9SZ7sbi8aWOe2JEeSPJHk2v7SS5KGxdImSVJ/ngfeVFWvA64CtiV5PbAHOFRVW4BD3WOSXA7sAK4AtgF3JlnXS3JJ0tBY2iRJ6knNm+sevrT7U8B2YH83vh+4vru/Hbi7qp6vqqeAI8DWIUaWJPXA0iZJUo+SrEvyMHASuL+qPg1MVdUJgO720m7xDcAzA6sf68YkSWPsvL4DSJI0yarqBeCqJK8A7kny2hdZPAs9xbcslOwCdgFMTU0xOzu7ooxTF8DuK08tatmVvlar5ubmxnbbFmPStx+cg0nffuh3DixtkiQ1oKq+nmSW+c+qPZtkfVWdSLKe+aNwMH9kbdPAahuB4ws81z5gH8D09HTNzMysKNt77rqXOx5Z3FuGozeu7LVaNTs7y0rncZRN+vaDczDp2w/9zoGnR0qS1JMkr+yOsJHkAuDNwBeAg8DObrGdwL3d/YPAjiTnJ7kM2AI8MNzUkqRh80ibJEn9WQ/s764A+RLgQFV9JMmngANJbgaeBm4AqKrDSQ4AjwGngFu60yslSWPM0iZJUk+q6o+AqxcY/wpwzVnW2QvsXeNokqSGeHqkJEmSJDXM0iZJkiRJDbO0SZIkSVLDLG2SJEmS1DBLmyRJkiQ1zNImSZIkSQ2ztEmSJElSwyxtkiRJktQwS5skSZIkNczSJkmSJEkNs7RJkiRJUsMsbZIkSZLUMEubJEmSJDXM0iZJkiRJDbO0SZIkSVLDLG2SJEmS1DBLmyRJkiQ1zNImSZIkSQ2ztEmSJElSwyxtkiRJktQwS5skSZIkNczSJkmSJEkNs7RJkiRJUsMsbZIkSZLUMEubJEmSJDXM0iZJkiRJDTtnaUvy/iQnkzw6MHZxkvuTPNndXjTws9uSHEnyRJJr1yq4JEmSJE2CxRxp+wCw7YyxPcChqtoCHOoek+RyYAdwRbfOnUnWrVpaSZIkSZow5yxtVfVJ4KtnDG8H9nf39wPXD4zfXVXPV9VTwBFg6ypllSRJkqSJs9zPtE1V1QmA7vbSbnwD8MzAcse6MUmSJEnSMpy3ys+XBcZqwQWTXcAugKmpKWZnZ5f9onNzcytavw9mXrrdV55a9LKnc/adealGLS+YeVhGMbMkSVodyy1tzyZZX1UnkqwHTnbjx4BNA8ttBI4v9ARVtQ/YBzA9PV0zMzPLjDL/Bn0l6/fBzEt30577Fr3s0RtngP4zL9Wo5QUzD8soZpYkSatjuadHHgR2dvd3AvcOjO9Icn6Sy4AtwAMriyhJkiRJk+ucR9qSfAiYAS5Jcgx4N3A7cCDJzcDTwA0AVXU4yQHgMeAUcEtVvbBG2SVJkiRp7J2ztFXV28/yo2vOsvxeYO9KQkmSJEmS5i339EhJkiRJ0hBY2iRJkiSpYZY2SZIkSWqYpU2SJEmSGmZpkyRJkqSGWdokSZIkqWGWNkmSepJkU5JPJHk8yeEkt3bjFye5P8mT3e1FA+vcluRIkieSXNtfeknSsFjaJEnqzylgd1W9Bng9cEuSy4E9wKGq2gIc6h7T/WwHcAWwDbgzybpekkuShsbSJklST6rqRFV9trv/DeBxYAOwHdjfLbYfuL67vx24u6qer6qngCPA1uGmliQNm6VNkqQGJNkMXA18GpiqqhMwX+yAS7vFNgDPDKx2rBuTJI2x8/oOIEnSpEvyMuB3gHdV1XNJzrroAmO1wPPtAnYBTE1NMTs7u6J8UxfA7itPLWrZlb5Wq+bm5sZ22xZj0rcfnINJ337odw4sbZIk9SjJS5kvbHdV1e92w88mWV9VJ5KsB05248eATQOrbwSOn/mcVbUP2AcwPT1dMzMzK8r4nrvu5Y5HFveW4eiNK3utVs3OzrLSeRxlk7794BxM+vZDv3Pg6ZGSJPUk84fU3gc8XlW/OPCjg8DO7v5O4N6B8R1Jzk9yGbAFeGBYeSVJ/fBImyRJ/XkD8GPAI0ke7sZ+BrgdOJDkZuBp4AaAqjqc5ADwGPNXnrylql4YfmxJ0jBZ2iRJ6klV/QELf04N4JqzrLMX2LtmoSRJzfH0SEmSJElqmKVNkiRJkhrm6ZGSJGnVbN5z36KXPXr7dWuYRJLGh0faJEmSJKlhljZJkiRJapilTZIkSZIaZmmTJEmSpIZZ2iRJkiSpYZY2SZIkSWqYl/zXomzecx+7rzzFTYu4lPNSLuG8lEtDS5IkSZPII22SJEmS1DBLmyRJkiQ1zNImSZIkSQ2ztEmSJElSw7wQicbS6QucLObiKUu5cIokSZI0bB5pkyRJkqSGWdokSZIkqWGWNkmSJElqmKVNkiRJkhpmaZMkSZKkhlnaJEmSJKlhljZJkiRJapilTZIkSZIaZmmTJEmSpIZZ2iRJkiSpYZY2SZIkSWqYpU2SJEmSGmZpkyRJkqSGWdokSZIkqWGWNkmSJElqmKVNkiRJkhpmaZMkSZKkhlnaJEmSJKlhljZJkiRJath5K1k5yVHgG8ALwKmqmk5yMfDbwGbgKPBPquprK4spSZIkSZNpNY60vbGqrqqq6e7xHuBQVW0BDnWPJUmSJEnLsKIjbWexHZjp7u8HZoGfXoPXUaM277mv7wiSJEnS2FjpkbYCPpbkoSS7urGpqjoB0N1eusLXkCRJkqSJtdIjbW+oquNJLgXuT/KFxa7YlbxdAFNTU8zOzi47xNzc3IrW78OoZd595SmmLpi/HSWLydzSf4dR+70AMw/LKGaWJEmrY0WlraqOd7cnk9wDbAWeTbK+qk4kWQ+cPMu6+4B9ANPT0zUzM7PsHLOzs6xk/T60kHlppzGex+4rT3HHI2txRu3aWUzmozfODCfMIrTwe7FUZh6OUcwsSZJWx7JPj0xyYZKXn74PvAV4FDgI7OwW2wncu9KQkiRJkjSpVnLYZAq4J8np5/mtqvq9JJ8BDiS5GXgauGHlMSVJkiRpMi27tFXVl4DXLTD+FeCalYSSJGkSJHk/8EPAyap6bTd21u87TXIbcDPz34/6k1X1+z3EliQN2Wp8T5skSVqeDwDbzhhb8PtOk1wO7ACu6Na5M8m64UWVJPVltK4qIfVsKRdvOXr7dWuYRNI4qKpPJtl8xvDZvu90O3B3VT0PPJXkCPMXAPvUMLJKkvrjkTZJktpytu873QA8M7DcsW5MkjTmPNImSdJoyAJjteCCq/hdqLC477xcjlH67sFJ/67ESd9+cA4mffuh3zmwtEmS1Jazfd/pMWDTwHIbgeMLPcFqfhcqwHvuundNvqezpe/JPJdJ/67ESd9+cA4mffuh3znw9EhJktpytu87PQjsSHJ+ksuALcADPeSTJA2ZR9okSepJkg8xf9GRS5IcA94N3M4C33daVYeTHAAeA04Bt1TVC70ElyQNlaVNkqSeVNXbz/KjBb/vtKr2AnvXLpEkqUWeHilJkiRJDbO0SZIkSVLDLG2SJEmS1DBLmyRJkiQ1zAuRSA3YvOc+YP7La2/q7p/N0duvG0YkSZIkNcIjbZIkSZLUMEubJEmSJDXM0iZJkiRJDfMzbZp4m8/xGTJJkiSpTx5pkyRJkqSGWdokSZIkqWGWNkmSJElqmJ9pGzN+PkuSJEkaLx5pkyRJkqSGWdokSZIkqWGeHimtEU9VlSRJ0mrwSJskSZIkNcwjbZKApR0ZPHr7dWuYRJIkSYM80iZJkiRJDbO0SZIkSVLDPD1SkiSNFU/3ljRuxqa0+Q+0JEnjyyvySppkY1PaJEnSaLGISdLiWNqkMeYbIkmSpNFnaZO0ZINlcPeVp7jpRcqhpyNLkiStjFePlCRJkqSGeaRNGjGe8ihJkjRZLG2S1pRXdpUkSVoZT4+UJEmSpIZZ2iRJkiSpYRN5eqSna0mSJEkaFRNZ2tbKUsrgB7ZduIZJJEmSJI0LS5skncGj8ZIkqSWWNkkTYSlfCC5JktQSL0QiSZIkSQ2ztEmSJElSwzw9siePfPnPFn16lp+Z0aRYymfJJEmSJoWlbQT4RlaSJEmaXJa2c7AwSZI0vha7n9995Slm1jaKJJ2VpU2SJEmL5teiSMNnaZOkFfDNizQ5Wjj7xn9H5vlvrybNml09Msm2JE8kOZJkz1q9jiRJk8Z9rCRNljU50pZkHfCrwD8CjgGfSXKwqh5bi9eTpHFz5v9FfrEvBPf/Ik8W97GSNHnW6vTIrcCRqvoSQJK7ge2AOxRJklbGfaxGhqcxSqtjrUrbBuCZgcfHgO9fo9eSpJGwVp+H8U3RxHEfO8HG+e97C58ZlM40+Hv5Yme9wNr+nUtVrf6TJjcA11bVT3SPfwzYWlX/ZmCZXcCu7uGrgSdW8JKXAH+6gvX7YObhGLXMo5YXzDwsrWR+VVW9su8Qk6yHfSy08/vXp0mfg0nffnAOJn37Ye3n4Kz72LU60nYM2DTweCNwfHCBqtoH7FuNF0vyYFVNr8ZzDYuZh2PUMo9aXjDzsIxiZq2Zoe5jwd8/cA4mffvBOZj07Yd+52Ctrh75GWBLksuSfBuwAzi4Rq8lSdIkcR8rSRNmTY60VdWpJO8Efh9YB7y/qg6vxWtJkjRJ3MdK0uRZsy/XrqqPAh9dq+c/w6qdAjJEZh6OUcs8annBzMMyipm1Roa8jwV//8A5mPTtB+dg0rcfepyDNbkQiSRJkiRpdazVZ9okSZIkSatg5Etbkm1JnkhyJMmevvMsJMn7k5xM8ujA2MVJ7k/yZHd7UZ8ZByXZlOQTSR5PcjjJrd14y5m/PckDST7fZf6FbrzZzABJ1iX5XJKPdI+bzguQ5GiSR5I8nOTBbqzZ3ElekeTDSb7Q/U7//cbzvrqb29N/nkvyrpYza3yNwj52NSx1P53ktm5OnkhybT+pV89y9vtjOAdLfh8xbnMAS3tfMqbbv6T3OMOcg5EubUnWAb8KvBW4HHh7ksv7TbWgDwDbzhjbAxyqqi3Aoe5xK04Bu6vqNcDrgVu6eW058/PAm6rqdcBVwLYkr6ftzAC3Ao8PPG4972lvrKqrBi5723LuXwZ+r6q+F3gd8/PdbN6qeqKb26uAvwf8H+AeGs6s8TRC+9jV8AEWuZ/u5mAHcEW3zp3dXI2yJe33x3QOlvQ+YkznABb5vmSMtx8W+R5n2HMw0qUN2AocqaovVdVfAHcD23vO9C2q6pPAV88Y3g7s7+7vB64faqgXUVUnquqz3f1vMP+XdwNtZ66qmusevrT7UzScOclG4DrgvQPDzeY9hyZzJ/lO4AeA9wFU1V9U1ddpNO8CrgG+WFV/zOhk1vgYiX3saljifno7cHdVPV9VTwFHmJ+rkbWM/f44zsFS30eM3Rws8X3J2G3/i2hiDka9tG0Anhl4fKwbGwVTVXUC5v+xBC7tOc+CkmwGrgY+TeOZu0P6DwMngfurqvXMvwT8FPCXA2Mt5z2tgI8leSjJrm6s1dx/G/gT4D93p3u8N8mFtJv3TDuAD3X3RyWzxsco72NXw9n+zo31vCxyvz+Wc7DE9xHjOAdLeV8yjtsPS3uPM9Q5GPXSlgXGvBzmKknyMuB3gHdV1XN95zmXqnqhO6VsI7A1yWv7znQ2SX4IOFlVD/WdZRneUFXfx/wpU7ck+YG+A72I84DvA36tqq4GvsmInFaY+S9N/mHgv/SdRRPLfezCxnZelrDfH8s5WOL7iLGag2W8Lxmr7R+wlPc4Q52DUS9tx4BNA483Asd7yrJUzyZZD9Ddnuw5z9+Q5KXM/8N9V1X9bjfcdObTutPfZpk/v7jVzG8AfjjJUeZPOXpTkg/Sbt6/UlXHu9uTzH/Waivt5j4GHOv+bynAh5kvca3mHfRW4LNV9Wz3eBQya7yM8j52NZzt79xYzssS9/tjOQenLfJ9xLjNwVLfl4zb9gNLfo8z1DkY9dL2GWBLksu6/yu9AzjYc6bFOgjs7O7vBO7tMcvfkCTMfwbo8ar6xYEftZz5lUle0d2/AHgz8AUazVxVt1XVxqrazPzv7cer6kdpNO9pSS5M8vLT94G3AI/SaO6q+t/AM0le3Q1dAzxGo3nP8Hb++tRIGI3MGi+jvI9dDWf7O3cQ2JHk/CSXAVuAB3rIt2qWsd8fxzlY6vuIsZqDZbwvGavth2W9xxnuHFTVSP8B3gb8L+CLwM/2necsGT8EnAD+H/Ot/Gbgu5m/As2T3e3FfeccyPsPmD+8+0fAw92ftzWe+e8Cn+syPwr8XDfebOaB7DPAR0YhL/OfEft89+fw6b9zLedm/ipgD3a/G/8VuKjlvF3m7wC+AnzXwFjTmf0znn9GYR+7Stu5pP008LPdnDwBvLXv/Kuw/Uve7/y3fO4AAABoSURBVI/hHCz5fcS4zcHAdi3qfcm4bf9y3uMMcw7SvaAkSZIkqUGjfnqkJEmSJI01S5skSZIkNczSJkmSJEkNs7RJkiRJUsMsbZIkSZLUMEubJEmSJDXM0iZJkiRJDbO0SZIkSVLD/j+UgSC64ypSUwAAAABJRU5ErkJggg==\n", "text/plain": [ "
" ] }, "metadata": { "needs_background": "light" }, "output_type": "display_data" } ], "source": [ "# we will use two continuous variables for the transformations\n", "X_train[[\"age\", 'fare']].hist(bins=30)\n", "plt.show()" ] }, { "cell_type": "code", "execution_count": 8, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "Pipeline(memory=None,\n", " steps=[('EqualFrequencyDiscretiser',\n", " EqualFrequencyDiscretiser(q=4, return_boundaries=False,\n", " return_object=True,\n", " variables=['age', 'fare'])),\n", " ('WoEEncoder', WoEEncoder(variables=['age', 'fare']))],\n", " verbose=False)" ] }, "execution_count": 8, "metadata": {}, "output_type": "execute_result" } ], "source": [ "# set up the discretiser\n", "\n", "efd = EqualFrequencyDiscretiser(\n", " q=4,\n", " variables=['age', 'fare'],\n", " # important: return values as categorical\n", " return_object=True)\n", "\n", "# set up the encoder\n", "woe = WoEEncoder(variables=['age', 'fare'])\n", "\n", "# pipeline\n", "transformer = Pipeline(steps=[('EqualFrequencyDiscretiser', efd),\n", " ('WoEEncoder', woe),\n", " ])\n", "\n", "transformer.fit(X_train, y_train)" ] }, { "cell_type": "code", "execution_count": 9, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "{'age': [-inf, 23.0, 28.0, 35.0, inf],\n", " 'fare': [-inf, 7.8958, 14.4542, 31.275, inf]}" ] }, "execution_count": 9, "metadata": {}, "output_type": "execute_result" } ], "source": [ "transformer.named_steps['EqualFrequencyDiscretiser'].binner_dict_" ] }, { "cell_type": "code", "execution_count": 10, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "{'age': {0: 0.07533270507296917,\n", " 1: -0.260402163917158,\n", " 2: 0.3237107275657203,\n", " 3: 0.05769015189511875},\n", " 'fare': {0: -0.5990108946387251,\n", " 1: -0.41504696424627724,\n", " 2: 0.142571903020815,\n", " 3: 0.7852653023249282}}" ] }, "execution_count": 10, "metadata": {}, "output_type": "execute_result" } ], "source": [ "transformer.named_steps['WoEEncoder'].encoder_dict_" ] }, { "cell_type": "code", "execution_count": 11, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
pclasssexagesibspparchfarecabinembarked
11393male0.05769000-0.599011nS
5332female0.075333010.142572nS
4592male0.057690100.142572nS
11503male-0.260402000.142572nS
3932male-0.260402000.785265nS
\n", "
" ], "text/plain": [ " pclass sex age sibsp parch fare cabin embarked\n", "1139 3 male 0.057690 0 0 -0.599011 n S\n", "533 2 female 0.075333 0 1 0.142572 n S\n", "459 2 male 0.057690 1 0 0.142572 n S\n", "1150 3 male -0.260402 0 0 0.142572 n S\n", "393 2 male -0.260402 0 0 0.785265 n S" ] }, "execution_count": 11, "metadata": {}, "output_type": "execute_result" } ], "source": [ "train_t = transformer.transform(X_train)\n", "test_t = transformer.transform(X_test)\n", "\n", "test_t.head()" ] }, { "cell_type": "code", "execution_count": 15, "metadata": {}, "outputs": [ { "data": { "image/png": "iVBORw0KGgoAAAANSUhEUgAAAbkAAAFNCAYAAACdVxEnAAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADh0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uMy4yLjEsIGh0dHA6Ly9tYXRwbG90bGliLm9yZy+j8jraAAAgAElEQVR4nO3dd3yV9d3/8dcnCWHvsCEksofMyLa3A62zatU6oUoVd+1tbbX3ba1tbbV2qFVbb7SIDEXqwm21rspQwhQQFAhh770zPr8/zkV/aUzIAXJynZzzfj4eeeRc87y/Z33ONb7XMXdHREQkEaWEHUBERCRWVORERCRhqciJiEjCUpETEZGEpSInIiIJS0VOREQSloqcHDMz+8jMrjvGZTPNbI+ZpVZ2rhL3cZ+ZTTzC9EVmdsoxrtvNrOMxh4tzZnaRma0OnqO+Yec5Gon+3MjRUZFLcma20sz2Bx9mG8xsnJnVi9H9DD887O6r3L2euxdV9n1Fy917uPtHVX2/1eRD+A/ArcFzNDfsMJXleL6YJcL9JyMVOQE4393rAX2AvsDPQs4j4WsPLDqWBc0srZKzxI1EbluiUpGTf3P3DcC7RIodAGY2yMymm9kOM5tf3u49M+tgZh+Y2VYz22Jmk8ysUTBtApAJvB5sMf7UzLKCLZq0YJ7WZvaamW0zs2Vmdn2Jdd9nZlPMbLyZ7Q52M+aUmH6Xma0Npi01s9NLREs/wnL/3roM7uNFM3shmHeOmfWu4CE7x8xWBO39vZn9+/1kZqPM7Esz225m75pZ+2D8J8Es84PH4jIz+9jMLg6mDwsel3OC4eFmNq+i9QbTuprZe8FjuNTMvldi2jgze8LM3gza95mZdSjjeaxpZnuA1CDj8mD83Wa2PFh2sZldVGKZa8xsmpk9bGbbgPuC9fzBzFaZ2UYze9LMapf1IB7ptVPiebrTzBaY2c7gOapVYvpPzGy9ma0zs1HlPVlm9hvgZODx4LF/PBj/qEV2ze4ys9lmdnKJZQ6/Liaa2S7gGjPLNrNPgsfi/eBxnVhimTLfM+Xdv8SYu+svif+AlcDw4HZb4Avg0WC4DbAVOIfIF6IzguFmwfSPgOuC2x2D6TWBZsAnwCNl3U8wnAU4kBYMfwz8BahFpMhuBk4Ppt0HHAhypAIPADODaV2A1UDrEuvtUNFyZbT9PqAAuASoAdwJ5AE1ynncHPgQaEKkgH9V4rG4EFgGdAPSgHuA6aWW7Vhi+FfAY8Ht/wGWA78rMe3RitYL1A0eh2uDaf2ALUCPYPo4YBswIJg+CZh8hNdF6YyXAq2D18FlwF6gVTDtGqAQuC1Yd23gEeC14PGpD7wOPFDOfUXz2vk8uP8mwJfAjcG0s4CNQM/gMXiudPZS9/XR4eepxLirgaZB9h8DG4BapV4XFwZtrw3MILI7Nx0YBuwCJh7te0Z/VfQZF3YA/YX8Aoh8gOwBdgcfDv8EGgXT7gImlJr/XeD7we1y37DBh8LcUvdTZpED2gFFQP0S0x8AxgW37wPeLzGtO7A/uN0R2AQMp1RBOtJypTMF85YsgCnAeuDkctrnwFklhm8G/hncfhv4Qal17QPal1i2ZAE5HVgQ3H4HuI7/X8Q/Br5b0XqJFJ5/lcr4f8AvgtvjgKdLTDsHWHKE10W5hSKYPg+4ILh9DbCqxDQjUgQ7lBg3GMiL8jVZ1mvn6hLDDwFPBrfHAg+WmNb5SNmJosgA24HeJV4Xn5SYlkmkoNcpMW4i/7/IHfN7Rn+x+dPuSgG40N3rA6cAXYGMYHx74NJgt8sOM9tB5Jtrq9IrMLPmZjY52G24i8gbP6P0fOVoDWxz990lxuUT+VZ82IYSt/cBtcwszd2XAT8i8mG0KcjQuqLlysmx+vANdy8G1gTZyrO6xO38EvO2Bx4t8ZhtI/LB34ayzQA6m1kLIlux44F2ZpZBZMvr8C7OI623PTCw1HN1FdCyxP2UfiyiPsHIzEaa2bwS6+7Jfz6/JR+LZkAdYHaJ+d8Jxpe17mheO+Vlb803n4ejYmY/DnYB7wyyNjxC2w6/VveVMz3q94xUDRU5+Td3/5jIN/4/BKNWE/lW2qjEX113f7CMxR8g8g26l7s3ILILyEqu/gh3vQ5oYmb1S4zLBNZGmfs5dx9G5APGgd9Fs1wZ2h2+YZHja22DbBXOTyTv4XlXAzeUetxqu/v0cvLvA2YDtwML3f0QMB24A1ju7luiWO9q4ONS0+q5+01H/SiUEhz3ewq4FWjq7o2AhZT//G4B9hPZVXo4S0OPnNxUlopeO0eynm8+D0fyH6/D4PjbXcD3gMZB23ZSftvWE3mt1ikxruT9V/Se0c++VDEVOSntEeAMM+tD5Bv1+Wb2bTNLNbNaZnaKmbUtY7n6RHZ77jCzNsBPSk3fCJxQ1h26+2oiH+oPBPfRC/gBkeNGR2RmXczsNDOrSeT4234iuz6PRX8z+26wpfcj4CAw8wjz/8TMGptZOyIF6oVg/JPAz8ysR5CxoZldWmK5sh6Lj4kUkY+D4Y9KDVe03jeIbA2OMLMawd9JZtYt6taXry6RD+fNwf1eS2RLrkzBVvBTwMNm1jxYpo2ZfbucRSp67RzJFCIng3QPCs8vKpi/9GNfn8jux81AmpndCzQob2F3zwdyiZxck25mg4HzS8xS0Xum3PeBxIaKnPwHd99MZHfZz4PicwGRkyE2E/mW+hPKft38ksjJDjuBN4GXS01/ALgn2IVzZxnLX0HkON064BUix5LeiyJyTeBBIlsPG4DmQd5jMZXIsa3twAgix8IKKph/NpHjU28CfwNw91eIbE1ODna/LQTOLrHcfcCzwWNx+AzIj4l84H5SzvAR1xvs6j0TuJzIY7ghmLfm0T4Ipbn7YuCPRHarbgROBKZVsNhdRE6SmRlkfZ/ISUJlqei1c6RsbxP5YvZBcH8fVLDIo8AlFjk79c9Ejpe9TeTEoXwiX5RWH2F5iOwGHkzkhJL7iXy5ORjkqeg9U/r+JcbMXVvPImZ2H5GTFa4OO4tUL2b2ApGTeCraipQQaEtOROQoBLuBO5hZipmdRWTL7dWwc0nZ1HtfROTotCSyS7UpkTNwb/IEuvRZotHuShERSVjaXSkiIglLRU5ERBJWtTsml5GR4VlZWWHHEBGRODJ79uwt7v6Nq+pUuyKXlZVFbm5u2DFERCSOmFmZl3TT7koREUlYKnIiIpKwVORERCRhqciJiEjCUpETEZGEpSInIiIJS0VOREQSVkyLnJmdZWZLzWyZmd1dzjynmNk8M1tkZh+XNY+IiMixiFlncDNLBZ4AziBype5ZZvZa8AOMh+dpBPwFOMvdVx3+FWEREZHKEMstuQHAMndf4e6HgMlEfneppCuBl919FYC7b4phHhERiRPuzuz8bbyzcH1M7yeWl/Vqw3/+jPwaYGCpeToDNczsI6A+8Ki7jy+9IjMbDYwGyMzMjElYERGJvUOFxbz1xXqemZbH/DU76di8Ht/u0RIzi8n9xbLIlZW49I/XpQH9gdOB2sAMM5vp7l/9x0LuY4AxADk5OfoBPBGRambrnoM8//kqxs/IZ9Pug5zQrC73X9iT7/ZrE7MCB7EtcmuAdiWG2wLryphni7vvBfaa2SdAb+ArRESk2luyYRfPfLqSV+at5VBhMd/q3IyHLsniW52akZISu+J2WCyL3Cygk5llA2uBy4kcgytpKvC4maUB6UR2Zz4cw0wiIhJjxcXOh0s3MXZaHtOWbaVWjRQu6d+Wa4dk0alF/SrNErMi5+6FZnYr8C6QCox190VmdmMw/Ul3/9LM3gEWAMXA0+6+MFaZREQkdvYcLOSl2Wt4ZloeK7fuo1XDWtx1VleuGNCORnXSQ8lk7tXrEFdOTo7r9+REROLH6m37eHb6Sl6YtZrdBwvpm9mIUUOzOatnS2qkVs01R8xstrvnlB5f7X40VUREwufuzFq5nbGf5vGPxRtIMeOcE1tx7dAs+mY2Djvev6nIiYhI1A4WFvHG/PU8Mz2PhWt30ahODW78rw6MGNyeVg1rhx3vG1TkRESkQlv2HGTSzFVMmJnPlj0H6di8Hr+96EQu6tuG2umpYccrl4qciIiUa/G6XTwzLY+p89ZxqKiYU7s0Y9SwbIZ1zIhp/7bKoiInIiL/oajY+eeXGxk7LY+ZK7ZRu0Yql53UjmuGZtGhWb2w4x0VFTkREQFg94EC/p67hnHTV7Jq2z7aNKrNz87uyuUnZdKwTo2w4x0TFTkRkSS3aus+xk1fyZTc1ew5WEhO+8bcfXZXzuzegrQq6gIQKypyIiJJyN2ZuWIbY6fl8f6XG0k14/zerbl2aBa92jYKO16lUZETEUkiBwqKeG3+OsZ+mseSDbtpUjedW0/tyNWD2tOiQa2w41U6FTkRkSSwafcBJs5cxaSZ+Wzde4guLerzu4tP5II+bahVI367ABwvFTkRkQS2cO1Oxk7L4/X56ygsdk7v2pxRQ7MZ3KFptegCcLxU5EREEkxRsfPe4g2M/XQln6/cRp30VK4a2J7vD8kiO6Nu2PGqlIqciEiC2HWggCmzVjNu+krWbN9P28a1uefcblya046GtatnF4DjpSInIlLN5W3Zy7PTV/L33NXsPVTEgOwm3HNud87o3oLUKvhh0nimIiciUg25O9OXb2Xsp3l8sHQTaSmRLgCjhmbTs03DsOPFDRU5EZFq5EBBEVPnrWXspytZunE3Teum88PTOnHVoEya10+8LgDHS0VORKQa2LjrABNm5PPc56vYtvcQ3Vo14PeX9OL83q0TugvA8VKRExGJYwvW7GDsp3m8sWA9Re6c0a0F1w7NZtAJTZKiC8DxUpETEYkzhUXF/GPxRsZ+mkdu/nbq1Uxj5OAsrhmSRWbTOmHHq1ZU5ERE4sTOfQVMnrWK8TPyWbtjP5lN6nDved25NKct9WslZxeA46UiJyISsuWb9zBu2kpenL2G/QVFDD6hKfd9pwendW2e9F0AjpeKnIhICNydf329hbHT8vho6WbSU1O4oE9rrh2aTffWDcKOlzBU5EREqtD+Q0W8Mnctz0zL4+tNe8ioV5M7zujMlQMzyahXM+x4CUdFTkSkCqzfuZ/xM/J5/vNV7NhXQI/WDfjT93pzbq9W1ExTF4BYUZETEYmhuau2M3baSt76Yj3uzpndWzJqWDYnZTVWF4AqoCInIlLJCoqKeXvhBp6ZlsfcVTuoXzONUUOzGDk4i3ZN1AWgKqnIiYhUku17D/H8rFWMn57Phl0HyGpah19+pwcX929LvZr6uA2DHnURkeP09cbdPDN9JS/PWcOBgmKGdczgNxf15NQuzUlRF4BQqciJiByD4mLn468388y0lXzy1WbS01L4bt82XDM0i64t1QUgXqjIiYgchX2HCnlpTqQLwIrNe2levyZ3ntmZKwZk0lRdAOKOipyISBTW7tjP+Bkref6zVew6UEivtg159PI+nN2zFelpKWHHk3KoyImIlMPdmbNqO2M/Xck7izYAcFaPlowalkW/THUBqA5U5ERESjlUWMxbX6znmWl5zF+zkwa10rju5GxGDs6iTaPaYceTo6AiJyIS2Lb3EM99ls/4Gfls2n2QE5rV5dcX9uTifm2ok66Py+pIz5qIJL2lG3bzzLQ8Xpm7loOFxXyrczN+d0kW/9WpmboAVHMqciKSlIqLnQ+XbmLstDymLdtKrRopXNy/LdcOyaJTi/phx5NKoiInIkll78FCXpy9hnHTV5K3ZS8tG9Tip2d14YqTMmlcNz3seFLJVOREJCms3raP8TNWMnnWanYfKKRvZiMeu6IvZ/VsSY1UdQFIVCpyIpKw3J1ZK7cz9tM8/rF4A2bGOSe24tqhkS4AkvhU5EQk4RwsLOLNBesZOy2PhWt30ahODW74rw6MHNyeVg3VBSCZqMiJSMLYsucgk2auYuJn+WzefZCOzevx24tO5KK+baidrh8mTUYqciJS7S1et4tnpuUxdf46DhUWc0qXZowams3JnTJ0VZIkpyInItVSUbHzwZJNjP00jxkrtlK7RiqX5bTjmqFZdGhWL+x4EidU5ESkWtl9oIC/50a6AKzato82jWrzs7O7cvlJmTSsUyPseBJnVOREpFpYtXUf46avZEruavYcLCSnfWPuPrsrZ3ZvQZq6AEg5VOREJG65OzNXbGPstDze/3IjqWac16sV1w7Npne7RmHHk2ogpkXOzM4CHgVSgafd/cFS008BpgJ5waiX3f1XscwkIvHvQEERr89fx9hpK/ly/S6a1E3nllM6MmJwe1o0qBV2PKlGYlbkzCwVeAI4A1gDzDKz19x9calZ/+Xu58Uqh4hUH5t2H2DizFU891k+W/YcokuL+vzu4hO5oE8batVQFwA5erHckhsALHP3FQBmNhm4AChd5EQkyS1cu5Ox0/J4ff46Coqc07s2Z9SwbIZ0aKouAHJcYlnk2gCrSwyvAQaWMd9gM5sPrAPudPdFMcwkInGiqNh5b/FGxk7L4/O8bdRJT+Wqge35/pAssjPqhh1PEkQsi1xZX7+81PAcoL277zGzc4BXgU7fWJHZaGA0QGZmZmXnFJEqtOtAAVNmrWbc9JWs2b6fNo1qc8+53bg0px0Na6sLgFSuWBa5NUC7EsNtiWyt/Zu77ypx+y0z+4uZZbj7llLzjQHGAOTk5JQulCJSDeRt2cuz01fy99zV7D1UxICsJtxzbjeGd1MXAImdWBa5WUAnM8sG1gKXA1eWnMHMWgIb3d3NbACQAmyNYSYRqULuzvTlW3lmWh7/XLKJtBTj/N6tGTU0m55tGoYdT5JAzIqcuxea2a3Au0S6EIx190VmdmMw/UngEuAmMysE9gOXu7u21ESquQMFRUydt5axn65k6cbdNK2bzm2ndeLqQZk0r68uAFJ1rLrVlJycHM/NzQ07hoiUYeOuA0ycmc+kz1axbe8hurVqwKihWZzfu7W6AEhMmdlsd88pPV5XPBGR47ZgzQ7GfprHm1+sp7DYGd6tBaOGZjPohCbqAiChUpETkWNSWFTMPxZvZOyneeTmb6dezTRGDMri+0Pa076pugBIfFCRE5GjsnNfAZNnrWL8jHzW7thPZpM63Htedy7NaUv9WuoCIPFFRU5EorJ88x7GTVvJi7PXsL+giMEnNOUX53fn9G4tSE3RLkmJTypyIlIud+dfX2/hmWl5fLh0M+mpKVzQpzXXDs2me+sGYccTqZCKnIh8w/5DRbwydy3PTMvj6017yKhXk/8e3pkrB2bSrH7NsOOJRE1FTkT+bf3O/UyYkc9zn69ix74CerRuwB8v7c15vVtRM01dAKT6UZETEeau2s7YaSt5+4v1FLtzZveWjBqWzUlZjdUFQKo1FTmRJFVQVMw7Czcwdloec1ftoH7NNK4ZksX3h2TRrkmdsOOJVAoVOZEks2PfIZ7/fDXjZ6xk/c4DZDWtwy+/04OL+7elXk19JEhi0StaJEms3bGfJz5cxstz1nCgoJihHZty/4U9ObVLc1LUBUASlIqcSBKYvnwLt0yaw95DRXy3bxuuGZpF15bqAiCJT0VOJIG5OxNm5vPL1xeTnVGXl0fm6Fe3JamoyIkkqEOFxfzitYU8//lqhndrzsOX9dFltyTpqMiJJKDNuw9y08TZ5OZv55ZTO/DjM7rouJskJRU5kQTzxZqdjJ6Qy/Z9h3jsir6c37t12JFEQqMiJ5JAps5by09fXEBGvZq8dNMQerRuGHYkkVCpyIkkgKJi5w//WMpfP1rOgKwm/OXqfmTU0zUmRVTkRKq5XQcKuP35uXy4dDNXDszkvvN7kJ6WEnYskbigIidSja3YvIfrxueyaus+7r+wJ1cPah92JJG4oiInUk19tHQTtz0/lxqpKUy6biADT2gadiSRuKMiJ1LNuDtjPlnB795ZQpeWDXhqZH/aNtYFlUXKoiInUo0cKCjiZy9/wStz13Luia34/aW9qJOut7FIefTuEKkm1u/czw0TZrNgzU7uPLMzt5zaUb/1JlIBFTmRamB2/jZumDCH/YcKeWpkDmd0bxF2JJFqQUVOJM5NmbWae15dSKtGtXju+oF0blE/7Egi1YaKnEicKiwq5v43v2Tc9JWc3CmDx67oS6M66WHHEqlWKuwxamZDoxknIpVn+95DjBz7OeOmr+QHw7J55pqTVOBEjkE0W3KPAf2iGCcilWDpht1cPz6XDTsP8IdLe3NJ/7ZhRxKptsotcmY2GBgCNDOzO0pMagCkxjqYSDJ6Z+EG7pgyj3o103jhhkH0zWwcdiSRau1IW3LpQL1gnpJHuncBl8QylEiyKS52HvtgGQ+//xW92zVizIj+tGhQK+xYItVeuUXO3T8GPjazce6eb2Z13X1vFWYTSQp7DxZy59/n8/bCDXy3Xxt+e9GJ1KqhnSUilSGaS5W3NrPFwJcAZtbbzP4S21giyWH1tn1c/NfpvLtoA/ec240/XtpbBU6kEkVz4skjwLeB1wDcfb6ZfSumqUSSwPTlW7hl0hyKip1x1w7gW52bhR1JJOFE1U/O3VeXunxQUWziiCQ+d2f8jHx+9cZisjPq8tTIHLIz6oYdSyQhRVPkVpvZEMDNLB34IcGuSxE5OocKi7l36kImz1rN8G7NefiyPtSvVSPsWCIJK5oidyPwKNAGWAP8A7gllqFEEtHm3Qe5aeJscvO3c+upHbnjjM6kpOgCyyKxVGGRc/ctwFVVkEUkYX2xZiejJ+Syfd8hHr+yL+f1ah12JJGkUGGRM7M/lzF6J5Dr7lMrP5JIYpk6by0/fXEBGfVq8tJNQ+jRumHYkUSSRjRdCGoBfYCvg79eQBPgB2b2SAyziVRrRcXOg28v4fbJ8+jdthFTbx2qAidSxaI5JtcROM3dCwHM7K9EjsudAXwRw2wi1dauAwXc/vxcPly6masGZvKL83uQnhbNd0oRqUzRFLk2QF0iuygJbrd29yIzOxizZCLV1IrNe7hufC6rtu7j/gt7cvWg9mFHEkla0RS5h4B5ZvYRYMC3gN+aWV3g/RhmE6l2Plq6iduen0uN1BQmXTeQgSc0DTuSSFI7YpEzsxQifeKGAAOIFLn/cfd1wSw/iW08kerB3RnzyQp+984SurRswFMj+9O2cZ2wY4kkvSMWOXcvNrM/uvtgQGdSipThQEERd7+0gFfnrePcE1vx+0t7USc9qosJiUiMRfNO/IeZXQy87O4e60Ai1cn6nfu5YcJsvli7k598uws3n9KBUpfAE5EQRVPk7iByskmhmR0gssvS3b1BTJOJxLnZ+du4YcIcDhQU8dSIHIZ3bxF2JBEppcJzmt29vrunuHu6uzcIhqMqcGZ2lpktNbNlZnb3EeY7ycyKzEw/xirVwpRZq7lizGfUrZnKKzcPUYETiVNRHTgws8ZAJyIdwwFw908qWCYVeIJIf7o1wCwze83dF5cx3++Ad48uukjVKygq5jdvfsm46Ss5uVMGj13Rl0Z10sOOJSLliOayXtcBtwNtgXnAIGAGcFoFiw4Alrn7imA9k4ELgMWl5rsNeAk46aiSi1Sx7XsPcctzc5i+fCs/GJbNz87uSlqqOniLxLNo3qG3EylA+e5+KtAX2BzFcm2A1SWG1wTj/s3M2gAXAU9GlVYkJEs27OI7T3xKbv52/nBpb35+XncVOJFqIJrdlQfc/YCZYWY13X2JmXWJYrmyTjErfXbmI8BdwdVTyl+R2WhgNEBmZmYUdy1Sed5ZuIE7psyjXs00Xhg9iL6ZjcOOJCJRiqbIrTGzRsCrwHtmth1YV8EyENlya1diuG0Zy+UAk4MClwGcY2aF7v5qyZncfQwwBiAnJ0fdGKRKFBc7j32wjIff/4re7RoxZkR/WjSoVfGCIhI3ovk9uYuCm/eZ2YdAQ+DtKNY9C+hkZtnAWuBy4MpS684+fNvMxgFvlC5wImHYe7CQH0+ZzzuLNvDdfm347UUnUqtGatixROQoRXPiyQR3HwHg7h8fHgeMONJy7l5oZrcSOWsyFRjr7ovM7MZguo7DSVxavW0f14/P5auNu7nn3G78YFi2OniLVFPR7K7sUXIgOOW/fzQrd/e3gLdKjSuzuLn7NdGsUySWpi/fwi2T5lDs8OyoAZzcqVnYkUTkOJR7epiZ/czMdgO9zGxX8Lcb2ISuYykJxt15dvpKRvztc5rWq8nUW4aqwIkkgHK35Nz9AeABM3vA3X9WhZlEqtShwmLunbqQybNWM7xbcx6+rA/1a9UIO5aIVIJoTjxRgZOEtXn3QW6aOJvc/O3cempH7jijMykpOv4mkij0eyCStL5Ys5PRE3LZvu8Qj1/Zl/N6tQ47kohUsiMdk8sub5pIdTd13loueXI6KWa8dNMQFTiRBHWk6xK9CGBm/6yiLCIxV1TsPPj2Em6fPI/e7Rrx2q1D6dG6YdixRCRGjrS7MsXMfgF0NrM7Sk909z/FLpZI5dt1oIDbn5/Lh0s3c9XATH5xfg/S03T9SZFEdqQidzlwYTBP/aqJIxIbyzfv4frxuazauo/7L+zJ1YPahx1JRKrAkboQLAV+Z2YL3D2ay3iJxKUPl27ih8/PpUZqCpOuG8jAE5qGHUlEqkg0Z1dON7M/Ad8Khj8GfuXuO2MXS+T4uTtjPlnBg+8soVvLBowZ2Z+2jeuEHUtEqlA0RW4ssBD4XjA8AngG+G6sQokcrwMFRdz90gJenbeOc3u14veX9KJOunrMiCSbaN71Hdz94hLDvzSzebEKJHK81u/cz+jxs1m4bic/+XYXbj6lgy6wLJKkoily+81smLt/CmBmQ4H9sY0lcmxm52/jhglzOFBQxFMjchjevUXYkUQkRNEUuRuB8WZ2uDPRduD7sYskcmxemLWKe15dSOtGtXn++oF0aqGTgkWSXTTXrpwP9DazBsHwrpinEjkKBUXF/ObNLxk3fSUnd8rgsSv60qhOetixRCQORH0kXsVN4tH2vYe45bk5TF++leuGZXP32V1JS1UHbxGJ0OlmUm0t2bCL68fnsnHXQf54aW8u7t827EgiEmdU5KRaemfhBu6YMo96NdN4YfQg+mY2DjuSiMShqIqcmQ0BskrO7+7jY5RJpFzFxc6fP/iaR97/mt7tGjFmRH9aNKgVdiwRiVMVFgtAsaoAABYESURBVDkzmwB0AOYBRcFoB1TkpErtPVjIj6fM551FG7i4X1t+c1FPatVIDTuWiMSxaLbkcoDu7u6xDiNSntXb9nH9+Fy+2ribn5/XnVFDs9TBW0QqFE2RWwi0BNbHOItImaYv28Itz82h2OHZUQM4uVOzsCOJSDURTZHLABab2efAwcMj3f07MUslQuQCy+Nn5POrNxaTnVGXp0fmkJVRN+xYIlKNRFPk7ot1CJHSDhYWce+ri3ghdzXDuzXn4cv6UL9WjbBjiUg1E80VTz6uiiAih23efZAbJ85mdv52bj21I3ec0ZmUFB1/E5GjF83ZlYOAx4BuQDqQCux19wYxziZJ6Is1Oxk9IZcd+wp4/Mq+nNerddiRRKQai2Z35ePA5cDfiZxpORLoFMtQkpymzlvLT19cQEa9mrx402B6tG5Y8UIiIkcQVWdwd19mZqnuXgQ8Y2bTY5xLkkhRsfPQu0v4v49XMCC7CX+9qh9N69UMO5aIJIBoitw+M0sH5pnZQ0S6EugUN6kUO/cXcPvkuXy0dDNXDczkF+f3ID1NF1gWkcoRTZEbAaQAtwL/DbQDLj7iEiJRWL55D9ePz2XV1n3cf2FPrh7UPuxIIpJgojm7Mt/MagOt3P2XVZBJksCHSzfxw+fnkp6awqTrBjLwhKZhRxKRBFThfiEzO5/IdSvfCYb7mNlrsQ4micndefLj5YwaN4t2jesw9dahKnAiEjPRdgYfAHwE4O7zzCwrZokkYR0oKOKulxYwdd46zu3Vit9f0os66fq1JxGJnWg+YQrdfacuhivHY/3O/YweP5uF63byk2934eZTOugCyyISc1FdoNnMrgRSzawT8ENAXQgkarPzt3HDhDkcKCjiqRE5DO/eIuxIIpIkojlX+zagB5GLMz8P7AJ+FMtQkjhemLWKy8fMpF7NVF65eYgKnIhUqWjOrtwH/G/wJxKVgqJifvPml4ybvpKTO2Xw+BX9aFhHF1gWkapVbpGr6AxK/dSOlGf73kPcPGkOM1Zs5bph2dx9dlfSUtXBW0Sq3pG25AYDq4nsovwM0FkCUqElG3Zx/fhcNu46yB8v7c3F/duGHUlEktiRilxL4AzgCuBK4E3geXdfVBXBpPp5Z+EG7pgyj3o103hh9CD6ZjYOO5KIJLly9yG5e5G7v+Pu3wcGAcuAj8zstipLJ9VCcbHzyPtfcePE2XRuUZ/XbxumAiciceGIJ56YWU3gXCJbc1nAn4GXYx9Lqou9Bwv58ZT5vLNoAxf3a8tvLupJrRqpYccSEQGOfOLJs0BP4G3gl+6+sMpSSbWwaus+rh+fy9ebdvPz87ozamiWOniLSFw50pbcCGAv0Bn4YYkPLwNcvwye3KYv28LNz83BHZ4dNYCTOzULO5KIyDeUW+TcXed8yze4O+Nn5POrNxaTnVGXp0fmkJWhnxcUkfikq+NK1A4WFnHvq4t4IXc1w7u14OHLelO/ljp4i0j8UpGTqGzafYCbJs5hdv52bjutI/89vDMpKTr+JiLxLaa7JM3sLDNbambLzOzuMqZfYGYLzGyemeWa2bBY5pFjs2DNDi54fBqL1+3iiSv78eMzu6jAiUi1ELMtOTNLBZ4g0qF8DTDLzF5z98UlZvsn8Jq7u5n1AqYAXWOVSY7e1Hlr+emLC8ioV5MXbxpMj9YNw44kIhK1WO6uHAAsc/cVAGY2GbgA+HeRc/c9JeavC3gM88hRKCp2Hnp3Cf/38QoGZDfhr1f1o2m9mmHHEhE5KrEscm2IXPvysDXAwNIzmdlFwANAcyIdzyVkO/cXcPvkuXy0dDNXD8rk3vN6kJ6mk21FpPqJZZEr66DNN7bU3P0V4BUz+xbwa2D4N1ZkNhoYDZCZmVnJMaWk5Zv3cP34XFZt3cdvLurJVQPbhx1JROSYxfLr+RqgXYnhtsC68mZ290+ADmaWUca0Me6e4+45zZqp03GsfLhkExc+Po2d+wp47vpBKnAiUu3FssjNAjqZWbaZpQOXA//xG3Vm1tGCS6mYWT8gHdgaw0xSBnfnyY+XM+rZWbRrUoeptw5lQHaTsGOJiBy3mO2udPdCM7sVeBdIBca6+yIzuzGY/iRwMTDSzAqA/cBl7q6TT6rQgYIi7nppAVPnrePcXq34/SW9qJOu7pMikhisutWUnJwcz83NDTtGQli/cz+jx89m4bqd3HlmF24+pYMusCwi1ZKZzXb3nNLj9ZU9SeWu3MaNE+dwoKCIp0bkMLx7i7AjiYhUOhW5JDT581X8fOpC2jSqzfPXD6RTi/phRxIRiQkVuSRSUFTM/W8s5tkZ+ZzcKYPHr+hHwzq6wLKIJC4VuSSxfe8hbp40hxkrtnLdsGzuPrsraanq4C0iiU1FLgks2bCL68fnsnHXQf54aW8u7t827EgiIlVCRS7BvbNwPXdMmU+9mmlMuWEwfdo1CjuSiEiVUZFLUMXFzqP//JpH//k1fdo14v9G9KdFg1phxxIRqVIqcglo78FC7pgyj3cXbeTifm35zUU9qVUjNexYIiJVTkUuwazauo/rx+fy9abd/Py87owamqUO3iKStFTkEsj0ZVu4+bk5uMOzowZwciddzFpEkpuKXAJwd56dvpJfv/klJ2TU5amROWRl1A07lohI6FTkqrmDhUXc++oiXshdzfBuLXj4st7Ur6UO3iIioCJXrW3afYCbJs5hdv52bjutI/89vDMpKTr+JiJymIpcNbVgzQ5umDCbHfsKeOLKfpzbq1XYkURE4o6KXDU0dd5afvriAjLq1eTFmwbTo3XDsCOJiMQlFblq5smPl/Pg20sYmN2Ev1zVj6b1aoYdSUQkbqnIVSPvLd7Ig28v4fzerfnT93pTQxdYFhE5In1KVhPLNu3mv1+YR6+2Dfn9Jb1U4EREoqBPympg5/4Crh8/m1o1Unjy6v66RJeISJS0uzLOFRU7P5o8l9Xb9vH86EG0blQ77EgiItWGilyc+9N7S/lw6Wbuv7AnJ2U1CTuOiEi1ot2VcezNBet54sPlXDGgHVcNzAw7johItaMiF6e+XL+LO/8+n/7tG3Pfd3rolwRERI6Bilwc2r73EKMn5NKgdhp/vaofNdN0oomIyLHQMbk4U1hUzG3Pz2XjzoNMuXEwzfVr3iIix0xFLs48+PYSPl22hYcu6UWfdo3CjiMiUq1pd2UceWXuGp7+NI9rhmTxvZx2YccREan2VOTixBdrdnL3S18w6IQm/O+53cKOIyKSEFTk4sDm3QcZPSGXjHo1eeLKfrpkl4hIJdExuZAVFBVzy6Q5bN93iBdvHKJfFRARqUQqciH71euL+XzlNh69vA892+h34UREKpP2i4Vo8uermDAznxu+dQIX9GkTdhwRkYSjIheS2fnb+fnUhZzcKYOfntU17DgiIglJRS4EG3cd4MaJs2ndqDaPXdGX1BRdsktEJBZ0TK6KHSws4oYJs9l7sJCJPxhIozrpYUcSEUlYKnJVyN35+asLmbd6B09e3Y8uLeuHHUlEJKFpd2UVmjAznym5a/jhaR05q2ersOOIiCQ8FbkqMnPFVn71+mKGd2vOj4Z3DjuOiEhSUJGrAmt37OfmSXNo37QOD1/WhxSdaCIiUiVU5GJs/6EiRo/PpaCwmDEjc6hfq0bYkUREkoZOPIkhd+fulxeweP0u/vb9HDo0qxd2JBGRpKItuRh6+l95TJ23jjvP7MJpXVuEHUdEJOmoyMXIJ19t5oG3v+ScE1ty8ykdwo4jIpKUVORiIH/rXm57fi6dW9Tn95f0xkwnmoiIhEFFrpLtPVjI6PGzMYMxI3KoW1OHPUVEwqIiV4ncnTv/Pp+vN+3m8Sv6kdm0TtiRRESSmopcJXriw2W8vXAD/3NON4Z1ygg7johI0otpkTOzs8xsqZktM7O7y5h+lZktCP6mm1nvWOaJpX9+uZE/vvcVF/Vtww+GZYcdR0REiGGRM7NU4AngbKA7cIWZdS81Wx7wX+7eC/g1MCZWeWJp2aY9/GjyPHq0bsAD3z1RJ5qIiMSJWG7JDQCWufsKdz8ETAYuKDmDu0939+3B4EygbQzzxMSuAwWMnpBLeloK/zcih1o1UsOOJCIigVgWuTbA6hLDa4Jx5fkB8HYM81S64mLnR5PnsWrrPv5yVT/aNKoddiQRESkhlue3l7XPzsuc0exUIkVuWDnTRwOjATIzMysr33F7+P2v+GDJJn59QQ8GntA07DgiIlJKLLfk1gDtSgy3BdaVnsnMegFPAxe4+9ayVuTuY9w9x91zmjVrFpOwR+vtL9bz2AfLuCynHVcPah92HBERKUMsi9wsoJOZZZtZOnA58FrJGcwsE3gZGOHuX8UwS6VasmEXP/77fPpmNuJXF/bQiSYiInEqZrsr3b3QzG4F3gVSgbHuvsjMbgymPwncCzQF/hIUikJ3z4lVpsqwY98hRo+fTb2aaTx5dX9qpulEExGReBXTa065+1vAW6XGPVni9nXAdbHMUJkKi4q57fm5bNh5gMk3DKJFg1phRxIRkSPQhRWPwkPvLuVfX2/hoYt70S+zcdhxRESkArqsV5RenbuWMZ+sYOTg9nzvpHYVLyAiIqFTkYvCwrU7ueulBQzIbsLPzyt90RYREYlXKnIV2LLnIKPH59K0bjp/uaofNVL1kImIVBc6JncEBUXF3DxpDlv3HuLFG4eQUa9m2JFEROQoqMgdwf1vLObzvG08clkfTmzbMOw4IiJylLTvrRxTZq3m2Rn5XH9yNhf2PdIlN0VEJF6pyJVhzqrt3PPqQk7ulMFdZ3UNO46IiBwjFblSNu46wI0TZtOyYS0eu6IvaTrRRESk2tIneAkHC4u4ceJs9hwsZMzI/jSqkx52JBEROQ468STg7tz76iLmrtrBX6/qR9eWDcKOJCIix0lbcoGJM/N5IXc1t57akbNPbBV2HBERqQQqcsBnK7byy9cXc3rX5txxRuew44iISCVJ+iK3dsd+bp40h8ymdXj48j6kpOi34UREEkVSF7kDBUXcMCGXQ4XFjBmRQ4NaNcKOJCIilShpTzxxd3728hcsWreLp0fm0LF5vbAjiYhIJUvaLbm/fZrHK3PXcsfwzpzerUXYcUREJAaSssh9+vUWfvvWl5zdsyW3ntYx7DgiIhIjSVfkioqde6cupFPz+vzh0t6Y6UQTEZFElXTH5FJTjGdHDcAd6tZMuuaLiCSVpPyUb9ekTtgRRESkCiTd7koREUkeKnIiIpKwVORERCRhqciJiEjCUpETEZGEpSInIiIJS0VOREQSloqciIgkLBU5ERFJWCpyIiKSsMzdw85wVMxsM5BfCavKALZUwnrigdoSvxKpPWpL/Eqk9hxrW9q7e7PSI6tdkassZpbr7jlh56gMakv8SqT2qC3xK5HaU9lt0e5KERFJWCpyIiKSsJK5yI0JO0AlUlviVyK1R22JX4nUnkptS9IekxMRkcSXzFtyIiKS4JKmyJlZEzN7z8y+Dv43Lme+Rmb2opktMbMvzWxwVWetSLRtCeZNNbO5ZvZGVWaMVjRtMbN2ZvZh8HwsMrPbw8haHjM7y8yWmtkyM7u7jOlmZn8Opi8ws35h5IxWFO25KmjHAjObbma9w8gZjYraUmK+k8ysyMwuqcp8RyOatpjZKWY2L3iffFzVGaMVxWusoZm9bmbzg7Zce8x35u5J8Qc8BNwd3L4b+F058z0LXBfcTgcahZ39WNsSTL8DeA54I+zcx9oWoBXQL7hdH/gK6B529iBPKrAcOCF4vcwvnQ04B3gbMGAQ8FnYuY+zPUOAxsHts+O1PdG0pcR8HwBvAZeEnfs4npdGwGIgMxhuHnbu42jL/xz+LACaAduA9GO5v6TZkgMuIFLACP5fWHoGM2sAfAv4G4C7H3L3HVWWMHoVtgXAzNoC5wJPV1GuY1FhW9x9vbvPCW7vBr4E2lRZwiMbACxz9xXufgiYTKRNJV0AjPeImUAjM2tV1UGjVGF73H26u28PBmcCbas4Y7SieW4AbgNeAjZVZbijFE1brgRedvdVAO4er+2Jpi0O1DczA+oRKXKFx3JnyVTkWrj7eoh8aALNy5jnBGAz8Eywi+9pM6tblSGjFE1bAB4BfgoUV1WwYxBtWwAwsyygL/BZzJNFpw2wusTwGr5ZgKOZJ14cbdYfENlKjUcVtsXM2gAXAU9WYa5jEc3z0hlobGYfmdlsMxtZZemOTjRteRzoBqwDvgBud/dj+hxLO5aF4pWZvQ+0LGPS/0a5ijSgH3Cbu39mZo8S2YX280qKGLXjbYuZnQdscvfZZnZKZWY7WpXwvBxeTz0i37h/5O67KiNbJbAyxpU+ZTmaeeJF1FnN7FQiRW5YTBMdu2ja8ghwl7sXRTYa4lY0bUkD+gOnA7WBGWY2092/inW4oxRNW74NzANOAzoA75nZv47lfZ9QRc7dh5c3zcw2mlkrd18f7Coqa1N+DbDG3Q9vJbxIpMhVuUpoy1DgO2Z2DlALaGBmE9396hhFLlcltAUzq0GkwE1y95djFPVYrAHalRhuS+Tb59HOEy+iympmvYjsBj/b3bdWUbajFU1bcoDJQYHLAM4xs0J3f7VqIkYt2tfZFnffC+w1s0+A3kSOYceTaNpyLfCgRw7KLTOzPKAr8PnR3lky7a58Dfh+cPv7wNTSM7j7BmC1mXUJRp1O5EBuvImmLT9z97bungVcDnwQRoGLQoVtCfbL/w340t3/VIXZojEL6GRm2WaWTuSxfq3UPK8BI4OzLAcBOw/voo1DFbbHzDKBl4ERcbiVUFKFbXH3bHfPCt4nLwI3x2GBg+heZ1OBk80szczqAAOJHL+ON9G0ZRWRz1/MrAXQBVhxTPcW9pk2VfUHNAX+CXwd/G8SjG8NvFVivj5ALrAAeJXgLLJ4+ou2LSXmP4X4PbuywrYQ2R3mwXMyL/g7J+zsJdpwDpFvy8uB/w3G3QjcGNw24Ilg+hdATtiZj7M9TwPbSzwXuWFnPta2lJp3HHF6dmW0bQF+QuSL+UIiu/VDz32Mr7HWwD+C98tC4OpjvS9d8URERBJWMu2uFBGRJKMiJyIiCUtFTkREEpaKnIiIJCwVORERSVgqciJxysx+GPzywqSws4hUV+pCIBKnzGwJkSuK5EUxb5q7H9MFbEUSWUJd1kskUZjZk0QuGP6amU0kcpX22sB+4Fp3X2pm1xD5lYlaQF0zOx94DDiRyHv7Pnf/xhVkRJKJtuRE4pSZrSRybcVDwD53LzSz4cBN7n5xUOTuB3q5+zYz+y2w2N0nmlkjItf56+uRaxmKJCVtyYnEv4bAs2bWicjlzWqUmPaeu28Lbp9J5KLcdwbDtYBM4vP6hSJVQkVOJP79GvjQ3S8Kfk/voxLTSm6lGXCxuy+tumgi8U1nV4rEv4bA2uD2NUeY713gtuBXGzCzvjHOJRL3VORE4t9DwANmNg1IPcJ8vyayK3OBmS0MhkWSmk48ERGRhKUtORERSVgqciIikrBU5EREJGGpyImISMJSkRMRkYSlIiciIglLRU5ERBKWipyIiCSs/wdoWNiEVjBCaAAAAABJRU5ErkJggg==\n", "text/plain": [ "
" ] }, "metadata": { "needs_background": "light" }, "output_type": "display_data" } ], "source": [ "# let's explore the monotonic relationship\n", "plt.figure(figsize=(7,5))\n", "pd.concat([test_t,y_test], axis=1).groupby(\"fare\")[\"survived\"].mean().plot()\n", "plt.title(\"Relationship between fare and target\")\n", "plt.xlabel(\"fare\")\n", "plt.ylabel(\"Mean of target\")\n", "plt.show()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Note how now the intervals are monotonically sorted respect to the target." ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [] } ], "metadata": { "kernelspec": { "display_name": "fengine", "language": "python", "name": "fengine" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.8.2" }, "toc": { "base_numbering": 1, "nav_menu": {}, "number_sections": true, "sideBar": true, "skip_h1_title": false, "title_cell": "Table of Contents", "title_sidebar": "Contents", "toc_cell": false, "toc_position": {}, "toc_section_display": true, "toc_window_display": false } }, "nbformat": 4, "nbformat_minor": 4 }