{
 "cells": [
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "43f7ead10550fc82e3218c7887eb67586f68ea8b"
   },
   "source": [
    "<h2 style=\"color:Blue;\"> Advance Pandas Trick and Techniques </h2>\n",
    "\n",
    "----\n",
    "\n",
    "<h3 style=\"color:Blue;\">Index of learning</h3>  <a id=\"00\"> </a>\n",
    "\n",
    "\n",
    "---\n",
    "1. [**Basic Pandas Reading Method**](#1)\n",
    "1. [**Advance Pandas Reading Methods**](#2)\n",
    "    1. [**Manipulating Column & Index Locations and Names**](#21)\n",
    "    1. [**Data Parsing options**](#22)\n",
    "    1. [**Reading data from excel files**](#23)\n",
    "    1. [**Reading data from some other popular formats**](#24)\n",
    "1. [**Apply multiple filter criteria to a pandas DataFrame**](#3)\n",
    "1. [**Changing the datatype of a Pandas Series**](#4)\n",
    "1. [**Filter rows of a pandas DataFrame by column value**](#5)\n",
    "1. [**Selecting multiple rows and columns from a pandas DataFrame**](#6)\n",
    "1. [**Sorting a pandas DataFrame or a Series**](#7)\n",
    "1. [**Using pandas Series data structure to select a subset of the data**](#8)\n",
    "1. [**Using string methods in pandas**](#9)\n",
    "1. [**Using the axis parameter in pandas**](#10)\n",
    "1. [**Applying a function to a pandas Series or DataFrame** ](#11)\n",
    "1. [**Handling SettingWithCopyWarning**](#12)\n",
    "1. [**Handling missing values in pandas**](#13)\n",
    "1. [**Indexing in pandas dataframes**](#14)\n",
    "1. [**Merging and concatenating multiple data frames into one** ](#15)\n",
    "1. [**Modifying a Pandas Dataframe inplace**](#16)\n",
    "1. [**Removing columns from a pandas DataFrame**](#17)\n",
    "1. [**Renaming columns in a pandas DataFrame**](#18)\n",
    "1. [**Using groupby method**](#19)\n",
    "1. [**Work with dates and times data**](#20)\n",
    "1. [**Choosing the colors for the plots**](#211)\n",
    "1. [**Controlling plot aesthetics**](#221)\n",
    "1. [**Plotting categorical data**](#231)\n",
    "1. [**Plotting with data aware grids**](#241)\n",
    " \n",
    "---\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 1,
   "metadata": {
    "_cell_guid": "b1076dfc-b9ad-4769-8c92-a6c4dae69d19",
    "_kg_hide-input": true,
    "_kg_hide-output": true,
    "_uuid": "8f2839f25d086af736a60e9eeb907d3b93b6e0e5"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<style>\n",
       ".output_wrapper, .output {\n",
       "    height:auto !important;\n",
       "    max-height:350px;  /* your desired max-height here */\n",
       "}\n",
       ".output_scroll {\n",
       "    box-shadow:none !important;\n",
       "    webkit-box-shadow:none !important;\n",
       "}\n",
       "</style>"
      ],
      "text/plain": [
       "<IPython.core.display.HTML object>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "%%html\n",
    "<style>\n",
    ".output_wrapper, .output {\n",
    "    height:auto !important;\n",
    "    max-height:350px;  /* your desired max-height here */\n",
    "}\n",
    ".output_scroll {\n",
    "    box-shadow:none !important;\n",
    "    webkit-box-shadow:none !important;\n",
    "}\n",
    "</style>"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 2,
   "metadata": {
    "_cell_guid": "79c7e3d0-c299-4dcb-8224-4455121ee9b0",
    "_kg_hide-input": true,
    "_kg_hide-output": true,
    "_uuid": "d629ff2d2480ee46fbb7e2d37f6b5fab8052498a"
   },
   "outputs": [],
   "source": [
    "from IPython.core.interactiveshell import InteractiveShell\n",
    "InteractiveShell.ast_node_interactivity = \"all\""
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "906802c5feb7d6426f38964bfe19992f20a1a75f"
   },
   "source": [
    "# 1.Basic Pandas Reading Method <a id=\"1\"> </a> \n",
    " ---\n",
    " [**Go to top**](#00)\n",
    " \n",
    " ![](https://python-graph-gallery.com/wp-content/uploads/Pandas_Cheat_Datacamp.png)\n",
    " ![](https://ugoproto.github.io/ugo_py_doc/img/scipy_cs/Pandas_Cheat_Sheeta.png)\n",
    " ![](https://cdn-images-1.medium.com/max/2000/1*YhTbz8b8Svi22wNVvqzneg.jpeg) "
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 3,
   "metadata": {
    "_uuid": "2013cff9b3d848e1923926032534ef7d305d9be4"
   },
   "outputs": [],
   "source": [
    "import pandas as pd\n",
    "import matplotlib.pyplot as plt\n",
    "import numpy as np\n",
    "%matplotlib inline\n",
    "import warnings\n",
    "warnings.simplefilter(action='ignore', category=FutureWarning)\n",
    "warnings.simplefilter(action='ignore', category=RuntimeWarning)\n",
    "warnings.simplefilter(action='ignore', category=UserWarning)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 4,
   "metadata": {
    "_uuid": "ffa1934d0c985b3cbf0defb72b3da8cc36e75a10"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>X</th>\n",
       "      <th>Title</th>\n",
       "      <th>Rating</th>\n",
       "      <th>TotalVotes</th>\n",
       "      <th>Genre1</th>\n",
       "      <th>Genre2</th>\n",
       "      <th>Genre3</th>\n",
       "      <th>MetaCritic</th>\n",
       "      <th>Budget</th>\n",
       "      <th>Runtime</th>\n",
       "      <th>CVotes10</th>\n",
       "      <th>CVotes09</th>\n",
       "      <th>CVotes08</th>\n",
       "      <th>CVotes07</th>\n",
       "      <th>CVotes06</th>\n",
       "      <th>CVotes05</th>\n",
       "      <th>CVotes04</th>\n",
       "      <th>CVotes03</th>\n",
       "      <th>CVotes02</th>\n",
       "      <th>CVotes01</th>\n",
       "      <th>CVotesMale</th>\n",
       "      <th>CVotesFemale</th>\n",
       "      <th>CVotesU18</th>\n",
       "      <th>CVotesU18M</th>\n",
       "      <th>CVotesU18F</th>\n",
       "      <th>CVotes1829</th>\n",
       "      <th>CVotes1829M</th>\n",
       "      <th>CVotes1829F</th>\n",
       "      <th>CVotes3044</th>\n",
       "      <th>CVotes3044M</th>\n",
       "      <th>CVotes3044F</th>\n",
       "      <th>CVotes45A</th>\n",
       "      <th>CVotes45AM</th>\n",
       "      <th>CVotes45AF</th>\n",
       "      <th>CVotes1000</th>\n",
       "      <th>CVotesUS</th>\n",
       "      <th>CVotesnUS</th>\n",
       "      <th>VotesM</th>\n",
       "      <th>VotesF</th>\n",
       "      <th>VotesU18</th>\n",
       "      <th>VotesU18M</th>\n",
       "      <th>VotesU18F</th>\n",
       "      <th>Votes1829</th>\n",
       "      <th>Votes1829M</th>\n",
       "      <th>Votes1829F</th>\n",
       "      <th>Votes3044</th>\n",
       "      <th>Votes3044M</th>\n",
       "      <th>Votes3044F</th>\n",
       "      <th>Votes45A</th>\n",
       "      <th>Votes45AM</th>\n",
       "      <th>Votes45AF</th>\n",
       "      <th>VotesIMDB</th>\n",
       "      <th>Votes1000</th>\n",
       "      <th>VotesUS</th>\n",
       "      <th>VotesnUS</th>\n",
       "      <th>Domestic</th>\n",
       "      <th>Foreign</th>\n",
       "      <th>Worldwide</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>1</td>\n",
       "      <td>12 Years a Slave (2013)</td>\n",
       "      <td>8.1</td>\n",
       "      <td>496092</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>History</td>\n",
       "      <td>96.0</td>\n",
       "      <td>20000000.0</td>\n",
       "      <td>134 min</td>\n",
       "      <td>75556</td>\n",
       "      <td>126223</td>\n",
       "      <td>161460</td>\n",
       "      <td>83070</td>\n",
       "      <td>27231</td>\n",
       "      <td>9603</td>\n",
       "      <td>4021</td>\n",
       "      <td>2420</td>\n",
       "      <td>1785</td>\n",
       "      <td>4739</td>\n",
       "      <td>313823</td>\n",
       "      <td>82012</td>\n",
       "      <td>1837</td>\n",
       "      <td>1363</td>\n",
       "      <td>457</td>\n",
       "      <td>200910</td>\n",
       "      <td>153669</td>\n",
       "      <td>45301</td>\n",
       "      <td>138762</td>\n",
       "      <td>112943</td>\n",
       "      <td>23895</td>\n",
       "      <td>29252</td>\n",
       "      <td>23072</td>\n",
       "      <td>5726</td>\n",
       "      <td>664</td>\n",
       "      <td>53328</td>\n",
       "      <td>224519.0</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.0</td>\n",
       "      <td>$56,671,993</td>\n",
       "      <td>131061209</td>\n",
       "      <td>$187,733,202</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2</td>\n",
       "      <td>127 Hours (2010)</td>\n",
       "      <td>7.6</td>\n",
       "      <td>297075</td>\n",
       "      <td>Adventure</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>82.0</td>\n",
       "      <td>18000000.0</td>\n",
       "      <td>94 min</td>\n",
       "      <td>28939</td>\n",
       "      <td>44110</td>\n",
       "      <td>98845</td>\n",
       "      <td>78451</td>\n",
       "      <td>28394</td>\n",
       "      <td>9403</td>\n",
       "      <td>3796</td>\n",
       "      <td>1930</td>\n",
       "      <td>1161</td>\n",
       "      <td>2059</td>\n",
       "      <td>212866</td>\n",
       "      <td>44600</td>\n",
       "      <td>745</td>\n",
       "      <td>567</td>\n",
       "      <td>170</td>\n",
       "      <td>133336</td>\n",
       "      <td>106007</td>\n",
       "      <td>26152</td>\n",
       "      <td>102120</td>\n",
       "      <td>86609</td>\n",
       "      <td>14304</td>\n",
       "      <td>14895</td>\n",
       "      <td>12400</td>\n",
       "      <td>2261</td>\n",
       "      <td>649</td>\n",
       "      <td>38478</td>\n",
       "      <td>169745.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$18,335,230</td>\n",
       "      <td>42403567</td>\n",
       "      <td>$60,738,797</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>3</td>\n",
       "      <td>50/50 (2011)</td>\n",
       "      <td>7.7</td>\n",
       "      <td>283935</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>72.0</td>\n",
       "      <td>8000000.0</td>\n",
       "      <td>100 min</td>\n",
       "      <td>28304</td>\n",
       "      <td>47501</td>\n",
       "      <td>99524</td>\n",
       "      <td>71485</td>\n",
       "      <td>24252</td>\n",
       "      <td>7545</td>\n",
       "      <td>2381</td>\n",
       "      <td>1109</td>\n",
       "      <td>634</td>\n",
       "      <td>1202</td>\n",
       "      <td>188925</td>\n",
       "      <td>58348</td>\n",
       "      <td>506</td>\n",
       "      <td>348</td>\n",
       "      <td>153</td>\n",
       "      <td>132350</td>\n",
       "      <td>96269</td>\n",
       "      <td>34765</td>\n",
       "      <td>94745</td>\n",
       "      <td>75394</td>\n",
       "      <td>18163</td>\n",
       "      <td>12829</td>\n",
       "      <td>9912</td>\n",
       "      <td>2681</td>\n",
       "      <td>555</td>\n",
       "      <td>46947</td>\n",
       "      <td>147849.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$35,014,192</td>\n",
       "      <td>4173591</td>\n",
       "      <td>$39,187,783</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>4</td>\n",
       "      <td>About Time (2013)</td>\n",
       "      <td>7.8</td>\n",
       "      <td>225412</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Fantasy</td>\n",
       "      <td>NaN</td>\n",
       "      <td>12000000.0</td>\n",
       "      <td>123 min</td>\n",
       "      <td>38556</td>\n",
       "      <td>43170</td>\n",
       "      <td>70850</td>\n",
       "      <td>45487</td>\n",
       "      <td>16542</td>\n",
       "      <td>5673</td>\n",
       "      <td>2210</td>\n",
       "      <td>1084</td>\n",
       "      <td>664</td>\n",
       "      <td>1182</td>\n",
       "      <td>126718</td>\n",
       "      <td>58098</td>\n",
       "      <td>654</td>\n",
       "      <td>325</td>\n",
       "      <td>321</td>\n",
       "      <td>92940</td>\n",
       "      <td>57778</td>\n",
       "      <td>34126</td>\n",
       "      <td>67477</td>\n",
       "      <td>50212</td>\n",
       "      <td>16222</td>\n",
       "      <td>13973</td>\n",
       "      <td>10690</td>\n",
       "      <td>3026</td>\n",
       "      <td>475</td>\n",
       "      <td>20450</td>\n",
       "      <td>111670.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>6.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>$15,322,921</td>\n",
       "      <td>71777528</td>\n",
       "      <td>$87,100,449</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>5</td>\n",
       "      <td>Amour (2012)</td>\n",
       "      <td>7.9</td>\n",
       "      <td>76121</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>NaN</td>\n",
       "      <td>94.0</td>\n",
       "      <td>8900000.0</td>\n",
       "      <td>127 min</td>\n",
       "      <td>11093</td>\n",
       "      <td>15944</td>\n",
       "      <td>22942</td>\n",
       "      <td>14187</td>\n",
       "      <td>5945</td>\n",
       "      <td>2585</td>\n",
       "      <td>1188</td>\n",
       "      <td>710</td>\n",
       "      <td>534</td>\n",
       "      <td>995</td>\n",
       "      <td>49808</td>\n",
       "      <td>16719</td>\n",
       "      <td>121</td>\n",
       "      <td>95</td>\n",
       "      <td>24</td>\n",
       "      <td>28593</td>\n",
       "      <td>20107</td>\n",
       "      <td>8167</td>\n",
       "      <td>28691</td>\n",
       "      <td>21990</td>\n",
       "      <td>6269</td>\n",
       "      <td>7425</td>\n",
       "      <td>5803</td>\n",
       "      <td>1490</td>\n",
       "      <td>391</td>\n",
       "      <td>7959</td>\n",
       "      <td>46138.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.6</td>\n",
       "      <td>8.7</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>6.6</td>\n",
       "      <td>7.2</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>$6,739,492</td>\n",
       "      <td>13100000</td>\n",
       "      <td>$19,839,492</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "   X                     Title      ...          Foreign      Worldwide\n",
       "0  1  12 Years a Slave (2013)      ...        131061209  $187,733,202 \n",
       "1  2         127 Hours (2010)      ...         42403567   $60,738,797 \n",
       "2  3             50/50 (2011)      ...          4173591   $39,187,783 \n",
       "3  4        About Time (2013)      ...         71777528   $87,100,449 \n",
       "4  5             Amour (2012)      ...         13100000   $19,839,492 \n",
       "\n",
       "[5 rows x 58 columns]"
      ]
     },
     "execution_count": 4,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df = pd.read_csv(\"../input/datasetsdifferent-format/IMDB.csv\", encoding=\"ISO-8859-1\")\n",
    "df.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "874f13ff80642477b5c340131cef545b32c58246"
   },
   "source": [
    "# 2.Advance Pandas Reading Methods <a id=\"2\"></a>\n",
    "---\n",
    " [**Go to top**](#00)\n",
    " \n",
    " ![](https://i.stack.imgur.com/qCOaK.png)\n",
    " \n",
    "* [**Advance Pandas Reading Methods**](#2)\n",
    "    * [**Manipulating Column & Index Locations and Names**](#21)\n",
    "    * [**Data Parsing options**](#22)\n",
    "    * [**Reading data from excel files**](#23)\n",
    "    * [**Reading data from some other popular formats**](#24)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "ef5847c6c5b2ddc6c9a77e183f861ead8dfe87b9"
   },
   "source": [
    "> ### 2.1 Manipulating Columns & Index Location and Names <a id=\"21\"></a>\n",
    "\n",
    "### 1. No Header and No Columns\n",
    "* There is **no header*** and **no columns** while reading csv file here and used `encoding` because file in `ISO-8859-1` format"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 5,
   "metadata": {
    "_uuid": "5265eee715b0e068a2349b0cfe5383746e263d2a"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>0</th>\n",
       "      <th>1</th>\n",
       "      <th>2</th>\n",
       "      <th>3</th>\n",
       "      <th>4</th>\n",
       "      <th>5</th>\n",
       "      <th>6</th>\n",
       "      <th>7</th>\n",
       "      <th>8</th>\n",
       "      <th>9</th>\n",
       "      <th>10</th>\n",
       "      <th>11</th>\n",
       "      <th>12</th>\n",
       "      <th>13</th>\n",
       "      <th>14</th>\n",
       "      <th>15</th>\n",
       "      <th>16</th>\n",
       "      <th>17</th>\n",
       "      <th>18</th>\n",
       "      <th>19</th>\n",
       "      <th>20</th>\n",
       "      <th>21</th>\n",
       "      <th>22</th>\n",
       "      <th>23</th>\n",
       "      <th>24</th>\n",
       "      <th>25</th>\n",
       "      <th>26</th>\n",
       "      <th>27</th>\n",
       "      <th>28</th>\n",
       "      <th>29</th>\n",
       "      <th>30</th>\n",
       "      <th>31</th>\n",
       "      <th>32</th>\n",
       "      <th>33</th>\n",
       "      <th>34</th>\n",
       "      <th>35</th>\n",
       "      <th>36</th>\n",
       "      <th>37</th>\n",
       "      <th>38</th>\n",
       "      <th>39</th>\n",
       "      <th>40</th>\n",
       "      <th>41</th>\n",
       "      <th>42</th>\n",
       "      <th>43</th>\n",
       "      <th>44</th>\n",
       "      <th>45</th>\n",
       "      <th>46</th>\n",
       "      <th>47</th>\n",
       "      <th>48</th>\n",
       "      <th>49</th>\n",
       "      <th>50</th>\n",
       "      <th>51</th>\n",
       "      <th>52</th>\n",
       "      <th>53</th>\n",
       "      <th>54</th>\n",
       "      <th>55</th>\n",
       "      <th>56</th>\n",
       "      <th>57</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>X</td>\n",
       "      <td>Title</td>\n",
       "      <td>Rating</td>\n",
       "      <td>TotalVotes</td>\n",
       "      <td>Genre1</td>\n",
       "      <td>Genre2</td>\n",
       "      <td>Genre3</td>\n",
       "      <td>MetaCritic</td>\n",
       "      <td>Budget</td>\n",
       "      <td>Runtime</td>\n",
       "      <td>CVotes10</td>\n",
       "      <td>CVotes09</td>\n",
       "      <td>CVotes08</td>\n",
       "      <td>CVotes07</td>\n",
       "      <td>CVotes06</td>\n",
       "      <td>CVotes05</td>\n",
       "      <td>CVotes04</td>\n",
       "      <td>CVotes03</td>\n",
       "      <td>CVotes02</td>\n",
       "      <td>CVotes01</td>\n",
       "      <td>CVotesMale</td>\n",
       "      <td>CVotesFemale</td>\n",
       "      <td>CVotesU18</td>\n",
       "      <td>CVotesU18M</td>\n",
       "      <td>CVotesU18F</td>\n",
       "      <td>CVotes1829</td>\n",
       "      <td>CVotes1829M</td>\n",
       "      <td>CVotes1829F</td>\n",
       "      <td>CVotes3044</td>\n",
       "      <td>CVotes3044M</td>\n",
       "      <td>CVotes3044F</td>\n",
       "      <td>CVotes45A</td>\n",
       "      <td>CVotes45AM</td>\n",
       "      <td>CVotes45AF</td>\n",
       "      <td>CVotes1000</td>\n",
       "      <td>CVotesUS</td>\n",
       "      <td>CVotesnUS</td>\n",
       "      <td>VotesM</td>\n",
       "      <td>VotesF</td>\n",
       "      <td>VotesU18</td>\n",
       "      <td>VotesU18M</td>\n",
       "      <td>VotesU18F</td>\n",
       "      <td>Votes1829</td>\n",
       "      <td>Votes1829M</td>\n",
       "      <td>Votes1829F</td>\n",
       "      <td>Votes3044</td>\n",
       "      <td>Votes3044M</td>\n",
       "      <td>Votes3044F</td>\n",
       "      <td>Votes45A</td>\n",
       "      <td>Votes45AM</td>\n",
       "      <td>Votes45AF</td>\n",
       "      <td>VotesIMDB</td>\n",
       "      <td>Votes1000</td>\n",
       "      <td>VotesUS</td>\n",
       "      <td>VotesnUS</td>\n",
       "      <td>Domestic</td>\n",
       "      <td>Foreign</td>\n",
       "      <td>Worldwide</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>1</td>\n",
       "      <td>12 Years a Slave (2013)</td>\n",
       "      <td>8.1</td>\n",
       "      <td>496092</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>History</td>\n",
       "      <td>96</td>\n",
       "      <td>20000000</td>\n",
       "      <td>134 min</td>\n",
       "      <td>75556</td>\n",
       "      <td>126223</td>\n",
       "      <td>161460</td>\n",
       "      <td>83070</td>\n",
       "      <td>27231</td>\n",
       "      <td>9603</td>\n",
       "      <td>4021</td>\n",
       "      <td>2420</td>\n",
       "      <td>1785</td>\n",
       "      <td>4739</td>\n",
       "      <td>313823</td>\n",
       "      <td>82012</td>\n",
       "      <td>1837</td>\n",
       "      <td>1363</td>\n",
       "      <td>457</td>\n",
       "      <td>200910</td>\n",
       "      <td>153669</td>\n",
       "      <td>45301</td>\n",
       "      <td>138762</td>\n",
       "      <td>112943</td>\n",
       "      <td>23895</td>\n",
       "      <td>29252</td>\n",
       "      <td>23072</td>\n",
       "      <td>5726</td>\n",
       "      <td>664</td>\n",
       "      <td>53328</td>\n",
       "      <td>224519</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8</td>\n",
       "      <td>$56,671,993</td>\n",
       "      <td>131061209</td>\n",
       "      <td>$187,733,202</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>2</td>\n",
       "      <td>127 Hours (2010)</td>\n",
       "      <td>7.6</td>\n",
       "      <td>297075</td>\n",
       "      <td>Adventure</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>82</td>\n",
       "      <td>18000000</td>\n",
       "      <td>94 min</td>\n",
       "      <td>28939</td>\n",
       "      <td>44110</td>\n",
       "      <td>98845</td>\n",
       "      <td>78451</td>\n",
       "      <td>28394</td>\n",
       "      <td>9403</td>\n",
       "      <td>3796</td>\n",
       "      <td>1930</td>\n",
       "      <td>1161</td>\n",
       "      <td>2059</td>\n",
       "      <td>212866</td>\n",
       "      <td>44600</td>\n",
       "      <td>745</td>\n",
       "      <td>567</td>\n",
       "      <td>170</td>\n",
       "      <td>133336</td>\n",
       "      <td>106007</td>\n",
       "      <td>26152</td>\n",
       "      <td>102120</td>\n",
       "      <td>86609</td>\n",
       "      <td>14304</td>\n",
       "      <td>14895</td>\n",
       "      <td>12400</td>\n",
       "      <td>2261</td>\n",
       "      <td>649</td>\n",
       "      <td>38478</td>\n",
       "      <td>169745</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$18,335,230</td>\n",
       "      <td>42403567</td>\n",
       "      <td>$60,738,797</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>3</td>\n",
       "      <td>50/50 (2011)</td>\n",
       "      <td>7.7</td>\n",
       "      <td>283935</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>72</td>\n",
       "      <td>8000000</td>\n",
       "      <td>100 min</td>\n",
       "      <td>28304</td>\n",
       "      <td>47501</td>\n",
       "      <td>99524</td>\n",
       "      <td>71485</td>\n",
       "      <td>24252</td>\n",
       "      <td>7545</td>\n",
       "      <td>2381</td>\n",
       "      <td>1109</td>\n",
       "      <td>634</td>\n",
       "      <td>1202</td>\n",
       "      <td>188925</td>\n",
       "      <td>58348</td>\n",
       "      <td>506</td>\n",
       "      <td>348</td>\n",
       "      <td>153</td>\n",
       "      <td>132350</td>\n",
       "      <td>96269</td>\n",
       "      <td>34765</td>\n",
       "      <td>94745</td>\n",
       "      <td>75394</td>\n",
       "      <td>18163</td>\n",
       "      <td>12829</td>\n",
       "      <td>9912</td>\n",
       "      <td>2681</td>\n",
       "      <td>555</td>\n",
       "      <td>46947</td>\n",
       "      <td>147849</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$35,014,192</td>\n",
       "      <td>4173591</td>\n",
       "      <td>$39,187,783</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>4</td>\n",
       "      <td>About Time (2013)</td>\n",
       "      <td>7.8</td>\n",
       "      <td>225412</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Fantasy</td>\n",
       "      <td>NaN</td>\n",
       "      <td>12000000</td>\n",
       "      <td>123 min</td>\n",
       "      <td>38556</td>\n",
       "      <td>43170</td>\n",
       "      <td>70850</td>\n",
       "      <td>45487</td>\n",
       "      <td>16542</td>\n",
       "      <td>5673</td>\n",
       "      <td>2210</td>\n",
       "      <td>1084</td>\n",
       "      <td>664</td>\n",
       "      <td>1182</td>\n",
       "      <td>126718</td>\n",
       "      <td>58098</td>\n",
       "      <td>654</td>\n",
       "      <td>325</td>\n",
       "      <td>321</td>\n",
       "      <td>92940</td>\n",
       "      <td>57778</td>\n",
       "      <td>34126</td>\n",
       "      <td>67477</td>\n",
       "      <td>50212</td>\n",
       "      <td>16222</td>\n",
       "      <td>13973</td>\n",
       "      <td>10690</td>\n",
       "      <td>3026</td>\n",
       "      <td>475</td>\n",
       "      <td>20450</td>\n",
       "      <td>111670</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8</td>\n",
       "      <td>8</td>\n",
       "      <td>8</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>6.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>$15,322,921</td>\n",
       "      <td>71777528</td>\n",
       "      <td>$87,100,449</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "  0                         1       ...               56             57\n",
       "0  X                     Title      ...          Foreign      Worldwide\n",
       "1  1  12 Years a Slave (2013)      ...        131061209  $187,733,202 \n",
       "2  2         127 Hours (2010)      ...         42403567   $60,738,797 \n",
       "3  3             50/50 (2011)      ...          4173591   $39,187,783 \n",
       "4  4        About Time (2013)      ...         71777528   $87,100,449 \n",
       "\n",
       "[5 rows x 58 columns]"
      ]
     },
     "execution_count": 5,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df = pd.read_csv('../input/datasetsdifferent-format/IMDB.csv', encoding = \"ISO-8859-1\", header=None)\n",
    "df.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "08294d75e254363a4538f82483e9c6c354e19c53"
   },
   "source": [
    "### 2.Specify a different row as header\n",
    "* Read specific **rows as header** which is working as **column name**\n",
    "* In the result, row 2 become header of dataframe"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 6,
   "metadata": {
    "_uuid": "750fcf0945cb137ca4db9c504a16f48532a943d8"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>2</th>\n",
       "      <th>127 Hours (2010)</th>\n",
       "      <th>7.6</th>\n",
       "      <th>297075</th>\n",
       "      <th>Adventure</th>\n",
       "      <th>Biography</th>\n",
       "      <th>Drama</th>\n",
       "      <th>82</th>\n",
       "      <th>18000000</th>\n",
       "      <th>94 min</th>\n",
       "      <th>28939</th>\n",
       "      <th>44110</th>\n",
       "      <th>98845</th>\n",
       "      <th>78451</th>\n",
       "      <th>28394</th>\n",
       "      <th>9403</th>\n",
       "      <th>3796</th>\n",
       "      <th>1930</th>\n",
       "      <th>1161</th>\n",
       "      <th>2059</th>\n",
       "      <th>212866</th>\n",
       "      <th>44600</th>\n",
       "      <th>745</th>\n",
       "      <th>567</th>\n",
       "      <th>170</th>\n",
       "      <th>133336</th>\n",
       "      <th>106007</th>\n",
       "      <th>26152</th>\n",
       "      <th>102120</th>\n",
       "      <th>86609</th>\n",
       "      <th>14304</th>\n",
       "      <th>14895</th>\n",
       "      <th>12400</th>\n",
       "      <th>2261</th>\n",
       "      <th>649</th>\n",
       "      <th>38478</th>\n",
       "      <th>169745</th>\n",
       "      <th>7.6.1</th>\n",
       "      <th>7.6.2</th>\n",
       "      <th>7.9</th>\n",
       "      <th>7.9.1</th>\n",
       "      <th>7.9.2</th>\n",
       "      <th>7.7</th>\n",
       "      <th>7.8</th>\n",
       "      <th>7.7.1</th>\n",
       "      <th>7.5</th>\n",
       "      <th>7.5.1</th>\n",
       "      <th>7.5.2</th>\n",
       "      <th>7.3</th>\n",
       "      <th>7.3.1</th>\n",
       "      <th>7.5.3</th>\n",
       "      <th>7.6.3</th>\n",
       "      <th>7</th>\n",
       "      <th>7.7.2</th>\n",
       "      <th>7.6.4</th>\n",
       "      <th>$18,335,230</th>\n",
       "      <th>42403567</th>\n",
       "      <th>$60,738,797</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>3</td>\n",
       "      <td>50/50 (2011)</td>\n",
       "      <td>7.7</td>\n",
       "      <td>283935</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>72.0</td>\n",
       "      <td>8000000.0</td>\n",
       "      <td>100 min</td>\n",
       "      <td>28304</td>\n",
       "      <td>47501</td>\n",
       "      <td>99524</td>\n",
       "      <td>71485</td>\n",
       "      <td>24252</td>\n",
       "      <td>7545</td>\n",
       "      <td>2381</td>\n",
       "      <td>1109</td>\n",
       "      <td>634</td>\n",
       "      <td>1202</td>\n",
       "      <td>188925</td>\n",
       "      <td>58348</td>\n",
       "      <td>506</td>\n",
       "      <td>348</td>\n",
       "      <td>153</td>\n",
       "      <td>132350</td>\n",
       "      <td>96269</td>\n",
       "      <td>34765</td>\n",
       "      <td>94745</td>\n",
       "      <td>75394</td>\n",
       "      <td>18163</td>\n",
       "      <td>12829</td>\n",
       "      <td>9912</td>\n",
       "      <td>2681</td>\n",
       "      <td>555</td>\n",
       "      <td>46947</td>\n",
       "      <td>147849.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$35,014,192</td>\n",
       "      <td>4173591</td>\n",
       "      <td>$39,187,783</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>4</td>\n",
       "      <td>About Time (2013)</td>\n",
       "      <td>7.8</td>\n",
       "      <td>225412</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Fantasy</td>\n",
       "      <td>NaN</td>\n",
       "      <td>12000000.0</td>\n",
       "      <td>123 min</td>\n",
       "      <td>38556</td>\n",
       "      <td>43170</td>\n",
       "      <td>70850</td>\n",
       "      <td>45487</td>\n",
       "      <td>16542</td>\n",
       "      <td>5673</td>\n",
       "      <td>2210</td>\n",
       "      <td>1084</td>\n",
       "      <td>664</td>\n",
       "      <td>1182</td>\n",
       "      <td>126718</td>\n",
       "      <td>58098</td>\n",
       "      <td>654</td>\n",
       "      <td>325</td>\n",
       "      <td>321</td>\n",
       "      <td>92940</td>\n",
       "      <td>57778</td>\n",
       "      <td>34126</td>\n",
       "      <td>67477</td>\n",
       "      <td>50212</td>\n",
       "      <td>16222</td>\n",
       "      <td>13973</td>\n",
       "      <td>10690</td>\n",
       "      <td>3026</td>\n",
       "      <td>475</td>\n",
       "      <td>20450</td>\n",
       "      <td>111670.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>6.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>$15,322,921</td>\n",
       "      <td>71777528</td>\n",
       "      <td>$87,100,449</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>5</td>\n",
       "      <td>Amour (2012)</td>\n",
       "      <td>7.9</td>\n",
       "      <td>76121</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>NaN</td>\n",
       "      <td>94.0</td>\n",
       "      <td>8900000.0</td>\n",
       "      <td>127 min</td>\n",
       "      <td>11093</td>\n",
       "      <td>15944</td>\n",
       "      <td>22942</td>\n",
       "      <td>14187</td>\n",
       "      <td>5945</td>\n",
       "      <td>2585</td>\n",
       "      <td>1188</td>\n",
       "      <td>710</td>\n",
       "      <td>534</td>\n",
       "      <td>995</td>\n",
       "      <td>49808</td>\n",
       "      <td>16719</td>\n",
       "      <td>121</td>\n",
       "      <td>95</td>\n",
       "      <td>24</td>\n",
       "      <td>28593</td>\n",
       "      <td>20107</td>\n",
       "      <td>8167</td>\n",
       "      <td>28691</td>\n",
       "      <td>21990</td>\n",
       "      <td>6269</td>\n",
       "      <td>7425</td>\n",
       "      <td>5803</td>\n",
       "      <td>1490</td>\n",
       "      <td>391</td>\n",
       "      <td>7959</td>\n",
       "      <td>46138.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.6</td>\n",
       "      <td>8.7</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>6.6</td>\n",
       "      <td>7.2</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>$6,739,492</td>\n",
       "      <td>13100000</td>\n",
       "      <td>$19,839,492</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>6</td>\n",
       "      <td>Argo (2012)</td>\n",
       "      <td>7.7</td>\n",
       "      <td>486840</td>\n",
       "      <td>Action</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>86.0</td>\n",
       "      <td>44500000.0</td>\n",
       "      <td>120 min</td>\n",
       "      <td>43875</td>\n",
       "      <td>89490</td>\n",
       "      <td>171495</td>\n",
       "      <td>115165</td>\n",
       "      <td>37332</td>\n",
       "      <td>12630</td>\n",
       "      <td>4992</td>\n",
       "      <td>2910</td>\n",
       "      <td>2020</td>\n",
       "      <td>6941</td>\n",
       "      <td>334838</td>\n",
       "      <td>67910</td>\n",
       "      <td>971</td>\n",
       "      <td>795</td>\n",
       "      <td>162</td>\n",
       "      <td>178794</td>\n",
       "      <td>146371</td>\n",
       "      <td>30643</td>\n",
       "      <td>163795</td>\n",
       "      <td>136391</td>\n",
       "      <td>24948</td>\n",
       "      <td>36215</td>\n",
       "      <td>28817</td>\n",
       "      <td>6752</td>\n",
       "      <td>740</td>\n",
       "      <td>70110</td>\n",
       "      <td>229137.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.1</td>\n",
       "      <td>7.2</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$136,025,503</td>\n",
       "      <td>96300000</td>\n",
       "      <td>$232,325,503</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>7</td>\n",
       "      <td>Arrival (2016)</td>\n",
       "      <td>8.0</td>\n",
       "      <td>370842</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Mystery</td>\n",
       "      <td>Sci-Fi</td>\n",
       "      <td>81.0</td>\n",
       "      <td>47000000.0</td>\n",
       "      <td>116 min</td>\n",
       "      <td>55533</td>\n",
       "      <td>87850</td>\n",
       "      <td>109536</td>\n",
       "      <td>65440</td>\n",
       "      <td>26913</td>\n",
       "      <td>10556</td>\n",
       "      <td>5057</td>\n",
       "      <td>3083</td>\n",
       "      <td>2194</td>\n",
       "      <td>4734</td>\n",
       "      <td>237437</td>\n",
       "      <td>46272</td>\n",
       "      <td>1943</td>\n",
       "      <td>1544</td>\n",
       "      <td>376</td>\n",
       "      <td>126301</td>\n",
       "      <td>101741</td>\n",
       "      <td>23163</td>\n",
       "      <td>111985</td>\n",
       "      <td>95005</td>\n",
       "      <td>15227</td>\n",
       "      <td>24027</td>\n",
       "      <td>20118</td>\n",
       "      <td>3440</td>\n",
       "      <td>537</td>\n",
       "      <td>42062</td>\n",
       "      <td>163774.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.6</td>\n",
       "      <td>8.6</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.1</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.7</td>\n",
       "      <td>8.3</td>\n",
       "      <td>7.3</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>$100,546,139</td>\n",
       "      <td>102842047</td>\n",
       "      <td>$203,388,186</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "   2   127 Hours (2010)      ...         42403567   $60,738,797 \n",
       "0  3       50/50 (2011)      ...          4173591   $39,187,783 \n",
       "1  4  About Time (2013)      ...         71777528   $87,100,449 \n",
       "2  5       Amour (2012)      ...         13100000   $19,839,492 \n",
       "3  6        Argo (2012)      ...         96300000  $232,325,503 \n",
       "4  7     Arrival (2016)      ...        102842047  $203,388,186 \n",
       "\n",
       "[5 rows x 58 columns]"
      ]
     },
     "execution_count": 6,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df = pd.read_csv(\"../input/datasetsdifferent-format/IMDB.csv\", encoding = \"ISO-8859-1\", header=2)\n",
    "df.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "579d6d098ca78625931c91013b093d5e6b2ccbf1"
   },
   "source": [
    "### 3.Specify a column as index\n",
    "* Read specify column as index for dataframe\n",
    "* In the result, title column become Index column of dataframe"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 7,
   "metadata": {
    "_uuid": "eaa4eb0bb9c5a3c63bd996147d1ac4632ef1e47a"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>X</th>\n",
       "      <th>Rating</th>\n",
       "      <th>TotalVotes</th>\n",
       "      <th>Genre1</th>\n",
       "      <th>Genre2</th>\n",
       "      <th>Genre3</th>\n",
       "      <th>MetaCritic</th>\n",
       "      <th>Budget</th>\n",
       "      <th>Runtime</th>\n",
       "      <th>CVotes10</th>\n",
       "      <th>CVotes09</th>\n",
       "      <th>CVotes08</th>\n",
       "      <th>CVotes07</th>\n",
       "      <th>CVotes06</th>\n",
       "      <th>CVotes05</th>\n",
       "      <th>CVotes04</th>\n",
       "      <th>CVotes03</th>\n",
       "      <th>CVotes02</th>\n",
       "      <th>CVotes01</th>\n",
       "      <th>CVotesMale</th>\n",
       "      <th>CVotesFemale</th>\n",
       "      <th>CVotesU18</th>\n",
       "      <th>CVotesU18M</th>\n",
       "      <th>CVotesU18F</th>\n",
       "      <th>CVotes1829</th>\n",
       "      <th>CVotes1829M</th>\n",
       "      <th>CVotes1829F</th>\n",
       "      <th>CVotes3044</th>\n",
       "      <th>CVotes3044M</th>\n",
       "      <th>CVotes3044F</th>\n",
       "      <th>CVotes45A</th>\n",
       "      <th>CVotes45AM</th>\n",
       "      <th>CVotes45AF</th>\n",
       "      <th>CVotes1000</th>\n",
       "      <th>CVotesUS</th>\n",
       "      <th>CVotesnUS</th>\n",
       "      <th>VotesM</th>\n",
       "      <th>VotesF</th>\n",
       "      <th>VotesU18</th>\n",
       "      <th>VotesU18M</th>\n",
       "      <th>VotesU18F</th>\n",
       "      <th>Votes1829</th>\n",
       "      <th>Votes1829M</th>\n",
       "      <th>Votes1829F</th>\n",
       "      <th>Votes3044</th>\n",
       "      <th>Votes3044M</th>\n",
       "      <th>Votes3044F</th>\n",
       "      <th>Votes45A</th>\n",
       "      <th>Votes45AM</th>\n",
       "      <th>Votes45AF</th>\n",
       "      <th>VotesIMDB</th>\n",
       "      <th>Votes1000</th>\n",
       "      <th>VotesUS</th>\n",
       "      <th>VotesnUS</th>\n",
       "      <th>Domestic</th>\n",
       "      <th>Foreign</th>\n",
       "      <th>Worldwide</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Title</th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>12 Years a Slave (2013)</th>\n",
       "      <td>1</td>\n",
       "      <td>8.1</td>\n",
       "      <td>496092</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>History</td>\n",
       "      <td>96.0</td>\n",
       "      <td>20000000.0</td>\n",
       "      <td>134 min</td>\n",
       "      <td>75556</td>\n",
       "      <td>126223</td>\n",
       "      <td>161460</td>\n",
       "      <td>83070</td>\n",
       "      <td>27231</td>\n",
       "      <td>9603</td>\n",
       "      <td>4021</td>\n",
       "      <td>2420</td>\n",
       "      <td>1785</td>\n",
       "      <td>4739</td>\n",
       "      <td>313823</td>\n",
       "      <td>82012</td>\n",
       "      <td>1837</td>\n",
       "      <td>1363</td>\n",
       "      <td>457</td>\n",
       "      <td>200910</td>\n",
       "      <td>153669</td>\n",
       "      <td>45301</td>\n",
       "      <td>138762</td>\n",
       "      <td>112943</td>\n",
       "      <td>23895</td>\n",
       "      <td>29252</td>\n",
       "      <td>23072</td>\n",
       "      <td>5726</td>\n",
       "      <td>664</td>\n",
       "      <td>53328</td>\n",
       "      <td>224519.0</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.0</td>\n",
       "      <td>$56,671,993</td>\n",
       "      <td>131061209</td>\n",
       "      <td>$187,733,202</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>127 Hours (2010)</th>\n",
       "      <td>2</td>\n",
       "      <td>7.6</td>\n",
       "      <td>297075</td>\n",
       "      <td>Adventure</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>82.0</td>\n",
       "      <td>18000000.0</td>\n",
       "      <td>94 min</td>\n",
       "      <td>28939</td>\n",
       "      <td>44110</td>\n",
       "      <td>98845</td>\n",
       "      <td>78451</td>\n",
       "      <td>28394</td>\n",
       "      <td>9403</td>\n",
       "      <td>3796</td>\n",
       "      <td>1930</td>\n",
       "      <td>1161</td>\n",
       "      <td>2059</td>\n",
       "      <td>212866</td>\n",
       "      <td>44600</td>\n",
       "      <td>745</td>\n",
       "      <td>567</td>\n",
       "      <td>170</td>\n",
       "      <td>133336</td>\n",
       "      <td>106007</td>\n",
       "      <td>26152</td>\n",
       "      <td>102120</td>\n",
       "      <td>86609</td>\n",
       "      <td>14304</td>\n",
       "      <td>14895</td>\n",
       "      <td>12400</td>\n",
       "      <td>2261</td>\n",
       "      <td>649</td>\n",
       "      <td>38478</td>\n",
       "      <td>169745.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$18,335,230</td>\n",
       "      <td>42403567</td>\n",
       "      <td>$60,738,797</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>50/50 (2011)</th>\n",
       "      <td>3</td>\n",
       "      <td>7.7</td>\n",
       "      <td>283935</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>72.0</td>\n",
       "      <td>8000000.0</td>\n",
       "      <td>100 min</td>\n",
       "      <td>28304</td>\n",
       "      <td>47501</td>\n",
       "      <td>99524</td>\n",
       "      <td>71485</td>\n",
       "      <td>24252</td>\n",
       "      <td>7545</td>\n",
       "      <td>2381</td>\n",
       "      <td>1109</td>\n",
       "      <td>634</td>\n",
       "      <td>1202</td>\n",
       "      <td>188925</td>\n",
       "      <td>58348</td>\n",
       "      <td>506</td>\n",
       "      <td>348</td>\n",
       "      <td>153</td>\n",
       "      <td>132350</td>\n",
       "      <td>96269</td>\n",
       "      <td>34765</td>\n",
       "      <td>94745</td>\n",
       "      <td>75394</td>\n",
       "      <td>18163</td>\n",
       "      <td>12829</td>\n",
       "      <td>9912</td>\n",
       "      <td>2681</td>\n",
       "      <td>555</td>\n",
       "      <td>46947</td>\n",
       "      <td>147849.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$35,014,192</td>\n",
       "      <td>4173591</td>\n",
       "      <td>$39,187,783</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>About Time (2013)</th>\n",
       "      <td>4</td>\n",
       "      <td>7.8</td>\n",
       "      <td>225412</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Fantasy</td>\n",
       "      <td>NaN</td>\n",
       "      <td>12000000.0</td>\n",
       "      <td>123 min</td>\n",
       "      <td>38556</td>\n",
       "      <td>43170</td>\n",
       "      <td>70850</td>\n",
       "      <td>45487</td>\n",
       "      <td>16542</td>\n",
       "      <td>5673</td>\n",
       "      <td>2210</td>\n",
       "      <td>1084</td>\n",
       "      <td>664</td>\n",
       "      <td>1182</td>\n",
       "      <td>126718</td>\n",
       "      <td>58098</td>\n",
       "      <td>654</td>\n",
       "      <td>325</td>\n",
       "      <td>321</td>\n",
       "      <td>92940</td>\n",
       "      <td>57778</td>\n",
       "      <td>34126</td>\n",
       "      <td>67477</td>\n",
       "      <td>50212</td>\n",
       "      <td>16222</td>\n",
       "      <td>13973</td>\n",
       "      <td>10690</td>\n",
       "      <td>3026</td>\n",
       "      <td>475</td>\n",
       "      <td>20450</td>\n",
       "      <td>111670.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>6.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>$15,322,921</td>\n",
       "      <td>71777528</td>\n",
       "      <td>$87,100,449</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Amour (2012)</th>\n",
       "      <td>5</td>\n",
       "      <td>7.9</td>\n",
       "      <td>76121</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>NaN</td>\n",
       "      <td>94.0</td>\n",
       "      <td>8900000.0</td>\n",
       "      <td>127 min</td>\n",
       "      <td>11093</td>\n",
       "      <td>15944</td>\n",
       "      <td>22942</td>\n",
       "      <td>14187</td>\n",
       "      <td>5945</td>\n",
       "      <td>2585</td>\n",
       "      <td>1188</td>\n",
       "      <td>710</td>\n",
       "      <td>534</td>\n",
       "      <td>995</td>\n",
       "      <td>49808</td>\n",
       "      <td>16719</td>\n",
       "      <td>121</td>\n",
       "      <td>95</td>\n",
       "      <td>24</td>\n",
       "      <td>28593</td>\n",
       "      <td>20107</td>\n",
       "      <td>8167</td>\n",
       "      <td>28691</td>\n",
       "      <td>21990</td>\n",
       "      <td>6269</td>\n",
       "      <td>7425</td>\n",
       "      <td>5803</td>\n",
       "      <td>1490</td>\n",
       "      <td>391</td>\n",
       "      <td>7959</td>\n",
       "      <td>46138.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.6</td>\n",
       "      <td>8.7</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>6.6</td>\n",
       "      <td>7.2</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>$6,739,492</td>\n",
       "      <td>13100000</td>\n",
       "      <td>$19,839,492</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "                          X  Rating      ...          Foreign      Worldwide\n",
       "Title                                    ...                                \n",
       "12 Years a Slave (2013)  1     8.1      ...        131061209  $187,733,202 \n",
       "127 Hours (2010)         2     7.6      ...         42403567   $60,738,797 \n",
       "50/50 (2011)             3     7.7      ...          4173591   $39,187,783 \n",
       "About Time (2013)        4     7.8      ...         71777528   $87,100,449 \n",
       "Amour (2012)             5     7.9      ...         13100000   $19,839,492 \n",
       "\n",
       "[5 rows x 57 columns]"
      ]
     },
     "execution_count": 7,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df = pd.read_csv(\"../input/datasetsdifferent-format/IMDB.csv\", encoding = \"ISO-8859-1\", index_col=\"Title\")\n",
    "df.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "9b059a5914a12924ebb48694c933b5c80e22eaf3"
   },
   "source": [
    "### 4.Choose only a subset of columns to be read\n",
    "* Subset specific columns from the dataframe while reading file\n",
    "* In the result, subset the ` Title, Genre1, Genre2, Budget` columns"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 8,
   "metadata": {
    "_uuid": "f4aa41dc2b262bc2a2eebef760bbba9f2791ba0a"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Title</th>\n",
       "      <th>Genre1</th>\n",
       "      <th>Genre2</th>\n",
       "      <th>Budget</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>12 Years a Slave (2013)</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>20000000.0</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>127 Hours (2010)</td>\n",
       "      <td>Adventure</td>\n",
       "      <td>Biography</td>\n",
       "      <td>18000000.0</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>50/50 (2011)</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>8000000.0</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>About Time (2013)</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>12000000.0</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>Amour (2012)</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>8900000.0</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "                      Title     Genre1     Genre2      Budget\n",
       "0  12 Years a Slave (2013)  Biography      Drama  20000000.0\n",
       "1         127 Hours (2010)  Adventure  Biography  18000000.0\n",
       "2             50/50 (2011)     Comedy      Drama   8000000.0\n",
       "3        About Time (2013)     Comedy      Drama  12000000.0\n",
       "4             Amour (2012)      Drama    Romance   8900000.0"
      ]
     },
     "execution_count": 8,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df = pd.read_csv(\"../input/datasetsdifferent-format/IMDB.csv\", encoding = \"ISO-8859-1\", usecols=['Title','Genre1','Genre2','Budget'])\n",
    "df.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "33993b9f2fca1e51dd0bea77f946e61b107d6bfc"
   },
   "source": [
    "### 5.Handling Missing and NA data\n",
    "\n",
    "***Missing Value format :***  NaN: ”, ‘#N/A’, ‘#N/A N/A’, ‘#NA’, ‘-1.#IND’, ‘-1.#QNAN’, ‘-NaN’, ‘-nan’, ‘1.#IND’, ‘1.#QNAN’, ‘N/A’, ‘NA’, ‘NULL’, ‘NaN’, ‘nan’`.\n",
    "\n",
    "* Handling Missing value while reading data.\n",
    "* In the result, dataframe handle the result which contain `nan` kind missing value"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 9,
   "metadata": {
    "_uuid": "7dbad608ce9b01300bca013668eae61d2a2828aa"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>X</th>\n",
       "      <th>Title</th>\n",
       "      <th>Rating</th>\n",
       "      <th>TotalVotes</th>\n",
       "      <th>Genre1</th>\n",
       "      <th>Genre2</th>\n",
       "      <th>Genre3</th>\n",
       "      <th>MetaCritic</th>\n",
       "      <th>Budget</th>\n",
       "      <th>Runtime</th>\n",
       "      <th>CVotes10</th>\n",
       "      <th>CVotes09</th>\n",
       "      <th>CVotes08</th>\n",
       "      <th>CVotes07</th>\n",
       "      <th>CVotes06</th>\n",
       "      <th>CVotes05</th>\n",
       "      <th>CVotes04</th>\n",
       "      <th>CVotes03</th>\n",
       "      <th>CVotes02</th>\n",
       "      <th>CVotes01</th>\n",
       "      <th>CVotesMale</th>\n",
       "      <th>CVotesFemale</th>\n",
       "      <th>CVotesU18</th>\n",
       "      <th>CVotesU18M</th>\n",
       "      <th>CVotesU18F</th>\n",
       "      <th>CVotes1829</th>\n",
       "      <th>CVotes1829M</th>\n",
       "      <th>CVotes1829F</th>\n",
       "      <th>CVotes3044</th>\n",
       "      <th>CVotes3044M</th>\n",
       "      <th>CVotes3044F</th>\n",
       "      <th>CVotes45A</th>\n",
       "      <th>CVotes45AM</th>\n",
       "      <th>CVotes45AF</th>\n",
       "      <th>CVotes1000</th>\n",
       "      <th>CVotesUS</th>\n",
       "      <th>CVotesnUS</th>\n",
       "      <th>VotesM</th>\n",
       "      <th>VotesF</th>\n",
       "      <th>VotesU18</th>\n",
       "      <th>VotesU18M</th>\n",
       "      <th>VotesU18F</th>\n",
       "      <th>Votes1829</th>\n",
       "      <th>Votes1829M</th>\n",
       "      <th>Votes1829F</th>\n",
       "      <th>Votes3044</th>\n",
       "      <th>Votes3044M</th>\n",
       "      <th>Votes3044F</th>\n",
       "      <th>Votes45A</th>\n",
       "      <th>Votes45AM</th>\n",
       "      <th>Votes45AF</th>\n",
       "      <th>VotesIMDB</th>\n",
       "      <th>Votes1000</th>\n",
       "      <th>VotesUS</th>\n",
       "      <th>VotesnUS</th>\n",
       "      <th>Domestic</th>\n",
       "      <th>Foreign</th>\n",
       "      <th>Worldwide</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>1</td>\n",
       "      <td>12 Years a Slave (2013)</td>\n",
       "      <td>8.1</td>\n",
       "      <td>496092</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>History</td>\n",
       "      <td>96.0</td>\n",
       "      <td>20000000.0</td>\n",
       "      <td>134 min</td>\n",
       "      <td>75556</td>\n",
       "      <td>126223</td>\n",
       "      <td>161460</td>\n",
       "      <td>83070</td>\n",
       "      <td>27231</td>\n",
       "      <td>9603</td>\n",
       "      <td>4021</td>\n",
       "      <td>2420</td>\n",
       "      <td>1785</td>\n",
       "      <td>4739</td>\n",
       "      <td>313823</td>\n",
       "      <td>82012</td>\n",
       "      <td>1837</td>\n",
       "      <td>1363</td>\n",
       "      <td>457</td>\n",
       "      <td>200910</td>\n",
       "      <td>153669</td>\n",
       "      <td>45301</td>\n",
       "      <td>138762</td>\n",
       "      <td>112943</td>\n",
       "      <td>23895</td>\n",
       "      <td>29252</td>\n",
       "      <td>23072</td>\n",
       "      <td>5726</td>\n",
       "      <td>664</td>\n",
       "      <td>53328</td>\n",
       "      <td>224519.0</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.0</td>\n",
       "      <td>$56,671,993</td>\n",
       "      <td>131061209</td>\n",
       "      <td>$187,733,202</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2</td>\n",
       "      <td>127 Hours (2010)</td>\n",
       "      <td>7.6</td>\n",
       "      <td>297075</td>\n",
       "      <td>Adventure</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>82.0</td>\n",
       "      <td>18000000.0</td>\n",
       "      <td>94 min</td>\n",
       "      <td>28939</td>\n",
       "      <td>44110</td>\n",
       "      <td>98845</td>\n",
       "      <td>78451</td>\n",
       "      <td>28394</td>\n",
       "      <td>9403</td>\n",
       "      <td>3796</td>\n",
       "      <td>1930</td>\n",
       "      <td>1161</td>\n",
       "      <td>2059</td>\n",
       "      <td>212866</td>\n",
       "      <td>44600</td>\n",
       "      <td>745</td>\n",
       "      <td>567</td>\n",
       "      <td>170</td>\n",
       "      <td>133336</td>\n",
       "      <td>106007</td>\n",
       "      <td>26152</td>\n",
       "      <td>102120</td>\n",
       "      <td>86609</td>\n",
       "      <td>14304</td>\n",
       "      <td>14895</td>\n",
       "      <td>12400</td>\n",
       "      <td>2261</td>\n",
       "      <td>649</td>\n",
       "      <td>38478</td>\n",
       "      <td>169745.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$18,335,230</td>\n",
       "      <td>42403567</td>\n",
       "      <td>$60,738,797</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>3</td>\n",
       "      <td>50/50 (2011)</td>\n",
       "      <td>7.7</td>\n",
       "      <td>283935</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>72.0</td>\n",
       "      <td>8000000.0</td>\n",
       "      <td>100 min</td>\n",
       "      <td>28304</td>\n",
       "      <td>47501</td>\n",
       "      <td>99524</td>\n",
       "      <td>71485</td>\n",
       "      <td>24252</td>\n",
       "      <td>7545</td>\n",
       "      <td>2381</td>\n",
       "      <td>1109</td>\n",
       "      <td>634</td>\n",
       "      <td>1202</td>\n",
       "      <td>188925</td>\n",
       "      <td>58348</td>\n",
       "      <td>506</td>\n",
       "      <td>348</td>\n",
       "      <td>153</td>\n",
       "      <td>132350</td>\n",
       "      <td>96269</td>\n",
       "      <td>34765</td>\n",
       "      <td>94745</td>\n",
       "      <td>75394</td>\n",
       "      <td>18163</td>\n",
       "      <td>12829</td>\n",
       "      <td>9912</td>\n",
       "      <td>2681</td>\n",
       "      <td>555</td>\n",
       "      <td>46947</td>\n",
       "      <td>147849.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$35,014,192</td>\n",
       "      <td>4173591</td>\n",
       "      <td>$39,187,783</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>4</td>\n",
       "      <td>About Time (2013)</td>\n",
       "      <td>7.8</td>\n",
       "      <td>225412</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Fantasy</td>\n",
       "      <td>NaN</td>\n",
       "      <td>12000000.0</td>\n",
       "      <td>123 min</td>\n",
       "      <td>38556</td>\n",
       "      <td>43170</td>\n",
       "      <td>70850</td>\n",
       "      <td>45487</td>\n",
       "      <td>16542</td>\n",
       "      <td>5673</td>\n",
       "      <td>2210</td>\n",
       "      <td>1084</td>\n",
       "      <td>664</td>\n",
       "      <td>1182</td>\n",
       "      <td>126718</td>\n",
       "      <td>58098</td>\n",
       "      <td>654</td>\n",
       "      <td>325</td>\n",
       "      <td>321</td>\n",
       "      <td>92940</td>\n",
       "      <td>57778</td>\n",
       "      <td>34126</td>\n",
       "      <td>67477</td>\n",
       "      <td>50212</td>\n",
       "      <td>16222</td>\n",
       "      <td>13973</td>\n",
       "      <td>10690</td>\n",
       "      <td>3026</td>\n",
       "      <td>475</td>\n",
       "      <td>20450</td>\n",
       "      <td>111670.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>6.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>$15,322,921</td>\n",
       "      <td>71777528</td>\n",
       "      <td>$87,100,449</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>5</td>\n",
       "      <td>Amour (2012)</td>\n",
       "      <td>7.9</td>\n",
       "      <td>76121</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>NaN</td>\n",
       "      <td>94.0</td>\n",
       "      <td>8900000.0</td>\n",
       "      <td>127 min</td>\n",
       "      <td>11093</td>\n",
       "      <td>15944</td>\n",
       "      <td>22942</td>\n",
       "      <td>14187</td>\n",
       "      <td>5945</td>\n",
       "      <td>2585</td>\n",
       "      <td>1188</td>\n",
       "      <td>710</td>\n",
       "      <td>534</td>\n",
       "      <td>995</td>\n",
       "      <td>49808</td>\n",
       "      <td>16719</td>\n",
       "      <td>121</td>\n",
       "      <td>95</td>\n",
       "      <td>24</td>\n",
       "      <td>28593</td>\n",
       "      <td>20107</td>\n",
       "      <td>8167</td>\n",
       "      <td>28691</td>\n",
       "      <td>21990</td>\n",
       "      <td>6269</td>\n",
       "      <td>7425</td>\n",
       "      <td>5803</td>\n",
       "      <td>1490</td>\n",
       "      <td>391</td>\n",
       "      <td>7959</td>\n",
       "      <td>46138.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.6</td>\n",
       "      <td>8.7</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>6.6</td>\n",
       "      <td>7.2</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>$6,739,492</td>\n",
       "      <td>13100000</td>\n",
       "      <td>$19,839,492</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "   X                     Title      ...          Foreign      Worldwide\n",
       "0  1  12 Years a Slave (2013)      ...        131061209  $187,733,202 \n",
       "1  2         127 Hours (2010)      ...         42403567   $60,738,797 \n",
       "2  3             50/50 (2011)      ...          4173591   $39,187,783 \n",
       "3  4        About Time (2013)      ...         71777528   $87,100,449 \n",
       "4  5             Amour (2012)      ...         13100000   $19,839,492 \n",
       "\n",
       "[5 rows x 58 columns]"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "(117, 58)\n"
     ]
    }
   ],
   "source": [
    "df = pd.read_csv('../input/datasetsdifferent-format/IMDB.csv', encoding = \"ISO-8859-1\", na_values=['nan'])\n",
    "display(df.head())\n",
    "print(df.shape)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "b65eef0d3d330b1608bcd7afad673711ca53de54"
   },
   "source": [
    "### 6.Choose whether to skip over blank rows or not\n",
    "\n",
    "* you choose whether to skip over blank rows while reading data\n",
    "* In the result, you can see that we have skipped blank rows.\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 10,
   "metadata": {
    "_uuid": "10ab61642b1b00e58ea82fa2535e35864653eba7"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>X</th>\n",
       "      <th>Title</th>\n",
       "      <th>Rating</th>\n",
       "      <th>TotalVotes</th>\n",
       "      <th>Genre1</th>\n",
       "      <th>Genre2</th>\n",
       "      <th>Genre3</th>\n",
       "      <th>MetaCritic</th>\n",
       "      <th>Budget</th>\n",
       "      <th>Runtime</th>\n",
       "      <th>CVotes10</th>\n",
       "      <th>CVotes09</th>\n",
       "      <th>CVotes08</th>\n",
       "      <th>CVotes07</th>\n",
       "      <th>CVotes06</th>\n",
       "      <th>CVotes05</th>\n",
       "      <th>CVotes04</th>\n",
       "      <th>CVotes03</th>\n",
       "      <th>CVotes02</th>\n",
       "      <th>CVotes01</th>\n",
       "      <th>CVotesMale</th>\n",
       "      <th>CVotesFemale</th>\n",
       "      <th>CVotesU18</th>\n",
       "      <th>CVotesU18M</th>\n",
       "      <th>CVotesU18F</th>\n",
       "      <th>CVotes1829</th>\n",
       "      <th>CVotes1829M</th>\n",
       "      <th>CVotes1829F</th>\n",
       "      <th>CVotes3044</th>\n",
       "      <th>CVotes3044M</th>\n",
       "      <th>CVotes3044F</th>\n",
       "      <th>CVotes45A</th>\n",
       "      <th>CVotes45AM</th>\n",
       "      <th>CVotes45AF</th>\n",
       "      <th>CVotes1000</th>\n",
       "      <th>CVotesUS</th>\n",
       "      <th>CVotesnUS</th>\n",
       "      <th>VotesM</th>\n",
       "      <th>VotesF</th>\n",
       "      <th>VotesU18</th>\n",
       "      <th>VotesU18M</th>\n",
       "      <th>VotesU18F</th>\n",
       "      <th>Votes1829</th>\n",
       "      <th>Votes1829M</th>\n",
       "      <th>Votes1829F</th>\n",
       "      <th>Votes3044</th>\n",
       "      <th>Votes3044M</th>\n",
       "      <th>Votes3044F</th>\n",
       "      <th>Votes45A</th>\n",
       "      <th>Votes45AM</th>\n",
       "      <th>Votes45AF</th>\n",
       "      <th>VotesIMDB</th>\n",
       "      <th>Votes1000</th>\n",
       "      <th>VotesUS</th>\n",
       "      <th>VotesnUS</th>\n",
       "      <th>Domestic</th>\n",
       "      <th>Foreign</th>\n",
       "      <th>Worldwide</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>1</td>\n",
       "      <td>12 Years a Slave (2013)</td>\n",
       "      <td>8.1</td>\n",
       "      <td>496092</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>History</td>\n",
       "      <td>96.0</td>\n",
       "      <td>20000000.0</td>\n",
       "      <td>134 min</td>\n",
       "      <td>75556</td>\n",
       "      <td>126223</td>\n",
       "      <td>161460</td>\n",
       "      <td>83070</td>\n",
       "      <td>27231</td>\n",
       "      <td>9603</td>\n",
       "      <td>4021</td>\n",
       "      <td>2420</td>\n",
       "      <td>1785</td>\n",
       "      <td>4739</td>\n",
       "      <td>313823</td>\n",
       "      <td>82012</td>\n",
       "      <td>1837</td>\n",
       "      <td>1363</td>\n",
       "      <td>457</td>\n",
       "      <td>200910</td>\n",
       "      <td>153669</td>\n",
       "      <td>45301</td>\n",
       "      <td>138762</td>\n",
       "      <td>112943</td>\n",
       "      <td>23895</td>\n",
       "      <td>29252</td>\n",
       "      <td>23072</td>\n",
       "      <td>5726</td>\n",
       "      <td>664</td>\n",
       "      <td>53328</td>\n",
       "      <td>224519.0</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.0</td>\n",
       "      <td>$56,671,993</td>\n",
       "      <td>131061209</td>\n",
       "      <td>$187,733,202</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2</td>\n",
       "      <td>127 Hours (2010)</td>\n",
       "      <td>7.6</td>\n",
       "      <td>297075</td>\n",
       "      <td>Adventure</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>82.0</td>\n",
       "      <td>18000000.0</td>\n",
       "      <td>94 min</td>\n",
       "      <td>28939</td>\n",
       "      <td>44110</td>\n",
       "      <td>98845</td>\n",
       "      <td>78451</td>\n",
       "      <td>28394</td>\n",
       "      <td>9403</td>\n",
       "      <td>3796</td>\n",
       "      <td>1930</td>\n",
       "      <td>1161</td>\n",
       "      <td>2059</td>\n",
       "      <td>212866</td>\n",
       "      <td>44600</td>\n",
       "      <td>745</td>\n",
       "      <td>567</td>\n",
       "      <td>170</td>\n",
       "      <td>133336</td>\n",
       "      <td>106007</td>\n",
       "      <td>26152</td>\n",
       "      <td>102120</td>\n",
       "      <td>86609</td>\n",
       "      <td>14304</td>\n",
       "      <td>14895</td>\n",
       "      <td>12400</td>\n",
       "      <td>2261</td>\n",
       "      <td>649</td>\n",
       "      <td>38478</td>\n",
       "      <td>169745.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$18,335,230</td>\n",
       "      <td>42403567</td>\n",
       "      <td>$60,738,797</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>3</td>\n",
       "      <td>50/50 (2011)</td>\n",
       "      <td>7.7</td>\n",
       "      <td>283935</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>72.0</td>\n",
       "      <td>8000000.0</td>\n",
       "      <td>100 min</td>\n",
       "      <td>28304</td>\n",
       "      <td>47501</td>\n",
       "      <td>99524</td>\n",
       "      <td>71485</td>\n",
       "      <td>24252</td>\n",
       "      <td>7545</td>\n",
       "      <td>2381</td>\n",
       "      <td>1109</td>\n",
       "      <td>634</td>\n",
       "      <td>1202</td>\n",
       "      <td>188925</td>\n",
       "      <td>58348</td>\n",
       "      <td>506</td>\n",
       "      <td>348</td>\n",
       "      <td>153</td>\n",
       "      <td>132350</td>\n",
       "      <td>96269</td>\n",
       "      <td>34765</td>\n",
       "      <td>94745</td>\n",
       "      <td>75394</td>\n",
       "      <td>18163</td>\n",
       "      <td>12829</td>\n",
       "      <td>9912</td>\n",
       "      <td>2681</td>\n",
       "      <td>555</td>\n",
       "      <td>46947</td>\n",
       "      <td>147849.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$35,014,192</td>\n",
       "      <td>4173591</td>\n",
       "      <td>$39,187,783</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>4</td>\n",
       "      <td>About Time (2013)</td>\n",
       "      <td>7.8</td>\n",
       "      <td>225412</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Fantasy</td>\n",
       "      <td>NaN</td>\n",
       "      <td>12000000.0</td>\n",
       "      <td>123 min</td>\n",
       "      <td>38556</td>\n",
       "      <td>43170</td>\n",
       "      <td>70850</td>\n",
       "      <td>45487</td>\n",
       "      <td>16542</td>\n",
       "      <td>5673</td>\n",
       "      <td>2210</td>\n",
       "      <td>1084</td>\n",
       "      <td>664</td>\n",
       "      <td>1182</td>\n",
       "      <td>126718</td>\n",
       "      <td>58098</td>\n",
       "      <td>654</td>\n",
       "      <td>325</td>\n",
       "      <td>321</td>\n",
       "      <td>92940</td>\n",
       "      <td>57778</td>\n",
       "      <td>34126</td>\n",
       "      <td>67477</td>\n",
       "      <td>50212</td>\n",
       "      <td>16222</td>\n",
       "      <td>13973</td>\n",
       "      <td>10690</td>\n",
       "      <td>3026</td>\n",
       "      <td>475</td>\n",
       "      <td>20450</td>\n",
       "      <td>111670.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>6.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>$15,322,921</td>\n",
       "      <td>71777528</td>\n",
       "      <td>$87,100,449</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>5</td>\n",
       "      <td>Amour (2012)</td>\n",
       "      <td>7.9</td>\n",
       "      <td>76121</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>NaN</td>\n",
       "      <td>94.0</td>\n",
       "      <td>8900000.0</td>\n",
       "      <td>127 min</td>\n",
       "      <td>11093</td>\n",
       "      <td>15944</td>\n",
       "      <td>22942</td>\n",
       "      <td>14187</td>\n",
       "      <td>5945</td>\n",
       "      <td>2585</td>\n",
       "      <td>1188</td>\n",
       "      <td>710</td>\n",
       "      <td>534</td>\n",
       "      <td>995</td>\n",
       "      <td>49808</td>\n",
       "      <td>16719</td>\n",
       "      <td>121</td>\n",
       "      <td>95</td>\n",
       "      <td>24</td>\n",
       "      <td>28593</td>\n",
       "      <td>20107</td>\n",
       "      <td>8167</td>\n",
       "      <td>28691</td>\n",
       "      <td>21990</td>\n",
       "      <td>6269</td>\n",
       "      <td>7425</td>\n",
       "      <td>5803</td>\n",
       "      <td>1490</td>\n",
       "      <td>391</td>\n",
       "      <td>7959</td>\n",
       "      <td>46138.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.6</td>\n",
       "      <td>8.7</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>6.6</td>\n",
       "      <td>7.2</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>$6,739,492</td>\n",
       "      <td>13100000</td>\n",
       "      <td>$19,839,492</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "   X                     Title      ...          Foreign      Worldwide\n",
       "0  1  12 Years a Slave (2013)      ...        131061209  $187,733,202 \n",
       "1  2         127 Hours (2010)      ...         42403567   $60,738,797 \n",
       "2  3             50/50 (2011)      ...          4173591   $39,187,783 \n",
       "3  4        About Time (2013)      ...         71777528   $87,100,449 \n",
       "4  5             Amour (2012)      ...         13100000   $19,839,492 \n",
       "\n",
       "[5 rows x 58 columns]"
      ]
     },
     "execution_count": 10,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df = pd.read_csv('../input/datasetsdifferent-format/IMDB.csv', encoding = \"ISO-8859-1\", skip_blank_lines=False)\n",
    "df.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "ee0eda98b03f87df896b6b249eab34457a758741"
   },
   "source": [
    "> ### 2.2 Data Parsing options <a id=\"22\"> </a>\n",
    "\n",
    "### 1. Skip Rows\n",
    "* We can skip the rows by reading the dataset\n",
    "* In the result, you can see that row number `1,3,7` are skipped from the dataframe."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 11,
   "metadata": {
    "_uuid": "0007420c303217d629eb1fb838e5706fa1b1166c"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>X</th>\n",
       "      <th>Title</th>\n",
       "      <th>Rating</th>\n",
       "      <th>TotalVotes</th>\n",
       "      <th>Genre1</th>\n",
       "      <th>Genre2</th>\n",
       "      <th>Genre3</th>\n",
       "      <th>MetaCritic</th>\n",
       "      <th>Budget</th>\n",
       "      <th>Runtime</th>\n",
       "      <th>CVotes10</th>\n",
       "      <th>CVotes09</th>\n",
       "      <th>CVotes08</th>\n",
       "      <th>CVotes07</th>\n",
       "      <th>CVotes06</th>\n",
       "      <th>CVotes05</th>\n",
       "      <th>CVotes04</th>\n",
       "      <th>CVotes03</th>\n",
       "      <th>CVotes02</th>\n",
       "      <th>CVotes01</th>\n",
       "      <th>CVotesMale</th>\n",
       "      <th>CVotesFemale</th>\n",
       "      <th>CVotesU18</th>\n",
       "      <th>CVotesU18M</th>\n",
       "      <th>CVotesU18F</th>\n",
       "      <th>CVotes1829</th>\n",
       "      <th>CVotes1829M</th>\n",
       "      <th>CVotes1829F</th>\n",
       "      <th>CVotes3044</th>\n",
       "      <th>CVotes3044M</th>\n",
       "      <th>CVotes3044F</th>\n",
       "      <th>CVotes45A</th>\n",
       "      <th>CVotes45AM</th>\n",
       "      <th>CVotes45AF</th>\n",
       "      <th>CVotes1000</th>\n",
       "      <th>CVotesUS</th>\n",
       "      <th>CVotesnUS</th>\n",
       "      <th>VotesM</th>\n",
       "      <th>VotesF</th>\n",
       "      <th>VotesU18</th>\n",
       "      <th>VotesU18M</th>\n",
       "      <th>VotesU18F</th>\n",
       "      <th>Votes1829</th>\n",
       "      <th>Votes1829M</th>\n",
       "      <th>Votes1829F</th>\n",
       "      <th>Votes3044</th>\n",
       "      <th>Votes3044M</th>\n",
       "      <th>Votes3044F</th>\n",
       "      <th>Votes45A</th>\n",
       "      <th>Votes45AM</th>\n",
       "      <th>Votes45AF</th>\n",
       "      <th>VotesIMDB</th>\n",
       "      <th>Votes1000</th>\n",
       "      <th>VotesUS</th>\n",
       "      <th>VotesnUS</th>\n",
       "      <th>Domestic</th>\n",
       "      <th>Foreign</th>\n",
       "      <th>Worldwide</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>2</td>\n",
       "      <td>127 Hours (2010)</td>\n",
       "      <td>7.6</td>\n",
       "      <td>297075</td>\n",
       "      <td>Adventure</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>82.0</td>\n",
       "      <td>18000000.0</td>\n",
       "      <td>94 min</td>\n",
       "      <td>28939</td>\n",
       "      <td>44110</td>\n",
       "      <td>98845</td>\n",
       "      <td>78451</td>\n",
       "      <td>28394</td>\n",
       "      <td>9403</td>\n",
       "      <td>3796</td>\n",
       "      <td>1930</td>\n",
       "      <td>1161</td>\n",
       "      <td>2059</td>\n",
       "      <td>212866</td>\n",
       "      <td>44600</td>\n",
       "      <td>745</td>\n",
       "      <td>567</td>\n",
       "      <td>170</td>\n",
       "      <td>133336</td>\n",
       "      <td>106007</td>\n",
       "      <td>26152</td>\n",
       "      <td>102120</td>\n",
       "      <td>86609</td>\n",
       "      <td>14304</td>\n",
       "      <td>14895</td>\n",
       "      <td>12400</td>\n",
       "      <td>2261</td>\n",
       "      <td>649</td>\n",
       "      <td>38478</td>\n",
       "      <td>169745.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$18,335,230</td>\n",
       "      <td>42403567</td>\n",
       "      <td>$60,738,797</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>4</td>\n",
       "      <td>About Time (2013)</td>\n",
       "      <td>7.8</td>\n",
       "      <td>225412</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Fantasy</td>\n",
       "      <td>NaN</td>\n",
       "      <td>12000000.0</td>\n",
       "      <td>123 min</td>\n",
       "      <td>38556</td>\n",
       "      <td>43170</td>\n",
       "      <td>70850</td>\n",
       "      <td>45487</td>\n",
       "      <td>16542</td>\n",
       "      <td>5673</td>\n",
       "      <td>2210</td>\n",
       "      <td>1084</td>\n",
       "      <td>664</td>\n",
       "      <td>1182</td>\n",
       "      <td>126718</td>\n",
       "      <td>58098</td>\n",
       "      <td>654</td>\n",
       "      <td>325</td>\n",
       "      <td>321</td>\n",
       "      <td>92940</td>\n",
       "      <td>57778</td>\n",
       "      <td>34126</td>\n",
       "      <td>67477</td>\n",
       "      <td>50212</td>\n",
       "      <td>16222</td>\n",
       "      <td>13973</td>\n",
       "      <td>10690</td>\n",
       "      <td>3026</td>\n",
       "      <td>475</td>\n",
       "      <td>20450</td>\n",
       "      <td>111670.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>6.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>$15,322,921</td>\n",
       "      <td>71777528</td>\n",
       "      <td>$87,100,449</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>5</td>\n",
       "      <td>Amour (2012)</td>\n",
       "      <td>7.9</td>\n",
       "      <td>76121</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>NaN</td>\n",
       "      <td>94.0</td>\n",
       "      <td>8900000.0</td>\n",
       "      <td>127 min</td>\n",
       "      <td>11093</td>\n",
       "      <td>15944</td>\n",
       "      <td>22942</td>\n",
       "      <td>14187</td>\n",
       "      <td>5945</td>\n",
       "      <td>2585</td>\n",
       "      <td>1188</td>\n",
       "      <td>710</td>\n",
       "      <td>534</td>\n",
       "      <td>995</td>\n",
       "      <td>49808</td>\n",
       "      <td>16719</td>\n",
       "      <td>121</td>\n",
       "      <td>95</td>\n",
       "      <td>24</td>\n",
       "      <td>28593</td>\n",
       "      <td>20107</td>\n",
       "      <td>8167</td>\n",
       "      <td>28691</td>\n",
       "      <td>21990</td>\n",
       "      <td>6269</td>\n",
       "      <td>7425</td>\n",
       "      <td>5803</td>\n",
       "      <td>1490</td>\n",
       "      <td>391</td>\n",
       "      <td>7959</td>\n",
       "      <td>46138.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.6</td>\n",
       "      <td>8.7</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>6.6</td>\n",
       "      <td>7.2</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>$6,739,492</td>\n",
       "      <td>13100000</td>\n",
       "      <td>$19,839,492</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>6</td>\n",
       "      <td>Argo (2012)</td>\n",
       "      <td>7.7</td>\n",
       "      <td>486840</td>\n",
       "      <td>Action</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>86.0</td>\n",
       "      <td>44500000.0</td>\n",
       "      <td>120 min</td>\n",
       "      <td>43875</td>\n",
       "      <td>89490</td>\n",
       "      <td>171495</td>\n",
       "      <td>115165</td>\n",
       "      <td>37332</td>\n",
       "      <td>12630</td>\n",
       "      <td>4992</td>\n",
       "      <td>2910</td>\n",
       "      <td>2020</td>\n",
       "      <td>6941</td>\n",
       "      <td>334838</td>\n",
       "      <td>67910</td>\n",
       "      <td>971</td>\n",
       "      <td>795</td>\n",
       "      <td>162</td>\n",
       "      <td>178794</td>\n",
       "      <td>146371</td>\n",
       "      <td>30643</td>\n",
       "      <td>163795</td>\n",
       "      <td>136391</td>\n",
       "      <td>24948</td>\n",
       "      <td>36215</td>\n",
       "      <td>28817</td>\n",
       "      <td>6752</td>\n",
       "      <td>740</td>\n",
       "      <td>70110</td>\n",
       "      <td>229137.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.1</td>\n",
       "      <td>7.2</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$136,025,503</td>\n",
       "      <td>96300000</td>\n",
       "      <td>$232,325,503</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>9</td>\n",
       "      <td>Before Midnight (2013)</td>\n",
       "      <td>7.9</td>\n",
       "      <td>106553</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>NaN</td>\n",
       "      <td>94.0</td>\n",
       "      <td>3000000.0</td>\n",
       "      <td>109 min</td>\n",
       "      <td>16953</td>\n",
       "      <td>22109</td>\n",
       "      <td>31439</td>\n",
       "      <td>19251</td>\n",
       "      <td>8142</td>\n",
       "      <td>3412</td>\n",
       "      <td>1649</td>\n",
       "      <td>1033</td>\n",
       "      <td>826</td>\n",
       "      <td>1745</td>\n",
       "      <td>67076</td>\n",
       "      <td>23823</td>\n",
       "      <td>208</td>\n",
       "      <td>138</td>\n",
       "      <td>66</td>\n",
       "      <td>43312</td>\n",
       "      <td>30016</td>\n",
       "      <td>12857</td>\n",
       "      <td>37072</td>\n",
       "      <td>28401</td>\n",
       "      <td>8189</td>\n",
       "      <td>7479</td>\n",
       "      <td>5891</td>\n",
       "      <td>1470</td>\n",
       "      <td>447</td>\n",
       "      <td>12382</td>\n",
       "      <td>59116.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.3</td>\n",
       "      <td>7.4</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.2</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.2</td>\n",
       "      <td>8.5</td>\n",
       "      <td>7.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>$8,114,627</td>\n",
       "      <td>3061842</td>\n",
       "      <td>$11,176,469</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "   X                    Title      ...         Foreign      Worldwide\n",
       "0  2        127 Hours (2010)      ...        42403567   $60,738,797 \n",
       "1  4       About Time (2013)      ...        71777528   $87,100,449 \n",
       "2  5            Amour (2012)      ...        13100000   $19,839,492 \n",
       "3  6             Argo (2012)      ...        96300000  $232,325,503 \n",
       "4  9  Before Midnight (2013)      ...         3061842   $11,176,469 \n",
       "\n",
       "[5 rows x 58 columns]"
      ]
     },
     "execution_count": 11,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df = pd.read_csv('../input/datasetsdifferent-format/IMDB.csv', encoding = \"ISO-8859-1\", skiprows = [1,3,7])\n",
    "df.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "bd774460f6a6df8f91736f9969be5eca18c67017"
   },
   "source": [
    "### 2.Skip rows from footer or from end of the file\n",
    "\n",
    "* We can skip the rows from the footer.\n",
    "* In the result, "
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 12,
   "metadata": {
    "_uuid": "e8f0a40f628ab1087eb755d14d4d005b50db4697"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>X</th>\n",
       "      <th>Title</th>\n",
       "      <th>Rating</th>\n",
       "      <th>TotalVotes</th>\n",
       "      <th>Genre1</th>\n",
       "      <th>Genre2</th>\n",
       "      <th>Genre3</th>\n",
       "      <th>MetaCritic</th>\n",
       "      <th>Budget</th>\n",
       "      <th>Runtime</th>\n",
       "      <th>CVotes10</th>\n",
       "      <th>CVotes09</th>\n",
       "      <th>CVotes08</th>\n",
       "      <th>CVotes07</th>\n",
       "      <th>CVotes06</th>\n",
       "      <th>CVotes05</th>\n",
       "      <th>CVotes04</th>\n",
       "      <th>CVotes03</th>\n",
       "      <th>CVotes02</th>\n",
       "      <th>CVotes01</th>\n",
       "      <th>CVotesMale</th>\n",
       "      <th>CVotesFemale</th>\n",
       "      <th>CVotesU18</th>\n",
       "      <th>CVotesU18M</th>\n",
       "      <th>CVotesU18F</th>\n",
       "      <th>CVotes1829</th>\n",
       "      <th>CVotes1829M</th>\n",
       "      <th>CVotes1829F</th>\n",
       "      <th>CVotes3044</th>\n",
       "      <th>CVotes3044M</th>\n",
       "      <th>CVotes3044F</th>\n",
       "      <th>CVotes45A</th>\n",
       "      <th>CVotes45AM</th>\n",
       "      <th>CVotes45AF</th>\n",
       "      <th>CVotes1000</th>\n",
       "      <th>CVotesUS</th>\n",
       "      <th>CVotesnUS</th>\n",
       "      <th>VotesM</th>\n",
       "      <th>VotesF</th>\n",
       "      <th>VotesU18</th>\n",
       "      <th>VotesU18M</th>\n",
       "      <th>VotesU18F</th>\n",
       "      <th>Votes1829</th>\n",
       "      <th>Votes1829M</th>\n",
       "      <th>Votes1829F</th>\n",
       "      <th>Votes3044</th>\n",
       "      <th>Votes3044M</th>\n",
       "      <th>Votes3044F</th>\n",
       "      <th>Votes45A</th>\n",
       "      <th>Votes45AM</th>\n",
       "      <th>Votes45AF</th>\n",
       "      <th>VotesIMDB</th>\n",
       "      <th>Votes1000</th>\n",
       "      <th>VotesUS</th>\n",
       "      <th>VotesnUS</th>\n",
       "      <th>Domestic</th>\n",
       "      <th>Foreign</th>\n",
       "      <th>Worldwide</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>112</th>\n",
       "      <td>117</td>\n",
       "      <td>X-Men: First Class (2011)</td>\n",
       "      <td>7.8</td>\n",
       "      <td>556713</td>\n",
       "      <td>Action</td>\n",
       "      <td>Adventure</td>\n",
       "      <td>Sci-Fi</td>\n",
       "      <td>65.0</td>\n",
       "      <td>160000000.0</td>\n",
       "      <td>132 min</td>\n",
       "      <td>64428</td>\n",
       "      <td>96219</td>\n",
       "      <td>200144</td>\n",
       "      <td>129352</td>\n",
       "      <td>41945</td>\n",
       "      <td>12861</td>\n",
       "      <td>4799</td>\n",
       "      <td>2349</td>\n",
       "      <td>1448</td>\n",
       "      <td>3182</td>\n",
       "      <td>382107</td>\n",
       "      <td>80444</td>\n",
       "      <td>2075</td>\n",
       "      <td>1612</td>\n",
       "      <td>443</td>\n",
       "      <td>223309</td>\n",
       "      <td>176821</td>\n",
       "      <td>44428</td>\n",
       "      <td>185909</td>\n",
       "      <td>157332</td>\n",
       "      <td>26094</td>\n",
       "      <td>30217</td>\n",
       "      <td>25051</td>\n",
       "      <td>4691</td>\n",
       "      <td>780</td>\n",
       "      <td>87542</td>\n",
       "      <td>257681.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.3</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>$146,408,305</td>\n",
       "      <td>207215819</td>\n",
       "      <td>$353,624,124</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>113</th>\n",
       "      <td>118</td>\n",
       "      <td>Zootopia (2016)</td>\n",
       "      <td>8.1</td>\n",
       "      <td>309474</td>\n",
       "      <td>Animation</td>\n",
       "      <td>Adventure</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>78.0</td>\n",
       "      <td>150000000.0</td>\n",
       "      <td>108 min</td>\n",
       "      <td>53626</td>\n",
       "      <td>70912</td>\n",
       "      <td>102352</td>\n",
       "      <td>57261</td>\n",
       "      <td>16719</td>\n",
       "      <td>4539</td>\n",
       "      <td>1467</td>\n",
       "      <td>733</td>\n",
       "      <td>496</td>\n",
       "      <td>1386</td>\n",
       "      <td>176202</td>\n",
       "      <td>52345</td>\n",
       "      <td>2362</td>\n",
       "      <td>1641</td>\n",
       "      <td>706</td>\n",
       "      <td>119637</td>\n",
       "      <td>87499</td>\n",
       "      <td>30813</td>\n",
       "      <td>75474</td>\n",
       "      <td>61358</td>\n",
       "      <td>13034</td>\n",
       "      <td>12353</td>\n",
       "      <td>9959</td>\n",
       "      <td>2151</td>\n",
       "      <td>518</td>\n",
       "      <td>35975</td>\n",
       "      <td>122844.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.7</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.4</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>$341,268,248</td>\n",
       "      <td>682515947</td>\n",
       "      <td>$1,023,784,195</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "       X       ...               Worldwide\n",
       "112  117       ...           $353,624,124 \n",
       "113  118       ...         $1,023,784,195 \n",
       "\n",
       "[2 rows x 58 columns]"
      ]
     },
     "execution_count": 12,
     "metadata": {},
     "output_type": "execute_result"
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "After Skipping the Rows\n"
     ]
    },
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>X</th>\n",
       "      <th>Title</th>\n",
       "      <th>Rating</th>\n",
       "      <th>TotalVotes</th>\n",
       "      <th>Genre1</th>\n",
       "      <th>Genre2</th>\n",
       "      <th>Genre3</th>\n",
       "      <th>MetaCritic</th>\n",
       "      <th>Budget</th>\n",
       "      <th>Runtime</th>\n",
       "      <th>CVotes10</th>\n",
       "      <th>CVotes09</th>\n",
       "      <th>CVotes08</th>\n",
       "      <th>CVotes07</th>\n",
       "      <th>CVotes06</th>\n",
       "      <th>CVotes05</th>\n",
       "      <th>CVotes04</th>\n",
       "      <th>CVotes03</th>\n",
       "      <th>CVotes02</th>\n",
       "      <th>CVotes01</th>\n",
       "      <th>CVotesMale</th>\n",
       "      <th>CVotesFemale</th>\n",
       "      <th>CVotesU18</th>\n",
       "      <th>CVotesU18M</th>\n",
       "      <th>CVotesU18F</th>\n",
       "      <th>CVotes1829</th>\n",
       "      <th>CVotes1829M</th>\n",
       "      <th>CVotes1829F</th>\n",
       "      <th>CVotes3044</th>\n",
       "      <th>CVotes3044M</th>\n",
       "      <th>CVotes3044F</th>\n",
       "      <th>CVotes45A</th>\n",
       "      <th>CVotes45AM</th>\n",
       "      <th>CVotes45AF</th>\n",
       "      <th>CVotes1000</th>\n",
       "      <th>CVotesUS</th>\n",
       "      <th>CVotesnUS</th>\n",
       "      <th>VotesM</th>\n",
       "      <th>VotesF</th>\n",
       "      <th>VotesU18</th>\n",
       "      <th>VotesU18M</th>\n",
       "      <th>VotesU18F</th>\n",
       "      <th>Votes1829</th>\n",
       "      <th>Votes1829M</th>\n",
       "      <th>Votes1829F</th>\n",
       "      <th>Votes3044</th>\n",
       "      <th>Votes3044M</th>\n",
       "      <th>Votes3044F</th>\n",
       "      <th>Votes45A</th>\n",
       "      <th>Votes45AM</th>\n",
       "      <th>Votes45AF</th>\n",
       "      <th>VotesIMDB</th>\n",
       "      <th>Votes1000</th>\n",
       "      <th>VotesUS</th>\n",
       "      <th>VotesnUS</th>\n",
       "      <th>Domestic</th>\n",
       "      <th>Foreign</th>\n",
       "      <th>Worldwide</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>113</th>\n",
       "      <td>115</td>\n",
       "      <td>Wreck-It Ralph (2012)</td>\n",
       "      <td>7.7</td>\n",
       "      <td>295125</td>\n",
       "      <td>Animation</td>\n",
       "      <td>Adventure</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>72.0</td>\n",
       "      <td>165000000.0</td>\n",
       "      <td>NaN</td>\n",
       "      <td>41980</td>\n",
       "      <td>50262</td>\n",
       "      <td>96477</td>\n",
       "      <td>67934</td>\n",
       "      <td>24894</td>\n",
       "      <td>7748</td>\n",
       "      <td>2724</td>\n",
       "      <td>1190</td>\n",
       "      <td>703</td>\n",
       "      <td>1226</td>\n",
       "      <td>190983</td>\n",
       "      <td>50202</td>\n",
       "      <td>1663</td>\n",
       "      <td>1182</td>\n",
       "      <td>467</td>\n",
       "      <td>120962</td>\n",
       "      <td>90759</td>\n",
       "      <td>29003</td>\n",
       "      <td>90203</td>\n",
       "      <td>74767</td>\n",
       "      <td>14148</td>\n",
       "      <td>13706</td>\n",
       "      <td>11356</td>\n",
       "      <td>2112</td>\n",
       "      <td>614</td>\n",
       "      <td>44962</td>\n",
       "      <td>129487.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.2</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$189,422,889</td>\n",
       "      <td>281800000</td>\n",
       "      <td>$471,222,889</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>114</th>\n",
       "      <td>116</td>\n",
       "      <td>X-Men: Days of Future Past (2014)</td>\n",
       "      <td>8.0</td>\n",
       "      <td>560736</td>\n",
       "      <td>Action</td>\n",
       "      <td>Adventure</td>\n",
       "      <td>Sci-Fi</td>\n",
       "      <td>74.0</td>\n",
       "      <td>200000000.0</td>\n",
       "      <td>132 min</td>\n",
       "      <td>91765</td>\n",
       "      <td>127521</td>\n",
       "      <td>183578</td>\n",
       "      <td>104658</td>\n",
       "      <td>33027</td>\n",
       "      <td>10059</td>\n",
       "      <td>3710</td>\n",
       "      <td>1903</td>\n",
       "      <td>1225</td>\n",
       "      <td>3301</td>\n",
       "      <td>370835</td>\n",
       "      <td>71008</td>\n",
       "      <td>3038</td>\n",
       "      <td>2403</td>\n",
       "      <td>614</td>\n",
       "      <td>220178</td>\n",
       "      <td>179039</td>\n",
       "      <td>39094</td>\n",
       "      <td>158607</td>\n",
       "      <td>135392</td>\n",
       "      <td>20927</td>\n",
       "      <td>26834</td>\n",
       "      <td>22460</td>\n",
       "      <td>3884</td>\n",
       "      <td>710</td>\n",
       "      <td>67889</td>\n",
       "      <td>229049.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.6</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.2</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.4</td>\n",
       "      <td>8.1</td>\n",
       "      <td>7.9</td>\n",
       "      <td>$233,921,534</td>\n",
       "      <td>513941241</td>\n",
       "      <td>$747,862,775</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "       X      ...            Worldwide\n",
       "113  115      ...        $471,222,889 \n",
       "114  116      ...        $747,862,775 \n",
       "\n",
       "[2 rows x 58 columns]"
      ]
     },
     "execution_count": 12,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df.tail(2)\n",
    "print(\"After Skipping the Rows\")\n",
    "df = pd.read_csv('../input/datasetsdifferent-format/IMDB.csv', encoding = \"ISO-8859-1\", skipfooter=2, engine='python')\n",
    "df.tail(2)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "0b64a249ab7243461062d00e86fb97c5ee21b3b6"
   },
   "source": [
    "### 3.Reading only a subset of the file or a certain number of rows\n",
    "\n",
    "* We are also Reading only a subset of the file or a certain number of rows while reading whole dataset file.\n",
    "* In the result, we can see the shape of the data before and after."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 13,
   "metadata": {
    "_uuid": "39dc72f18ba37cd62d186860d5e7cfbfca0acad2"
   },
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Before Shape: (115, 58)\n",
      "After Selecting 100 Rows\n",
      "After Shape: (100, 58)\n"
     ]
    }
   ],
   "source": [
    "print(\"Before Shape:\",df.shape)\n",
    "print(\"After Selecting 100 Rows\")\n",
    "df = pd.read_csv('../input/datasetsdifferent-format/IMDB.csv', encoding = \"ISO-8859-1\", nrows=100)\n",
    "print(\"After Shape:\",df.shape)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "cc48f56dd7a7c3c24e75d303ddd313877f85bd4d"
   },
   "source": [
    "> ### 2.3.Reading data from excel files <a id=\"23\"></a>\n",
    "\n",
    "### 1.Basic Excel read\n",
    "* Basic Excel file reading with default sheet number"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 14,
   "metadata": {
    "_uuid": "892798063ece437ac8ad6c5c7e29eb5b807452ea"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>X</th>\n",
       "      <th>Title</th>\n",
       "      <th>Rating</th>\n",
       "      <th>TotalVotes</th>\n",
       "      <th>Genre1</th>\n",
       "      <th>Genre2</th>\n",
       "      <th>Genre3</th>\n",
       "      <th>MetaCritic</th>\n",
       "      <th>Budget</th>\n",
       "      <th>Runtime</th>\n",
       "      <th>CVotes10</th>\n",
       "      <th>CVotes09</th>\n",
       "      <th>CVotes08</th>\n",
       "      <th>CVotes07</th>\n",
       "      <th>CVotes06</th>\n",
       "      <th>CVotes05</th>\n",
       "      <th>CVotes04</th>\n",
       "      <th>CVotes03</th>\n",
       "      <th>CVotes02</th>\n",
       "      <th>CVotes01</th>\n",
       "      <th>CVotesMale</th>\n",
       "      <th>CVotesFemale</th>\n",
       "      <th>CVotesU18</th>\n",
       "      <th>CVotesU18M</th>\n",
       "      <th>CVotesU18F</th>\n",
       "      <th>CVotes1829</th>\n",
       "      <th>CVotes1829M</th>\n",
       "      <th>CVotes1829F</th>\n",
       "      <th>CVotes3044</th>\n",
       "      <th>CVotes3044M</th>\n",
       "      <th>CVotes3044F</th>\n",
       "      <th>CVotes45A</th>\n",
       "      <th>CVotes45AM</th>\n",
       "      <th>CVotes45AF</th>\n",
       "      <th>CVotes1000</th>\n",
       "      <th>CVotesUS</th>\n",
       "      <th>CVotesnUS</th>\n",
       "      <th>VotesM</th>\n",
       "      <th>VotesF</th>\n",
       "      <th>VotesU18</th>\n",
       "      <th>VotesU18M</th>\n",
       "      <th>VotesU18F</th>\n",
       "      <th>Votes1829</th>\n",
       "      <th>Votes1829M</th>\n",
       "      <th>Votes1829F</th>\n",
       "      <th>Votes3044</th>\n",
       "      <th>Votes3044M</th>\n",
       "      <th>Votes3044F</th>\n",
       "      <th>Votes45A</th>\n",
       "      <th>Votes45AM</th>\n",
       "      <th>Votes45AF</th>\n",
       "      <th>VotesIMDB</th>\n",
       "      <th>Votes1000</th>\n",
       "      <th>VotesUS</th>\n",
       "      <th>VotesnUS</th>\n",
       "      <th>Domestic</th>\n",
       "      <th>Foreign</th>\n",
       "      <th>Worldwide</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>1</td>\n",
       "      <td>12 Years a Slave (2013)</td>\n",
       "      <td>8.1</td>\n",
       "      <td>496092</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>History</td>\n",
       "      <td>96.0</td>\n",
       "      <td>20000000.0</td>\n",
       "      <td>134 min</td>\n",
       "      <td>75556</td>\n",
       "      <td>126223</td>\n",
       "      <td>161460</td>\n",
       "      <td>83070</td>\n",
       "      <td>27231</td>\n",
       "      <td>9603</td>\n",
       "      <td>4021</td>\n",
       "      <td>2420</td>\n",
       "      <td>1785</td>\n",
       "      <td>4739</td>\n",
       "      <td>313823</td>\n",
       "      <td>82012</td>\n",
       "      <td>1837</td>\n",
       "      <td>1363</td>\n",
       "      <td>457</td>\n",
       "      <td>200910</td>\n",
       "      <td>153669</td>\n",
       "      <td>45301</td>\n",
       "      <td>138762</td>\n",
       "      <td>112943</td>\n",
       "      <td>23895</td>\n",
       "      <td>29252</td>\n",
       "      <td>23072</td>\n",
       "      <td>5726</td>\n",
       "      <td>664</td>\n",
       "      <td>53328</td>\n",
       "      <td>224519.0</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.0</td>\n",
       "      <td>$56,671,993</td>\n",
       "      <td>131061209</td>\n",
       "      <td>$187,733,202</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2</td>\n",
       "      <td>127 Hours (2010)</td>\n",
       "      <td>7.6</td>\n",
       "      <td>297075</td>\n",
       "      <td>Adventure</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>82.0</td>\n",
       "      <td>18000000.0</td>\n",
       "      <td>94 min</td>\n",
       "      <td>28939</td>\n",
       "      <td>44110</td>\n",
       "      <td>98845</td>\n",
       "      <td>78451</td>\n",
       "      <td>28394</td>\n",
       "      <td>9403</td>\n",
       "      <td>3796</td>\n",
       "      <td>1930</td>\n",
       "      <td>1161</td>\n",
       "      <td>2059</td>\n",
       "      <td>212866</td>\n",
       "      <td>44600</td>\n",
       "      <td>745</td>\n",
       "      <td>567</td>\n",
       "      <td>170</td>\n",
       "      <td>133336</td>\n",
       "      <td>106007</td>\n",
       "      <td>26152</td>\n",
       "      <td>102120</td>\n",
       "      <td>86609</td>\n",
       "      <td>14304</td>\n",
       "      <td>14895</td>\n",
       "      <td>12400</td>\n",
       "      <td>2261</td>\n",
       "      <td>649</td>\n",
       "      <td>38478</td>\n",
       "      <td>169745.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$18,335,230</td>\n",
       "      <td>42403567</td>\n",
       "      <td>$60,738,797</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>3</td>\n",
       "      <td>50/50 (2011)</td>\n",
       "      <td>7.7</td>\n",
       "      <td>283935</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>72.0</td>\n",
       "      <td>8000000.0</td>\n",
       "      <td>100 min</td>\n",
       "      <td>28304</td>\n",
       "      <td>47501</td>\n",
       "      <td>99524</td>\n",
       "      <td>71485</td>\n",
       "      <td>24252</td>\n",
       "      <td>7545</td>\n",
       "      <td>2381</td>\n",
       "      <td>1109</td>\n",
       "      <td>634</td>\n",
       "      <td>1202</td>\n",
       "      <td>188925</td>\n",
       "      <td>58348</td>\n",
       "      <td>506</td>\n",
       "      <td>348</td>\n",
       "      <td>153</td>\n",
       "      <td>132350</td>\n",
       "      <td>96269</td>\n",
       "      <td>34765</td>\n",
       "      <td>94745</td>\n",
       "      <td>75394</td>\n",
       "      <td>18163</td>\n",
       "      <td>12829</td>\n",
       "      <td>9912</td>\n",
       "      <td>2681</td>\n",
       "      <td>555</td>\n",
       "      <td>46947</td>\n",
       "      <td>147849.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$35,014,192</td>\n",
       "      <td>4173591</td>\n",
       "      <td>$39,187,783</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>4</td>\n",
       "      <td>About Time (2013)</td>\n",
       "      <td>7.8</td>\n",
       "      <td>225412</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Fantasy</td>\n",
       "      <td>NaN</td>\n",
       "      <td>12000000.0</td>\n",
       "      <td>123 min</td>\n",
       "      <td>38556</td>\n",
       "      <td>43170</td>\n",
       "      <td>70850</td>\n",
       "      <td>45487</td>\n",
       "      <td>16542</td>\n",
       "      <td>5673</td>\n",
       "      <td>2210</td>\n",
       "      <td>1084</td>\n",
       "      <td>664</td>\n",
       "      <td>1182</td>\n",
       "      <td>126718</td>\n",
       "      <td>58098</td>\n",
       "      <td>654</td>\n",
       "      <td>325</td>\n",
       "      <td>321</td>\n",
       "      <td>92940</td>\n",
       "      <td>57778</td>\n",
       "      <td>34126</td>\n",
       "      <td>67477</td>\n",
       "      <td>50212</td>\n",
       "      <td>16222</td>\n",
       "      <td>13973</td>\n",
       "      <td>10690</td>\n",
       "      <td>3026</td>\n",
       "      <td>475</td>\n",
       "      <td>20450</td>\n",
       "      <td>111670.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>6.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>$15,322,921</td>\n",
       "      <td>71777528</td>\n",
       "      <td>$87,100,449</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>5</td>\n",
       "      <td>Amour (2012)</td>\n",
       "      <td>7.9</td>\n",
       "      <td>76121</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>NaN</td>\n",
       "      <td>94.0</td>\n",
       "      <td>8900000.0</td>\n",
       "      <td>127 min</td>\n",
       "      <td>11093</td>\n",
       "      <td>15944</td>\n",
       "      <td>22942</td>\n",
       "      <td>14187</td>\n",
       "      <td>5945</td>\n",
       "      <td>2585</td>\n",
       "      <td>1188</td>\n",
       "      <td>710</td>\n",
       "      <td>534</td>\n",
       "      <td>995</td>\n",
       "      <td>49808</td>\n",
       "      <td>16719</td>\n",
       "      <td>121</td>\n",
       "      <td>95</td>\n",
       "      <td>24</td>\n",
       "      <td>28593</td>\n",
       "      <td>20107</td>\n",
       "      <td>8167</td>\n",
       "      <td>28691</td>\n",
       "      <td>21990</td>\n",
       "      <td>6269</td>\n",
       "      <td>7425</td>\n",
       "      <td>5803</td>\n",
       "      <td>1490</td>\n",
       "      <td>391</td>\n",
       "      <td>7959</td>\n",
       "      <td>46138.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.6</td>\n",
       "      <td>8.7</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>6.6</td>\n",
       "      <td>7.2</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>$6,739,492</td>\n",
       "      <td>13100000</td>\n",
       "      <td>$19,839,492</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "   X                    Title      ...          Foreign      Worldwide\n",
       "0  1  12 Years a Slave (2013)      ...        131061209  $187,733,202 \n",
       "1  2         127 Hours (2010)      ...         42403567   $60,738,797 \n",
       "2  3             50/50 (2011)      ...          4173591   $39,187,783 \n",
       "3  4        About Time (2013)      ...         71777528   $87,100,449 \n",
       "4  5             Amour (2012)      ...         13100000   $19,839,492 \n",
       "\n",
       "[5 rows x 58 columns]"
      ]
     },
     "execution_count": 14,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df = pd.read_excel('../input/datasetsdifferent-format/IMDB.xlsx')\n",
    "df.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "e3bf37852aee430f60c2b4579421c93a7432a56e"
   },
   "source": [
    "## Advanced read options\n",
    "\n",
    "`pandas.read_excel(io, sheetname=0, header=0, skiprows=None, skip_footer=0, index_col=None, names=None, parse_cols=None, parse_dates=False, date_parser=None, na_values=None, thousands=None, convert_float=True, has_index_names=None, converters=None, dtype=None, true_values=None, false_values=None, engine=None, squeeze=False, **kwds)`\n",
    "\n",
    "***Reference:*** [Pandas Doc](http://pandas.pydata.org/pandas-docs/version/0.20/generated/pandas.read_excel.html)\n",
    "\n",
    "### 2.Which Sheet to read?\n",
    "* We can select which sheet which we have to read."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 15,
   "metadata": {
    "_uuid": "8d233c938e9366f4eba90ae1ec943a123635f497"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>X</th>\n",
       "      <th>Title</th>\n",
       "      <th>Rating</th>\n",
       "      <th>TotalVotes</th>\n",
       "      <th>Genre1</th>\n",
       "      <th>Genre2</th>\n",
       "      <th>Genre3</th>\n",
       "      <th>MetaCritic</th>\n",
       "      <th>Budget</th>\n",
       "      <th>Runtime</th>\n",
       "      <th>CVotes10</th>\n",
       "      <th>CVotes09</th>\n",
       "      <th>CVotes08</th>\n",
       "      <th>CVotes07</th>\n",
       "      <th>CVotes06</th>\n",
       "      <th>CVotes05</th>\n",
       "      <th>CVotes04</th>\n",
       "      <th>CVotes03</th>\n",
       "      <th>CVotes02</th>\n",
       "      <th>CVotes01</th>\n",
       "      <th>CVotesMale</th>\n",
       "      <th>CVotesFemale</th>\n",
       "      <th>CVotesU18</th>\n",
       "      <th>CVotesU18M</th>\n",
       "      <th>CVotesU18F</th>\n",
       "      <th>CVotes1829</th>\n",
       "      <th>CVotes1829M</th>\n",
       "      <th>CVotes1829F</th>\n",
       "      <th>CVotes3044</th>\n",
       "      <th>CVotes3044M</th>\n",
       "      <th>CVotes3044F</th>\n",
       "      <th>CVotes45A</th>\n",
       "      <th>CVotes45AM</th>\n",
       "      <th>CVotes45AF</th>\n",
       "      <th>CVotes1000</th>\n",
       "      <th>CVotesUS</th>\n",
       "      <th>CVotesnUS</th>\n",
       "      <th>VotesM</th>\n",
       "      <th>VotesF</th>\n",
       "      <th>VotesU18</th>\n",
       "      <th>VotesU18M</th>\n",
       "      <th>VotesU18F</th>\n",
       "      <th>Votes1829</th>\n",
       "      <th>Votes1829M</th>\n",
       "      <th>Votes1829F</th>\n",
       "      <th>Votes3044</th>\n",
       "      <th>Votes3044M</th>\n",
       "      <th>Votes3044F</th>\n",
       "      <th>Votes45A</th>\n",
       "      <th>Votes45AM</th>\n",
       "      <th>Votes45AF</th>\n",
       "      <th>VotesIMDB</th>\n",
       "      <th>Votes1000</th>\n",
       "      <th>VotesUS</th>\n",
       "      <th>VotesnUS</th>\n",
       "      <th>Domestic</th>\n",
       "      <th>Foreign</th>\n",
       "      <th>Worldwide</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>1</td>\n",
       "      <td>12 Years a Slave (2013)</td>\n",
       "      <td>8.1</td>\n",
       "      <td>496092</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>History</td>\n",
       "      <td>96.0</td>\n",
       "      <td>20000000.0</td>\n",
       "      <td>134 min</td>\n",
       "      <td>75556</td>\n",
       "      <td>126223</td>\n",
       "      <td>161460</td>\n",
       "      <td>83070</td>\n",
       "      <td>27231</td>\n",
       "      <td>9603</td>\n",
       "      <td>4021</td>\n",
       "      <td>2420</td>\n",
       "      <td>1785</td>\n",
       "      <td>4739</td>\n",
       "      <td>313823</td>\n",
       "      <td>82012</td>\n",
       "      <td>1837</td>\n",
       "      <td>1363</td>\n",
       "      <td>457</td>\n",
       "      <td>200910</td>\n",
       "      <td>153669</td>\n",
       "      <td>45301</td>\n",
       "      <td>138762</td>\n",
       "      <td>112943</td>\n",
       "      <td>23895</td>\n",
       "      <td>29252</td>\n",
       "      <td>23072</td>\n",
       "      <td>5726</td>\n",
       "      <td>664</td>\n",
       "      <td>53328</td>\n",
       "      <td>224519.0</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.0</td>\n",
       "      <td>$56,671,993</td>\n",
       "      <td>131061209</td>\n",
       "      <td>$187,733,202</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2</td>\n",
       "      <td>127 Hours (2010)</td>\n",
       "      <td>7.6</td>\n",
       "      <td>297075</td>\n",
       "      <td>Adventure</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>82.0</td>\n",
       "      <td>18000000.0</td>\n",
       "      <td>94 min</td>\n",
       "      <td>28939</td>\n",
       "      <td>44110</td>\n",
       "      <td>98845</td>\n",
       "      <td>78451</td>\n",
       "      <td>28394</td>\n",
       "      <td>9403</td>\n",
       "      <td>3796</td>\n",
       "      <td>1930</td>\n",
       "      <td>1161</td>\n",
       "      <td>2059</td>\n",
       "      <td>212866</td>\n",
       "      <td>44600</td>\n",
       "      <td>745</td>\n",
       "      <td>567</td>\n",
       "      <td>170</td>\n",
       "      <td>133336</td>\n",
       "      <td>106007</td>\n",
       "      <td>26152</td>\n",
       "      <td>102120</td>\n",
       "      <td>86609</td>\n",
       "      <td>14304</td>\n",
       "      <td>14895</td>\n",
       "      <td>12400</td>\n",
       "      <td>2261</td>\n",
       "      <td>649</td>\n",
       "      <td>38478</td>\n",
       "      <td>169745.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$18,335,230</td>\n",
       "      <td>42403567</td>\n",
       "      <td>$60,738,797</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>3</td>\n",
       "      <td>50/50 (2011)</td>\n",
       "      <td>7.7</td>\n",
       "      <td>283935</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>72.0</td>\n",
       "      <td>8000000.0</td>\n",
       "      <td>100 min</td>\n",
       "      <td>28304</td>\n",
       "      <td>47501</td>\n",
       "      <td>99524</td>\n",
       "      <td>71485</td>\n",
       "      <td>24252</td>\n",
       "      <td>7545</td>\n",
       "      <td>2381</td>\n",
       "      <td>1109</td>\n",
       "      <td>634</td>\n",
       "      <td>1202</td>\n",
       "      <td>188925</td>\n",
       "      <td>58348</td>\n",
       "      <td>506</td>\n",
       "      <td>348</td>\n",
       "      <td>153</td>\n",
       "      <td>132350</td>\n",
       "      <td>96269</td>\n",
       "      <td>34765</td>\n",
       "      <td>94745</td>\n",
       "      <td>75394</td>\n",
       "      <td>18163</td>\n",
       "      <td>12829</td>\n",
       "      <td>9912</td>\n",
       "      <td>2681</td>\n",
       "      <td>555</td>\n",
       "      <td>46947</td>\n",
       "      <td>147849.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$35,014,192</td>\n",
       "      <td>4173591</td>\n",
       "      <td>$39,187,783</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>4</td>\n",
       "      <td>About Time (2013)</td>\n",
       "      <td>7.8</td>\n",
       "      <td>225412</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Fantasy</td>\n",
       "      <td>NaN</td>\n",
       "      <td>12000000.0</td>\n",
       "      <td>123 min</td>\n",
       "      <td>38556</td>\n",
       "      <td>43170</td>\n",
       "      <td>70850</td>\n",
       "      <td>45487</td>\n",
       "      <td>16542</td>\n",
       "      <td>5673</td>\n",
       "      <td>2210</td>\n",
       "      <td>1084</td>\n",
       "      <td>664</td>\n",
       "      <td>1182</td>\n",
       "      <td>126718</td>\n",
       "      <td>58098</td>\n",
       "      <td>654</td>\n",
       "      <td>325</td>\n",
       "      <td>321</td>\n",
       "      <td>92940</td>\n",
       "      <td>57778</td>\n",
       "      <td>34126</td>\n",
       "      <td>67477</td>\n",
       "      <td>50212</td>\n",
       "      <td>16222</td>\n",
       "      <td>13973</td>\n",
       "      <td>10690</td>\n",
       "      <td>3026</td>\n",
       "      <td>475</td>\n",
       "      <td>20450</td>\n",
       "      <td>111670.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>6.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>$15,322,921</td>\n",
       "      <td>71777528</td>\n",
       "      <td>$87,100,449</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>5</td>\n",
       "      <td>Amour (2012)</td>\n",
       "      <td>7.9</td>\n",
       "      <td>76121</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>NaN</td>\n",
       "      <td>94.0</td>\n",
       "      <td>8900000.0</td>\n",
       "      <td>127 min</td>\n",
       "      <td>11093</td>\n",
       "      <td>15944</td>\n",
       "      <td>22942</td>\n",
       "      <td>14187</td>\n",
       "      <td>5945</td>\n",
       "      <td>2585</td>\n",
       "      <td>1188</td>\n",
       "      <td>710</td>\n",
       "      <td>534</td>\n",
       "      <td>995</td>\n",
       "      <td>49808</td>\n",
       "      <td>16719</td>\n",
       "      <td>121</td>\n",
       "      <td>95</td>\n",
       "      <td>24</td>\n",
       "      <td>28593</td>\n",
       "      <td>20107</td>\n",
       "      <td>8167</td>\n",
       "      <td>28691</td>\n",
       "      <td>21990</td>\n",
       "      <td>6269</td>\n",
       "      <td>7425</td>\n",
       "      <td>5803</td>\n",
       "      <td>1490</td>\n",
       "      <td>391</td>\n",
       "      <td>7959</td>\n",
       "      <td>46138.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.6</td>\n",
       "      <td>8.7</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>6.6</td>\n",
       "      <td>7.2</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>$6,739,492</td>\n",
       "      <td>13100000</td>\n",
       "      <td>$19,839,492</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "   X                    Title      ...          Foreign      Worldwide\n",
       "0  1  12 Years a Slave (2013)      ...        131061209  $187,733,202 \n",
       "1  2         127 Hours (2010)      ...         42403567   $60,738,797 \n",
       "2  3             50/50 (2011)      ...          4173591   $39,187,783 \n",
       "3  4        About Time (2013)      ...         71777528   $87,100,449 \n",
       "4  5             Amour (2012)      ...         13100000   $19,839,492 \n",
       "\n",
       "[5 rows x 58 columns]"
      ]
     },
     "execution_count": 15,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df = pd.read_excel('../input/datasetsdifferent-format/IMDB.xlsx', sheet_name=0)\n",
    "df.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "8ae73a56d8cc6ff09da94d8e306d9301d403acb2"
   },
   "source": [
    "### 3.Reading data from multiple sheets in an excel file\n",
    "* Find out the sheet list of the excel file"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 16,
   "metadata": {
    "_uuid": "0a20ca9bff983a3cf4235747b7c14f4bcc693027"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "['movies', 'by genre']"
      ]
     },
     "execution_count": 16,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df_excel = pd.ExcelFile('../input/datasetsdifferent-format/IMDB.xlsx')\n",
    "df_excel.sheet_names"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 17,
   "metadata": {
    "_uuid": "3f68aa745dc513893d650899da432ca8bb0bf115"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>X</th>\n",
       "      <th>Title</th>\n",
       "      <th>Rating</th>\n",
       "      <th>TotalVotes</th>\n",
       "      <th>Genre1</th>\n",
       "      <th>Genre2</th>\n",
       "      <th>Genre3</th>\n",
       "      <th>MetaCritic</th>\n",
       "      <th>Budget</th>\n",
       "      <th>Runtime</th>\n",
       "      <th>CVotes10</th>\n",
       "      <th>CVotes09</th>\n",
       "      <th>CVotes08</th>\n",
       "      <th>CVotes07</th>\n",
       "      <th>CVotes06</th>\n",
       "      <th>CVotes05</th>\n",
       "      <th>CVotes04</th>\n",
       "      <th>CVotes03</th>\n",
       "      <th>CVotes02</th>\n",
       "      <th>CVotes01</th>\n",
       "      <th>CVotesMale</th>\n",
       "      <th>CVotesFemale</th>\n",
       "      <th>CVotesU18</th>\n",
       "      <th>CVotesU18M</th>\n",
       "      <th>CVotesU18F</th>\n",
       "      <th>CVotes1829</th>\n",
       "      <th>CVotes1829M</th>\n",
       "      <th>CVotes1829F</th>\n",
       "      <th>CVotes3044</th>\n",
       "      <th>CVotes3044M</th>\n",
       "      <th>CVotes3044F</th>\n",
       "      <th>CVotes45A</th>\n",
       "      <th>CVotes45AM</th>\n",
       "      <th>CVotes45AF</th>\n",
       "      <th>CVotes1000</th>\n",
       "      <th>CVotesUS</th>\n",
       "      <th>CVotesnUS</th>\n",
       "      <th>VotesM</th>\n",
       "      <th>VotesF</th>\n",
       "      <th>VotesU18</th>\n",
       "      <th>VotesU18M</th>\n",
       "      <th>VotesU18F</th>\n",
       "      <th>Votes1829</th>\n",
       "      <th>Votes1829M</th>\n",
       "      <th>Votes1829F</th>\n",
       "      <th>Votes3044</th>\n",
       "      <th>Votes3044M</th>\n",
       "      <th>Votes3044F</th>\n",
       "      <th>Votes45A</th>\n",
       "      <th>Votes45AM</th>\n",
       "      <th>Votes45AF</th>\n",
       "      <th>VotesIMDB</th>\n",
       "      <th>Votes1000</th>\n",
       "      <th>VotesUS</th>\n",
       "      <th>VotesnUS</th>\n",
       "      <th>Domestic</th>\n",
       "      <th>Foreign</th>\n",
       "      <th>Worldwide</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>1</td>\n",
       "      <td>12 Years a Slave (2013)</td>\n",
       "      <td>8.1</td>\n",
       "      <td>496092</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>History</td>\n",
       "      <td>96.0</td>\n",
       "      <td>20000000.0</td>\n",
       "      <td>134 min</td>\n",
       "      <td>75556</td>\n",
       "      <td>126223</td>\n",
       "      <td>161460</td>\n",
       "      <td>83070</td>\n",
       "      <td>27231</td>\n",
       "      <td>9603</td>\n",
       "      <td>4021</td>\n",
       "      <td>2420</td>\n",
       "      <td>1785</td>\n",
       "      <td>4739</td>\n",
       "      <td>313823</td>\n",
       "      <td>82012</td>\n",
       "      <td>1837</td>\n",
       "      <td>1363</td>\n",
       "      <td>457</td>\n",
       "      <td>200910</td>\n",
       "      <td>153669</td>\n",
       "      <td>45301</td>\n",
       "      <td>138762</td>\n",
       "      <td>112943</td>\n",
       "      <td>23895</td>\n",
       "      <td>29252</td>\n",
       "      <td>23072</td>\n",
       "      <td>5726</td>\n",
       "      <td>664</td>\n",
       "      <td>53328</td>\n",
       "      <td>224519.0</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.0</td>\n",
       "      <td>$56,671,993</td>\n",
       "      <td>131061209</td>\n",
       "      <td>$187,733,202</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2</td>\n",
       "      <td>127 Hours (2010)</td>\n",
       "      <td>7.6</td>\n",
       "      <td>297075</td>\n",
       "      <td>Adventure</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>82.0</td>\n",
       "      <td>18000000.0</td>\n",
       "      <td>94 min</td>\n",
       "      <td>28939</td>\n",
       "      <td>44110</td>\n",
       "      <td>98845</td>\n",
       "      <td>78451</td>\n",
       "      <td>28394</td>\n",
       "      <td>9403</td>\n",
       "      <td>3796</td>\n",
       "      <td>1930</td>\n",
       "      <td>1161</td>\n",
       "      <td>2059</td>\n",
       "      <td>212866</td>\n",
       "      <td>44600</td>\n",
       "      <td>745</td>\n",
       "      <td>567</td>\n",
       "      <td>170</td>\n",
       "      <td>133336</td>\n",
       "      <td>106007</td>\n",
       "      <td>26152</td>\n",
       "      <td>102120</td>\n",
       "      <td>86609</td>\n",
       "      <td>14304</td>\n",
       "      <td>14895</td>\n",
       "      <td>12400</td>\n",
       "      <td>2261</td>\n",
       "      <td>649</td>\n",
       "      <td>38478</td>\n",
       "      <td>169745.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$18,335,230</td>\n",
       "      <td>42403567</td>\n",
       "      <td>$60,738,797</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>3</td>\n",
       "      <td>50/50 (2011)</td>\n",
       "      <td>7.7</td>\n",
       "      <td>283935</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>72.0</td>\n",
       "      <td>8000000.0</td>\n",
       "      <td>100 min</td>\n",
       "      <td>28304</td>\n",
       "      <td>47501</td>\n",
       "      <td>99524</td>\n",
       "      <td>71485</td>\n",
       "      <td>24252</td>\n",
       "      <td>7545</td>\n",
       "      <td>2381</td>\n",
       "      <td>1109</td>\n",
       "      <td>634</td>\n",
       "      <td>1202</td>\n",
       "      <td>188925</td>\n",
       "      <td>58348</td>\n",
       "      <td>506</td>\n",
       "      <td>348</td>\n",
       "      <td>153</td>\n",
       "      <td>132350</td>\n",
       "      <td>96269</td>\n",
       "      <td>34765</td>\n",
       "      <td>94745</td>\n",
       "      <td>75394</td>\n",
       "      <td>18163</td>\n",
       "      <td>12829</td>\n",
       "      <td>9912</td>\n",
       "      <td>2681</td>\n",
       "      <td>555</td>\n",
       "      <td>46947</td>\n",
       "      <td>147849.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$35,014,192</td>\n",
       "      <td>4173591</td>\n",
       "      <td>$39,187,783</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>4</td>\n",
       "      <td>About Time (2013)</td>\n",
       "      <td>7.8</td>\n",
       "      <td>225412</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Fantasy</td>\n",
       "      <td>NaN</td>\n",
       "      <td>12000000.0</td>\n",
       "      <td>123 min</td>\n",
       "      <td>38556</td>\n",
       "      <td>43170</td>\n",
       "      <td>70850</td>\n",
       "      <td>45487</td>\n",
       "      <td>16542</td>\n",
       "      <td>5673</td>\n",
       "      <td>2210</td>\n",
       "      <td>1084</td>\n",
       "      <td>664</td>\n",
       "      <td>1182</td>\n",
       "      <td>126718</td>\n",
       "      <td>58098</td>\n",
       "      <td>654</td>\n",
       "      <td>325</td>\n",
       "      <td>321</td>\n",
       "      <td>92940</td>\n",
       "      <td>57778</td>\n",
       "      <td>34126</td>\n",
       "      <td>67477</td>\n",
       "      <td>50212</td>\n",
       "      <td>16222</td>\n",
       "      <td>13973</td>\n",
       "      <td>10690</td>\n",
       "      <td>3026</td>\n",
       "      <td>475</td>\n",
       "      <td>20450</td>\n",
       "      <td>111670.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>6.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>$15,322,921</td>\n",
       "      <td>71777528</td>\n",
       "      <td>$87,100,449</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>5</td>\n",
       "      <td>Amour (2012)</td>\n",
       "      <td>7.9</td>\n",
       "      <td>76121</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>NaN</td>\n",
       "      <td>94.0</td>\n",
       "      <td>8900000.0</td>\n",
       "      <td>127 min</td>\n",
       "      <td>11093</td>\n",
       "      <td>15944</td>\n",
       "      <td>22942</td>\n",
       "      <td>14187</td>\n",
       "      <td>5945</td>\n",
       "      <td>2585</td>\n",
       "      <td>1188</td>\n",
       "      <td>710</td>\n",
       "      <td>534</td>\n",
       "      <td>995</td>\n",
       "      <td>49808</td>\n",
       "      <td>16719</td>\n",
       "      <td>121</td>\n",
       "      <td>95</td>\n",
       "      <td>24</td>\n",
       "      <td>28593</td>\n",
       "      <td>20107</td>\n",
       "      <td>8167</td>\n",
       "      <td>28691</td>\n",
       "      <td>21990</td>\n",
       "      <td>6269</td>\n",
       "      <td>7425</td>\n",
       "      <td>5803</td>\n",
       "      <td>1490</td>\n",
       "      <td>391</td>\n",
       "      <td>7959</td>\n",
       "      <td>46138.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.6</td>\n",
       "      <td>8.7</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>6.6</td>\n",
       "      <td>7.2</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>$6,739,492</td>\n",
       "      <td>13100000</td>\n",
       "      <td>$19,839,492</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "   X                    Title      ...          Foreign      Worldwide\n",
       "0  1  12 Years a Slave (2013)      ...        131061209  $187,733,202 \n",
       "1  2         127 Hours (2010)      ...         42403567   $60,738,797 \n",
       "2  3             50/50 (2011)      ...          4173591   $39,187,783 \n",
       "3  4        About Time (2013)      ...         71777528   $87,100,449 \n",
       "4  5             Amour (2012)      ...         13100000   $19,839,492 \n",
       "\n",
       "[5 rows x 58 columns]"
      ]
     },
     "execution_count": 17,
     "metadata": {},
     "output_type": "execute_result"
    },
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>X</th>\n",
       "      <th>Title</th>\n",
       "      <th>Rating</th>\n",
       "      <th>TotalVotes</th>\n",
       "      <th>Genre1</th>\n",
       "      <th>Genre2</th>\n",
       "      <th>Genre3</th>\n",
       "      <th>MetaCritic</th>\n",
       "      <th>Budget</th>\n",
       "      <th>Runtime</th>\n",
       "      <th>CVotes10</th>\n",
       "      <th>CVotes09</th>\n",
       "      <th>CVotes08</th>\n",
       "      <th>CVotes07</th>\n",
       "      <th>CVotes06</th>\n",
       "      <th>CVotes05</th>\n",
       "      <th>CVotes04</th>\n",
       "      <th>CVotes03</th>\n",
       "      <th>CVotes02</th>\n",
       "      <th>CVotes01</th>\n",
       "      <th>CVotesMale</th>\n",
       "      <th>CVotesFemale</th>\n",
       "      <th>CVotesU18</th>\n",
       "      <th>CVotesU18M</th>\n",
       "      <th>CVotesU18F</th>\n",
       "      <th>CVotes1829</th>\n",
       "      <th>CVotes1829M</th>\n",
       "      <th>CVotes1829F</th>\n",
       "      <th>CVotes3044</th>\n",
       "      <th>CVotes3044M</th>\n",
       "      <th>CVotes3044F</th>\n",
       "      <th>CVotes45A</th>\n",
       "      <th>CVotes45AM</th>\n",
       "      <th>CVotes45AF</th>\n",
       "      <th>CVotes1000</th>\n",
       "      <th>CVotesUS</th>\n",
       "      <th>CVotesnUS</th>\n",
       "      <th>VotesM</th>\n",
       "      <th>VotesF</th>\n",
       "      <th>VotesU18</th>\n",
       "      <th>VotesU18M</th>\n",
       "      <th>VotesU18F</th>\n",
       "      <th>Votes1829</th>\n",
       "      <th>Votes1829M</th>\n",
       "      <th>Votes1829F</th>\n",
       "      <th>Votes3044</th>\n",
       "      <th>Votes3044M</th>\n",
       "      <th>Votes3044F</th>\n",
       "      <th>Votes45A</th>\n",
       "      <th>Votes45AM</th>\n",
       "      <th>Votes45AF</th>\n",
       "      <th>VotesIMDB</th>\n",
       "      <th>Votes1000</th>\n",
       "      <th>VotesUS</th>\n",
       "      <th>VotesnUS</th>\n",
       "      <th>Domestic</th>\n",
       "      <th>Foreign</th>\n",
       "      <th>Worldwide</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>1</td>\n",
       "      <td>12 Years a Slave (2013)</td>\n",
       "      <td>8.1</td>\n",
       "      <td>496092</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>History</td>\n",
       "      <td>96.0</td>\n",
       "      <td>20000000.0</td>\n",
       "      <td>134 min</td>\n",
       "      <td>75556</td>\n",
       "      <td>126223</td>\n",
       "      <td>161460</td>\n",
       "      <td>83070</td>\n",
       "      <td>27231</td>\n",
       "      <td>9603</td>\n",
       "      <td>4021</td>\n",
       "      <td>2420</td>\n",
       "      <td>1785</td>\n",
       "      <td>4739</td>\n",
       "      <td>313823</td>\n",
       "      <td>82012</td>\n",
       "      <td>1837</td>\n",
       "      <td>1363</td>\n",
       "      <td>457</td>\n",
       "      <td>200910</td>\n",
       "      <td>153669</td>\n",
       "      <td>45301</td>\n",
       "      <td>138762</td>\n",
       "      <td>112943</td>\n",
       "      <td>23895</td>\n",
       "      <td>29252</td>\n",
       "      <td>23072</td>\n",
       "      <td>5726</td>\n",
       "      <td>664</td>\n",
       "      <td>53328</td>\n",
       "      <td>224519.0</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.0</td>\n",
       "      <td>$56,671,993</td>\n",
       "      <td>131061209</td>\n",
       "      <td>$187,733,202</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2</td>\n",
       "      <td>127 Hours (2010)</td>\n",
       "      <td>7.6</td>\n",
       "      <td>297075</td>\n",
       "      <td>Adventure</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>82.0</td>\n",
       "      <td>18000000.0</td>\n",
       "      <td>94 min</td>\n",
       "      <td>28939</td>\n",
       "      <td>44110</td>\n",
       "      <td>98845</td>\n",
       "      <td>78451</td>\n",
       "      <td>28394</td>\n",
       "      <td>9403</td>\n",
       "      <td>3796</td>\n",
       "      <td>1930</td>\n",
       "      <td>1161</td>\n",
       "      <td>2059</td>\n",
       "      <td>212866</td>\n",
       "      <td>44600</td>\n",
       "      <td>745</td>\n",
       "      <td>567</td>\n",
       "      <td>170</td>\n",
       "      <td>133336</td>\n",
       "      <td>106007</td>\n",
       "      <td>26152</td>\n",
       "      <td>102120</td>\n",
       "      <td>86609</td>\n",
       "      <td>14304</td>\n",
       "      <td>14895</td>\n",
       "      <td>12400</td>\n",
       "      <td>2261</td>\n",
       "      <td>649</td>\n",
       "      <td>38478</td>\n",
       "      <td>169745.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$18,335,230</td>\n",
       "      <td>42403567</td>\n",
       "      <td>$60,738,797</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>3</td>\n",
       "      <td>50/50 (2011)</td>\n",
       "      <td>7.7</td>\n",
       "      <td>283935</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>72.0</td>\n",
       "      <td>8000000.0</td>\n",
       "      <td>100 min</td>\n",
       "      <td>28304</td>\n",
       "      <td>47501</td>\n",
       "      <td>99524</td>\n",
       "      <td>71485</td>\n",
       "      <td>24252</td>\n",
       "      <td>7545</td>\n",
       "      <td>2381</td>\n",
       "      <td>1109</td>\n",
       "      <td>634</td>\n",
       "      <td>1202</td>\n",
       "      <td>188925</td>\n",
       "      <td>58348</td>\n",
       "      <td>506</td>\n",
       "      <td>348</td>\n",
       "      <td>153</td>\n",
       "      <td>132350</td>\n",
       "      <td>96269</td>\n",
       "      <td>34765</td>\n",
       "      <td>94745</td>\n",
       "      <td>75394</td>\n",
       "      <td>18163</td>\n",
       "      <td>12829</td>\n",
       "      <td>9912</td>\n",
       "      <td>2681</td>\n",
       "      <td>555</td>\n",
       "      <td>46947</td>\n",
       "      <td>147849.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$35,014,192</td>\n",
       "      <td>4173591</td>\n",
       "      <td>$39,187,783</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>4</td>\n",
       "      <td>About Time (2013)</td>\n",
       "      <td>7.8</td>\n",
       "      <td>225412</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Fantasy</td>\n",
       "      <td>NaN</td>\n",
       "      <td>12000000.0</td>\n",
       "      <td>123 min</td>\n",
       "      <td>38556</td>\n",
       "      <td>43170</td>\n",
       "      <td>70850</td>\n",
       "      <td>45487</td>\n",
       "      <td>16542</td>\n",
       "      <td>5673</td>\n",
       "      <td>2210</td>\n",
       "      <td>1084</td>\n",
       "      <td>664</td>\n",
       "      <td>1182</td>\n",
       "      <td>126718</td>\n",
       "      <td>58098</td>\n",
       "      <td>654</td>\n",
       "      <td>325</td>\n",
       "      <td>321</td>\n",
       "      <td>92940</td>\n",
       "      <td>57778</td>\n",
       "      <td>34126</td>\n",
       "      <td>67477</td>\n",
       "      <td>50212</td>\n",
       "      <td>16222</td>\n",
       "      <td>13973</td>\n",
       "      <td>10690</td>\n",
       "      <td>3026</td>\n",
       "      <td>475</td>\n",
       "      <td>20450</td>\n",
       "      <td>111670.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>6.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>$15,322,921</td>\n",
       "      <td>71777528</td>\n",
       "      <td>$87,100,449</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>5</td>\n",
       "      <td>Amour (2012)</td>\n",
       "      <td>7.9</td>\n",
       "      <td>76121</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>NaN</td>\n",
       "      <td>94.0</td>\n",
       "      <td>8900000.0</td>\n",
       "      <td>127 min</td>\n",
       "      <td>11093</td>\n",
       "      <td>15944</td>\n",
       "      <td>22942</td>\n",
       "      <td>14187</td>\n",
       "      <td>5945</td>\n",
       "      <td>2585</td>\n",
       "      <td>1188</td>\n",
       "      <td>710</td>\n",
       "      <td>534</td>\n",
       "      <td>995</td>\n",
       "      <td>49808</td>\n",
       "      <td>16719</td>\n",
       "      <td>121</td>\n",
       "      <td>95</td>\n",
       "      <td>24</td>\n",
       "      <td>28593</td>\n",
       "      <td>20107</td>\n",
       "      <td>8167</td>\n",
       "      <td>28691</td>\n",
       "      <td>21990</td>\n",
       "      <td>6269</td>\n",
       "      <td>7425</td>\n",
       "      <td>5803</td>\n",
       "      <td>1490</td>\n",
       "      <td>391</td>\n",
       "      <td>7959</td>\n",
       "      <td>46138.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.6</td>\n",
       "      <td>8.7</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>6.6</td>\n",
       "      <td>7.2</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>$6,739,492</td>\n",
       "      <td>13100000</td>\n",
       "      <td>$19,839,492</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "   X                    Title      ...          Foreign      Worldwide\n",
       "0  1  12 Years a Slave (2013)      ...        131061209  $187,733,202 \n",
       "1  2         127 Hours (2010)      ...         42403567   $60,738,797 \n",
       "2  3             50/50 (2011)      ...          4173591   $39,187,783 \n",
       "3  4        About Time (2013)      ...         71777528   $87,100,449 \n",
       "4  5             Amour (2012)      ...         13100000   $19,839,492 \n",
       "\n",
       "[5 rows x 58 columns]"
      ]
     },
     "execution_count": 17,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df1 = df_excel.parse('movies')\n",
    "df2 = df_excel.parse('by genre')\n",
    "df1.head()\n",
    "df2.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "f91f240361730172a6fc594cb94a13d8daa3e206"
   },
   "source": [
    "### 4.Choose Header or column labels\n",
    "\n",
    "* we can also select header or columns labels from the `read_excel()`  function"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 18,
   "metadata": {
    "_uuid": "e5c5560acc7253f9ff7a1cf3953c73517857aa22"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>3</th>\n",
       "      <th>50/50 (2011)</th>\n",
       "      <th>7.7</th>\n",
       "      <th>283935</th>\n",
       "      <th>Comedy</th>\n",
       "      <th>Drama</th>\n",
       "      <th>Romance</th>\n",
       "      <th>72</th>\n",
       "      <th>8000000</th>\n",
       "      <th>100 min</th>\n",
       "      <th>28304</th>\n",
       "      <th>47501</th>\n",
       "      <th>99524</th>\n",
       "      <th>71485</th>\n",
       "      <th>24252</th>\n",
       "      <th>7545</th>\n",
       "      <th>2381</th>\n",
       "      <th>1109</th>\n",
       "      <th>634</th>\n",
       "      <th>1202</th>\n",
       "      <th>188925</th>\n",
       "      <th>58348</th>\n",
       "      <th>506</th>\n",
       "      <th>348</th>\n",
       "      <th>153</th>\n",
       "      <th>132350</th>\n",
       "      <th>96269</th>\n",
       "      <th>34765</th>\n",
       "      <th>94745</th>\n",
       "      <th>75394</th>\n",
       "      <th>18163</th>\n",
       "      <th>12829</th>\n",
       "      <th>9912</th>\n",
       "      <th>2681</th>\n",
       "      <th>555</th>\n",
       "      <th>46947</th>\n",
       "      <th>147849</th>\n",
       "      <th>7.7.1</th>\n",
       "      <th>7.7.2</th>\n",
       "      <th>7.9</th>\n",
       "      <th>7.9.1</th>\n",
       "      <th>7.9.2</th>\n",
       "      <th>7.8</th>\n",
       "      <th>7.8.1</th>\n",
       "      <th>7.7.3</th>\n",
       "      <th>7.6</th>\n",
       "      <th>7.6.1</th>\n",
       "      <th>7.6.2</th>\n",
       "      <th>7.4</th>\n",
       "      <th>7.4.1</th>\n",
       "      <th>7.5</th>\n",
       "      <th>7.4.2</th>\n",
       "      <th>7</th>\n",
       "      <th>7.9.3</th>\n",
       "      <th>7.6.3</th>\n",
       "      <th>$35,014,192</th>\n",
       "      <th>4173591</th>\n",
       "      <th>$39,187,783</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>4</td>\n",
       "      <td>About Time (2013)</td>\n",
       "      <td>7.8</td>\n",
       "      <td>225412</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Fantasy</td>\n",
       "      <td>NaN</td>\n",
       "      <td>12000000.0</td>\n",
       "      <td>123 min</td>\n",
       "      <td>38556</td>\n",
       "      <td>43170</td>\n",
       "      <td>70850</td>\n",
       "      <td>45487</td>\n",
       "      <td>16542</td>\n",
       "      <td>5673</td>\n",
       "      <td>2210</td>\n",
       "      <td>1084</td>\n",
       "      <td>664</td>\n",
       "      <td>1182</td>\n",
       "      <td>126718</td>\n",
       "      <td>58098</td>\n",
       "      <td>654</td>\n",
       "      <td>325</td>\n",
       "      <td>321</td>\n",
       "      <td>92940</td>\n",
       "      <td>57778</td>\n",
       "      <td>34126</td>\n",
       "      <td>67477</td>\n",
       "      <td>50212</td>\n",
       "      <td>16222</td>\n",
       "      <td>13973</td>\n",
       "      <td>10690</td>\n",
       "      <td>3026</td>\n",
       "      <td>475</td>\n",
       "      <td>20450</td>\n",
       "      <td>111670.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>6.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>$15,322,921</td>\n",
       "      <td>71777528</td>\n",
       "      <td>$87,100,449</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>5</td>\n",
       "      <td>Amour (2012)</td>\n",
       "      <td>7.9</td>\n",
       "      <td>76121</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>NaN</td>\n",
       "      <td>94.0</td>\n",
       "      <td>8900000.0</td>\n",
       "      <td>127 min</td>\n",
       "      <td>11093</td>\n",
       "      <td>15944</td>\n",
       "      <td>22942</td>\n",
       "      <td>14187</td>\n",
       "      <td>5945</td>\n",
       "      <td>2585</td>\n",
       "      <td>1188</td>\n",
       "      <td>710</td>\n",
       "      <td>534</td>\n",
       "      <td>995</td>\n",
       "      <td>49808</td>\n",
       "      <td>16719</td>\n",
       "      <td>121</td>\n",
       "      <td>95</td>\n",
       "      <td>24</td>\n",
       "      <td>28593</td>\n",
       "      <td>20107</td>\n",
       "      <td>8167</td>\n",
       "      <td>28691</td>\n",
       "      <td>21990</td>\n",
       "      <td>6269</td>\n",
       "      <td>7425</td>\n",
       "      <td>5803</td>\n",
       "      <td>1490</td>\n",
       "      <td>391</td>\n",
       "      <td>7959</td>\n",
       "      <td>46138.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.6</td>\n",
       "      <td>8.7</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>6.6</td>\n",
       "      <td>7.2</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>$6,739,492</td>\n",
       "      <td>13100000</td>\n",
       "      <td>$19,839,492</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>6</td>\n",
       "      <td>Argo (2012)</td>\n",
       "      <td>7.7</td>\n",
       "      <td>486840</td>\n",
       "      <td>Action</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>86.0</td>\n",
       "      <td>44500000.0</td>\n",
       "      <td>120 min</td>\n",
       "      <td>43875</td>\n",
       "      <td>89490</td>\n",
       "      <td>171495</td>\n",
       "      <td>115165</td>\n",
       "      <td>37332</td>\n",
       "      <td>12630</td>\n",
       "      <td>4992</td>\n",
       "      <td>2910</td>\n",
       "      <td>2020</td>\n",
       "      <td>6941</td>\n",
       "      <td>334838</td>\n",
       "      <td>67910</td>\n",
       "      <td>971</td>\n",
       "      <td>795</td>\n",
       "      <td>162</td>\n",
       "      <td>178794</td>\n",
       "      <td>146371</td>\n",
       "      <td>30643</td>\n",
       "      <td>163795</td>\n",
       "      <td>136391</td>\n",
       "      <td>24948</td>\n",
       "      <td>36215</td>\n",
       "      <td>28817</td>\n",
       "      <td>6752</td>\n",
       "      <td>740</td>\n",
       "      <td>70110</td>\n",
       "      <td>229137.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.1</td>\n",
       "      <td>7.2</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$136,025,503</td>\n",
       "      <td>96300000</td>\n",
       "      <td>$232,325,503</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>7</td>\n",
       "      <td>Arrival (2016)</td>\n",
       "      <td>8.0</td>\n",
       "      <td>370842</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Mystery</td>\n",
       "      <td>Sci-Fi</td>\n",
       "      <td>81.0</td>\n",
       "      <td>47000000.0</td>\n",
       "      <td>116 min</td>\n",
       "      <td>55533</td>\n",
       "      <td>87850</td>\n",
       "      <td>109536</td>\n",
       "      <td>65440</td>\n",
       "      <td>26913</td>\n",
       "      <td>10556</td>\n",
       "      <td>5057</td>\n",
       "      <td>3083</td>\n",
       "      <td>2194</td>\n",
       "      <td>4734</td>\n",
       "      <td>237437</td>\n",
       "      <td>46272</td>\n",
       "      <td>1943</td>\n",
       "      <td>1544</td>\n",
       "      <td>376</td>\n",
       "      <td>126301</td>\n",
       "      <td>101741</td>\n",
       "      <td>23163</td>\n",
       "      <td>111985</td>\n",
       "      <td>95005</td>\n",
       "      <td>15227</td>\n",
       "      <td>24027</td>\n",
       "      <td>20118</td>\n",
       "      <td>3440</td>\n",
       "      <td>537</td>\n",
       "      <td>42062</td>\n",
       "      <td>163774.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.6</td>\n",
       "      <td>8.6</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.1</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.7</td>\n",
       "      <td>8.3</td>\n",
       "      <td>7.3</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>$100,546,139</td>\n",
       "      <td>102842047</td>\n",
       "      <td>$203,388,186</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>9</td>\n",
       "      <td>Before Midnight (2013)</td>\n",
       "      <td>7.9</td>\n",
       "      <td>106553</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>NaN</td>\n",
       "      <td>94.0</td>\n",
       "      <td>3000000.0</td>\n",
       "      <td>109 min</td>\n",
       "      <td>16953</td>\n",
       "      <td>22109</td>\n",
       "      <td>31439</td>\n",
       "      <td>19251</td>\n",
       "      <td>8142</td>\n",
       "      <td>3412</td>\n",
       "      <td>1649</td>\n",
       "      <td>1033</td>\n",
       "      <td>826</td>\n",
       "      <td>1745</td>\n",
       "      <td>67076</td>\n",
       "      <td>23823</td>\n",
       "      <td>208</td>\n",
       "      <td>138</td>\n",
       "      <td>66</td>\n",
       "      <td>43312</td>\n",
       "      <td>30016</td>\n",
       "      <td>12857</td>\n",
       "      <td>37072</td>\n",
       "      <td>28401</td>\n",
       "      <td>8189</td>\n",
       "      <td>7479</td>\n",
       "      <td>5891</td>\n",
       "      <td>1470</td>\n",
       "      <td>447</td>\n",
       "      <td>12382</td>\n",
       "      <td>59116.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.3</td>\n",
       "      <td>7.4</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.2</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.2</td>\n",
       "      <td>8.5</td>\n",
       "      <td>7.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>$8,114,627</td>\n",
       "      <td>3061842</td>\n",
       "      <td>$11,176,469</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "   3            50/50 (2011)      ...          4173591   $39,187,783 \n",
       "0  4       About Time (2013)      ...         71777528   $87,100,449 \n",
       "1  5            Amour (2012)      ...         13100000   $19,839,492 \n",
       "2  6             Argo (2012)      ...         96300000  $232,325,503 \n",
       "3  7          Arrival (2016)      ...        102842047  $203,388,186 \n",
       "4  9  Before Midnight (2013)      ...          3061842   $11,176,469 \n",
       "\n",
       "[5 rows x 58 columns]"
      ]
     },
     "execution_count": 18,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df = pd.read_excel('../input/datasetsdifferent-format/IMDB.xlsx', sheet_name=1, header=3)\n",
    "df.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "13fd726b3eff8b3bff121aafb75a41b661ffaf81"
   },
   "source": [
    "### 5.No header\n",
    "* We can set `header = None` for not seeing header"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 19,
   "metadata": {
    "_uuid": "e141cd99952beebdfc5cf54efd844c819983da8b"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>0</th>\n",
       "      <th>1</th>\n",
       "      <th>2</th>\n",
       "      <th>3</th>\n",
       "      <th>4</th>\n",
       "      <th>5</th>\n",
       "      <th>6</th>\n",
       "      <th>7</th>\n",
       "      <th>8</th>\n",
       "      <th>9</th>\n",
       "      <th>10</th>\n",
       "      <th>11</th>\n",
       "      <th>12</th>\n",
       "      <th>13</th>\n",
       "      <th>14</th>\n",
       "      <th>15</th>\n",
       "      <th>16</th>\n",
       "      <th>17</th>\n",
       "      <th>18</th>\n",
       "      <th>19</th>\n",
       "      <th>20</th>\n",
       "      <th>21</th>\n",
       "      <th>22</th>\n",
       "      <th>23</th>\n",
       "      <th>24</th>\n",
       "      <th>25</th>\n",
       "      <th>26</th>\n",
       "      <th>27</th>\n",
       "      <th>28</th>\n",
       "      <th>29</th>\n",
       "      <th>30</th>\n",
       "      <th>31</th>\n",
       "      <th>32</th>\n",
       "      <th>33</th>\n",
       "      <th>34</th>\n",
       "      <th>35</th>\n",
       "      <th>36</th>\n",
       "      <th>37</th>\n",
       "      <th>38</th>\n",
       "      <th>39</th>\n",
       "      <th>40</th>\n",
       "      <th>41</th>\n",
       "      <th>42</th>\n",
       "      <th>43</th>\n",
       "      <th>44</th>\n",
       "      <th>45</th>\n",
       "      <th>46</th>\n",
       "      <th>47</th>\n",
       "      <th>48</th>\n",
       "      <th>49</th>\n",
       "      <th>50</th>\n",
       "      <th>51</th>\n",
       "      <th>52</th>\n",
       "      <th>53</th>\n",
       "      <th>54</th>\n",
       "      <th>55</th>\n",
       "      <th>56</th>\n",
       "      <th>57</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>X</td>\n",
       "      <td>Title</td>\n",
       "      <td>Rating</td>\n",
       "      <td>TotalVotes</td>\n",
       "      <td>Genre1</td>\n",
       "      <td>Genre2</td>\n",
       "      <td>Genre3</td>\n",
       "      <td>MetaCritic</td>\n",
       "      <td>Budget</td>\n",
       "      <td>Runtime</td>\n",
       "      <td>CVotes10</td>\n",
       "      <td>CVotes09</td>\n",
       "      <td>CVotes08</td>\n",
       "      <td>CVotes07</td>\n",
       "      <td>CVotes06</td>\n",
       "      <td>CVotes05</td>\n",
       "      <td>CVotes04</td>\n",
       "      <td>CVotes03</td>\n",
       "      <td>CVotes02</td>\n",
       "      <td>CVotes01</td>\n",
       "      <td>CVotesMale</td>\n",
       "      <td>CVotesFemale</td>\n",
       "      <td>CVotesU18</td>\n",
       "      <td>CVotesU18M</td>\n",
       "      <td>CVotesU18F</td>\n",
       "      <td>CVotes1829</td>\n",
       "      <td>CVotes1829M</td>\n",
       "      <td>CVotes1829F</td>\n",
       "      <td>CVotes3044</td>\n",
       "      <td>CVotes3044M</td>\n",
       "      <td>CVotes3044F</td>\n",
       "      <td>CVotes45A</td>\n",
       "      <td>CVotes45AM</td>\n",
       "      <td>CVotes45AF</td>\n",
       "      <td>CVotes1000</td>\n",
       "      <td>CVotesUS</td>\n",
       "      <td>CVotesnUS</td>\n",
       "      <td>VotesM</td>\n",
       "      <td>VotesF</td>\n",
       "      <td>VotesU18</td>\n",
       "      <td>VotesU18M</td>\n",
       "      <td>VotesU18F</td>\n",
       "      <td>Votes1829</td>\n",
       "      <td>Votes1829M</td>\n",
       "      <td>Votes1829F</td>\n",
       "      <td>Votes3044</td>\n",
       "      <td>Votes3044M</td>\n",
       "      <td>Votes3044F</td>\n",
       "      <td>Votes45A</td>\n",
       "      <td>Votes45AM</td>\n",
       "      <td>Votes45AF</td>\n",
       "      <td>VotesIMDB</td>\n",
       "      <td>Votes1000</td>\n",
       "      <td>VotesUS</td>\n",
       "      <td>VotesnUS</td>\n",
       "      <td>Domestic</td>\n",
       "      <td>Foreign</td>\n",
       "      <td>Worldwide</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>1</td>\n",
       "      <td>12 Years a Slave (2013)</td>\n",
       "      <td>8.1</td>\n",
       "      <td>496092</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>History</td>\n",
       "      <td>96</td>\n",
       "      <td>20000000</td>\n",
       "      <td>134 min</td>\n",
       "      <td>75556</td>\n",
       "      <td>126223</td>\n",
       "      <td>161460</td>\n",
       "      <td>83070</td>\n",
       "      <td>27231</td>\n",
       "      <td>9603</td>\n",
       "      <td>4021</td>\n",
       "      <td>2420</td>\n",
       "      <td>1785</td>\n",
       "      <td>4739</td>\n",
       "      <td>313823</td>\n",
       "      <td>82012</td>\n",
       "      <td>1837</td>\n",
       "      <td>1363</td>\n",
       "      <td>457</td>\n",
       "      <td>200910</td>\n",
       "      <td>153669</td>\n",
       "      <td>45301</td>\n",
       "      <td>138762</td>\n",
       "      <td>112943</td>\n",
       "      <td>23895</td>\n",
       "      <td>29252</td>\n",
       "      <td>23072</td>\n",
       "      <td>5726</td>\n",
       "      <td>664</td>\n",
       "      <td>53328</td>\n",
       "      <td>224519</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8</td>\n",
       "      <td>$56,671,993</td>\n",
       "      <td>131061209</td>\n",
       "      <td>$187,733,202</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>2</td>\n",
       "      <td>127 Hours (2010)</td>\n",
       "      <td>7.6</td>\n",
       "      <td>297075</td>\n",
       "      <td>Adventure</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>82</td>\n",
       "      <td>18000000</td>\n",
       "      <td>94 min</td>\n",
       "      <td>28939</td>\n",
       "      <td>44110</td>\n",
       "      <td>98845</td>\n",
       "      <td>78451</td>\n",
       "      <td>28394</td>\n",
       "      <td>9403</td>\n",
       "      <td>3796</td>\n",
       "      <td>1930</td>\n",
       "      <td>1161</td>\n",
       "      <td>2059</td>\n",
       "      <td>212866</td>\n",
       "      <td>44600</td>\n",
       "      <td>745</td>\n",
       "      <td>567</td>\n",
       "      <td>170</td>\n",
       "      <td>133336</td>\n",
       "      <td>106007</td>\n",
       "      <td>26152</td>\n",
       "      <td>102120</td>\n",
       "      <td>86609</td>\n",
       "      <td>14304</td>\n",
       "      <td>14895</td>\n",
       "      <td>12400</td>\n",
       "      <td>2261</td>\n",
       "      <td>649</td>\n",
       "      <td>38478</td>\n",
       "      <td>169745</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$18,335,230</td>\n",
       "      <td>42403567</td>\n",
       "      <td>$60,738,797</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>3</td>\n",
       "      <td>50/50 (2011)</td>\n",
       "      <td>7.7</td>\n",
       "      <td>283935</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>72</td>\n",
       "      <td>8000000</td>\n",
       "      <td>100 min</td>\n",
       "      <td>28304</td>\n",
       "      <td>47501</td>\n",
       "      <td>99524</td>\n",
       "      <td>71485</td>\n",
       "      <td>24252</td>\n",
       "      <td>7545</td>\n",
       "      <td>2381</td>\n",
       "      <td>1109</td>\n",
       "      <td>634</td>\n",
       "      <td>1202</td>\n",
       "      <td>188925</td>\n",
       "      <td>58348</td>\n",
       "      <td>506</td>\n",
       "      <td>348</td>\n",
       "      <td>153</td>\n",
       "      <td>132350</td>\n",
       "      <td>96269</td>\n",
       "      <td>34765</td>\n",
       "      <td>94745</td>\n",
       "      <td>75394</td>\n",
       "      <td>18163</td>\n",
       "      <td>12829</td>\n",
       "      <td>9912</td>\n",
       "      <td>2681</td>\n",
       "      <td>555</td>\n",
       "      <td>46947</td>\n",
       "      <td>147849</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$35,014,192</td>\n",
       "      <td>4173591</td>\n",
       "      <td>$39,187,783</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>4</td>\n",
       "      <td>About Time (2013)</td>\n",
       "      <td>7.8</td>\n",
       "      <td>225412</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Fantasy</td>\n",
       "      <td>NaN</td>\n",
       "      <td>12000000</td>\n",
       "      <td>123 min</td>\n",
       "      <td>38556</td>\n",
       "      <td>43170</td>\n",
       "      <td>70850</td>\n",
       "      <td>45487</td>\n",
       "      <td>16542</td>\n",
       "      <td>5673</td>\n",
       "      <td>2210</td>\n",
       "      <td>1084</td>\n",
       "      <td>664</td>\n",
       "      <td>1182</td>\n",
       "      <td>126718</td>\n",
       "      <td>58098</td>\n",
       "      <td>654</td>\n",
       "      <td>325</td>\n",
       "      <td>321</td>\n",
       "      <td>92940</td>\n",
       "      <td>57778</td>\n",
       "      <td>34126</td>\n",
       "      <td>67477</td>\n",
       "      <td>50212</td>\n",
       "      <td>16222</td>\n",
       "      <td>13973</td>\n",
       "      <td>10690</td>\n",
       "      <td>3026</td>\n",
       "      <td>475</td>\n",
       "      <td>20450</td>\n",
       "      <td>111670</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8</td>\n",
       "      <td>8</td>\n",
       "      <td>8</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>6.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>$15,322,921</td>\n",
       "      <td>71777528</td>\n",
       "      <td>$87,100,449</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "  0                        1       ...               56             57\n",
       "0  X                    Title      ...          Foreign      Worldwide\n",
       "1  1  12 Years a Slave (2013)      ...        131061209  $187,733,202 \n",
       "2  2         127 Hours (2010)      ...         42403567   $60,738,797 \n",
       "3  3             50/50 (2011)      ...          4173591   $39,187,783 \n",
       "4  4        About Time (2013)      ...         71777528   $87,100,449 \n",
       "\n",
       "[5 rows x 58 columns]"
      ]
     },
     "execution_count": 19,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df = pd.read_excel('../input/datasetsdifferent-format/IMDB.xlsx', sheet_name=1, header=None)\n",
    "df.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "de562cd865358ff0204833a4f8e39fff69f4674e"
   },
   "source": [
    "### 6.Skip Rows at the beginning of the file\n",
    "* Skip the rows"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 20,
   "metadata": {
    "_uuid": "5efd37ea4d594331b30954a4c7ce6231ca53430e",
    "scrolled": true
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>7</th>\n",
       "      <th>Arrival (2016)</th>\n",
       "      <th>8</th>\n",
       "      <th>370842</th>\n",
       "      <th>Drama</th>\n",
       "      <th>Mystery</th>\n",
       "      <th>Sci-Fi</th>\n",
       "      <th>81</th>\n",
       "      <th>47000000</th>\n",
       "      <th>116 min</th>\n",
       "      <th>55533</th>\n",
       "      <th>87850</th>\n",
       "      <th>109536</th>\n",
       "      <th>65440</th>\n",
       "      <th>26913</th>\n",
       "      <th>10556</th>\n",
       "      <th>5057</th>\n",
       "      <th>3083</th>\n",
       "      <th>2194</th>\n",
       "      <th>4734</th>\n",
       "      <th>237437</th>\n",
       "      <th>46272</th>\n",
       "      <th>1943</th>\n",
       "      <th>1544</th>\n",
       "      <th>376</th>\n",
       "      <th>126301</th>\n",
       "      <th>101741</th>\n",
       "      <th>23163</th>\n",
       "      <th>111985</th>\n",
       "      <th>95005</th>\n",
       "      <th>15227</th>\n",
       "      <th>24027</th>\n",
       "      <th>20118</th>\n",
       "      <th>3440</th>\n",
       "      <th>537</th>\n",
       "      <th>42062</th>\n",
       "      <th>163774</th>\n",
       "      <th>7.9</th>\n",
       "      <th>8.1</th>\n",
       "      <th>8.6</th>\n",
       "      <th>8.6.1</th>\n",
       "      <th>8.4</th>\n",
       "      <th>8.2</th>\n",
       "      <th>8.2.1</th>\n",
       "      <th>8.1</th>\n",
       "      <th>7.8</th>\n",
       "      <th>7.8.1</th>\n",
       "      <th>7.8.2</th>\n",
       "      <th>7.6</th>\n",
       "      <th>7.6.1</th>\n",
       "      <th>7.7</th>\n",
       "      <th>8.3</th>\n",
       "      <th>7.3</th>\n",
       "      <th>8.2</th>\n",
       "      <th>7.9.1</th>\n",
       "      <th>$100,546,139</th>\n",
       "      <th>102842047</th>\n",
       "      <th>$203,388,186</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>9</td>\n",
       "      <td>Before Midnight (2013)</td>\n",
       "      <td>7.9</td>\n",
       "      <td>106553</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>NaN</td>\n",
       "      <td>94.0</td>\n",
       "      <td>3000000.0</td>\n",
       "      <td>109 min</td>\n",
       "      <td>16953</td>\n",
       "      <td>22109</td>\n",
       "      <td>31439</td>\n",
       "      <td>19251</td>\n",
       "      <td>8142</td>\n",
       "      <td>3412</td>\n",
       "      <td>1649</td>\n",
       "      <td>1033</td>\n",
       "      <td>826</td>\n",
       "      <td>1745</td>\n",
       "      <td>67076</td>\n",
       "      <td>23823</td>\n",
       "      <td>208</td>\n",
       "      <td>138</td>\n",
       "      <td>66</td>\n",
       "      <td>43312</td>\n",
       "      <td>30016</td>\n",
       "      <td>12857</td>\n",
       "      <td>37072</td>\n",
       "      <td>28401</td>\n",
       "      <td>8189</td>\n",
       "      <td>7479</td>\n",
       "      <td>5891</td>\n",
       "      <td>1470</td>\n",
       "      <td>447</td>\n",
       "      <td>12382</td>\n",
       "      <td>59116.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.3</td>\n",
       "      <td>7.4</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.2</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.2</td>\n",
       "      <td>8.5</td>\n",
       "      <td>7.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>$8,114,627</td>\n",
       "      <td>3061842</td>\n",
       "      <td>$11,176,469</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>10</td>\n",
       "      <td>Big Hero 6 (2014)</td>\n",
       "      <td>7.8</td>\n",
       "      <td>315485</td>\n",
       "      <td>Animation</td>\n",
       "      <td>Action</td>\n",
       "      <td>Adventure</td>\n",
       "      <td>74.0</td>\n",
       "      <td>165000000.0</td>\n",
       "      <td>NaN</td>\n",
       "      <td>50311</td>\n",
       "      <td>61304</td>\n",
       "      <td>103726</td>\n",
       "      <td>65681</td>\n",
       "      <td>22389</td>\n",
       "      <td>6830</td>\n",
       "      <td>2251</td>\n",
       "      <td>1036</td>\n",
       "      <td>539</td>\n",
       "      <td>1439</td>\n",
       "      <td>187383</td>\n",
       "      <td>58731</td>\n",
       "      <td>2446</td>\n",
       "      <td>1571</td>\n",
       "      <td>855</td>\n",
       "      <td>128237</td>\n",
       "      <td>91744</td>\n",
       "      <td>35122</td>\n",
       "      <td>84098</td>\n",
       "      <td>68040</td>\n",
       "      <td>14796</td>\n",
       "      <td>13974</td>\n",
       "      <td>11304</td>\n",
       "      <td>2400</td>\n",
       "      <td>525</td>\n",
       "      <td>36702</td>\n",
       "      <td>131818.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.3</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.2</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>$222,527,828</td>\n",
       "      <td>435290784</td>\n",
       "      <td>$657,818,612</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>11</td>\n",
       "      <td>Birdman or (The Unexpected Virtue of Ignorance...</td>\n",
       "      <td>7.8</td>\n",
       "      <td>448725</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>NaN</td>\n",
       "      <td>88.0</td>\n",
       "      <td>18000000.0</td>\n",
       "      <td>119 min</td>\n",
       "      <td>60209</td>\n",
       "      <td>94476</td>\n",
       "      <td>121637</td>\n",
       "      <td>80828</td>\n",
       "      <td>38373</td>\n",
       "      <td>19161</td>\n",
       "      <td>10116</td>\n",
       "      <td>6750</td>\n",
       "      <td>5378</td>\n",
       "      <td>11807</td>\n",
       "      <td>292808</td>\n",
       "      <td>63310</td>\n",
       "      <td>1891</td>\n",
       "      <td>1538</td>\n",
       "      <td>334</td>\n",
       "      <td>178850</td>\n",
       "      <td>142244</td>\n",
       "      <td>34666</td>\n",
       "      <td>129547</td>\n",
       "      <td>108049</td>\n",
       "      <td>19457</td>\n",
       "      <td>26016</td>\n",
       "      <td>21166</td>\n",
       "      <td>4329</td>\n",
       "      <td>656</td>\n",
       "      <td>52288</td>\n",
       "      <td>203731.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.5</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.6</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.1</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.2</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.1</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>$42,340,598</td>\n",
       "      <td>60874496</td>\n",
       "      <td>$103,215,094</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>12</td>\n",
       "      <td>Black Swan (2010)</td>\n",
       "      <td>8.0</td>\n",
       "      <td>587893</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Thriller</td>\n",
       "      <td>NaN</td>\n",
       "      <td>79.0</td>\n",
       "      <td>13000000.0</td>\n",
       "      <td>108 min</td>\n",
       "      <td>93798</td>\n",
       "      <td>136615</td>\n",
       "      <td>174500</td>\n",
       "      <td>97826</td>\n",
       "      <td>40319</td>\n",
       "      <td>16993</td>\n",
       "      <td>9084</td>\n",
       "      <td>6065</td>\n",
       "      <td>3981</td>\n",
       "      <td>8726</td>\n",
       "      <td>356707</td>\n",
       "      <td>143077</td>\n",
       "      <td>1112</td>\n",
       "      <td>583</td>\n",
       "      <td>516</td>\n",
       "      <td>244970</td>\n",
       "      <td>159567</td>\n",
       "      <td>82856</td>\n",
       "      <td>204465</td>\n",
       "      <td>156163</td>\n",
       "      <td>45352</td>\n",
       "      <td>35111</td>\n",
       "      <td>27022</td>\n",
       "      <td>7459</td>\n",
       "      <td>802</td>\n",
       "      <td>86552</td>\n",
       "      <td>306578.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.6</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.1</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.6</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>$106,954,678</td>\n",
       "      <td>222443368</td>\n",
       "      <td>$329,398,046</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>13</td>\n",
       "      <td>Boyhood (2014)</td>\n",
       "      <td>7.9</td>\n",
       "      <td>290327</td>\n",
       "      <td>Drama</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>100.0</td>\n",
       "      <td>4000000.0</td>\n",
       "      <td>165 min</td>\n",
       "      <td>49673</td>\n",
       "      <td>62055</td>\n",
       "      <td>76838</td>\n",
       "      <td>52238</td>\n",
       "      <td>23789</td>\n",
       "      <td>10431</td>\n",
       "      <td>4906</td>\n",
       "      <td>3071</td>\n",
       "      <td>2248</td>\n",
       "      <td>5086</td>\n",
       "      <td>183807</td>\n",
       "      <td>51558</td>\n",
       "      <td>1393</td>\n",
       "      <td>995</td>\n",
       "      <td>381</td>\n",
       "      <td>123006</td>\n",
       "      <td>92639</td>\n",
       "      <td>29076</td>\n",
       "      <td>81594</td>\n",
       "      <td>65261</td>\n",
       "      <td>15118</td>\n",
       "      <td>17881</td>\n",
       "      <td>13995</td>\n",
       "      <td>3567</td>\n",
       "      <td>559</td>\n",
       "      <td>36433</td>\n",
       "      <td>134679.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.1</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>8.2</td>\n",
       "      <td>7.2</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>$25,352,281</td>\n",
       "      <td>19143000</td>\n",
       "      <td>$44,495,281</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>5</th>\n",
       "      <td>14</td>\n",
       "      <td>Bridge of Spies (2015)</td>\n",
       "      <td>7.6</td>\n",
       "      <td>223756</td>\n",
       "      <td>Drama</td>\n",
       "      <td>History</td>\n",
       "      <td>Thriller</td>\n",
       "      <td>81.0</td>\n",
       "      <td>40000000.0</td>\n",
       "      <td>142 min</td>\n",
       "      <td>15757</td>\n",
       "      <td>32840</td>\n",
       "      <td>83322</td>\n",
       "      <td>63800</td>\n",
       "      <td>19183</td>\n",
       "      <td>5178</td>\n",
       "      <td>1657</td>\n",
       "      <td>735</td>\n",
       "      <td>419</td>\n",
       "      <td>878</td>\n",
       "      <td>152707</td>\n",
       "      <td>23978</td>\n",
       "      <td>846</td>\n",
       "      <td>732</td>\n",
       "      <td>104</td>\n",
       "      <td>76784</td>\n",
       "      <td>64810</td>\n",
       "      <td>11177</td>\n",
       "      <td>70780</td>\n",
       "      <td>61525</td>\n",
       "      <td>8196</td>\n",
       "      <td>18494</td>\n",
       "      <td>15504</td>\n",
       "      <td>2667</td>\n",
       "      <td>545</td>\n",
       "      <td>24273</td>\n",
       "      <td>105678.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.5</td>\n",
       "      <td>$72,313,754</td>\n",
       "      <td>93164594</td>\n",
       "      <td>$165,478,348</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>6</th>\n",
       "      <td>15</td>\n",
       "      <td>Captain America: Civil War (2016)</td>\n",
       "      <td>7.9</td>\n",
       "      <td>431555</td>\n",
       "      <td>Action</td>\n",
       "      <td>Adventure</td>\n",
       "      <td>Sci-Fi</td>\n",
       "      <td>75.0</td>\n",
       "      <td>250000000.0</td>\n",
       "      <td>147 min</td>\n",
       "      <td>81893</td>\n",
       "      <td>90156</td>\n",
       "      <td>117188</td>\n",
       "      <td>79377</td>\n",
       "      <td>32782</td>\n",
       "      <td>12322</td>\n",
       "      <td>5095</td>\n",
       "      <td>2994</td>\n",
       "      <td>1989</td>\n",
       "      <td>7786</td>\n",
       "      <td>264239</td>\n",
       "      <td>43818</td>\n",
       "      <td>3572</td>\n",
       "      <td>2865</td>\n",
       "      <td>683</td>\n",
       "      <td>148991</td>\n",
       "      <td>124124</td>\n",
       "      <td>23355</td>\n",
       "      <td>105069</td>\n",
       "      <td>91345</td>\n",
       "      <td>12135</td>\n",
       "      <td>19151</td>\n",
       "      <td>16351</td>\n",
       "      <td>2459</td>\n",
       "      <td>593</td>\n",
       "      <td>48777</td>\n",
       "      <td>153638.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.6</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.5</td>\n",
       "      <td>8.1</td>\n",
       "      <td>7.7</td>\n",
       "      <td>$408,084,349</td>\n",
       "      <td>745220146</td>\n",
       "      <td>$1,153,304,495</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>7</th>\n",
       "      <td>16</td>\n",
       "      <td>Captain America: The Winter Soldier (2014)</td>\n",
       "      <td>7.8</td>\n",
       "      <td>552706</td>\n",
       "      <td>Action</td>\n",
       "      <td>Adventure</td>\n",
       "      <td>Sci-Fi</td>\n",
       "      <td>70.0</td>\n",
       "      <td>170000000.0</td>\n",
       "      <td>136 min</td>\n",
       "      <td>84943</td>\n",
       "      <td>103896</td>\n",
       "      <td>169440</td>\n",
       "      <td>120197</td>\n",
       "      <td>44124</td>\n",
       "      <td>14639</td>\n",
       "      <td>5571</td>\n",
       "      <td>2735</td>\n",
       "      <td>1932</td>\n",
       "      <td>5248</td>\n",
       "      <td>360615</td>\n",
       "      <td>66751</td>\n",
       "      <td>3765</td>\n",
       "      <td>2900</td>\n",
       "      <td>844</td>\n",
       "      <td>208526</td>\n",
       "      <td>170111</td>\n",
       "      <td>36456</td>\n",
       "      <td>150264</td>\n",
       "      <td>129500</td>\n",
       "      <td>18637</td>\n",
       "      <td>28922</td>\n",
       "      <td>24313</td>\n",
       "      <td>4103</td>\n",
       "      <td>720</td>\n",
       "      <td>72120</td>\n",
       "      <td>213180.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.5</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.5</td>\n",
       "      <td>8.1</td>\n",
       "      <td>7.5</td>\n",
       "      <td>$259,766,572</td>\n",
       "      <td>454497695</td>\n",
       "      <td>$714,264,267</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>8</th>\n",
       "      <td>17</td>\n",
       "      <td>Captain Fantastic (2016)</td>\n",
       "      <td>7.9</td>\n",
       "      <td>115194</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>NaN</td>\n",
       "      <td>72.0</td>\n",
       "      <td>5000000.0</td>\n",
       "      <td>NaN</td>\n",
       "      <td>16165</td>\n",
       "      <td>24762</td>\n",
       "      <td>39686</td>\n",
       "      <td>22429</td>\n",
       "      <td>7134</td>\n",
       "      <td>2255</td>\n",
       "      <td>982</td>\n",
       "      <td>542</td>\n",
       "      <td>419</td>\n",
       "      <td>832</td>\n",
       "      <td>71760</td>\n",
       "      <td>19138</td>\n",
       "      <td>447</td>\n",
       "      <td>329</td>\n",
       "      <td>112</td>\n",
       "      <td>40918</td>\n",
       "      <td>30740</td>\n",
       "      <td>9707</td>\n",
       "      <td>36357</td>\n",
       "      <td>29410</td>\n",
       "      <td>6414</td>\n",
       "      <td>8123</td>\n",
       "      <td>6521</td>\n",
       "      <td>1433</td>\n",
       "      <td>351</td>\n",
       "      <td>10694</td>\n",
       "      <td>56956.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.2</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.5</td>\n",
       "      <td>6.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>$5,879,736</td>\n",
       "      <td>n/a</td>\n",
       "      <td>$5,879,736</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>9</th>\n",
       "      <td>18</td>\n",
       "      <td>Captain Phillips (2013)</td>\n",
       "      <td>7.8</td>\n",
       "      <td>350818</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Thriller</td>\n",
       "      <td>83.0</td>\n",
       "      <td>55000000.0</td>\n",
       "      <td>134 min</td>\n",
       "      <td>37461</td>\n",
       "      <td>70216</td>\n",
       "      <td>133266</td>\n",
       "      <td>76657</td>\n",
       "      <td>21791</td>\n",
       "      <td>6099</td>\n",
       "      <td>2051</td>\n",
       "      <td>1062</td>\n",
       "      <td>707</td>\n",
       "      <td>1517</td>\n",
       "      <td>247889</td>\n",
       "      <td>41602</td>\n",
       "      <td>995</td>\n",
       "      <td>838</td>\n",
       "      <td>147</td>\n",
       "      <td>131052</td>\n",
       "      <td>110723</td>\n",
       "      <td>19092</td>\n",
       "      <td>114418</td>\n",
       "      <td>98191</td>\n",
       "      <td>14686</td>\n",
       "      <td>24670</td>\n",
       "      <td>20178</td>\n",
       "      <td>4053</td>\n",
       "      <td>633</td>\n",
       "      <td>43042</td>\n",
       "      <td>165981.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>$107,100,855</td>\n",
       "      <td>111690956</td>\n",
       "      <td>$218,791,811</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "    7       ...           $203,388,186 \n",
       "0   9       ...            $11,176,469 \n",
       "1  10       ...           $657,818,612 \n",
       "2  11       ...           $103,215,094 \n",
       "3  12       ...           $329,398,046 \n",
       "4  13       ...            $44,495,281 \n",
       "5  14       ...           $165,478,348 \n",
       "6  15       ...         $1,153,304,495 \n",
       "7  16       ...           $714,264,267 \n",
       "8  17       ...             $5,879,736 \n",
       "9  18       ...           $218,791,811 \n",
       "\n",
       "[10 rows x 58 columns]"
      ]
     },
     "execution_count": 20,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df = pd.read_excel('../input/datasetsdifferent-format/IMDB.xlsx', sheet_name=1, skiprows=7)\n",
    "df.head(10)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "caea4db03dbaf4eb71c6a358bf229a64f75b6855"
   },
   "source": [
    "### 7.Skip rows from the end of the file"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 21,
   "metadata": {
    "_uuid": "2066ad264aaf7583b824c7283fcd31f0070d9588"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>X</th>\n",
       "      <th>Title</th>\n",
       "      <th>Rating</th>\n",
       "      <th>TotalVotes</th>\n",
       "      <th>Genre1</th>\n",
       "      <th>Genre2</th>\n",
       "      <th>Genre3</th>\n",
       "      <th>MetaCritic</th>\n",
       "      <th>Budget</th>\n",
       "      <th>Runtime</th>\n",
       "      <th>CVotes10</th>\n",
       "      <th>CVotes09</th>\n",
       "      <th>CVotes08</th>\n",
       "      <th>CVotes07</th>\n",
       "      <th>CVotes06</th>\n",
       "      <th>CVotes05</th>\n",
       "      <th>CVotes04</th>\n",
       "      <th>CVotes03</th>\n",
       "      <th>CVotes02</th>\n",
       "      <th>CVotes01</th>\n",
       "      <th>CVotesMale</th>\n",
       "      <th>CVotesFemale</th>\n",
       "      <th>CVotesU18</th>\n",
       "      <th>CVotesU18M</th>\n",
       "      <th>CVotesU18F</th>\n",
       "      <th>CVotes1829</th>\n",
       "      <th>CVotes1829M</th>\n",
       "      <th>CVotes1829F</th>\n",
       "      <th>CVotes3044</th>\n",
       "      <th>CVotes3044M</th>\n",
       "      <th>CVotes3044F</th>\n",
       "      <th>CVotes45A</th>\n",
       "      <th>CVotes45AM</th>\n",
       "      <th>CVotes45AF</th>\n",
       "      <th>CVotes1000</th>\n",
       "      <th>CVotesUS</th>\n",
       "      <th>CVotesnUS</th>\n",
       "      <th>VotesM</th>\n",
       "      <th>VotesF</th>\n",
       "      <th>VotesU18</th>\n",
       "      <th>VotesU18M</th>\n",
       "      <th>VotesU18F</th>\n",
       "      <th>Votes1829</th>\n",
       "      <th>Votes1829M</th>\n",
       "      <th>Votes1829F</th>\n",
       "      <th>Votes3044</th>\n",
       "      <th>Votes3044M</th>\n",
       "      <th>Votes3044F</th>\n",
       "      <th>Votes45A</th>\n",
       "      <th>Votes45AM</th>\n",
       "      <th>Votes45AF</th>\n",
       "      <th>VotesIMDB</th>\n",
       "      <th>Votes1000</th>\n",
       "      <th>VotesUS</th>\n",
       "      <th>VotesnUS</th>\n",
       "      <th>Domestic</th>\n",
       "      <th>Foreign</th>\n",
       "      <th>Worldwide</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>107</th>\n",
       "      <td>109</td>\n",
       "      <td>True Grit (2010)</td>\n",
       "      <td>7.6</td>\n",
       "      <td>257670</td>\n",
       "      <td>Adventure</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Western</td>\n",
       "      <td>80.0</td>\n",
       "      <td>38000000.0</td>\n",
       "      <td>110 min</td>\n",
       "      <td>21094</td>\n",
       "      <td>40901</td>\n",
       "      <td>91825</td>\n",
       "      <td>67175</td>\n",
       "      <td>23055</td>\n",
       "      <td>7191</td>\n",
       "      <td>2678</td>\n",
       "      <td>1305</td>\n",
       "      <td>779</td>\n",
       "      <td>1672</td>\n",
       "      <td>197105</td>\n",
       "      <td>27125</td>\n",
       "      <td>381</td>\n",
       "      <td>340</td>\n",
       "      <td>35</td>\n",
       "      <td>89394</td>\n",
       "      <td>76864</td>\n",
       "      <td>11720</td>\n",
       "      <td>104201</td>\n",
       "      <td>91807</td>\n",
       "      <td>11163</td>\n",
       "      <td>25641</td>\n",
       "      <td>21885</td>\n",
       "      <td>3369</td>\n",
       "      <td>747</td>\n",
       "      <td>53749</td>\n",
       "      <td>137672.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$171,243,005</td>\n",
       "      <td>81033922</td>\n",
       "      <td>$252,276,927</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>108</th>\n",
       "      <td>110</td>\n",
       "      <td>Tucker and Dale vs Evil (2010)</td>\n",
       "      <td>7.6</td>\n",
       "      <td>138624</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Horror</td>\n",
       "      <td>NaN</td>\n",
       "      <td>65.0</td>\n",
       "      <td>5000000.0</td>\n",
       "      <td>NaN</td>\n",
       "      <td>16572</td>\n",
       "      <td>19818</td>\n",
       "      <td>44460</td>\n",
       "      <td>35863</td>\n",
       "      <td>13456</td>\n",
       "      <td>4588</td>\n",
       "      <td>1684</td>\n",
       "      <td>855</td>\n",
       "      <td>479</td>\n",
       "      <td>848</td>\n",
       "      <td>106144</td>\n",
       "      <td>15113</td>\n",
       "      <td>219</td>\n",
       "      <td>198</td>\n",
       "      <td>20</td>\n",
       "      <td>52889</td>\n",
       "      <td>45169</td>\n",
       "      <td>7232</td>\n",
       "      <td>56379</td>\n",
       "      <td>49634</td>\n",
       "      <td>6156</td>\n",
       "      <td>8861</td>\n",
       "      <td>7645</td>\n",
       "      <td>1072</td>\n",
       "      <td>540</td>\n",
       "      <td>26213</td>\n",
       "      <td>73918.0</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>8.2</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.1</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.5</td>\n",
       "      <td>$223,838</td>\n",
       "      <td>4525678</td>\n",
       "      <td>$4,749,516</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>109</th>\n",
       "      <td>111</td>\n",
       "      <td>Tyrannosaur (2011)</td>\n",
       "      <td>7.6</td>\n",
       "      <td>26016</td>\n",
       "      <td>Drama</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>65.0</td>\n",
       "      <td>1000000.0</td>\n",
       "      <td>NaN</td>\n",
       "      <td>2060</td>\n",
       "      <td>4083</td>\n",
       "      <td>9078</td>\n",
       "      <td>6754</td>\n",
       "      <td>2468</td>\n",
       "      <td>755</td>\n",
       "      <td>310</td>\n",
       "      <td>146</td>\n",
       "      <td>111</td>\n",
       "      <td>251</td>\n",
       "      <td>19827</td>\n",
       "      <td>3649</td>\n",
       "      <td>6</td>\n",
       "      <td>6</td>\n",
       "      <td>7314</td>\n",
       "      <td>5920</td>\n",
       "      <td>1338</td>\n",
       "      <td>12497</td>\n",
       "      <td>10628</td>\n",
       "      <td>1724</td>\n",
       "      <td>3311</td>\n",
       "      <td>2784</td>\n",
       "      <td>480</td>\n",
       "      <td>4</td>\n",
       "      <td>2231</td>\n",
       "      <td>18173</td>\n",
       "      <td>NaN</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>6.0</td>\n",
       "      <td>6.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.4</td>\n",
       "      <td>5.8</td>\n",
       "      <td>6.5</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.6</td>\n",
       "      <td>NaN</td>\n",
       "      <td>$22,321</td>\n",
       "      <td>n/a</td>\n",
       "      <td>$22,321</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>110</th>\n",
       "      <td>112</td>\n",
       "      <td>Warrior (2011)</td>\n",
       "      <td>8.2</td>\n",
       "      <td>361049</td>\n",
       "      <td>Action</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Sport</td>\n",
       "      <td>71.0</td>\n",
       "      <td>25000000.0</td>\n",
       "      <td>140 min</td>\n",
       "      <td>74983</td>\n",
       "      <td>96953</td>\n",
       "      <td>106673</td>\n",
       "      <td>52972</td>\n",
       "      <td>16668</td>\n",
       "      <td>5727</td>\n",
       "      <td>2353</td>\n",
       "      <td>1205</td>\n",
       "      <td>1050</td>\n",
       "      <td>2479</td>\n",
       "      <td>270734</td>\n",
       "      <td>31075</td>\n",
       "      <td>673</td>\n",
       "      <td>583</td>\n",
       "      <td>84</td>\n",
       "      <td>153824</td>\n",
       "      <td>136536</td>\n",
       "      <td>16000</td>\n",
       "      <td>117636</td>\n",
       "      <td>105144</td>\n",
       "      <td>11019</td>\n",
       "      <td>15201</td>\n",
       "      <td>12960</td>\n",
       "      <td>1990</td>\n",
       "      <td>586</td>\n",
       "      <td>45342</td>\n",
       "      <td>176397.0</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.7</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.1</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.1</td>\n",
       "      <td>$13,657,115</td>\n",
       "      <td>9400000</td>\n",
       "      <td>$23,057,115</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>111</th>\n",
       "      <td>113</td>\n",
       "      <td>What We Do in the Shadows (2014)</td>\n",
       "      <td>7.6</td>\n",
       "      <td>87975</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Horror</td>\n",
       "      <td>NaN</td>\n",
       "      <td>76.0</td>\n",
       "      <td>1600000.0</td>\n",
       "      <td>NaN</td>\n",
       "      <td>10485</td>\n",
       "      <td>14507</td>\n",
       "      <td>28608</td>\n",
       "      <td>20735</td>\n",
       "      <td>7696</td>\n",
       "      <td>2802</td>\n",
       "      <td>1200</td>\n",
       "      <td>721</td>\n",
       "      <td>450</td>\n",
       "      <td>781</td>\n",
       "      <td>57028</td>\n",
       "      <td>15840</td>\n",
       "      <td>268</td>\n",
       "      <td>209</td>\n",
       "      <td>56</td>\n",
       "      <td>32406</td>\n",
       "      <td>23869</td>\n",
       "      <td>8125</td>\n",
       "      <td>31707</td>\n",
       "      <td>25592</td>\n",
       "      <td>5689</td>\n",
       "      <td>6013</td>\n",
       "      <td>4740</td>\n",
       "      <td>1138</td>\n",
       "      <td>389</td>\n",
       "      <td>12341</td>\n",
       "      <td>45062.0</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>6.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.5</td>\n",
       "      <td>$3,469,224</td>\n",
       "      <td>2794000</td>\n",
       "      <td>$6,263,224</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>112</th>\n",
       "      <td>114</td>\n",
       "      <td>Whiplash (2014)</td>\n",
       "      <td>8.5</td>\n",
       "      <td>492285</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Music</td>\n",
       "      <td>NaN</td>\n",
       "      <td>88.0</td>\n",
       "      <td>3300000.0</td>\n",
       "      <td>107 min</td>\n",
       "      <td>110404</td>\n",
       "      <td>161864</td>\n",
       "      <td>132656</td>\n",
       "      <td>56007</td>\n",
       "      <td>16577</td>\n",
       "      <td>6031</td>\n",
       "      <td>2937</td>\n",
       "      <td>1859</td>\n",
       "      <td>1263</td>\n",
       "      <td>2723</td>\n",
       "      <td>308900</td>\n",
       "      <td>71066</td>\n",
       "      <td>2878</td>\n",
       "      <td>2200</td>\n",
       "      <td>660</td>\n",
       "      <td>205839</td>\n",
       "      <td>161853</td>\n",
       "      <td>41944</td>\n",
       "      <td>123712</td>\n",
       "      <td>102839</td>\n",
       "      <td>19018</td>\n",
       "      <td>23345</td>\n",
       "      <td>19072</td>\n",
       "      <td>3812</td>\n",
       "      <td>590</td>\n",
       "      <td>49868</td>\n",
       "      <td>213952.0</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.4</td>\n",
       "      <td>9.0</td>\n",
       "      <td>9.1</td>\n",
       "      <td>8.9</td>\n",
       "      <td>8.6</td>\n",
       "      <td>8.7</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.7</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.6</td>\n",
       "      <td>8.4</td>\n",
       "      <td>$13,092,000</td>\n",
       "      <td>35890041</td>\n",
       "      <td>$48,982,041</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>113</th>\n",
       "      <td>115</td>\n",
       "      <td>Wreck-It Ralph (2012)</td>\n",
       "      <td>7.7</td>\n",
       "      <td>295125</td>\n",
       "      <td>Animation</td>\n",
       "      <td>Adventure</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>72.0</td>\n",
       "      <td>165000000.0</td>\n",
       "      <td>NaN</td>\n",
       "      <td>41980</td>\n",
       "      <td>50262</td>\n",
       "      <td>96477</td>\n",
       "      <td>67934</td>\n",
       "      <td>24894</td>\n",
       "      <td>7748</td>\n",
       "      <td>2724</td>\n",
       "      <td>1190</td>\n",
       "      <td>703</td>\n",
       "      <td>1226</td>\n",
       "      <td>190983</td>\n",
       "      <td>50202</td>\n",
       "      <td>1663</td>\n",
       "      <td>1182</td>\n",
       "      <td>467</td>\n",
       "      <td>120962</td>\n",
       "      <td>90759</td>\n",
       "      <td>29003</td>\n",
       "      <td>90203</td>\n",
       "      <td>74767</td>\n",
       "      <td>14148</td>\n",
       "      <td>13706</td>\n",
       "      <td>11356</td>\n",
       "      <td>2112</td>\n",
       "      <td>614</td>\n",
       "      <td>44962</td>\n",
       "      <td>129487.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.2</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$189,422,889</td>\n",
       "      <td>281800000</td>\n",
       "      <td>$471,222,889</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>114</th>\n",
       "      <td>116</td>\n",
       "      <td>X-Men: Days of Future Past (2014)</td>\n",
       "      <td>8.0</td>\n",
       "      <td>560736</td>\n",
       "      <td>Action</td>\n",
       "      <td>Adventure</td>\n",
       "      <td>Sci-Fi</td>\n",
       "      <td>74.0</td>\n",
       "      <td>200000000.0</td>\n",
       "      <td>132 min</td>\n",
       "      <td>91765</td>\n",
       "      <td>127521</td>\n",
       "      <td>183578</td>\n",
       "      <td>104658</td>\n",
       "      <td>33027</td>\n",
       "      <td>10059</td>\n",
       "      <td>3710</td>\n",
       "      <td>1903</td>\n",
       "      <td>1225</td>\n",
       "      <td>3301</td>\n",
       "      <td>370835</td>\n",
       "      <td>71008</td>\n",
       "      <td>3038</td>\n",
       "      <td>2403</td>\n",
       "      <td>614</td>\n",
       "      <td>220178</td>\n",
       "      <td>179039</td>\n",
       "      <td>39094</td>\n",
       "      <td>158607</td>\n",
       "      <td>135392</td>\n",
       "      <td>20927</td>\n",
       "      <td>26834</td>\n",
       "      <td>22460</td>\n",
       "      <td>3884</td>\n",
       "      <td>710</td>\n",
       "      <td>67889</td>\n",
       "      <td>229049.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.6</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.2</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.4</td>\n",
       "      <td>8.1</td>\n",
       "      <td>7.9</td>\n",
       "      <td>$233,921,534</td>\n",
       "      <td>513941241</td>\n",
       "      <td>$747,862,775</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>115</th>\n",
       "      <td>117</td>\n",
       "      <td>X-Men: First Class (2011)</td>\n",
       "      <td>7.8</td>\n",
       "      <td>556713</td>\n",
       "      <td>Action</td>\n",
       "      <td>Adventure</td>\n",
       "      <td>Sci-Fi</td>\n",
       "      <td>65.0</td>\n",
       "      <td>160000000.0</td>\n",
       "      <td>132 min</td>\n",
       "      <td>64428</td>\n",
       "      <td>96219</td>\n",
       "      <td>200144</td>\n",
       "      <td>129352</td>\n",
       "      <td>41945</td>\n",
       "      <td>12861</td>\n",
       "      <td>4799</td>\n",
       "      <td>2349</td>\n",
       "      <td>1448</td>\n",
       "      <td>3182</td>\n",
       "      <td>382107</td>\n",
       "      <td>80444</td>\n",
       "      <td>2075</td>\n",
       "      <td>1612</td>\n",
       "      <td>443</td>\n",
       "      <td>223309</td>\n",
       "      <td>176821</td>\n",
       "      <td>44428</td>\n",
       "      <td>185909</td>\n",
       "      <td>157332</td>\n",
       "      <td>26094</td>\n",
       "      <td>30217</td>\n",
       "      <td>25051</td>\n",
       "      <td>4691</td>\n",
       "      <td>780</td>\n",
       "      <td>87542</td>\n",
       "      <td>257681.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.3</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>$146,408,305</td>\n",
       "      <td>207215819</td>\n",
       "      <td>$353,624,124</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>116</th>\n",
       "      <td>118</td>\n",
       "      <td>Zootopia (2016)</td>\n",
       "      <td>8.1</td>\n",
       "      <td>309474</td>\n",
       "      <td>Animation</td>\n",
       "      <td>Adventure</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>78.0</td>\n",
       "      <td>150000000.0</td>\n",
       "      <td>108 min</td>\n",
       "      <td>53626</td>\n",
       "      <td>70912</td>\n",
       "      <td>102352</td>\n",
       "      <td>57261</td>\n",
       "      <td>16719</td>\n",
       "      <td>4539</td>\n",
       "      <td>1467</td>\n",
       "      <td>733</td>\n",
       "      <td>496</td>\n",
       "      <td>1386</td>\n",
       "      <td>176202</td>\n",
       "      <td>52345</td>\n",
       "      <td>2362</td>\n",
       "      <td>1641</td>\n",
       "      <td>706</td>\n",
       "      <td>119637</td>\n",
       "      <td>87499</td>\n",
       "      <td>30813</td>\n",
       "      <td>75474</td>\n",
       "      <td>61358</td>\n",
       "      <td>13034</td>\n",
       "      <td>12353</td>\n",
       "      <td>9959</td>\n",
       "      <td>2151</td>\n",
       "      <td>518</td>\n",
       "      <td>35975</td>\n",
       "      <td>122844.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.7</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.4</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>$341,268,248</td>\n",
       "      <td>682515947</td>\n",
       "      <td>$1,023,784,195</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "       X       ...               Worldwide\n",
       "107  109       ...           $252,276,927 \n",
       "108  110       ...             $4,749,516 \n",
       "109  111       ...                $22,321 \n",
       "110  112       ...            $23,057,115 \n",
       "111  113       ...             $6,263,224 \n",
       "112  114       ...            $48,982,041 \n",
       "113  115       ...           $471,222,889 \n",
       "114  116       ...           $747,862,775 \n",
       "115  117       ...           $353,624,124 \n",
       "116  118       ...         $1,023,784,195 \n",
       "\n",
       "[10 rows x 58 columns]"
      ]
     },
     "execution_count": 21,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df = pd.read_excel('../input/datasetsdifferent-format/IMDB.xlsx', sheet_name=1, ski_footer=10)\n",
    "df.tail(10)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "dd6dc7c428fd94d9dcae0e284ead219aa5fc9f4f"
   },
   "source": [
    "### 8.Choose Columns\n",
    "* we can choose column from the excel file"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 22,
   "metadata": {
    "_uuid": "c278735e5e7dba13cfd6fe404a0bf1f140183dfd"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>X</th>\n",
       "      <th>Title</th>\n",
       "      <th>Rating</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>1</td>\n",
       "      <td>12 Years a Slave (2013)</td>\n",
       "      <td>8.1</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2</td>\n",
       "      <td>127 Hours (2010)</td>\n",
       "      <td>7.6</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>3</td>\n",
       "      <td>50/50 (2011)</td>\n",
       "      <td>7.7</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>4</td>\n",
       "      <td>About Time (2013)</td>\n",
       "      <td>7.8</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>5</td>\n",
       "      <td>Amour (2012)</td>\n",
       "      <td>7.9</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "   X                    Title  Rating\n",
       "0  1  12 Years a Slave (2013)     8.1\n",
       "1  2         127 Hours (2010)     7.6\n",
       "2  3             50/50 (2011)     7.7\n",
       "3  4        About Time (2013)     7.8\n",
       "4  5             Amour (2012)     7.9"
      ]
     },
     "execution_count": 22,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df = pd.read_excel('../input/datasetsdifferent-format/IMDB.xlsx', sheet_name= 0, usecols=2)\n",
    "df.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "de629441fc6b8e92d4fbfaebc65174af631ddf21"
   },
   "source": [
    "### 9.Column Names"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 23,
   "metadata": {
    "_uuid": "51f7cc408c97eeec0870ebb60b97d530c0267b64"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>X</th>\n",
       "      <th>Title</th>\n",
       "      <th>Rating</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>1</td>\n",
       "      <td>12 Years a Slave (2013)</td>\n",
       "      <td>8.1</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2</td>\n",
       "      <td>127 Hours (2010)</td>\n",
       "      <td>7.6</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>3</td>\n",
       "      <td>50/50 (2011)</td>\n",
       "      <td>7.7</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>4</td>\n",
       "      <td>About Time (2013)</td>\n",
       "      <td>7.8</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>5</td>\n",
       "      <td>Amour (2012)</td>\n",
       "      <td>7.9</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "   X                    Title  Rating\n",
       "0  1  12 Years a Slave (2013)     8.1\n",
       "1  2         127 Hours (2010)     7.6\n",
       "2  3             50/50 (2011)     7.7\n",
       "3  4        About Time (2013)     7.8\n",
       "4  5             Amour (2012)     7.9"
      ]
     },
     "execution_count": 23,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df = pd.read_excel('../input/datasetsdifferent-format/IMDB.xlsx', sheet_name=0, usecols = 2, names=['X','Title', 'Rating'], )\n",
    "df.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "55e85355e5ff998b77506eacdde582fa7b382d4c"
   },
   "source": [
    "### 10.Set an Index while reading data"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 24,
   "metadata": {
    "_uuid": "692892ef2ab621063fd916016da68fb7f91b4b0c"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>X</th>\n",
       "      <th>Rating</th>\n",
       "      <th>TotalVotes</th>\n",
       "      <th>Genre1</th>\n",
       "      <th>Genre2</th>\n",
       "      <th>Genre3</th>\n",
       "      <th>MetaCritic</th>\n",
       "      <th>Budget</th>\n",
       "      <th>Runtime</th>\n",
       "      <th>CVotes10</th>\n",
       "      <th>CVotes09</th>\n",
       "      <th>CVotes08</th>\n",
       "      <th>CVotes07</th>\n",
       "      <th>CVotes06</th>\n",
       "      <th>CVotes05</th>\n",
       "      <th>CVotes04</th>\n",
       "      <th>CVotes03</th>\n",
       "      <th>CVotes02</th>\n",
       "      <th>CVotes01</th>\n",
       "      <th>CVotesMale</th>\n",
       "      <th>CVotesFemale</th>\n",
       "      <th>CVotesU18</th>\n",
       "      <th>CVotesU18M</th>\n",
       "      <th>CVotesU18F</th>\n",
       "      <th>CVotes1829</th>\n",
       "      <th>CVotes1829M</th>\n",
       "      <th>CVotes1829F</th>\n",
       "      <th>CVotes3044</th>\n",
       "      <th>CVotes3044M</th>\n",
       "      <th>CVotes3044F</th>\n",
       "      <th>CVotes45A</th>\n",
       "      <th>CVotes45AM</th>\n",
       "      <th>CVotes45AF</th>\n",
       "      <th>CVotes1000</th>\n",
       "      <th>CVotesUS</th>\n",
       "      <th>CVotesnUS</th>\n",
       "      <th>VotesM</th>\n",
       "      <th>VotesF</th>\n",
       "      <th>VotesU18</th>\n",
       "      <th>VotesU18M</th>\n",
       "      <th>VotesU18F</th>\n",
       "      <th>Votes1829</th>\n",
       "      <th>Votes1829M</th>\n",
       "      <th>Votes1829F</th>\n",
       "      <th>Votes3044</th>\n",
       "      <th>Votes3044M</th>\n",
       "      <th>Votes3044F</th>\n",
       "      <th>Votes45A</th>\n",
       "      <th>Votes45AM</th>\n",
       "      <th>Votes45AF</th>\n",
       "      <th>VotesIMDB</th>\n",
       "      <th>Votes1000</th>\n",
       "      <th>VotesUS</th>\n",
       "      <th>VotesnUS</th>\n",
       "      <th>Domestic</th>\n",
       "      <th>Foreign</th>\n",
       "      <th>Worldwide</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Title</th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>12 Years a Slave (2013)</th>\n",
       "      <td>1</td>\n",
       "      <td>8.1</td>\n",
       "      <td>496092</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>History</td>\n",
       "      <td>96.0</td>\n",
       "      <td>20000000.0</td>\n",
       "      <td>134 min</td>\n",
       "      <td>75556</td>\n",
       "      <td>126223</td>\n",
       "      <td>161460</td>\n",
       "      <td>83070</td>\n",
       "      <td>27231</td>\n",
       "      <td>9603</td>\n",
       "      <td>4021</td>\n",
       "      <td>2420</td>\n",
       "      <td>1785</td>\n",
       "      <td>4739</td>\n",
       "      <td>313823</td>\n",
       "      <td>82012</td>\n",
       "      <td>1837</td>\n",
       "      <td>1363</td>\n",
       "      <td>457</td>\n",
       "      <td>200910</td>\n",
       "      <td>153669</td>\n",
       "      <td>45301</td>\n",
       "      <td>138762</td>\n",
       "      <td>112943</td>\n",
       "      <td>23895</td>\n",
       "      <td>29252</td>\n",
       "      <td>23072</td>\n",
       "      <td>5726</td>\n",
       "      <td>664</td>\n",
       "      <td>53328</td>\n",
       "      <td>224519.0</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.0</td>\n",
       "      <td>$56,671,993</td>\n",
       "      <td>131061209</td>\n",
       "      <td>$187,733,202</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>127 Hours (2010)</th>\n",
       "      <td>2</td>\n",
       "      <td>7.6</td>\n",
       "      <td>297075</td>\n",
       "      <td>Adventure</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>82.0</td>\n",
       "      <td>18000000.0</td>\n",
       "      <td>94 min</td>\n",
       "      <td>28939</td>\n",
       "      <td>44110</td>\n",
       "      <td>98845</td>\n",
       "      <td>78451</td>\n",
       "      <td>28394</td>\n",
       "      <td>9403</td>\n",
       "      <td>3796</td>\n",
       "      <td>1930</td>\n",
       "      <td>1161</td>\n",
       "      <td>2059</td>\n",
       "      <td>212866</td>\n",
       "      <td>44600</td>\n",
       "      <td>745</td>\n",
       "      <td>567</td>\n",
       "      <td>170</td>\n",
       "      <td>133336</td>\n",
       "      <td>106007</td>\n",
       "      <td>26152</td>\n",
       "      <td>102120</td>\n",
       "      <td>86609</td>\n",
       "      <td>14304</td>\n",
       "      <td>14895</td>\n",
       "      <td>12400</td>\n",
       "      <td>2261</td>\n",
       "      <td>649</td>\n",
       "      <td>38478</td>\n",
       "      <td>169745.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$18,335,230</td>\n",
       "      <td>42403567</td>\n",
       "      <td>$60,738,797</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>50/50 (2011)</th>\n",
       "      <td>3</td>\n",
       "      <td>7.7</td>\n",
       "      <td>283935</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>72.0</td>\n",
       "      <td>8000000.0</td>\n",
       "      <td>100 min</td>\n",
       "      <td>28304</td>\n",
       "      <td>47501</td>\n",
       "      <td>99524</td>\n",
       "      <td>71485</td>\n",
       "      <td>24252</td>\n",
       "      <td>7545</td>\n",
       "      <td>2381</td>\n",
       "      <td>1109</td>\n",
       "      <td>634</td>\n",
       "      <td>1202</td>\n",
       "      <td>188925</td>\n",
       "      <td>58348</td>\n",
       "      <td>506</td>\n",
       "      <td>348</td>\n",
       "      <td>153</td>\n",
       "      <td>132350</td>\n",
       "      <td>96269</td>\n",
       "      <td>34765</td>\n",
       "      <td>94745</td>\n",
       "      <td>75394</td>\n",
       "      <td>18163</td>\n",
       "      <td>12829</td>\n",
       "      <td>9912</td>\n",
       "      <td>2681</td>\n",
       "      <td>555</td>\n",
       "      <td>46947</td>\n",
       "      <td>147849.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$35,014,192</td>\n",
       "      <td>4173591</td>\n",
       "      <td>$39,187,783</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>About Time (2013)</th>\n",
       "      <td>4</td>\n",
       "      <td>7.8</td>\n",
       "      <td>225412</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Fantasy</td>\n",
       "      <td>NaN</td>\n",
       "      <td>12000000.0</td>\n",
       "      <td>123 min</td>\n",
       "      <td>38556</td>\n",
       "      <td>43170</td>\n",
       "      <td>70850</td>\n",
       "      <td>45487</td>\n",
       "      <td>16542</td>\n",
       "      <td>5673</td>\n",
       "      <td>2210</td>\n",
       "      <td>1084</td>\n",
       "      <td>664</td>\n",
       "      <td>1182</td>\n",
       "      <td>126718</td>\n",
       "      <td>58098</td>\n",
       "      <td>654</td>\n",
       "      <td>325</td>\n",
       "      <td>321</td>\n",
       "      <td>92940</td>\n",
       "      <td>57778</td>\n",
       "      <td>34126</td>\n",
       "      <td>67477</td>\n",
       "      <td>50212</td>\n",
       "      <td>16222</td>\n",
       "      <td>13973</td>\n",
       "      <td>10690</td>\n",
       "      <td>3026</td>\n",
       "      <td>475</td>\n",
       "      <td>20450</td>\n",
       "      <td>111670.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>6.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>$15,322,921</td>\n",
       "      <td>71777528</td>\n",
       "      <td>$87,100,449</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Amour (2012)</th>\n",
       "      <td>5</td>\n",
       "      <td>7.9</td>\n",
       "      <td>76121</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>NaN</td>\n",
       "      <td>94.0</td>\n",
       "      <td>8900000.0</td>\n",
       "      <td>127 min</td>\n",
       "      <td>11093</td>\n",
       "      <td>15944</td>\n",
       "      <td>22942</td>\n",
       "      <td>14187</td>\n",
       "      <td>5945</td>\n",
       "      <td>2585</td>\n",
       "      <td>1188</td>\n",
       "      <td>710</td>\n",
       "      <td>534</td>\n",
       "      <td>995</td>\n",
       "      <td>49808</td>\n",
       "      <td>16719</td>\n",
       "      <td>121</td>\n",
       "      <td>95</td>\n",
       "      <td>24</td>\n",
       "      <td>28593</td>\n",
       "      <td>20107</td>\n",
       "      <td>8167</td>\n",
       "      <td>28691</td>\n",
       "      <td>21990</td>\n",
       "      <td>6269</td>\n",
       "      <td>7425</td>\n",
       "      <td>5803</td>\n",
       "      <td>1490</td>\n",
       "      <td>391</td>\n",
       "      <td>7959</td>\n",
       "      <td>46138.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.6</td>\n",
       "      <td>8.7</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>6.6</td>\n",
       "      <td>7.2</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>$6,739,492</td>\n",
       "      <td>13100000</td>\n",
       "      <td>$19,839,492</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "                         X  Rating      ...          Foreign      Worldwide\n",
       "Title                                   ...                                \n",
       "12 Years a Slave (2013)  1     8.1      ...        131061209  $187,733,202 \n",
       "127 Hours (2010)         2     7.6      ...         42403567   $60,738,797 \n",
       "50/50 (2011)             3     7.7      ...          4173591   $39,187,783 \n",
       "About Time (2013)        4     7.8      ...         71777528   $87,100,449 \n",
       "Amour (2012)             5     7.9      ...         13100000   $19,839,492 \n",
       "\n",
       "[5 rows x 57 columns]"
      ]
     },
     "execution_count": 24,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df = pd.read_excel('../input/datasetsdifferent-format/IMDB.xlsx', sheet_name=0, index_col='Title')\n",
    "df.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "8a1013a08809f0470c29de85fe93fc57c6bd04ec"
   },
   "source": [
    "### 11.Handle missing data while reading"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 25,
   "metadata": {
    "_uuid": "2cc691a70994aeb7563ace2057bbcca177cc0bc7"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>X</th>\n",
       "      <th>Title</th>\n",
       "      <th>Rating</th>\n",
       "      <th>TotalVotes</th>\n",
       "      <th>Genre1</th>\n",
       "      <th>Genre2</th>\n",
       "      <th>Genre3</th>\n",
       "      <th>MetaCritic</th>\n",
       "      <th>Budget</th>\n",
       "      <th>Runtime</th>\n",
       "      <th>CVotes10</th>\n",
       "      <th>CVotes09</th>\n",
       "      <th>CVotes08</th>\n",
       "      <th>CVotes07</th>\n",
       "      <th>CVotes06</th>\n",
       "      <th>CVotes05</th>\n",
       "      <th>CVotes04</th>\n",
       "      <th>CVotes03</th>\n",
       "      <th>CVotes02</th>\n",
       "      <th>CVotes01</th>\n",
       "      <th>CVotesMale</th>\n",
       "      <th>CVotesFemale</th>\n",
       "      <th>CVotesU18</th>\n",
       "      <th>CVotesU18M</th>\n",
       "      <th>CVotesU18F</th>\n",
       "      <th>CVotes1829</th>\n",
       "      <th>CVotes1829M</th>\n",
       "      <th>CVotes1829F</th>\n",
       "      <th>CVotes3044</th>\n",
       "      <th>CVotes3044M</th>\n",
       "      <th>CVotes3044F</th>\n",
       "      <th>CVotes45A</th>\n",
       "      <th>CVotes45AM</th>\n",
       "      <th>CVotes45AF</th>\n",
       "      <th>CVotes1000</th>\n",
       "      <th>CVotesUS</th>\n",
       "      <th>CVotesnUS</th>\n",
       "      <th>VotesM</th>\n",
       "      <th>VotesF</th>\n",
       "      <th>VotesU18</th>\n",
       "      <th>VotesU18M</th>\n",
       "      <th>VotesU18F</th>\n",
       "      <th>Votes1829</th>\n",
       "      <th>Votes1829M</th>\n",
       "      <th>Votes1829F</th>\n",
       "      <th>Votes3044</th>\n",
       "      <th>Votes3044M</th>\n",
       "      <th>Votes3044F</th>\n",
       "      <th>Votes45A</th>\n",
       "      <th>Votes45AM</th>\n",
       "      <th>Votes45AF</th>\n",
       "      <th>VotesIMDB</th>\n",
       "      <th>Votes1000</th>\n",
       "      <th>VotesUS</th>\n",
       "      <th>VotesnUS</th>\n",
       "      <th>Domestic</th>\n",
       "      <th>Foreign</th>\n",
       "      <th>Worldwide</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>1</td>\n",
       "      <td>12 Years a Slave (2013)</td>\n",
       "      <td>8.1</td>\n",
       "      <td>496092</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>History</td>\n",
       "      <td>96.0</td>\n",
       "      <td>20000000.0</td>\n",
       "      <td>134 min</td>\n",
       "      <td>75556</td>\n",
       "      <td>126223</td>\n",
       "      <td>161460</td>\n",
       "      <td>83070</td>\n",
       "      <td>27231</td>\n",
       "      <td>9603</td>\n",
       "      <td>4021</td>\n",
       "      <td>2420</td>\n",
       "      <td>1785</td>\n",
       "      <td>4739</td>\n",
       "      <td>313823</td>\n",
       "      <td>82012</td>\n",
       "      <td>1837</td>\n",
       "      <td>1363</td>\n",
       "      <td>457</td>\n",
       "      <td>200910</td>\n",
       "      <td>153669</td>\n",
       "      <td>45301</td>\n",
       "      <td>138762</td>\n",
       "      <td>112943</td>\n",
       "      <td>23895</td>\n",
       "      <td>29252</td>\n",
       "      <td>23072</td>\n",
       "      <td>5726</td>\n",
       "      <td>664</td>\n",
       "      <td>53328</td>\n",
       "      <td>224519.0</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.0</td>\n",
       "      <td>$56,671,993</td>\n",
       "      <td>131061209</td>\n",
       "      <td>$187,733,202</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2</td>\n",
       "      <td>127 Hours (2010)</td>\n",
       "      <td>7.6</td>\n",
       "      <td>297075</td>\n",
       "      <td>Adventure</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>82.0</td>\n",
       "      <td>18000000.0</td>\n",
       "      <td>94 min</td>\n",
       "      <td>28939</td>\n",
       "      <td>44110</td>\n",
       "      <td>98845</td>\n",
       "      <td>78451</td>\n",
       "      <td>28394</td>\n",
       "      <td>9403</td>\n",
       "      <td>3796</td>\n",
       "      <td>1930</td>\n",
       "      <td>1161</td>\n",
       "      <td>2059</td>\n",
       "      <td>212866</td>\n",
       "      <td>44600</td>\n",
       "      <td>745</td>\n",
       "      <td>567</td>\n",
       "      <td>170</td>\n",
       "      <td>133336</td>\n",
       "      <td>106007</td>\n",
       "      <td>26152</td>\n",
       "      <td>102120</td>\n",
       "      <td>86609</td>\n",
       "      <td>14304</td>\n",
       "      <td>14895</td>\n",
       "      <td>12400</td>\n",
       "      <td>2261</td>\n",
       "      <td>649</td>\n",
       "      <td>38478</td>\n",
       "      <td>169745.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$18,335,230</td>\n",
       "      <td>42403567</td>\n",
       "      <td>$60,738,797</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>3</td>\n",
       "      <td>50/50 (2011)</td>\n",
       "      <td>7.7</td>\n",
       "      <td>283935</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>72.0</td>\n",
       "      <td>8000000.0</td>\n",
       "      <td>100 min</td>\n",
       "      <td>28304</td>\n",
       "      <td>47501</td>\n",
       "      <td>99524</td>\n",
       "      <td>71485</td>\n",
       "      <td>24252</td>\n",
       "      <td>7545</td>\n",
       "      <td>2381</td>\n",
       "      <td>1109</td>\n",
       "      <td>634</td>\n",
       "      <td>1202</td>\n",
       "      <td>188925</td>\n",
       "      <td>58348</td>\n",
       "      <td>506</td>\n",
       "      <td>348</td>\n",
       "      <td>153</td>\n",
       "      <td>132350</td>\n",
       "      <td>96269</td>\n",
       "      <td>34765</td>\n",
       "      <td>94745</td>\n",
       "      <td>75394</td>\n",
       "      <td>18163</td>\n",
       "      <td>12829</td>\n",
       "      <td>9912</td>\n",
       "      <td>2681</td>\n",
       "      <td>555</td>\n",
       "      <td>46947</td>\n",
       "      <td>147849.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$35,014,192</td>\n",
       "      <td>4173591</td>\n",
       "      <td>$39,187,783</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>4</td>\n",
       "      <td>About Time (2013)</td>\n",
       "      <td>7.8</td>\n",
       "      <td>225412</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Fantasy</td>\n",
       "      <td>NaN</td>\n",
       "      <td>12000000.0</td>\n",
       "      <td>123 min</td>\n",
       "      <td>38556</td>\n",
       "      <td>43170</td>\n",
       "      <td>70850</td>\n",
       "      <td>45487</td>\n",
       "      <td>16542</td>\n",
       "      <td>5673</td>\n",
       "      <td>2210</td>\n",
       "      <td>1084</td>\n",
       "      <td>664</td>\n",
       "      <td>1182</td>\n",
       "      <td>126718</td>\n",
       "      <td>58098</td>\n",
       "      <td>654</td>\n",
       "      <td>325</td>\n",
       "      <td>321</td>\n",
       "      <td>92940</td>\n",
       "      <td>57778</td>\n",
       "      <td>34126</td>\n",
       "      <td>67477</td>\n",
       "      <td>50212</td>\n",
       "      <td>16222</td>\n",
       "      <td>13973</td>\n",
       "      <td>10690</td>\n",
       "      <td>3026</td>\n",
       "      <td>475</td>\n",
       "      <td>20450</td>\n",
       "      <td>111670.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>6.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>$15,322,921</td>\n",
       "      <td>71777528</td>\n",
       "      <td>$87,100,449</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>5</td>\n",
       "      <td>Amour (2012)</td>\n",
       "      <td>7.9</td>\n",
       "      <td>76121</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>NaN</td>\n",
       "      <td>94.0</td>\n",
       "      <td>8900000.0</td>\n",
       "      <td>127 min</td>\n",
       "      <td>11093</td>\n",
       "      <td>15944</td>\n",
       "      <td>22942</td>\n",
       "      <td>14187</td>\n",
       "      <td>5945</td>\n",
       "      <td>2585</td>\n",
       "      <td>1188</td>\n",
       "      <td>710</td>\n",
       "      <td>534</td>\n",
       "      <td>995</td>\n",
       "      <td>49808</td>\n",
       "      <td>16719</td>\n",
       "      <td>121</td>\n",
       "      <td>95</td>\n",
       "      <td>24</td>\n",
       "      <td>28593</td>\n",
       "      <td>20107</td>\n",
       "      <td>8167</td>\n",
       "      <td>28691</td>\n",
       "      <td>21990</td>\n",
       "      <td>6269</td>\n",
       "      <td>7425</td>\n",
       "      <td>5803</td>\n",
       "      <td>1490</td>\n",
       "      <td>391</td>\n",
       "      <td>7959</td>\n",
       "      <td>46138.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.6</td>\n",
       "      <td>8.7</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>6.6</td>\n",
       "      <td>7.2</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>$6,739,492</td>\n",
       "      <td>13100000</td>\n",
       "      <td>$19,839,492</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "   X                    Title      ...          Foreign      Worldwide\n",
       "0  1  12 Years a Slave (2013)      ...        131061209  $187,733,202 \n",
       "1  2         127 Hours (2010)      ...         42403567   $60,738,797 \n",
       "2  3             50/50 (2011)      ...          4173591   $39,187,783 \n",
       "3  4        About Time (2013)      ...         71777528   $87,100,449 \n",
       "4  5             Amour (2012)      ...         13100000   $19,839,492 \n",
       "\n",
       "[5 rows x 58 columns]"
      ]
     },
     "execution_count": 25,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df = pd.read_excel('../input/datasetsdifferent-format/IMDB.xlsx', sheet_name= 0, na_values=['nan']) ## as per missing value\n",
    "df.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "45ea1503c41eb295b902791fbeff5fbee68f6512"
   },
   "source": [
    "> ### 2.4.Reading data from some other popular formats <a id=\"24\"></a>\n",
    "### 1.Reading JSON data into Pandas"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 26,
   "metadata": {
    "_uuid": "a2ebcc0cc977623af067c778fbc7c4434b6c4ffa"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Budget</th>\n",
       "      <th>CVotes01</th>\n",
       "      <th>CVotes02</th>\n",
       "      <th>CVotes03</th>\n",
       "      <th>CVotes04</th>\n",
       "      <th>CVotes05</th>\n",
       "      <th>CVotes06</th>\n",
       "      <th>CVotes07</th>\n",
       "      <th>CVotes08</th>\n",
       "      <th>CVotes09</th>\n",
       "      <th>CVotes10</th>\n",
       "      <th>CVotes1000</th>\n",
       "      <th>CVotes1829</th>\n",
       "      <th>CVotes1829F</th>\n",
       "      <th>CVotes1829M</th>\n",
       "      <th>CVotes3044</th>\n",
       "      <th>CVotes3044F</th>\n",
       "      <th>CVotes3044M</th>\n",
       "      <th>CVotes45A</th>\n",
       "      <th>CVotes45AF</th>\n",
       "      <th>CVotes45AM</th>\n",
       "      <th>CVotesFemale</th>\n",
       "      <th>CVotesMale</th>\n",
       "      <th>CVotesU18</th>\n",
       "      <th>CVotesU18F</th>\n",
       "      <th>CVotesU18M</th>\n",
       "      <th>CVotesUS</th>\n",
       "      <th>CVotesnUS</th>\n",
       "      <th>Domestic</th>\n",
       "      <th>Foreign</th>\n",
       "      <th>Genre1</th>\n",
       "      <th>Genre2</th>\n",
       "      <th>Genre3</th>\n",
       "      <th>MetaCritic</th>\n",
       "      <th>Rating</th>\n",
       "      <th>Runtime</th>\n",
       "      <th>Title</th>\n",
       "      <th>TotalVotes</th>\n",
       "      <th>Votes1000</th>\n",
       "      <th>Votes1829</th>\n",
       "      <th>Votes1829F</th>\n",
       "      <th>Votes1829M</th>\n",
       "      <th>Votes3044</th>\n",
       "      <th>Votes3044F</th>\n",
       "      <th>Votes3044M</th>\n",
       "      <th>Votes45A</th>\n",
       "      <th>Votes45AF</th>\n",
       "      <th>Votes45AM</th>\n",
       "      <th>VotesF</th>\n",
       "      <th>VotesIMDB</th>\n",
       "      <th>VotesM</th>\n",
       "      <th>VotesU18</th>\n",
       "      <th>VotesU18F</th>\n",
       "      <th>VotesU18M</th>\n",
       "      <th>VotesUS</th>\n",
       "      <th>VotesnUS</th>\n",
       "      <th>Worldwide</th>\n",
       "      <th>X</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>20000000</td>\n",
       "      <td>4739</td>\n",
       "      <td>1785</td>\n",
       "      <td>2420</td>\n",
       "      <td>4021</td>\n",
       "      <td>9603</td>\n",
       "      <td>27231</td>\n",
       "      <td>83070</td>\n",
       "      <td>161460</td>\n",
       "      <td>126223</td>\n",
       "      <td>75556</td>\n",
       "      <td>664</td>\n",
       "      <td>200910</td>\n",
       "      <td>45301</td>\n",
       "      <td>153669</td>\n",
       "      <td>138762</td>\n",
       "      <td>23895</td>\n",
       "      <td>112943</td>\n",
       "      <td>29252</td>\n",
       "      <td>5726</td>\n",
       "      <td>23072</td>\n",
       "      <td>82012</td>\n",
       "      <td>313823</td>\n",
       "      <td>1837</td>\n",
       "      <td>457</td>\n",
       "      <td>1363</td>\n",
       "      <td>53328</td>\n",
       "      <td>224519</td>\n",
       "      <td>$56,671,993</td>\n",
       "      <td>131061209</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>History</td>\n",
       "      <td>96</td>\n",
       "      <td>8.1</td>\n",
       "      <td>134 min</td>\n",
       "      <td>12 Years a Slave�(2013)</td>\n",
       "      <td>496092</td>\n",
       "      <td>7.7</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8</td>\n",
       "      <td>$187,733,202</td>\n",
       "      <td>1</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>18000000</td>\n",
       "      <td>2059</td>\n",
       "      <td>1161</td>\n",
       "      <td>1930</td>\n",
       "      <td>3796</td>\n",
       "      <td>9403</td>\n",
       "      <td>28394</td>\n",
       "      <td>78451</td>\n",
       "      <td>98845</td>\n",
       "      <td>44110</td>\n",
       "      <td>28939</td>\n",
       "      <td>649</td>\n",
       "      <td>133336</td>\n",
       "      <td>26152</td>\n",
       "      <td>106007</td>\n",
       "      <td>102120</td>\n",
       "      <td>14304</td>\n",
       "      <td>86609</td>\n",
       "      <td>14895</td>\n",
       "      <td>2261</td>\n",
       "      <td>12400</td>\n",
       "      <td>44600</td>\n",
       "      <td>212866</td>\n",
       "      <td>745</td>\n",
       "      <td>170</td>\n",
       "      <td>567</td>\n",
       "      <td>38478</td>\n",
       "      <td>169745</td>\n",
       "      <td>$18,335,230</td>\n",
       "      <td>42403567</td>\n",
       "      <td>Adventure</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>82</td>\n",
       "      <td>7.6</td>\n",
       "      <td>94 min</td>\n",
       "      <td>127 Hours�(2010)</td>\n",
       "      <td>297075</td>\n",
       "      <td>7.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$60,738,797</td>\n",
       "      <td>2</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>8000000</td>\n",
       "      <td>1202</td>\n",
       "      <td>634</td>\n",
       "      <td>1109</td>\n",
       "      <td>2381</td>\n",
       "      <td>7545</td>\n",
       "      <td>24252</td>\n",
       "      <td>71485</td>\n",
       "      <td>99524</td>\n",
       "      <td>47501</td>\n",
       "      <td>28304</td>\n",
       "      <td>555</td>\n",
       "      <td>132350</td>\n",
       "      <td>34765</td>\n",
       "      <td>96269</td>\n",
       "      <td>94745</td>\n",
       "      <td>18163</td>\n",
       "      <td>75394</td>\n",
       "      <td>12829</td>\n",
       "      <td>2681</td>\n",
       "      <td>9912</td>\n",
       "      <td>58348</td>\n",
       "      <td>188925</td>\n",
       "      <td>506</td>\n",
       "      <td>153</td>\n",
       "      <td>348</td>\n",
       "      <td>46947</td>\n",
       "      <td>147849</td>\n",
       "      <td>$35,014,192</td>\n",
       "      <td>4173591</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>72</td>\n",
       "      <td>7.7</td>\n",
       "      <td>100 min</td>\n",
       "      <td>50/50�(2011)</td>\n",
       "      <td>283935</td>\n",
       "      <td>7.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$39,187,783</td>\n",
       "      <td>3</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>12000000</td>\n",
       "      <td>1182</td>\n",
       "      <td>664</td>\n",
       "      <td>1084</td>\n",
       "      <td>2210</td>\n",
       "      <td>5673</td>\n",
       "      <td>16542</td>\n",
       "      <td>45487</td>\n",
       "      <td>70850</td>\n",
       "      <td>43170</td>\n",
       "      <td>38556</td>\n",
       "      <td>475</td>\n",
       "      <td>92940</td>\n",
       "      <td>34126</td>\n",
       "      <td>57778</td>\n",
       "      <td>67477</td>\n",
       "      <td>16222</td>\n",
       "      <td>50212</td>\n",
       "      <td>13973</td>\n",
       "      <td>3026</td>\n",
       "      <td>10690</td>\n",
       "      <td>58098</td>\n",
       "      <td>126718</td>\n",
       "      <td>654</td>\n",
       "      <td>321</td>\n",
       "      <td>325</td>\n",
       "      <td>20450</td>\n",
       "      <td>111670</td>\n",
       "      <td>$15,322,921</td>\n",
       "      <td>71777528</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Fantasy</td>\n",
       "      <td>NA</td>\n",
       "      <td>7.8</td>\n",
       "      <td>123 min</td>\n",
       "      <td>About Time�(2013)</td>\n",
       "      <td>225412</td>\n",
       "      <td>6.9</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.1</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>$87,100,449</td>\n",
       "      <td>4</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>8900000</td>\n",
       "      <td>995</td>\n",
       "      <td>534</td>\n",
       "      <td>710</td>\n",
       "      <td>1188</td>\n",
       "      <td>2585</td>\n",
       "      <td>5945</td>\n",
       "      <td>14187</td>\n",
       "      <td>22942</td>\n",
       "      <td>15944</td>\n",
       "      <td>11093</td>\n",
       "      <td>391</td>\n",
       "      <td>28593</td>\n",
       "      <td>8167</td>\n",
       "      <td>20107</td>\n",
       "      <td>28691</td>\n",
       "      <td>6269</td>\n",
       "      <td>21990</td>\n",
       "      <td>7425</td>\n",
       "      <td>1490</td>\n",
       "      <td>5803</td>\n",
       "      <td>16719</td>\n",
       "      <td>49808</td>\n",
       "      <td>121</td>\n",
       "      <td>24</td>\n",
       "      <td>95</td>\n",
       "      <td>7959</td>\n",
       "      <td>46138</td>\n",
       "      <td>$6,739,492</td>\n",
       "      <td>13100000</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td></td>\n",
       "      <td>94</td>\n",
       "      <td>7.9</td>\n",
       "      <td>127 min</td>\n",
       "      <td>Amour�(2012)</td>\n",
       "      <td>76121</td>\n",
       "      <td>7.2</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.1</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>6.6</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.6</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>$19,839,492</td>\n",
       "      <td>5</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "     Budget  CVotes01  CVotes02  CVotes03 ...  VotesUS  VotesnUS     Worldwide  X\n",
       "0  20000000      4739      1785      2420 ...      8.3         8  $187,733,202  1\n",
       "1  18000000      2059      1161      1930 ...      7.7       7.6   $60,738,797  2\n",
       "2   8000000      1202       634      1109 ...      7.9       7.6   $39,187,783  3\n",
       "3  12000000      1182       664      1084 ...      7.8       7.7   $87,100,449  4\n",
       "4   8900000       995       534       710 ...      7.9       7.8   $19,839,492  5\n",
       "\n",
       "[5 rows x 58 columns]"
      ]
     },
     "execution_count": 26,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "movies_json = pd.read_json('../input/datasetsdifferent-format/IMDB.json')\n",
    "movies_json.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "8b4c9ad82bfe20ed21eb28183ffd751f102c9495"
   },
   "source": [
    "### 2.Reading HTML data"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 27,
   "metadata": {
    "_uuid": "364096de4343fd462eebf1041823c4a1e432b35e"
   },
   "outputs": [],
   "source": [
    "df = pd.read_html('../input/datasetsdifferent-format/IMDB.html')\n",
    "# df"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "5fe67038e4aa9ce51904b41cdb99a72f81632b2d"
   },
   "source": [
    "### 3.Read pickle file"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 28,
   "metadata": {
    "_uuid": "584732d7f0660d5d78b22a0a316c75b4b6cb53cd"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>X</th>\n",
       "      <th>Title</th>\n",
       "      <th>Rating</th>\n",
       "      <th>TotalVotes</th>\n",
       "      <th>Genre1</th>\n",
       "      <th>Genre2</th>\n",
       "      <th>Genre3</th>\n",
       "      <th>MetaCritic</th>\n",
       "      <th>Budget</th>\n",
       "      <th>Runtime</th>\n",
       "      <th>CVotes10</th>\n",
       "      <th>CVotes09</th>\n",
       "      <th>CVotes08</th>\n",
       "      <th>CVotes07</th>\n",
       "      <th>CVotes06</th>\n",
       "      <th>CVotes05</th>\n",
       "      <th>CVotes04</th>\n",
       "      <th>CVotes03</th>\n",
       "      <th>CVotes02</th>\n",
       "      <th>CVotes01</th>\n",
       "      <th>CVotesMale</th>\n",
       "      <th>CVotesFemale</th>\n",
       "      <th>CVotesU18</th>\n",
       "      <th>CVotesU18M</th>\n",
       "      <th>CVotesU18F</th>\n",
       "      <th>CVotes1829</th>\n",
       "      <th>CVotes1829M</th>\n",
       "      <th>CVotes1829F</th>\n",
       "      <th>CVotes3044</th>\n",
       "      <th>CVotes3044M</th>\n",
       "      <th>CVotes3044F</th>\n",
       "      <th>CVotes45A</th>\n",
       "      <th>CVotes45AM</th>\n",
       "      <th>CVotes45AF</th>\n",
       "      <th>CVotes1000</th>\n",
       "      <th>CVotesUS</th>\n",
       "      <th>CVotesnUS</th>\n",
       "      <th>VotesM</th>\n",
       "      <th>VotesF</th>\n",
       "      <th>VotesU18</th>\n",
       "      <th>VotesU18M</th>\n",
       "      <th>VotesU18F</th>\n",
       "      <th>Votes1829</th>\n",
       "      <th>Votes1829M</th>\n",
       "      <th>Votes1829F</th>\n",
       "      <th>Votes3044</th>\n",
       "      <th>Votes3044M</th>\n",
       "      <th>Votes3044F</th>\n",
       "      <th>Votes45A</th>\n",
       "      <th>Votes45AM</th>\n",
       "      <th>Votes45AF</th>\n",
       "      <th>VotesIMDB</th>\n",
       "      <th>Votes1000</th>\n",
       "      <th>VotesUS</th>\n",
       "      <th>VotesnUS</th>\n",
       "      <th>Domestic</th>\n",
       "      <th>Foreign</th>\n",
       "      <th>Worldwide</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>1</td>\n",
       "      <td>12 Years a Slave (2013)</td>\n",
       "      <td>8.1</td>\n",
       "      <td>496092</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>History</td>\n",
       "      <td>96.0</td>\n",
       "      <td>20000000.0</td>\n",
       "      <td>134 min</td>\n",
       "      <td>75556</td>\n",
       "      <td>126223</td>\n",
       "      <td>161460</td>\n",
       "      <td>83070</td>\n",
       "      <td>27231</td>\n",
       "      <td>9603</td>\n",
       "      <td>4021</td>\n",
       "      <td>2420</td>\n",
       "      <td>1785</td>\n",
       "      <td>4739</td>\n",
       "      <td>313823</td>\n",
       "      <td>82012</td>\n",
       "      <td>1837</td>\n",
       "      <td>1363</td>\n",
       "      <td>457</td>\n",
       "      <td>200910</td>\n",
       "      <td>153669</td>\n",
       "      <td>45301</td>\n",
       "      <td>138762</td>\n",
       "      <td>112943</td>\n",
       "      <td>23895</td>\n",
       "      <td>29252</td>\n",
       "      <td>23072</td>\n",
       "      <td>5726</td>\n",
       "      <td>664</td>\n",
       "      <td>53328</td>\n",
       "      <td>224519.0</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.0</td>\n",
       "      <td>$56,671,993</td>\n",
       "      <td>131061209</td>\n",
       "      <td>$187,733,202</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2</td>\n",
       "      <td>127 Hours (2010)</td>\n",
       "      <td>7.6</td>\n",
       "      <td>297075</td>\n",
       "      <td>Adventure</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>82.0</td>\n",
       "      <td>18000000.0</td>\n",
       "      <td>94 min</td>\n",
       "      <td>28939</td>\n",
       "      <td>44110</td>\n",
       "      <td>98845</td>\n",
       "      <td>78451</td>\n",
       "      <td>28394</td>\n",
       "      <td>9403</td>\n",
       "      <td>3796</td>\n",
       "      <td>1930</td>\n",
       "      <td>1161</td>\n",
       "      <td>2059</td>\n",
       "      <td>212866</td>\n",
       "      <td>44600</td>\n",
       "      <td>745</td>\n",
       "      <td>567</td>\n",
       "      <td>170</td>\n",
       "      <td>133336</td>\n",
       "      <td>106007</td>\n",
       "      <td>26152</td>\n",
       "      <td>102120</td>\n",
       "      <td>86609</td>\n",
       "      <td>14304</td>\n",
       "      <td>14895</td>\n",
       "      <td>12400</td>\n",
       "      <td>2261</td>\n",
       "      <td>649</td>\n",
       "      <td>38478</td>\n",
       "      <td>169745.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$18,335,230</td>\n",
       "      <td>42403567</td>\n",
       "      <td>$60,738,797</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>3</td>\n",
       "      <td>50/50 (2011)</td>\n",
       "      <td>7.7</td>\n",
       "      <td>283935</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>72.0</td>\n",
       "      <td>8000000.0</td>\n",
       "      <td>100 min</td>\n",
       "      <td>28304</td>\n",
       "      <td>47501</td>\n",
       "      <td>99524</td>\n",
       "      <td>71485</td>\n",
       "      <td>24252</td>\n",
       "      <td>7545</td>\n",
       "      <td>2381</td>\n",
       "      <td>1109</td>\n",
       "      <td>634</td>\n",
       "      <td>1202</td>\n",
       "      <td>188925</td>\n",
       "      <td>58348</td>\n",
       "      <td>506</td>\n",
       "      <td>348</td>\n",
       "      <td>153</td>\n",
       "      <td>132350</td>\n",
       "      <td>96269</td>\n",
       "      <td>34765</td>\n",
       "      <td>94745</td>\n",
       "      <td>75394</td>\n",
       "      <td>18163</td>\n",
       "      <td>12829</td>\n",
       "      <td>9912</td>\n",
       "      <td>2681</td>\n",
       "      <td>555</td>\n",
       "      <td>46947</td>\n",
       "      <td>147849.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$35,014,192</td>\n",
       "      <td>4173591</td>\n",
       "      <td>$39,187,783</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>4</td>\n",
       "      <td>About Time (2013)</td>\n",
       "      <td>7.8</td>\n",
       "      <td>225412</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Fantasy</td>\n",
       "      <td>NaN</td>\n",
       "      <td>12000000.0</td>\n",
       "      <td>123 min</td>\n",
       "      <td>38556</td>\n",
       "      <td>43170</td>\n",
       "      <td>70850</td>\n",
       "      <td>45487</td>\n",
       "      <td>16542</td>\n",
       "      <td>5673</td>\n",
       "      <td>2210</td>\n",
       "      <td>1084</td>\n",
       "      <td>664</td>\n",
       "      <td>1182</td>\n",
       "      <td>126718</td>\n",
       "      <td>58098</td>\n",
       "      <td>654</td>\n",
       "      <td>325</td>\n",
       "      <td>321</td>\n",
       "      <td>92940</td>\n",
       "      <td>57778</td>\n",
       "      <td>34126</td>\n",
       "      <td>67477</td>\n",
       "      <td>50212</td>\n",
       "      <td>16222</td>\n",
       "      <td>13973</td>\n",
       "      <td>10690</td>\n",
       "      <td>3026</td>\n",
       "      <td>475</td>\n",
       "      <td>20450</td>\n",
       "      <td>111670.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>6.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>$15,322,921</td>\n",
       "      <td>71777528</td>\n",
       "      <td>$87,100,449</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>5</td>\n",
       "      <td>Amour (2012)</td>\n",
       "      <td>7.9</td>\n",
       "      <td>76121</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>NaN</td>\n",
       "      <td>94.0</td>\n",
       "      <td>8900000.0</td>\n",
       "      <td>127 min</td>\n",
       "      <td>11093</td>\n",
       "      <td>15944</td>\n",
       "      <td>22942</td>\n",
       "      <td>14187</td>\n",
       "      <td>5945</td>\n",
       "      <td>2585</td>\n",
       "      <td>1188</td>\n",
       "      <td>710</td>\n",
       "      <td>534</td>\n",
       "      <td>995</td>\n",
       "      <td>49808</td>\n",
       "      <td>16719</td>\n",
       "      <td>121</td>\n",
       "      <td>95</td>\n",
       "      <td>24</td>\n",
       "      <td>28593</td>\n",
       "      <td>20107</td>\n",
       "      <td>8167</td>\n",
       "      <td>28691</td>\n",
       "      <td>21990</td>\n",
       "      <td>6269</td>\n",
       "      <td>7425</td>\n",
       "      <td>5803</td>\n",
       "      <td>1490</td>\n",
       "      <td>391</td>\n",
       "      <td>7959</td>\n",
       "      <td>46138.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.6</td>\n",
       "      <td>8.7</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>6.6</td>\n",
       "      <td>7.2</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>$6,739,492</td>\n",
       "      <td>13100000</td>\n",
       "      <td>$19,839,492</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "   X                     Title      ...          Foreign      Worldwide\n",
       "0  1  12 Years a Slave (2013)      ...        131061209  $187,733,202 \n",
       "1  2         127 Hours (2010)      ...         42403567   $60,738,797 \n",
       "2  3             50/50 (2011)      ...          4173591   $39,187,783 \n",
       "3  4        About Time (2013)      ...         71777528   $87,100,449 \n",
       "4  5             Amour (2012)      ...         13100000   $19,839,492 \n",
       "\n",
       "[5 rows x 58 columns]"
      ]
     },
     "execution_count": 28,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df = pd.read_pickle('../input/datasetsdifferent-format/IMDB.p')\n",
    "df.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "7348b59b3fd1fada9e0b4b17f0a42ec5ad2dc634"
   },
   "source": [
    "### 4.Read SQL file"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 29,
   "metadata": {
    "_uuid": "b40768bf9b94c37e8faa41b64c2103bf95989b0d"
   },
   "outputs": [],
   "source": [
    "import sqlite3"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 30,
   "metadata": {
    "_uuid": "815927487c93eea72212c13e5dd6c5c33fae7871"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>X</th>\n",
       "      <th>Title</th>\n",
       "      <th>Rating</th>\n",
       "      <th>TotalVotes</th>\n",
       "      <th>Genre1</th>\n",
       "      <th>Genre2</th>\n",
       "      <th>Genre3</th>\n",
       "      <th>MetaCritic</th>\n",
       "      <th>Budget</th>\n",
       "      <th>Runtime</th>\n",
       "      <th>CVotes10</th>\n",
       "      <th>CVotes09</th>\n",
       "      <th>CVotes08</th>\n",
       "      <th>CVotes07</th>\n",
       "      <th>CVotes06</th>\n",
       "      <th>CVotes05</th>\n",
       "      <th>CVotes04</th>\n",
       "      <th>CVotes03</th>\n",
       "      <th>CVotes02</th>\n",
       "      <th>CVotes01</th>\n",
       "      <th>CVotesMale</th>\n",
       "      <th>CVotesFemale</th>\n",
       "      <th>CVotesU18</th>\n",
       "      <th>CVotesU18M</th>\n",
       "      <th>CVotesU18F</th>\n",
       "      <th>CVotes1829</th>\n",
       "      <th>CVotes1829M</th>\n",
       "      <th>CVotes1829F</th>\n",
       "      <th>CVotes3044</th>\n",
       "      <th>CVotes3044M</th>\n",
       "      <th>CVotes3044F</th>\n",
       "      <th>CVotes45A</th>\n",
       "      <th>CVotes45AM</th>\n",
       "      <th>CVotes45AF</th>\n",
       "      <th>CVotes1000</th>\n",
       "      <th>CVotesUS</th>\n",
       "      <th>CVotesnUS</th>\n",
       "      <th>VotesM</th>\n",
       "      <th>VotesF</th>\n",
       "      <th>VotesU18</th>\n",
       "      <th>VotesU18M</th>\n",
       "      <th>VotesU18F</th>\n",
       "      <th>Votes1829</th>\n",
       "      <th>Votes1829M</th>\n",
       "      <th>Votes1829F</th>\n",
       "      <th>Votes3044</th>\n",
       "      <th>Votes3044M</th>\n",
       "      <th>Votes3044F</th>\n",
       "      <th>Votes45A</th>\n",
       "      <th>Votes45AM</th>\n",
       "      <th>Votes45AF</th>\n",
       "      <th>VotesIMDB</th>\n",
       "      <th>Votes1000</th>\n",
       "      <th>VotesUS</th>\n",
       "      <th>VotesnUS</th>\n",
       "      <th>Domestic</th>\n",
       "      <th>Foreign</th>\n",
       "      <th>Worldwide</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>1</td>\n",
       "      <td>12 Years a Slave (2013)</td>\n",
       "      <td>8.1</td>\n",
       "      <td>496092</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>History</td>\n",
       "      <td>96</td>\n",
       "      <td>20000000</td>\n",
       "      <td>134 min</td>\n",
       "      <td>75556</td>\n",
       "      <td>126223</td>\n",
       "      <td>161460</td>\n",
       "      <td>83070</td>\n",
       "      <td>27231</td>\n",
       "      <td>9603</td>\n",
       "      <td>4021</td>\n",
       "      <td>2420</td>\n",
       "      <td>1785</td>\n",
       "      <td>4739</td>\n",
       "      <td>313823</td>\n",
       "      <td>82012</td>\n",
       "      <td>1837</td>\n",
       "      <td>1363</td>\n",
       "      <td>457</td>\n",
       "      <td>200910</td>\n",
       "      <td>153669</td>\n",
       "      <td>45301</td>\n",
       "      <td>138762</td>\n",
       "      <td>112943</td>\n",
       "      <td>23895</td>\n",
       "      <td>29252</td>\n",
       "      <td>23072</td>\n",
       "      <td>5726</td>\n",
       "      <td>664</td>\n",
       "      <td>53328</td>\n",
       "      <td>224519</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.4</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8</td>\n",
       "      <td>$56,671,993</td>\n",
       "      <td>131061209</td>\n",
       "      <td>$187,733,202</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2</td>\n",
       "      <td>127 Hours (2010)</td>\n",
       "      <td>7.6</td>\n",
       "      <td>297075</td>\n",
       "      <td>Adventure</td>\n",
       "      <td>Biography</td>\n",
       "      <td>Drama</td>\n",
       "      <td>82</td>\n",
       "      <td>18000000</td>\n",
       "      <td>94 min</td>\n",
       "      <td>28939</td>\n",
       "      <td>44110</td>\n",
       "      <td>98845</td>\n",
       "      <td>78451</td>\n",
       "      <td>28394</td>\n",
       "      <td>9403</td>\n",
       "      <td>3796</td>\n",
       "      <td>1930</td>\n",
       "      <td>1161</td>\n",
       "      <td>2059</td>\n",
       "      <td>212866</td>\n",
       "      <td>44600</td>\n",
       "      <td>745</td>\n",
       "      <td>567</td>\n",
       "      <td>170</td>\n",
       "      <td>133336</td>\n",
       "      <td>106007</td>\n",
       "      <td>26152</td>\n",
       "      <td>102120</td>\n",
       "      <td>86609</td>\n",
       "      <td>14304</td>\n",
       "      <td>14895</td>\n",
       "      <td>12400</td>\n",
       "      <td>2261</td>\n",
       "      <td>649</td>\n",
       "      <td>38478</td>\n",
       "      <td>169745</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.3</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.0</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$18,335,230</td>\n",
       "      <td>42403567</td>\n",
       "      <td>$60,738,797</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>3</td>\n",
       "      <td>50/50 (2011)</td>\n",
       "      <td>7.7</td>\n",
       "      <td>283935</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td>72</td>\n",
       "      <td>8000000</td>\n",
       "      <td>100 min</td>\n",
       "      <td>28304</td>\n",
       "      <td>47501</td>\n",
       "      <td>99524</td>\n",
       "      <td>71485</td>\n",
       "      <td>24252</td>\n",
       "      <td>7545</td>\n",
       "      <td>2381</td>\n",
       "      <td>1109</td>\n",
       "      <td>634</td>\n",
       "      <td>1202</td>\n",
       "      <td>188925</td>\n",
       "      <td>58348</td>\n",
       "      <td>506</td>\n",
       "      <td>348</td>\n",
       "      <td>153</td>\n",
       "      <td>132350</td>\n",
       "      <td>96269</td>\n",
       "      <td>34765</td>\n",
       "      <td>94745</td>\n",
       "      <td>75394</td>\n",
       "      <td>18163</td>\n",
       "      <td>12829</td>\n",
       "      <td>9912</td>\n",
       "      <td>2681</td>\n",
       "      <td>555</td>\n",
       "      <td>46947</td>\n",
       "      <td>147849</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.4</td>\n",
       "      <td>7.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.6</td>\n",
       "      <td>$35,014,192</td>\n",
       "      <td>4173591</td>\n",
       "      <td>$39,187,783</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>4</td>\n",
       "      <td>About Time (2013)</td>\n",
       "      <td>7.8</td>\n",
       "      <td>225412</td>\n",
       "      <td>Comedy</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Fantasy</td>\n",
       "      <td>NA</td>\n",
       "      <td>12000000</td>\n",
       "      <td>123 min</td>\n",
       "      <td>38556</td>\n",
       "      <td>43170</td>\n",
       "      <td>70850</td>\n",
       "      <td>45487</td>\n",
       "      <td>16542</td>\n",
       "      <td>5673</td>\n",
       "      <td>2210</td>\n",
       "      <td>1084</td>\n",
       "      <td>664</td>\n",
       "      <td>1182</td>\n",
       "      <td>126718</td>\n",
       "      <td>58098</td>\n",
       "      <td>654</td>\n",
       "      <td>325</td>\n",
       "      <td>321</td>\n",
       "      <td>92940</td>\n",
       "      <td>57778</td>\n",
       "      <td>34126</td>\n",
       "      <td>67477</td>\n",
       "      <td>50212</td>\n",
       "      <td>16222</td>\n",
       "      <td>13973</td>\n",
       "      <td>10690</td>\n",
       "      <td>3026</td>\n",
       "      <td>475</td>\n",
       "      <td>20450</td>\n",
       "      <td>111670</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.2</td>\n",
       "      <td>8.1</td>\n",
       "      <td>8.3</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.6</td>\n",
       "      <td>7.5</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>6.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.7</td>\n",
       "      <td>$15,322,921</td>\n",
       "      <td>71777528</td>\n",
       "      <td>$87,100,449</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>5</td>\n",
       "      <td>Amour (2012)</td>\n",
       "      <td>7.9</td>\n",
       "      <td>76121</td>\n",
       "      <td>Drama</td>\n",
       "      <td>Romance</td>\n",
       "      <td></td>\n",
       "      <td>94</td>\n",
       "      <td>8900000</td>\n",
       "      <td>127 min</td>\n",
       "      <td>11093</td>\n",
       "      <td>15944</td>\n",
       "      <td>22942</td>\n",
       "      <td>14187</td>\n",
       "      <td>5945</td>\n",
       "      <td>2585</td>\n",
       "      <td>1188</td>\n",
       "      <td>710</td>\n",
       "      <td>534</td>\n",
       "      <td>995</td>\n",
       "      <td>49808</td>\n",
       "      <td>16719</td>\n",
       "      <td>121</td>\n",
       "      <td>95</td>\n",
       "      <td>24</td>\n",
       "      <td>28593</td>\n",
       "      <td>20107</td>\n",
       "      <td>8167</td>\n",
       "      <td>28691</td>\n",
       "      <td>21990</td>\n",
       "      <td>6269</td>\n",
       "      <td>7425</td>\n",
       "      <td>5803</td>\n",
       "      <td>1490</td>\n",
       "      <td>391</td>\n",
       "      <td>7959</td>\n",
       "      <td>46138</td>\n",
       "      <td>7.8</td>\n",
       "      <td>7.9</td>\n",
       "      <td>8.6</td>\n",
       "      <td>8.7</td>\n",
       "      <td>8.5</td>\n",
       "      <td>8.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.7</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>8.1</td>\n",
       "      <td>6.6</td>\n",
       "      <td>7.2</td>\n",
       "      <td>7.9</td>\n",
       "      <td>7.8</td>\n",
       "      <td>$6,739,492</td>\n",
       "      <td>13100000</td>\n",
       "      <td>$19,839,492</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "   X                    Title      ...          Foreign      Worldwide\n",
       "0  1  12 Years a Slave (2013)      ...        131061209  $187,733,202 \n",
       "1  2         127 Hours (2010)      ...         42403567   $60,738,797 \n",
       "2  3             50/50 (2011)      ...          4173591   $39,187,783 \n",
       "3  4        About Time (2013)      ...         71777528   $87,100,449 \n",
       "4  5             Amour (2012)      ...         13100000   $19,839,492 \n",
       "\n",
       "[5 rows x 58 columns]"
      ]
     },
     "execution_count": 30,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "conn = sqlite3.connect(\"../input/datasetsdifferent-format/IMDB.sqlite\")\n",
    "df = pd.read_sql_query(\"SELECT * FROM IMDB;\", conn)\n",
    "df.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "fbe7564a083556b731fd8b0ccc972e66d7ff2315"
   },
   "source": [
    "### 5.Read data from clipboard"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 31,
   "metadata": {
    "_uuid": "0f56f90b33ffaee4a950199d8286f12a88fb4388"
   },
   "outputs": [],
   "source": [
    "# df = pd.read_clipboard()\n",
    "# # df.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "17eb6411108be9e6c96fe4983826509020d66d57"
   },
   "source": [
    "# 3.Apply multiple filter criteria to a pandas DataFrame<a id=\"3\"></a>\n",
    "---\n",
    " [**Go to top**](#00)\n",
    " \n",
    " ![](https://docs.microsoft.com/en-us/dynamics365/customer-engagement/social-engagement/media/data-set-concept-social-engagement.png)\n",
    " ### In this section, you will learn\n",
    "1. Filter using `&` **AND Operator.**\n",
    "1. Filter using `|`  **OR Operator.**\n",
    "1. Filtering using *`isin`* **method**\n",
    "1. Using ***`isin` method*** with multiple conditions\n",
    " \n",
    "###  1.Read in the dataset"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 32,
   "metadata": {
    "_uuid": "c5a1d7b807ffd9a07f768827d83025f14db09d24"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Date</th>\n",
       "      <th>RegionID</th>\n",
       "      <th>RegionName</th>\n",
       "      <th>State</th>\n",
       "      <th>Metro</th>\n",
       "      <th>County</th>\n",
       "      <th>SizeRank</th>\n",
       "      <th>Zhvi</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>6181</td>\n",
       "      <td>New York</td>\n",
       "      <td>NY</td>\n",
       "      <td>New York</td>\n",
       "      <td>Queens</td>\n",
       "      <td>0</td>\n",
       "      <td>672400</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>12447</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>CA</td>\n",
       "      <td>Los Angeles-Long Beach-Anaheim</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>1</td>\n",
       "      <td>629900</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>17426</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>IL</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>Cook</td>\n",
       "      <td>2</td>\n",
       "      <td>222700</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>13271</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>PA</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>3</td>\n",
       "      <td>137300</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>40326</td>\n",
       "      <td>Phoenix</td>\n",
       "      <td>AZ</td>\n",
       "      <td>Phoenix</td>\n",
       "      <td>Maricopa</td>\n",
       "      <td>4</td>\n",
       "      <td>211300</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "         Date  RegionID    RegionName   ...          County SizeRank    Zhvi\n",
       "0  2017-05-31      6181      New York   ...          Queens        0  672400\n",
       "1  2017-05-31     12447   Los Angeles   ...     Los Angeles        1  629900\n",
       "2  2017-05-31     17426       Chicago   ...            Cook        2  222700\n",
       "3  2017-05-31     13271  Philadelphia   ...    Philadelphia        3  137300\n",
       "4  2017-05-31     40326       Phoenix   ...        Maricopa        4  211300\n",
       "\n",
       "[5 rows x 8 columns]"
      ]
     },
     "execution_count": 32,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data_zillow = pd.read_table('../input/datasetsdifferent-format/data-zillow.csv', sep=',')\n",
    "data_zillow.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "413795c3fa6a2947e100b90001d2f2ac9178c651"
   },
   "source": [
    "### 2. FIlter Based on Multiple Condition"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 33,
   "metadata": {
    "_uuid": "1b41aa04401ea26bca1613144a5642ca47274793"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Date</th>\n",
       "      <th>RegionID</th>\n",
       "      <th>RegionName</th>\n",
       "      <th>State</th>\n",
       "      <th>Metro</th>\n",
       "      <th>County</th>\n",
       "      <th>SizeRank</th>\n",
       "      <th>Zhvi</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>1132</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>18375</td>\n",
       "      <td>Great Neck</td>\n",
       "      <td>NY</td>\n",
       "      <td>New York</td>\n",
       "      <td>Nassau</td>\n",
       "      <td>1132</td>\n",
       "      <td>1235800</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2405</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>54333</td>\n",
       "      <td>Scarsdale</td>\n",
       "      <td>NY</td>\n",
       "      <td>New York</td>\n",
       "      <td>Westchester</td>\n",
       "      <td>2405</td>\n",
       "      <td>1468100</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2619</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>47495</td>\n",
       "      <td>Rye</td>\n",
       "      <td>NY</td>\n",
       "      <td>New York</td>\n",
       "      <td>Westchester</td>\n",
       "      <td>2619</td>\n",
       "      <td>1736400</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3032</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>25725</td>\n",
       "      <td>Manhasset</td>\n",
       "      <td>NY</td>\n",
       "      <td>New York</td>\n",
       "      <td>Nassau</td>\n",
       "      <td>3032</td>\n",
       "      <td>1483400</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3064</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>18955</td>\n",
       "      <td>Larchmont</td>\n",
       "      <td>NY</td>\n",
       "      <td>New York</td>\n",
       "      <td>Westchester</td>\n",
       "      <td>3064</td>\n",
       "      <td>1052200</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "            Date  RegionID  RegionName   ...          County SizeRank     Zhvi\n",
       "1132  2017-05-31     18375  Great Neck   ...          Nassau     1132  1235800\n",
       "2405  2017-05-31     54333   Scarsdale   ...     Westchester     2405  1468100\n",
       "2619  2017-05-31     47495         Rye   ...     Westchester     2619  1736400\n",
       "3032  2017-05-31     25725   Manhasset   ...          Nassau     3032  1483400\n",
       "3064  2017-05-31     18955   Larchmont   ...     Westchester     3064  1052200\n",
       "\n",
       "[5 rows x 8 columns]"
      ]
     },
     "execution_count": 33,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data_zillow[(data_zillow['Zhvi'] > 1000000) & (data_zillow['State'] == 'NY')].head()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 34,
   "metadata": {
    "_uuid": "955854dc68d314209ad4d77486b8d765521fcb2d"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Date</th>\n",
       "      <th>RegionID</th>\n",
       "      <th>RegionName</th>\n",
       "      <th>State</th>\n",
       "      <th>Metro</th>\n",
       "      <th>County</th>\n",
       "      <th>SizeRank</th>\n",
       "      <th>Zhvi</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>6181</td>\n",
       "      <td>New York</td>\n",
       "      <td>NY</td>\n",
       "      <td>New York</td>\n",
       "      <td>Queens</td>\n",
       "      <td>0</td>\n",
       "      <td>672400</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>12447</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>CA</td>\n",
       "      <td>Los Angeles-Long Beach-Anaheim</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>1</td>\n",
       "      <td>629900</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>6</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>54296</td>\n",
       "      <td>San Diego</td>\n",
       "      <td>CA</td>\n",
       "      <td>San Diego</td>\n",
       "      <td>San Diego</td>\n",
       "      <td>6</td>\n",
       "      <td>572100</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>8</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>33839</td>\n",
       "      <td>San Jose</td>\n",
       "      <td>CA</td>\n",
       "      <td>San Jose</td>\n",
       "      <td>Santa Clara</td>\n",
       "      <td>8</td>\n",
       "      <td>877400</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>10</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>20330</td>\n",
       "      <td>San Francisco</td>\n",
       "      <td>CA</td>\n",
       "      <td>San Francisco</td>\n",
       "      <td>San Francisco</td>\n",
       "      <td>10</td>\n",
       "      <td>1194300</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "          Date  RegionID   ...    SizeRank     Zhvi\n",
       "0   2017-05-31      6181   ...           0   672400\n",
       "1   2017-05-31     12447   ...           1   629900\n",
       "6   2017-05-31     54296   ...           6   572100\n",
       "8   2017-05-31     33839   ...           8   877400\n",
       "10  2017-05-31     20330   ...          10  1194300\n",
       "\n",
       "[5 rows x 8 columns]"
      ]
     },
     "execution_count": 34,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data_zillow[((data_zillow['State'] == 'CA') | (data_zillow['State'] == 'NY'))].head()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 35,
   "metadata": {
    "_uuid": "56e2d0101d7f1561c1d6ce99aa1b5b5a24452ab2"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Date</th>\n",
       "      <th>RegionID</th>\n",
       "      <th>RegionName</th>\n",
       "      <th>State</th>\n",
       "      <th>Metro</th>\n",
       "      <th>County</th>\n",
       "      <th>SizeRank</th>\n",
       "      <th>Zhvi</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>6181</td>\n",
       "      <td>New York</td>\n",
       "      <td>NY</td>\n",
       "      <td>New York</td>\n",
       "      <td>Queens</td>\n",
       "      <td>0</td>\n",
       "      <td>672400</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>6</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>54296</td>\n",
       "      <td>San Diego</td>\n",
       "      <td>CA</td>\n",
       "      <td>San Diego</td>\n",
       "      <td>San Diego</td>\n",
       "      <td>6</td>\n",
       "      <td>572100</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>63</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>12970</td>\n",
       "      <td>Newark</td>\n",
       "      <td>NJ</td>\n",
       "      <td>New York</td>\n",
       "      <td>Essex</td>\n",
       "      <td>63</td>\n",
       "      <td>232800</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>72</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>25320</td>\n",
       "      <td>Jersey City</td>\n",
       "      <td>NJ</td>\n",
       "      <td>New York</td>\n",
       "      <td>Hudson</td>\n",
       "      <td>72</td>\n",
       "      <td>380000</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>85</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>51405</td>\n",
       "      <td>Chula Vista</td>\n",
       "      <td>CA</td>\n",
       "      <td>San Diego</td>\n",
       "      <td>San Diego</td>\n",
       "      <td>85</td>\n",
       "      <td>486900</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "          Date  RegionID   RegionName   ...       County SizeRank    Zhvi\n",
       "0   2017-05-31      6181     New York   ...       Queens        0  672400\n",
       "6   2017-05-31     54296    San Diego   ...    San Diego        6  572100\n",
       "63  2017-05-31     12970       Newark   ...        Essex       63  232800\n",
       "72  2017-05-31     25320  Jersey City   ...       Hudson       72  380000\n",
       "85  2017-05-31     51405  Chula Vista   ...    San Diego       85  486900\n",
       "\n",
       "[5 rows x 8 columns]"
      ]
     },
     "execution_count": 35,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "zillow_filter = data_zillow['Metro'].isin(['New York','San Diego'])\n",
    "data_zillow[zillow_filter].head()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 36,
   "metadata": {
    "_uuid": "24f583af9ea5055c9e8b1d0a20e895419fe3e94a"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Date</th>\n",
       "      <th>RegionID</th>\n",
       "      <th>RegionName</th>\n",
       "      <th>State</th>\n",
       "      <th>Metro</th>\n",
       "      <th>County</th>\n",
       "      <th>SizeRank</th>\n",
       "      <th>Zhvi</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>CA</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "  Date  RegionID RegionName State Metro County  SizeRank  Zhvi\n",
       "0  NaN       NaN        NaN   NaN   NaN    NaN       NaN   NaN\n",
       "1  NaN       NaN        NaN    CA   NaN    NaN       NaN   NaN\n",
       "2  NaN       NaN        NaN   NaN   NaN    NaN       NaN   NaN\n",
       "3  NaN       NaN        NaN   NaN   NaN    NaN       NaN   NaN\n",
       "4  NaN       NaN        NaN   NaN   NaN    NaN       NaN   NaN"
      ]
     },
     "execution_count": 36,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "zillow_filter1 = data_zillow.isin({'State': ['CA'], 'Metro': ['San Francisco']})\n",
    "data_zillow[zillow_filter1].head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "ea28db8b7db9d86d9f80423ed46e840f397d166a"
   },
   "source": [
    "# 4.Changing the datatype of a Pandas Series <a id=\"4\"></a>\n",
    "---\n",
    "[**Go to Top**](#00)\n",
    "\n",
    "![](https://cdn-images-1.medium.com/max/1600/1*oErPCXv1PFcuuizXqGEEbw.png)\n",
    "### In this section you will learn\n",
    "1. Changes Data int to float\n",
    "2. Changing datatype while reading data\n",
    "3. Converting string to datetime\n",
    "\n",
    "\n",
    "#### 1.Read Dataset"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 37,
   "metadata": {
    "_uuid": "93a0db48fed56c47d60a2e9057fee6d7aa6211c9"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Date</th>\n",
       "      <th>RegionID</th>\n",
       "      <th>RegionName</th>\n",
       "      <th>State</th>\n",
       "      <th>Metro</th>\n",
       "      <th>County</th>\n",
       "      <th>SizeRank</th>\n",
       "      <th>Zhvi</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>6181</td>\n",
       "      <td>New York</td>\n",
       "      <td>NY</td>\n",
       "      <td>New York</td>\n",
       "      <td>Queens</td>\n",
       "      <td>0</td>\n",
       "      <td>672400</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>12447</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>CA</td>\n",
       "      <td>Los Angeles-Long Beach-Anaheim</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>1</td>\n",
       "      <td>629900</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>17426</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>IL</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>Cook</td>\n",
       "      <td>2</td>\n",
       "      <td>222700</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>13271</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>PA</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>3</td>\n",
       "      <td>137300</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>40326</td>\n",
       "      <td>Phoenix</td>\n",
       "      <td>AZ</td>\n",
       "      <td>Phoenix</td>\n",
       "      <td>Maricopa</td>\n",
       "      <td>4</td>\n",
       "      <td>211300</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "         Date  RegionID    RegionName   ...          County SizeRank    Zhvi\n",
       "0  2017-05-31      6181      New York   ...          Queens        0  672400\n",
       "1  2017-05-31     12447   Los Angeles   ...     Los Angeles        1  629900\n",
       "2  2017-05-31     17426       Chicago   ...            Cook        2  222700\n",
       "3  2017-05-31     13271  Philadelphia   ...    Philadelphia        3  137300\n",
       "4  2017-05-31     40326       Phoenix   ...        Maricopa        4  211300\n",
       "\n",
       "[5 rows x 8 columns]"
      ]
     },
     "execution_count": 37,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data_zillow = pd.read_table('../input/datasetsdifferent-format/data-zillow.csv', sep=',')\n",
    "data_zillow.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "3087f29893ef44d873e10f5fa5c687170daf91e7"
   },
   "source": [
    "#### 2. Changes Data int to float"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 38,
   "metadata": {
    "_uuid": "dc65e1403cd5ef423af7e4246a2b6f00dc9f2430"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "Date          object\n",
       "RegionID       int64\n",
       "RegionName    object\n",
       "State         object\n",
       "Metro         object\n",
       "County        object\n",
       "SizeRank       int64\n",
       "Zhvi           int64\n",
       "dtype: object"
      ]
     },
     "execution_count": 38,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data_zillow.dtypes"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 39,
   "metadata": {
    "_uuid": "fbf40f0362b01c6dd98e496b2fef071ca9655166"
   },
   "outputs": [],
   "source": [
    "data_zillow['Zhvi'] = data_zillow.Zhvi.astype(float)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 40,
   "metadata": {
    "_uuid": "e24a30d90c5426575338eb43adc16b0e55f1cb1e"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "Date           object\n",
       "RegionID        int64\n",
       "RegionName     object\n",
       "State          object\n",
       "Metro          object\n",
       "County         object\n",
       "SizeRank        int64\n",
       "Zhvi          float64\n",
       "dtype: object"
      ]
     },
     "execution_count": 40,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data_zillow.dtypes"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "37d880da6ffad779f60fd944081adc25fc86e78f"
   },
   "source": [
    "### 3.Changing datatype while reading data\n",
    "* By using `dtype` parameter in reading function we can change data types of any column as per below example"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 41,
   "metadata": {
    "_uuid": "b1fcbfadb1b5d29e9e7eb78c444670c89d8dfb04"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "Date           object\n",
       "RegionID        int64\n",
       "RegionName     object\n",
       "State          object\n",
       "Metro          object\n",
       "County         object\n",
       "SizeRank        int64\n",
       "Zhvi          float64\n",
       "dtype: object"
      ]
     },
     "execution_count": 41,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data_zillow1 = pd.read_csv('../input/datasetsdifferent-format/data-zillow.csv', sep=',', dtype={'Zhvi':float})\n",
    "data_zillow1.dtypes"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "20c7cc9721e0dcd1f892966016ee83e9ff281e6d"
   },
   "source": [
    "### 4.Converting string to datetime\n",
    "* we can also change *`date`* data type by using `pd.to_datetime()`"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 42,
   "metadata": {
    "_uuid": "e0627d753f78ab8b0c0863de30984639b9b79ca3"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "0   2017-05-31\n",
       "1   2017-05-31\n",
       "2   2017-05-31\n",
       "3   2017-05-31\n",
       "4   2017-05-31\n",
       "Name: Date, dtype: datetime64[ns]"
      ]
     },
     "execution_count": 42,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "pd.to_datetime(data_zillow1.Date,infer_datetime_format=True).head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "8a447e68c5abd9d1b2158fa2aa651b9606894d2d"
   },
   "source": [
    "# 5.Filter rows of a pandas DataFrame by column value <a id=\"5\"></a>\n",
    "---\n",
    " [**Go to top**](#00)\n",
    "\n",
    "![](http://104.236.88.249/wp-content/uploads/2016/10/Pandas-selections-and-indexing.png)\n",
    "\n",
    "### In this section, you will learn\n",
    "1. Filtering Method by using `filter()`\n",
    "2. Filtering Method by Regular expression in `filter()` function\n",
    "3. Filter data using boolean indexing\n",
    "4. An alternative way to filter\n",
    "\n",
    "#### 1. Read Dataset"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 43,
   "metadata": {
    "_uuid": "c6d6c02657f0d6ae2603c6b18fb313176b831cc4"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Date</th>\n",
       "      <th>RegionID</th>\n",
       "      <th>RegionName</th>\n",
       "      <th>State</th>\n",
       "      <th>Metro</th>\n",
       "      <th>County</th>\n",
       "      <th>SizeRank</th>\n",
       "      <th>Zhvi</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>6181</td>\n",
       "      <td>New York</td>\n",
       "      <td>NY</td>\n",
       "      <td>New York</td>\n",
       "      <td>Queens</td>\n",
       "      <td>0</td>\n",
       "      <td>672400</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>12447</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>CA</td>\n",
       "      <td>Los Angeles-Long Beach-Anaheim</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>1</td>\n",
       "      <td>629900</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>17426</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>IL</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>Cook</td>\n",
       "      <td>2</td>\n",
       "      <td>222700</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>13271</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>PA</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>3</td>\n",
       "      <td>137300</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>40326</td>\n",
       "      <td>Phoenix</td>\n",
       "      <td>AZ</td>\n",
       "      <td>Phoenix</td>\n",
       "      <td>Maricopa</td>\n",
       "      <td>4</td>\n",
       "      <td>211300</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "         Date  RegionID    RegionName   ...          County SizeRank    Zhvi\n",
       "0  2017-05-31      6181      New York   ...          Queens        0  672400\n",
       "1  2017-05-31     12447   Los Angeles   ...     Los Angeles        1  629900\n",
       "2  2017-05-31     17426       Chicago   ...            Cook        2  222700\n",
       "3  2017-05-31     13271  Philadelphia   ...    Philadelphia        3  137300\n",
       "4  2017-05-31     40326       Phoenix   ...        Maricopa        4  211300\n",
       "\n",
       "[5 rows x 8 columns]"
      ]
     },
     "execution_count": 43,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data = pd.read_table('../input/datasetsdifferent-format/data-zillow.csv', sep=',')\n",
    "data.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "f74b48baf55db3db6dfdbb2cecb212b53cf86810"
   },
   "source": [
    "#### 2.Filter columns by Different Ways\n",
    "* Filtering Method by using `filter()`\n",
    "* Filtering Method by Regular expression in `filter()` function\n",
    "* Filter data using boolean indexing\n",
    "* An alternative way to filter"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 44,
   "metadata": {
    "_uuid": "8fa14aea076261376b133eb4069bae03b365b24c"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>State</th>\n",
       "      <th>Metro</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>NY</td>\n",
       "      <td>New York</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>CA</td>\n",
       "      <td>Los Angeles-Long Beach-Anaheim</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>IL</td>\n",
       "      <td>Chicago</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>PA</td>\n",
       "      <td>Philadelphia</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>AZ</td>\n",
       "      <td>Phoenix</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>5</th>\n",
       "      <td>NV</td>\n",
       "      <td>Las Vegas</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "  State                           Metro\n",
       "0    NY                        New York\n",
       "1    CA  Los Angeles-Long Beach-Anaheim\n",
       "2    IL                         Chicago\n",
       "3    PA                    Philadelphia\n",
       "4    AZ                         Phoenix\n",
       "5    NV                       Las Vegas"
      ]
     },
     "execution_count": 44,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "filtered_data = data.filter(items=['State', 'Metro'])\n",
    "filtered_data.head(6)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "2e093e9211d2739bdcb5300acda873bb52930c6f"
   },
   "source": [
    "#### 3.Filter columns by regular expression using filter()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 45,
   "metadata": {
    "_uuid": "8c35fce48a46ee20f3f226e5f6120772b7e9d369"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>RegionID</th>\n",
       "      <th>RegionName</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>6181</td>\n",
       "      <td>New York</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>12447</td>\n",
       "      <td>Los Angeles</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>17426</td>\n",
       "      <td>Chicago</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>13271</td>\n",
       "      <td>Philadelphia</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>40326</td>\n",
       "      <td>Phoenix</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "   RegionID    RegionName\n",
       "0      6181      New York\n",
       "1     12447   Los Angeles\n",
       "2     17426       Chicago\n",
       "3     13271  Philadelphia\n",
       "4     40326       Phoenix"
      ]
     },
     "execution_count": 45,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "filtered_data = data.filter(regex='Region', axis=1)\n",
    "filtered_data.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "9b46d2af12421b848afdfe9bf8f9455ff7ae1cc2"
   },
   "source": [
    "#### 4.Filter data using boolean indexing"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 46,
   "metadata": {
    "_uuid": "141892021173267ed9cd2edbe7c0c7231a4f37bd"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "0     True\n",
       "1     True\n",
       "2    False\n",
       "3    False\n",
       "4    False\n",
       "Name: Zhvi, dtype: bool"
      ]
     },
     "execution_count": 46,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "price_filter_series = data['Zhvi'] > 500000\n",
    "price_filter_series.head()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 47,
   "metadata": {
    "_uuid": "7784a5c079b2340c38bcbc77d71d306bdd804e45"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Date</th>\n",
       "      <th>RegionID</th>\n",
       "      <th>RegionName</th>\n",
       "      <th>State</th>\n",
       "      <th>Metro</th>\n",
       "      <th>County</th>\n",
       "      <th>SizeRank</th>\n",
       "      <th>Zhvi</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>6181</td>\n",
       "      <td>New York</td>\n",
       "      <td>NY</td>\n",
       "      <td>New York</td>\n",
       "      <td>Queens</td>\n",
       "      <td>0</td>\n",
       "      <td>672400</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>12447</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>CA</td>\n",
       "      <td>Los Angeles-Long Beach-Anaheim</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>1</td>\n",
       "      <td>629900</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>6</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>54296</td>\n",
       "      <td>San Diego</td>\n",
       "      <td>CA</td>\n",
       "      <td>San Diego</td>\n",
       "      <td>San Diego</td>\n",
       "      <td>6</td>\n",
       "      <td>572100</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>8</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>33839</td>\n",
       "      <td>San Jose</td>\n",
       "      <td>CA</td>\n",
       "      <td>San Jose</td>\n",
       "      <td>Santa Clara</td>\n",
       "      <td>8</td>\n",
       "      <td>877400</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>10</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>20330</td>\n",
       "      <td>San Francisco</td>\n",
       "      <td>CA</td>\n",
       "      <td>San Francisco</td>\n",
       "      <td>San Francisco</td>\n",
       "      <td>10</td>\n",
       "      <td>1194300</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "          Date  RegionID   ...    SizeRank     Zhvi\n",
       "0   2017-05-31      6181   ...           0   672400\n",
       "1   2017-05-31     12447   ...           1   629900\n",
       "6   2017-05-31     54296   ...           6   572100\n",
       "8   2017-05-31     33839   ...           8   877400\n",
       "10  2017-05-31     20330   ...          10  1194300\n",
       "\n",
       "[5 rows x 8 columns]"
      ]
     },
     "execution_count": 47,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data[price_filter_series].head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "e678f9ce3a5e3ae498c4144bb5ce20bb06bcc2f1"
   },
   "source": [
    "#### 5.An alternative way to filter"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 48,
   "metadata": {
    "_uuid": "d6a04e810b53ef415c45b0fc154cfe7d0173bc72"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Date</th>\n",
       "      <th>RegionID</th>\n",
       "      <th>RegionName</th>\n",
       "      <th>State</th>\n",
       "      <th>Metro</th>\n",
       "      <th>County</th>\n",
       "      <th>SizeRank</th>\n",
       "      <th>Zhvi</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>10</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>20330</td>\n",
       "      <td>San Francisco</td>\n",
       "      <td>CA</td>\n",
       "      <td>San Francisco</td>\n",
       "      <td>San Francisco</td>\n",
       "      <td>10</td>\n",
       "      <td>1194300</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>181</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>54626</td>\n",
       "      <td>Sunnyvale</td>\n",
       "      <td>CA</td>\n",
       "      <td>San Jose</td>\n",
       "      <td>Santa Clara</td>\n",
       "      <td>181</td>\n",
       "      <td>1509300</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>234</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>13713</td>\n",
       "      <td>Santa Clara</td>\n",
       "      <td>CA</td>\n",
       "      <td>San Jose</td>\n",
       "      <td>Santa Clara</td>\n",
       "      <td>234</td>\n",
       "      <td>1071500</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>238</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>16992</td>\n",
       "      <td>Berkeley</td>\n",
       "      <td>CA</td>\n",
       "      <td>San Francisco</td>\n",
       "      <td>Alameda</td>\n",
       "      <td>238</td>\n",
       "      <td>1102000</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>308</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>13699</td>\n",
       "      <td>San Mateo</td>\n",
       "      <td>CA</td>\n",
       "      <td>San Francisco</td>\n",
       "      <td>San Mateo</td>\n",
       "      <td>308</td>\n",
       "      <td>1198300</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "           Date  RegionID   ...    SizeRank     Zhvi\n",
       "10   2017-05-31     20330   ...          10  1194300\n",
       "181  2017-05-31     54626   ...         181  1509300\n",
       "234  2017-05-31     13713   ...         234  1071500\n",
       "238  2017-05-31     16992   ...         238  1102000\n",
       "308  2017-05-31     13699   ...         308  1198300\n",
       "\n",
       "[5 rows x 8 columns]"
      ]
     },
     "execution_count": 48,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data[data.Zhvi >= 1000000].head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "62bdcf35796634d18a388848e0bd9dd571470e84"
   },
   "source": [
    "# 6.Selecting multiple rows and columns from a pandas DataFrame <a id=\"6\"> </a>\n",
    "---\n",
    " [**Go to top**](#00)\n",
    " \n",
    " \n",
    "### In this Section you can learn:\n",
    "\n",
    "1. Select single row, single column\n",
    "1. Select single row, multiple columns\n",
    "1. Select single row, all columns\n",
    "1. Select multiple rows, single column\n",
    "1. Select multiple rows and multiple contiguous columns\n",
    "1. Select multiple rows and multiple non-contiguous columns\n",
    "1. Select multiple rows and all columns\n",
    "1. Select non-contiguous rows\n",
    "1. Selecting rows based on a specific column's value\n",
    "1. Selecting all rows for a specific column based on a value of another column\n",
    "\n",
    "### 1.Read dataset"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 49,
   "metadata": {
    "_uuid": "62a4ff323d8368f403fdaad9c7adf68f9a5f6360"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Date</th>\n",
       "      <th>RegionID</th>\n",
       "      <th>RegionName</th>\n",
       "      <th>State</th>\n",
       "      <th>Metro</th>\n",
       "      <th>County</th>\n",
       "      <th>SizeRank</th>\n",
       "      <th>Zhvi</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>6181</td>\n",
       "      <td>New York</td>\n",
       "      <td>NY</td>\n",
       "      <td>New York</td>\n",
       "      <td>Queens</td>\n",
       "      <td>0</td>\n",
       "      <td>672400</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>12447</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>CA</td>\n",
       "      <td>Los Angeles-Long Beach-Anaheim</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>1</td>\n",
       "      <td>629900</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>17426</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>IL</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>Cook</td>\n",
       "      <td>2</td>\n",
       "      <td>222700</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>13271</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>PA</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>3</td>\n",
       "      <td>137300</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>40326</td>\n",
       "      <td>Phoenix</td>\n",
       "      <td>AZ</td>\n",
       "      <td>Phoenix</td>\n",
       "      <td>Maricopa</td>\n",
       "      <td>4</td>\n",
       "      <td>211300</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "         Date  RegionID    RegionName   ...          County SizeRank    Zhvi\n",
       "0  2017-05-31      6181      New York   ...          Queens        0  672400\n",
       "1  2017-05-31     12447   Los Angeles   ...     Los Angeles        1  629900\n",
       "2  2017-05-31     17426       Chicago   ...            Cook        2  222700\n",
       "3  2017-05-31     13271  Philadelphia   ...    Philadelphia        3  137300\n",
       "4  2017-05-31     40326       Phoenix   ...        Maricopa        4  211300\n",
       "\n",
       "[5 rows x 8 columns]"
      ]
     },
     "execution_count": 49,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data_zillow = pd.read_table('../input/datasetsdifferent-format/data-zillow.csv', sep=',')\n",
    "data_zillow.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "0171cdcb1d3eb13990638e38aacaa1972fc1de51"
   },
   "source": [
    "### 2.Select single row, single column"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 50,
   "metadata": {
    "_uuid": "2c0a3e9aaba52d4f07dce5bb28945a8e14ece0d0"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "'Dallas-Fort Worth'"
      ]
     },
     "execution_count": 50,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data_zillow.loc[7, 'Metro']"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 51,
   "metadata": {
    "_uuid": "beb3d57d8ba0cffbc47aaa6a86cf2c51c0f881b9"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "'Dallas-Fort Worth'"
      ]
     },
     "execution_count": 51,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data_zillow.iloc[7,4]"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "593675eedf7c58de6effa7f28be47b92ed9b55f2"
   },
   "source": [
    "### 3.Select single row, multiple columns"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 52,
   "metadata": {
    "_uuid": "59da74f120a44865d5cc6bc66e034b4c02d60b2c"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "Metro     Dallas-Fort Worth\n",
       "County               Dallas\n",
       "Name: 7, dtype: object"
      ]
     },
     "execution_count": 52,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data_zillow.loc[7, ['Metro', 'County']]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 53,
   "metadata": {
    "_uuid": "74ce54f0cee66073d0de3b225258160123a9552d"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "Metro     Dallas-Fort Worth\n",
       "County               Dallas\n",
       "Name: 7, dtype: object"
      ]
     },
     "execution_count": 53,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data_zillow.iloc[7, [4,5]]"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "b77ccba774d2d0197b35164dd332377392ed824a"
   },
   "source": [
    "### 4.Select single row, all columns"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 54,
   "metadata": {
    "_uuid": "76ebe4edd971cb606987152e428a19afd4035172"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "Date          2017-05-31\n",
       "RegionID           10221\n",
       "RegionName        Austin\n",
       "State                 TX\n",
       "Metro             Austin\n",
       "County            Travis\n",
       "SizeRank              11\n",
       "Zhvi              321600\n",
       "Name: 11, dtype: object"
      ]
     },
     "execution_count": 54,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data_zillow.loc[11, :]"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "28850cf25cf9a821d30336841253d722a6a4db1e"
   },
   "source": [
    "### 5.Select multiple rows, single column"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 55,
   "metadata": {
    "_uuid": "1e2c1c80b775f7259ca073d8d011bf3d542fa4db"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "101                     Winston-Salem\n",
       "102    Los Angeles-Long Beach-Anaheim\n",
       "103                          Richmond\n",
       "104             Miami-Fort Lauderdale\n",
       "105                           Ventura\n",
       "Name: Metro, dtype: object"
      ]
     },
     "execution_count": 55,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data_zillow.loc[101:105, 'Metro']"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "d314c190a15e3cf9353fd324602b50c46fa9b66c"
   },
   "source": [
    "### 6.Select multiple rows and multiple contiguous columns\n",
    "\n",
    "* **In `loc`**  we pass the column label to fetch data.\n",
    "* **In `iloc`**  we pass the number to fetch data."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 56,
   "metadata": {
    "_uuid": "a705c5d817ed2153211c4a1125eeba7a0b80e338"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>State</th>\n",
       "      <th>Metro</th>\n",
       "      <th>County</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>201</th>\n",
       "      <td>OH</td>\n",
       "      <td>Canton</td>\n",
       "      <td>Stark</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>202</th>\n",
       "      <td>LA</td>\n",
       "      <td>New Orleans</td>\n",
       "      <td>Jefferson</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>203</th>\n",
       "      <td>CA</td>\n",
       "      <td>Santa Maria-Santa Barbara</td>\n",
       "      <td>Santa Barbara</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>204</th>\n",
       "      <td>CA</td>\n",
       "      <td>Los Angeles-Long Beach-Anaheim</td>\n",
       "      <td>Los Angeles</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "    State                           Metro         County\n",
       "201    OH                          Canton          Stark\n",
       "202    LA                     New Orleans      Jefferson\n",
       "203    CA       Santa Maria-Santa Barbara  Santa Barbara\n",
       "204    CA  Los Angeles-Long Beach-Anaheim    Los Angeles"
      ]
     },
     "execution_count": 56,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data_zillow.loc[201:204, \"State\":\"County\"]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 57,
   "metadata": {
    "_uuid": "3d1d15375f4ebc1f3fd899ebe55eb1604174d202"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>State</th>\n",
       "      <th>Metro</th>\n",
       "      <th>County</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>201</th>\n",
       "      <td>OH</td>\n",
       "      <td>Canton</td>\n",
       "      <td>Stark</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>202</th>\n",
       "      <td>LA</td>\n",
       "      <td>New Orleans</td>\n",
       "      <td>Jefferson</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>203</th>\n",
       "      <td>CA</td>\n",
       "      <td>Santa Maria-Santa Barbara</td>\n",
       "      <td>Santa Barbara</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>204</th>\n",
       "      <td>CA</td>\n",
       "      <td>Los Angeles-Long Beach-Anaheim</td>\n",
       "      <td>Los Angeles</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "    State                           Metro         County\n",
       "201    OH                          Canton          Stark\n",
       "202    LA                     New Orleans      Jefferson\n",
       "203    CA       Santa Maria-Santa Barbara  Santa Barbara\n",
       "204    CA  Los Angeles-Long Beach-Anaheim    Los Angeles"
      ]
     },
     "execution_count": 57,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data_zillow.iloc[201:205, 3:6]"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "a8966749d269438d65df99633fed40840507a819"
   },
   "source": [
    "### 7.Select multiple rows and multiple non-contiguous columns"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 58,
   "metadata": {
    "_uuid": "dfb6a748589c4cde8ad1820cbbdfc29f56ee0664"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>RegionName</th>\n",
       "      <th>State</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>201</th>\n",
       "      <td>Canton</td>\n",
       "      <td>OH</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>202</th>\n",
       "      <td>Metairie</td>\n",
       "      <td>LA</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>203</th>\n",
       "      <td>Santa Maria</td>\n",
       "      <td>CA</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>204</th>\n",
       "      <td>Inglewood</td>\n",
       "      <td>CA</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>205</th>\n",
       "      <td>Orange</td>\n",
       "      <td>CA</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "      RegionName State\n",
       "201       Canton    OH\n",
       "202     Metairie    LA\n",
       "203  Santa Maria    CA\n",
       "204    Inglewood    CA\n",
       "205       Orange    CA"
      ]
     },
     "execution_count": 58,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data_zillow.loc[201:205, ['RegionName', 'State']]"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "84529df0072f54388a4018afe81f7213283684c2"
   },
   "source": [
    "### 8.Select multiple rows and all columns"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 59,
   "metadata": {
    "_uuid": "6824c8aa1473fa11c7082474566ed223449fa0aa"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Date</th>\n",
       "      <th>RegionID</th>\n",
       "      <th>RegionName</th>\n",
       "      <th>State</th>\n",
       "      <th>Metro</th>\n",
       "      <th>County</th>\n",
       "      <th>SizeRank</th>\n",
       "      <th>Zhvi</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>201</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>51260</td>\n",
       "      <td>Canton</td>\n",
       "      <td>OH</td>\n",
       "      <td>Canton</td>\n",
       "      <td>Stark</td>\n",
       "      <td>201</td>\n",
       "      <td>94400</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>202</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>5914</td>\n",
       "      <td>Metairie</td>\n",
       "      <td>LA</td>\n",
       "      <td>New Orleans</td>\n",
       "      <td>Jefferson</td>\n",
       "      <td>202</td>\n",
       "      <td>232700</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>203</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>47570</td>\n",
       "      <td>Santa Maria</td>\n",
       "      <td>CA</td>\n",
       "      <td>Santa Maria-Santa Barbara</td>\n",
       "      <td>Santa Barbara</td>\n",
       "      <td>203</td>\n",
       "      <td>354600</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>204</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>45888</td>\n",
       "      <td>Inglewood</td>\n",
       "      <td>CA</td>\n",
       "      <td>Los Angeles-Long Beach-Anaheim</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>204</td>\n",
       "      <td>470600</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>205</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>33252</td>\n",
       "      <td>Orange</td>\n",
       "      <td>CA</td>\n",
       "      <td>Los Angeles-Long Beach-Anaheim</td>\n",
       "      <td>Orange</td>\n",
       "      <td>205</td>\n",
       "      <td>652000</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "           Date  RegionID   RegionName   ...           County SizeRank    Zhvi\n",
       "201  2017-05-31     51260       Canton   ...            Stark      201   94400\n",
       "202  2017-05-31      5914     Metairie   ...        Jefferson      202  232700\n",
       "203  2017-05-31     47570  Santa Maria   ...    Santa Barbara      203  354600\n",
       "204  2017-05-31     45888    Inglewood   ...      Los Angeles      204  470600\n",
       "205  2017-05-31     33252       Orange   ...           Orange      205  652000\n",
       "\n",
       "[5 rows x 8 columns]"
      ]
     },
     "execution_count": 59,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data_zillow.loc[201:205, :]"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "7144be8f55095681af806b286a9f2499d6ae9d4e"
   },
   "source": [
    "### 9.Select non-contiguous rows"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 60,
   "metadata": {
    "_uuid": "c5685535872165c7e49c70c2e29320cf580ef6b7"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Date</th>\n",
       "      <th>RegionID</th>\n",
       "      <th>RegionName</th>\n",
       "      <th>State</th>\n",
       "      <th>Metro</th>\n",
       "      <th>County</th>\n",
       "      <th>SizeRank</th>\n",
       "      <th>Zhvi</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>6181</td>\n",
       "      <td>New York</td>\n",
       "      <td>NY</td>\n",
       "      <td>New York</td>\n",
       "      <td>Queens</td>\n",
       "      <td>0</td>\n",
       "      <td>672400</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>5</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>18959</td>\n",
       "      <td>Las Vegas</td>\n",
       "      <td>NV</td>\n",
       "      <td>Las Vegas</td>\n",
       "      <td>Clark</td>\n",
       "      <td>5</td>\n",
       "      <td>216500</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>10</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>20330</td>\n",
       "      <td>San Francisco</td>\n",
       "      <td>CA</td>\n",
       "      <td>San Francisco</td>\n",
       "      <td>San Francisco</td>\n",
       "      <td>10</td>\n",
       "      <td>1194300</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "          Date  RegionID   ...    SizeRank     Zhvi\n",
       "0   2017-05-31      6181   ...           0   672400\n",
       "5   2017-05-31     18959   ...           5   216500\n",
       "10  2017-05-31     20330   ...          10  1194300\n",
       "\n",
       "[3 rows x 8 columns]"
      ]
     },
     "execution_count": 60,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data_zillow.loc[[0,5,10], :]"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "585ffc5545c57c1059aabee355875fd098ae03b5"
   },
   "source": [
    "### 10.Selecting rows based on a specific column's value"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 61,
   "metadata": {
    "_uuid": "2ed5994091954d738fc8579cd49562ac3705d204"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Date</th>\n",
       "      <th>RegionID</th>\n",
       "      <th>RegionName</th>\n",
       "      <th>State</th>\n",
       "      <th>Metro</th>\n",
       "      <th>County</th>\n",
       "      <th>SizeRank</th>\n",
       "      <th>Zhvi</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>6181</td>\n",
       "      <td>New York</td>\n",
       "      <td>NY</td>\n",
       "      <td>New York</td>\n",
       "      <td>Queens</td>\n",
       "      <td>0</td>\n",
       "      <td>672400</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "         Date  RegionID RegionName   ...    County SizeRank    Zhvi\n",
       "0  2017-05-31      6181   New York   ...    Queens        0  672400\n",
       "\n",
       "[1 rows x 8 columns]"
      ]
     },
     "execution_count": 61,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data_zillow.loc[data_zillow.County==\"Queens\"]"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "b50e6cfebafdd7410b9c1ba4fa75d89cad72db68"
   },
   "source": [
    "### 11.Selecting all rows for a specific column based on a value of another column"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 62,
   "metadata": {
    "_uuid": "18b0420a6eaf28da301625a37fe871c1fb89fcd8"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "0           Queens\n",
       "63           Essex\n",
       "72          Hudson\n",
       "138    Westchester\n",
       "176        Passaic\n",
       "Name: County, dtype: object"
      ]
     },
     "execution_count": 62,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data_zillow.loc[data_zillow.Metro==\"New York\", \"County\"].head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "2f7dc2e8ca7d307e7bc13f3052c1bac5f530f436"
   },
   "source": [
    "# 7.Sorting a pandas DataFrame or a Series <a id=\"7\"></a>\n",
    "---\n",
    "[**Go to top**](#00)\n",
    "\n",
    "![](https://www.notquitesusie.com/wp-content/uploads/2012/10/farmers-market-coloring-sorting-set.jpg)\n",
    "\n",
    "### In this section you can learn:\n",
    "\n",
    "1. Simple sort\n",
    "1. Changing the sort order\n",
    "1. Sort by more than one column\n",
    "1. Sort by multiple columns and mixed ascending order\n",
    "1. Sort a Series\n",
    "\n",
    "### 1.Read dataset"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 63,
   "metadata": {
    "_uuid": "a589700f06ff02dddae557afba57609b23ef8e82"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Date</th>\n",
       "      <th>RegionID</th>\n",
       "      <th>RegionName</th>\n",
       "      <th>State</th>\n",
       "      <th>Metro</th>\n",
       "      <th>County</th>\n",
       "      <th>SizeRank</th>\n",
       "      <th>Zhvi</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>6181</td>\n",
       "      <td>New York</td>\n",
       "      <td>NY</td>\n",
       "      <td>New York</td>\n",
       "      <td>Queens</td>\n",
       "      <td>0</td>\n",
       "      <td>672400</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>12447</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>CA</td>\n",
       "      <td>Los Angeles-Long Beach-Anaheim</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>1</td>\n",
       "      <td>629900</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>17426</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>IL</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>Cook</td>\n",
       "      <td>2</td>\n",
       "      <td>222700</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>13271</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>PA</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>3</td>\n",
       "      <td>137300</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>40326</td>\n",
       "      <td>Phoenix</td>\n",
       "      <td>AZ</td>\n",
       "      <td>Phoenix</td>\n",
       "      <td>Maricopa</td>\n",
       "      <td>4</td>\n",
       "      <td>211300</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "         Date  RegionID    RegionName   ...          County SizeRank    Zhvi\n",
       "0  2017-05-31      6181      New York   ...          Queens        0  672400\n",
       "1  2017-05-31     12447   Los Angeles   ...     Los Angeles        1  629900\n",
       "2  2017-05-31     17426       Chicago   ...            Cook        2  222700\n",
       "3  2017-05-31     13271  Philadelphia   ...    Philadelphia        3  137300\n",
       "4  2017-05-31     40326       Phoenix   ...        Maricopa        4  211300\n",
       "\n",
       "[5 rows x 8 columns]"
      ]
     },
     "execution_count": 63,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data_zillow = pd.read_table('../input/datasetsdifferent-format/data-zillow.csv', sep=',')\n",
    "data_zillow.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "004616e4f857e4d1cd5c7a31833fa9954dccfe8e"
   },
   "source": [
    "### 2.Simple sort\n",
    "* Sort the value by using column name"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 64,
   "metadata": {
    "_uuid": "6a547e6f11ca4f14707c366e30bb81d062f96e80"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Date</th>\n",
       "      <th>RegionID</th>\n",
       "      <th>RegionName</th>\n",
       "      <th>State</th>\n",
       "      <th>Metro</th>\n",
       "      <th>County</th>\n",
       "      <th>SizeRank</th>\n",
       "      <th>Zhvi</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>9851</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>48458</td>\n",
       "      <td>Westport</td>\n",
       "      <td>WA</td>\n",
       "      <td>Aberdeen</td>\n",
       "      <td>Grays Harbor</td>\n",
       "      <td>9851</td>\n",
       "      <td>144600</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4996</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>36873</td>\n",
       "      <td>Elma</td>\n",
       "      <td>WA</td>\n",
       "      <td>Aberdeen</td>\n",
       "      <td>Grays Harbor</td>\n",
       "      <td>4996</td>\n",
       "      <td>175200</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>5090</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>35514</td>\n",
       "      <td>Hoquiam</td>\n",
       "      <td>WA</td>\n",
       "      <td>Aberdeen</td>\n",
       "      <td>Grays Harbor</td>\n",
       "      <td>5090</td>\n",
       "      <td>95700</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>9401</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>33215</td>\n",
       "      <td>Ocean Shores</td>\n",
       "      <td>WA</td>\n",
       "      <td>Aberdeen</td>\n",
       "      <td>Grays Harbor</td>\n",
       "      <td>9401</td>\n",
       "      <td>152400</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>9149</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>18370</td>\n",
       "      <td>Grayland</td>\n",
       "      <td>WA</td>\n",
       "      <td>Aberdeen</td>\n",
       "      <td>Grays Harbor</td>\n",
       "      <td>9149</td>\n",
       "      <td>143900</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "            Date  RegionID    RegionName   ...          County SizeRank    Zhvi\n",
       "9851  2017-05-31     48458      Westport   ...    Grays Harbor     9851  144600\n",
       "4996  2017-05-31     36873          Elma   ...    Grays Harbor     4996  175200\n",
       "5090  2017-05-31     35514       Hoquiam   ...    Grays Harbor     5090   95700\n",
       "9401  2017-05-31     33215  Ocean Shores   ...    Grays Harbor     9401  152400\n",
       "9149  2017-05-31     18370      Grayland   ...    Grays Harbor     9149  143900\n",
       "\n",
       "[5 rows x 8 columns]"
      ]
     },
     "execution_count": 64,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data_zillow.sort_values('Metro').head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "7b7b9b398f259c8b9361709a7405f1bf70424fcf"
   },
   "source": [
    "### 3.Changing the sort order\n",
    "* Sorting the value basis on the descending order"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 65,
   "metadata": {
    "_uuid": "74060e2001dbfb9eb8f63d4818d57f8c8c4a9951"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Date</th>\n",
       "      <th>RegionID</th>\n",
       "      <th>RegionName</th>\n",
       "      <th>State</th>\n",
       "      <th>Metro</th>\n",
       "      <th>County</th>\n",
       "      <th>SizeRank</th>\n",
       "      <th>Zhvi</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>8064</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>19538</td>\n",
       "      <td>Nashport</td>\n",
       "      <td>OH</td>\n",
       "      <td>Zanesville</td>\n",
       "      <td>Muskingum</td>\n",
       "      <td>8064</td>\n",
       "      <td>153800</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>10271</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>15262</td>\n",
       "      <td>Hopewell</td>\n",
       "      <td>OH</td>\n",
       "      <td>Zanesville</td>\n",
       "      <td>Muskingum</td>\n",
       "      <td>10271</td>\n",
       "      <td>138700</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>10373</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>49730</td>\n",
       "      <td>Norwich</td>\n",
       "      <td>OH</td>\n",
       "      <td>Zanesville</td>\n",
       "      <td>Muskingum</td>\n",
       "      <td>10373</td>\n",
       "      <td>145100</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>5423</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>53527</td>\n",
       "      <td>New Concord</td>\n",
       "      <td>OH</td>\n",
       "      <td>Zanesville</td>\n",
       "      <td>Muskingum</td>\n",
       "      <td>5423</td>\n",
       "      <td>138300</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>7595</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>17815</td>\n",
       "      <td>Dresden</td>\n",
       "      <td>OH</td>\n",
       "      <td>Zanesville</td>\n",
       "      <td>Muskingum</td>\n",
       "      <td>7595</td>\n",
       "      <td>118400</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "             Date  RegionID   RegionName   ...       County SizeRank    Zhvi\n",
       "8064   2017-05-31     19538     Nashport   ...    Muskingum     8064  153800\n",
       "10271  2017-05-31     15262     Hopewell   ...    Muskingum    10271  138700\n",
       "10373  2017-05-31     49730      Norwich   ...    Muskingum    10373  145100\n",
       "5423   2017-05-31     53527  New Concord   ...    Muskingum     5423  138300\n",
       "7595   2017-05-31     17815      Dresden   ...    Muskingum     7595  118400\n",
       "\n",
       "[5 rows x 8 columns]"
      ]
     },
     "execution_count": 65,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "sorted = data_zillow.sort_values('Metro', ascending=False)\n",
    "sorted.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "a28385554e6e61a0e5909b0c975f2075a2958695"
   },
   "source": [
    "### 4.Sort by more than one column"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 66,
   "metadata": {
    "_uuid": "fb15d1ebc7d0406ae6a6201a2681804768059882"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Date</th>\n",
       "      <th>RegionID</th>\n",
       "      <th>RegionName</th>\n",
       "      <th>State</th>\n",
       "      <th>Metro</th>\n",
       "      <th>County</th>\n",
       "      <th>SizeRank</th>\n",
       "      <th>Zhvi</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>2073</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>30116</td>\n",
       "      <td>Aberdeen</td>\n",
       "      <td>WA</td>\n",
       "      <td>Aberdeen</td>\n",
       "      <td>Grays Harbor</td>\n",
       "      <td>2073</td>\n",
       "      <td>127800</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4568</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>56078</td>\n",
       "      <td>Montesano</td>\n",
       "      <td>WA</td>\n",
       "      <td>Aberdeen</td>\n",
       "      <td>Grays Harbor</td>\n",
       "      <td>4568</td>\n",
       "      <td>182000</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4996</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>36873</td>\n",
       "      <td>Elma</td>\n",
       "      <td>WA</td>\n",
       "      <td>Aberdeen</td>\n",
       "      <td>Grays Harbor</td>\n",
       "      <td>4996</td>\n",
       "      <td>175200</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>5090</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>35514</td>\n",
       "      <td>Hoquiam</td>\n",
       "      <td>WA</td>\n",
       "      <td>Aberdeen</td>\n",
       "      <td>Grays Harbor</td>\n",
       "      <td>5090</td>\n",
       "      <td>95700</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>7108</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>6275</td>\n",
       "      <td>Oakville</td>\n",
       "      <td>WA</td>\n",
       "      <td>Aberdeen</td>\n",
       "      <td>Grays Harbor</td>\n",
       "      <td>7108</td>\n",
       "      <td>186900</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "            Date  RegionID RegionName   ...          County SizeRank    Zhvi\n",
       "2073  2017-05-31     30116   Aberdeen   ...    Grays Harbor     2073  127800\n",
       "4568  2017-05-31     56078  Montesano   ...    Grays Harbor     4568  182000\n",
       "4996  2017-05-31     36873       Elma   ...    Grays Harbor     4996  175200\n",
       "5090  2017-05-31     35514    Hoquiam   ...    Grays Harbor     5090   95700\n",
       "7108  2017-05-31      6275   Oakville   ...    Grays Harbor     7108  186900\n",
       "\n",
       "[5 rows x 8 columns]"
      ]
     },
     "execution_count": 66,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "sorted = data_zillow.sort_values(by=['Metro','County'])\n",
    "sorted.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "18dd9b184d34bf47a3129c3caab1852ec74c32eb"
   },
   "source": [
    "### 5.Sort by multiple columns and mixed ascending order"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 67,
   "metadata": {
    "_uuid": "871b4d89addc1741d79ab58dda35798fa369a2a4"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Date</th>\n",
       "      <th>RegionID</th>\n",
       "      <th>RegionName</th>\n",
       "      <th>State</th>\n",
       "      <th>Metro</th>\n",
       "      <th>County</th>\n",
       "      <th>SizeRank</th>\n",
       "      <th>Zhvi</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>7108</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>6275</td>\n",
       "      <td>Oakville</td>\n",
       "      <td>WA</td>\n",
       "      <td>Aberdeen</td>\n",
       "      <td>Grays Harbor</td>\n",
       "      <td>7108</td>\n",
       "      <td>186900</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4568</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>56078</td>\n",
       "      <td>Montesano</td>\n",
       "      <td>WA</td>\n",
       "      <td>Aberdeen</td>\n",
       "      <td>Grays Harbor</td>\n",
       "      <td>4568</td>\n",
       "      <td>182000</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4996</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>36873</td>\n",
       "      <td>Elma</td>\n",
       "      <td>WA</td>\n",
       "      <td>Aberdeen</td>\n",
       "      <td>Grays Harbor</td>\n",
       "      <td>4996</td>\n",
       "      <td>175200</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>8420</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>19269</td>\n",
       "      <td>McCleary</td>\n",
       "      <td>WA</td>\n",
       "      <td>Aberdeen</td>\n",
       "      <td>Grays Harbor</td>\n",
       "      <td>8420</td>\n",
       "      <td>170700</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>9401</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>33215</td>\n",
       "      <td>Ocean Shores</td>\n",
       "      <td>WA</td>\n",
       "      <td>Aberdeen</td>\n",
       "      <td>Grays Harbor</td>\n",
       "      <td>9401</td>\n",
       "      <td>152400</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "            Date  RegionID    RegionName   ...          County SizeRank    Zhvi\n",
       "7108  2017-05-31      6275      Oakville   ...    Grays Harbor     7108  186900\n",
       "4568  2017-05-31     56078     Montesano   ...    Grays Harbor     4568  182000\n",
       "4996  2017-05-31     36873          Elma   ...    Grays Harbor     4996  175200\n",
       "8420  2017-05-31     19269      McCleary   ...    Grays Harbor     8420  170700\n",
       "9401  2017-05-31     33215  Ocean Shores   ...    Grays Harbor     9401  152400\n",
       "\n",
       "[5 rows x 8 columns]"
      ]
     },
     "execution_count": 67,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "sorted = data_zillow.sort_values(by=['Metro','County', 'Zhvi'], \n",
    "                            ascending=[True, True, False])\n",
    "sorted.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "9acb5996c97db4fe17dc08300bdf824116f6033e"
   },
   "source": [
    "### 6.Sort a Series\n",
    "\n",
    "* 1.Let's create a Series object"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 68,
   "metadata": {
    "_uuid": "a8e2f43674423ca056544339ced4d67c55281867"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "pandas.core.series.Series"
      ]
     },
     "execution_count": 68,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "regions = data_zillow.RegionID\n",
    "type(regions)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "12794c9bdfc00907d20a5ff05a983597845dde5b"
   },
   "source": [
    "**Let's sort the series¶**\n",
    "* **1.Original Series**"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 69,
   "metadata": {
    "_uuid": "e9877f445f344622bbeb484e3705ce2a955788a0"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "0     6181\n",
       "1    12447\n",
       "2    17426\n",
       "3    13271\n",
       "4    40326\n",
       "Name: RegionID, dtype: int64"
      ]
     },
     "execution_count": 69,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "regions.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "87a2abb1524e7585f6b3e88e1eea6f8e7b10ef0c"
   },
   "source": [
    "* **2.Sorted**"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 70,
   "metadata": {
    "_uuid": "0b80c1a3bf6b8b32f32c90aff466af290fa1f3e1"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "3043    3301\n",
       "4159    3304\n",
       "4986    3305\n",
       "1762    3310\n",
       "3116    3312\n",
       "Name: RegionID, dtype: int64"
      ]
     },
     "execution_count": 70,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "regions.sort_values().head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "84158ba6779216a8939efaba339167dd92a2544d"
   },
   "source": [
    "# 8.Using pandas Series data structure to select a subset of the data <a id=\"8\"></a>\n",
    "---\n",
    "[**Go to top**](#00)\n",
    "\n",
    "![](https://image.slidesharecdn.com/talk-120111102959-phpapp01/95/a-look-inside-pandas-design-and-development-23-728.jpg)\n",
    "\n",
    "### In this Section, you will learn below topics\n",
    "\n",
    "1. Select data\n",
    "    * Select a Series with bracket notation\n",
    "2. DataFrame vs Series\n",
    "    * Multi Column Selection - Series or DataFrame\n",
    "    * Select using dot notation\n",
    "3. Creating a new series by selection"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "946e7c43d811837fae81b0668a6986dd26dd7550"
   },
   "source": [
    "### 1.Read Dataset"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 71,
   "metadata": {
    "_uuid": "323a868a7b32c613cc6a340833972dbc81e69435"
   },
   "outputs": [],
   "source": [
    "data = pd.read_table('../input/datasetsdifferent-format/data-zillow.csv', sep=',')"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 72,
   "metadata": {
    "_uuid": "75823353510522d724e4f5ab61e3831fd26f6f9e"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Date</th>\n",
       "      <th>RegionID</th>\n",
       "      <th>RegionName</th>\n",
       "      <th>State</th>\n",
       "      <th>Metro</th>\n",
       "      <th>County</th>\n",
       "      <th>SizeRank</th>\n",
       "      <th>Zhvi</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>6181</td>\n",
       "      <td>New York</td>\n",
       "      <td>NY</td>\n",
       "      <td>New York</td>\n",
       "      <td>Queens</td>\n",
       "      <td>0</td>\n",
       "      <td>672400</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>12447</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>CA</td>\n",
       "      <td>Los Angeles-Long Beach-Anaheim</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>1</td>\n",
       "      <td>629900</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>17426</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>IL</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>Cook</td>\n",
       "      <td>2</td>\n",
       "      <td>222700</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>13271</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>PA</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>3</td>\n",
       "      <td>137300</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>40326</td>\n",
       "      <td>Phoenix</td>\n",
       "      <td>AZ</td>\n",
       "      <td>Phoenix</td>\n",
       "      <td>Maricopa</td>\n",
       "      <td>4</td>\n",
       "      <td>211300</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "         Date  RegionID    RegionName   ...          County SizeRank    Zhvi\n",
       "0  2017-05-31      6181      New York   ...          Queens        0  672400\n",
       "1  2017-05-31     12447   Los Angeles   ...     Los Angeles        1  629900\n",
       "2  2017-05-31     17426       Chicago   ...            Cook        2  222700\n",
       "3  2017-05-31     13271  Philadelphia   ...    Philadelphia        3  137300\n",
       "4  2017-05-31     40326       Phoenix   ...        Maricopa        4  211300\n",
       "\n",
       "[5 rows x 8 columns]"
      ]
     },
     "execution_count": 72,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "23ed758912e499e705c7fce070c3545dc9af1ca4"
   },
   "source": [
    "### 2.Select data\n",
    "* **Select a Series with bracket notation**"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 73,
   "metadata": {
    "_uuid": "3fca5ee8f633d53b0a7b83cdb34b3dd9e552db1d"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "pandas.core.series.Series"
      ]
     },
     "execution_count": 73,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "regions = data['RegionName']\n",
    "type(regions)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 74,
   "metadata": {
    "_uuid": "bba401187ba6815a159538a3ead2afe03a0eb7e0"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "0        New York\n",
       "1     Los Angeles\n",
       "2         Chicago\n",
       "3    Philadelphia\n",
       "4         Phoenix\n",
       "Name: RegionName, dtype: object"
      ]
     },
     "execution_count": 74,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "regions.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "5336e7deb3f385de384b71d962f98487fe328ad7"
   },
   "source": [
    "### 3.DataFrame vs Series\n",
    "* **Multi Column Selection - Series or DataFrame**"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 75,
   "metadata": {
    "_uuid": "afb08b96760753602cd00c7cd92efe043f7ba93a"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>RegionName</th>\n",
       "      <th>State</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>New York</td>\n",
       "      <td>NY</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>CA</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>Chicago</td>\n",
       "      <td>IL</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>PA</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>Phoenix</td>\n",
       "      <td>AZ</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "     RegionName State\n",
       "0      New York    NY\n",
       "1   Los Angeles    CA\n",
       "2       Chicago    IL\n",
       "3  Philadelphia    PA\n",
       "4       Phoenix    AZ"
      ]
     },
     "execution_count": 75,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "region_n_state = data[['RegionName', 'State']]\n",
    "region_n_state.head()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 76,
   "metadata": {
    "_uuid": "7f7137cc9766d9f714e3d0cb5aee3d7c5a5ce5a7"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "pandas.core.frame.DataFrame"
      ]
     },
     "execution_count": 76,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "type(region_n_state)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "3b280401fb54a2caaaa28595cb789afea4ec880a"
   },
   "source": [
    "* **Select using dot notation**"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 77,
   "metadata": {
    "_uuid": "2c7b9b1f1151d8bf28e0c3d90901f067b2c9662d"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "0    NY\n",
       "1    CA\n",
       "2    IL\n",
       "3    PA\n",
       "4    AZ\n",
       "Name: State, dtype: object"
      ]
     },
     "execution_count": 77,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data.State.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "b60b50c6f145871b82b983ecf22dfc05e438d6a1"
   },
   "source": [
    "### 4.Creating a new series by selection"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 78,
   "metadata": {
    "_uuid": "d19d04e9c13ba9d3e091cb13bc22d4bdd4334a28"
   },
   "outputs": [],
   "source": [
    "data['Address'] = data.County + ', ' + data.Metro + ', ' + data.State"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 79,
   "metadata": {
    "_uuid": "82b3975782a8bf36a205efd8df590ef501d7c5b3"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "0                               Queens, New York, NY\n",
       "1    Los Angeles, Los Angeles-Long Beach-Anaheim, CA\n",
       "2                                  Cook, Chicago, IL\n",
       "3                     Philadelphia, Philadelphia, PA\n",
       "4                              Maricopa, Phoenix, AZ\n",
       "Name: Address, dtype: object"
      ]
     },
     "execution_count": 79,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data.Address.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "56b7705c00a973b0051b6336101a506c7c8347f6"
   },
   "source": [
    "# 9.Using string methods in pandas <a id=\"9\"></a>\n",
    "---\n",
    "[**Go To Top**](#00)\n",
    "\n",
    "### In this section, you will learn\n",
    "1. Check for a substring\n",
    "2. Make values of a series or column uppercase\n",
    "3. Make values lowercase\n",
    "4. Get the length of each value in a column\n",
    "5. Remove all whitespace from the beginning\n",
    "6. Replace parts of a column's values"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "b1e2434a65c2f9ff8143c8b78e2e67970a7e7fb8"
   },
   "source": [
    "### 1. Read dataset"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 80,
   "metadata": {
    "_uuid": "76386e29ed05257ce80d9222717cebdafb366a1b"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Date</th>\n",
       "      <th>RegionID</th>\n",
       "      <th>RegionName</th>\n",
       "      <th>State</th>\n",
       "      <th>Metro</th>\n",
       "      <th>County</th>\n",
       "      <th>SizeRank</th>\n",
       "      <th>Zhvi</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>6181</td>\n",
       "      <td>New York</td>\n",
       "      <td>NY</td>\n",
       "      <td>New York</td>\n",
       "      <td>Queens</td>\n",
       "      <td>0</td>\n",
       "      <td>672400</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>12447</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>CA</td>\n",
       "      <td>Los Angeles-Long Beach-Anaheim</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>1</td>\n",
       "      <td>629900</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>17426</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>IL</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>Cook</td>\n",
       "      <td>2</td>\n",
       "      <td>222700</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>13271</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>PA</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>3</td>\n",
       "      <td>137300</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>40326</td>\n",
       "      <td>Phoenix</td>\n",
       "      <td>AZ</td>\n",
       "      <td>Phoenix</td>\n",
       "      <td>Maricopa</td>\n",
       "      <td>4</td>\n",
       "      <td>211300</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "         Date  RegionID    RegionName   ...          County SizeRank    Zhvi\n",
       "0  2017-05-31      6181      New York   ...          Queens        0  672400\n",
       "1  2017-05-31     12447   Los Angeles   ...     Los Angeles        1  629900\n",
       "2  2017-05-31     17426       Chicago   ...            Cook        2  222700\n",
       "3  2017-05-31     13271  Philadelphia   ...    Philadelphia        3  137300\n",
       "4  2017-05-31     40326       Phoenix   ...        Maricopa        4  211300\n",
       "\n",
       "[5 rows x 8 columns]"
      ]
     },
     "execution_count": 80,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data = pd.read_table('../input/datasetsdifferent-format/data-zillow.csv', sep=',')\n",
    "data.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "42bf238215d5ff64632d733160c71d478de6ec41"
   },
   "source": [
    "### 2.Check for a substring"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 81,
   "metadata": {
    "_uuid": "ade0cd23b629fe45650d933eabdec1107116b81d"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "0     True\n",
       "1    False\n",
       "2    False\n",
       "3    False\n",
       "4    False\n",
       "Name: RegionName, dtype: bool"
      ]
     },
     "execution_count": 81,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data.RegionName.str.contains('New').head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "013788b12a95c8342f46b981b8723aba96adc5d0"
   },
   "source": [
    "### 3.Make values of a series or column uppercase"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 82,
   "metadata": {
    "_uuid": "329f44fdaf6c41a2a3cf26c6f2bb74b2899925df"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "0        NEW YORK\n",
       "1     LOS ANGELES\n",
       "2         CHICAGO\n",
       "3    PHILADELPHIA\n",
       "4         PHOENIX\n",
       "Name: RegionName, dtype: object"
      ]
     },
     "execution_count": 82,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data.RegionName.str.upper().head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "45d353dfef0585bf8214ff34682d0254a2ad09b7"
   },
   "source": [
    "### 4.Make values lowercase\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 83,
   "metadata": {
    "_uuid": "83d33939208be8f3b4475c19c64731e7fb22309a"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "0        new york\n",
       "1     los angeles\n",
       "2         chicago\n",
       "3    philadelphia\n",
       "4         phoenix\n",
       "Name: RegionName, dtype: object"
      ]
     },
     "execution_count": 83,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data.RegionName.str.lower().head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "78117d56357462279874977341b3345241c52b52"
   },
   "source": [
    "### 5.Get the length of each value in a column"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 84,
   "metadata": {
    "_uuid": "decbb956f0df489935f3c5569bd9320f09458bc2"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "0     6\n",
       "1    11\n",
       "2     4\n",
       "3    12\n",
       "4     8\n",
       "Name: County, dtype: int64"
      ]
     },
     "execution_count": 84,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data.County.str.len().head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "8fc1665b41988b6f642373311e749ecd3b4cdf5e"
   },
   "source": [
    "### 6.Remove all whitespace from the beginning"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 85,
   "metadata": {
    "_uuid": "292284730d084ff3b9363afe24e9fc9d9b34e2b3"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "0        New York\n",
       "1     Los Angeles\n",
       "2         Chicago\n",
       "3    Philadelphia\n",
       "4         Phoenix\n",
       "Name: RegionName, dtype: object"
      ]
     },
     "execution_count": 85,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data.RegionName.str.lstrip().head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "95d22b4ca2591826ade6be510375391515e4b471"
   },
   "source": [
    "### 7.Replace parts of a column's values"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 86,
   "metadata": {
    "_uuid": "0c38dbad17108078192d3e10c62b28619fdc6278"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "0         NewYork\n",
       "1      LosAngeles\n",
       "2         Chicago\n",
       "3    Philadelphia\n",
       "4         Phoenix\n",
       "Name: RegionName, dtype: object"
      ]
     },
     "execution_count": 86,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data.RegionName.str.replace(' ', '').head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "a913de2eb8531a68ab989277f963d7ddc40d54c1"
   },
   "source": [
    "# 10.Using the axis parameter in pandas<a id=\"10\"></a>\n",
    "---\n",
    "[**Go To TOP**](#00)\n",
    "\n",
    "![](https://www.dataquest.io/blog/content/images/2017/12/axis_diagram.jpg)\n",
    "\n",
    "### In this section, you can learn\n",
    "\n",
    "1. Usage of axis parameter\n",
    "2. axis usage examples\n",
    "    * axis = 0\n",
    "    * axis = 1\n",
    "    * use labels instead of 0 and 1"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "5b848b4dfd71b4e71c324be89258bbcba44da8ce"
   },
   "source": [
    "### 1.Read Dataset"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 87,
   "metadata": {
    "_uuid": "7acf3f24c653cc4c05515dd9b69852d5ae32ef73"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Date</th>\n",
       "      <th>RegionID</th>\n",
       "      <th>RegionName</th>\n",
       "      <th>State</th>\n",
       "      <th>Metro</th>\n",
       "      <th>County</th>\n",
       "      <th>SizeRank</th>\n",
       "      <th>Zhvi</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>6181</td>\n",
       "      <td>New York</td>\n",
       "      <td>NY</td>\n",
       "      <td>New York</td>\n",
       "      <td>Queens</td>\n",
       "      <td>0</td>\n",
       "      <td>672400</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>12447</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>CA</td>\n",
       "      <td>Los Angeles-Long Beach-Anaheim</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>1</td>\n",
       "      <td>629900</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>17426</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>IL</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>Cook</td>\n",
       "      <td>2</td>\n",
       "      <td>222700</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>13271</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>PA</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>3</td>\n",
       "      <td>137300</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>40326</td>\n",
       "      <td>Phoenix</td>\n",
       "      <td>AZ</td>\n",
       "      <td>Phoenix</td>\n",
       "      <td>Maricopa</td>\n",
       "      <td>4</td>\n",
       "      <td>211300</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "         Date  RegionID    RegionName   ...          County SizeRank    Zhvi\n",
       "0  2017-05-31      6181      New York   ...          Queens        0  672400\n",
       "1  2017-05-31     12447   Los Angeles   ...     Los Angeles        1  629900\n",
       "2  2017-05-31     17426       Chicago   ...            Cook        2  222700\n",
       "3  2017-05-31     13271  Philadelphia   ...    Philadelphia        3  137300\n",
       "4  2017-05-31     40326       Phoenix   ...        Maricopa        4  211300\n",
       "\n",
       "[5 rows x 8 columns]"
      ]
     },
     "execution_count": 87,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data = pd.read_table('../input/datasetsdifferent-format/data-zillow.csv', sep=',')\n",
    "data.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "650fbe79df832ab3f9b929cc8dbb3009f956e5ea"
   },
   "source": [
    "### 2.Usage of axis parameter"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 88,
   "metadata": {
    "_uuid": "8fdabcdc1b4bbb1316406a5249c3276d83b51c32"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Date</th>\n",
       "      <th>RegionID</th>\n",
       "      <th>RegionName</th>\n",
       "      <th>State</th>\n",
       "      <th>Metro</th>\n",
       "      <th>County</th>\n",
       "      <th>SizeRank</th>\n",
       "      <th>Zhvi</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>6181</td>\n",
       "      <td>New York</td>\n",
       "      <td>NY</td>\n",
       "      <td>New York</td>\n",
       "      <td>Queens</td>\n",
       "      <td>0</td>\n",
       "      <td>672400</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>12447</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>CA</td>\n",
       "      <td>Los Angeles-Long Beach-Anaheim</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>1</td>\n",
       "      <td>629900</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>17426</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>IL</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>Cook</td>\n",
       "      <td>2</td>\n",
       "      <td>222700</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>13271</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>PA</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>3</td>\n",
       "      <td>137300</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>40326</td>\n",
       "      <td>Phoenix</td>\n",
       "      <td>AZ</td>\n",
       "      <td>Phoenix</td>\n",
       "      <td>Maricopa</td>\n",
       "      <td>4</td>\n",
       "      <td>211300</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "         Date  RegionID    RegionName   ...          County SizeRank    Zhvi\n",
       "0  2017-05-31      6181      New York   ...          Queens        0  672400\n",
       "1  2017-05-31     12447   Los Angeles   ...     Los Angeles        1  629900\n",
       "2  2017-05-31     17426       Chicago   ...            Cook        2  222700\n",
       "3  2017-05-31     13271  Philadelphia   ...    Philadelphia        3  137300\n",
       "4  2017-05-31     40326       Phoenix   ...        Maricopa        4  211300\n",
       "\n",
       "[5 rows x 8 columns]"
      ]
     },
     "execution_count": 88,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data.head()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 89,
   "metadata": {
    "_uuid": "9f6516b7ee88954c7e01b83988059277ac82a2db"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "[RangeIndex(start=0, stop=10830, step=1),\n",
       " Index(['Date', 'RegionID', 'RegionName', 'State', 'Metro', 'County',\n",
       "        'SizeRank', 'Zhvi'],\n",
       "       dtype='object')]"
      ]
     },
     "execution_count": 89,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data.axes"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "b5748ea09979ed1f104c0e1510f7ec7a8c378af5"
   },
   "source": [
    "### 1.**axis = 0**"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 90,
   "metadata": {
    "_uuid": "0362bfd8a85f9f94b37408f1453390e8a8f0b401"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "RegionID     84344.818837\n",
       "SizeRank      5414.500000\n",
       "Zhvi        250307.590028\n",
       "dtype: float64"
      ]
     },
     "execution_count": 90,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data.mean(axis=0)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "3cb2c4cf0b13166cdc2485e899c267f9db731a13"
   },
   "source": [
    "### 2.axis = 1"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 91,
   "metadata": {
    "_uuid": "19b7f672d56b307f39f1f851d50e25de4a166f86"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "0    226193.666667\n",
       "1    214116.000000\n",
       "2     80042.666667\n",
       "3     50191.333333\n",
       "4     83876.666667\n",
       "dtype: float64"
      ]
     },
     "execution_count": 91,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data.mean(axis=1).head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "90d27f42336c1349fcc8b32e7c852c7694c9c40f"
   },
   "source": [
    "### 3.use labels instead of 0 and 1"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 92,
   "metadata": {
    "_uuid": "42aecaf3aba29bee84c38f6b51adfbbc2a8ab2e3"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "RegionID     84344.818837\n",
       "SizeRank      5414.500000\n",
       "Zhvi        250307.590028\n",
       "dtype: float64"
      ]
     },
     "execution_count": 92,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data.mean(axis='rows')"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 93,
   "metadata": {
    "_uuid": "d6f0b08af0b3ef63d5904e999210550d81e0a86f"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "0    226193.666667\n",
       "1    214116.000000\n",
       "2     80042.666667\n",
       "3     50191.333333\n",
       "4     83876.666667\n",
       "dtype: float64"
      ]
     },
     "execution_count": 93,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data.mean(axis='columns').head()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 94,
   "metadata": {
    "_uuid": "83420020fc839458af19316d5da310e082ebe4ed"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Date</th>\n",
       "      <th>RegionID</th>\n",
       "      <th>RegionName</th>\n",
       "      <th>State</th>\n",
       "      <th>Metro</th>\n",
       "      <th>County</th>\n",
       "      <th>SizeRank</th>\n",
       "      <th>Zhvi</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>12447</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>CA</td>\n",
       "      <td>Los Angeles-Long Beach-Anaheim</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>1</td>\n",
       "      <td>629900</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>17426</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>IL</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>Cook</td>\n",
       "      <td>2</td>\n",
       "      <td>222700</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>13271</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>PA</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>3</td>\n",
       "      <td>137300</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>40326</td>\n",
       "      <td>Phoenix</td>\n",
       "      <td>AZ</td>\n",
       "      <td>Phoenix</td>\n",
       "      <td>Maricopa</td>\n",
       "      <td>4</td>\n",
       "      <td>211300</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>5</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>18959</td>\n",
       "      <td>Las Vegas</td>\n",
       "      <td>NV</td>\n",
       "      <td>Las Vegas</td>\n",
       "      <td>Clark</td>\n",
       "      <td>5</td>\n",
       "      <td>216500</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "         Date  RegionID    RegionName   ...          County SizeRank    Zhvi\n",
       "1  2017-05-31     12447   Los Angeles   ...     Los Angeles        1  629900\n",
       "2  2017-05-31     17426       Chicago   ...            Cook        2  222700\n",
       "3  2017-05-31     13271  Philadelphia   ...    Philadelphia        3  137300\n",
       "4  2017-05-31     40326       Phoenix   ...        Maricopa        4  211300\n",
       "5  2017-05-31     18959     Las Vegas   ...           Clark        5  216500\n",
       "\n",
       "[5 rows x 8 columns]"
      ]
     },
     "execution_count": 94,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data.drop(0, axis=0).head()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 95,
   "metadata": {
    "_uuid": "dd51d2e3bbba27b39138b3c8a4d5a20626ebd6cd"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>RegionID</th>\n",
       "      <th>RegionName</th>\n",
       "      <th>State</th>\n",
       "      <th>Metro</th>\n",
       "      <th>County</th>\n",
       "      <th>SizeRank</th>\n",
       "      <th>Zhvi</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>6181</td>\n",
       "      <td>New York</td>\n",
       "      <td>NY</td>\n",
       "      <td>New York</td>\n",
       "      <td>Queens</td>\n",
       "      <td>0</td>\n",
       "      <td>672400</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>12447</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>CA</td>\n",
       "      <td>Los Angeles-Long Beach-Anaheim</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>1</td>\n",
       "      <td>629900</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>17426</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>IL</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>Cook</td>\n",
       "      <td>2</td>\n",
       "      <td>222700</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>13271</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>PA</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>3</td>\n",
       "      <td>137300</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>40326</td>\n",
       "      <td>Phoenix</td>\n",
       "      <td>AZ</td>\n",
       "      <td>Phoenix</td>\n",
       "      <td>Maricopa</td>\n",
       "      <td>4</td>\n",
       "      <td>211300</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "   RegionID    RegionName State   ...          County SizeRank    Zhvi\n",
       "0      6181      New York    NY   ...          Queens        0  672400\n",
       "1     12447   Los Angeles    CA   ...     Los Angeles        1  629900\n",
       "2     17426       Chicago    IL   ...            Cook        2  222700\n",
       "3     13271  Philadelphia    PA   ...    Philadelphia        3  137300\n",
       "4     40326       Phoenix    AZ   ...        Maricopa        4  211300\n",
       "\n",
       "[5 rows x 7 columns]"
      ]
     },
     "execution_count": 95,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data.drop('Date', axis=1).head()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 96,
   "metadata": {
    "_uuid": "2c96a6ecb8507f9d61fa4e4e3013395ddcdb7605"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>RegionID</th>\n",
       "      <th>RegionName</th>\n",
       "      <th>State</th>\n",
       "      <th>Metro</th>\n",
       "      <th>County</th>\n",
       "      <th>SizeRank</th>\n",
       "      <th>Zhvi</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>6181</td>\n",
       "      <td>New York</td>\n",
       "      <td>NY</td>\n",
       "      <td>New York</td>\n",
       "      <td>Queens</td>\n",
       "      <td>0</td>\n",
       "      <td>672400</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>12447</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>CA</td>\n",
       "      <td>Los Angeles-Long Beach-Anaheim</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>1</td>\n",
       "      <td>629900</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>17426</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>IL</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>Cook</td>\n",
       "      <td>2</td>\n",
       "      <td>222700</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>13271</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>PA</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>3</td>\n",
       "      <td>137300</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>40326</td>\n",
       "      <td>Phoenix</td>\n",
       "      <td>AZ</td>\n",
       "      <td>Phoenix</td>\n",
       "      <td>Maricopa</td>\n",
       "      <td>4</td>\n",
       "      <td>211300</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "   RegionID    RegionName State   ...          County SizeRank    Zhvi\n",
       "0      6181      New York    NY   ...          Queens        0  672400\n",
       "1     12447   Los Angeles    CA   ...     Los Angeles        1  629900\n",
       "2     17426       Chicago    IL   ...            Cook        2  222700\n",
       "3     13271  Philadelphia    PA   ...    Philadelphia        3  137300\n",
       "4     40326       Phoenix    AZ   ...        Maricopa        4  211300\n",
       "\n",
       "[5 rows x 7 columns]"
      ]
     },
     "execution_count": 96,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data.drop('Date', axis=1).head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "b780dad1ddbb809bb4970064e285b7294c4f30f6"
   },
   "source": [
    "# 11.Applying a function to a pandas Series or DataFrame<a id=\"11\"></a>\n",
    "---\n",
    "[**Go To TOP**](#00)\n",
    "\n",
    "![](https://i.stack.imgur.com/AqYhv.png)\n",
    "\n",
    "### In this section, you can learn\n",
    "\n",
    "1. Apply functions using apply()\n",
    "2. Apply functions using applymap()\n",
    "3. Applying our own functions"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "62729c46bdde975e817d8cd5f559eaa25a0d0069"
   },
   "source": [
    "### 1.Read dataset"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 97,
   "metadata": {
    "_uuid": "f084a0bb05334f3d87d25550691da33203670fdf"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>PassengerId</th>\n",
       "      <th>Survived</th>\n",
       "      <th>Pclass</th>\n",
       "      <th>Name</th>\n",
       "      <th>Sex</th>\n",
       "      <th>Age</th>\n",
       "      <th>SibSp</th>\n",
       "      <th>Parch</th>\n",
       "      <th>Ticket</th>\n",
       "      <th>Fare</th>\n",
       "      <th>Cabin</th>\n",
       "      <th>Embarked</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>3</td>\n",
       "      <td>Braund, Mr. Owen Harris</td>\n",
       "      <td>male</td>\n",
       "      <td>22.0</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>A/5 21171</td>\n",
       "      <td>7.2500</td>\n",
       "      <td>NaN</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>Cumings, Mrs. John Bradley (Florence Briggs Th...</td>\n",
       "      <td>female</td>\n",
       "      <td>38.0</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>PC 17599</td>\n",
       "      <td>71.2833</td>\n",
       "      <td>C85</td>\n",
       "      <td>C</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>3</td>\n",
       "      <td>1</td>\n",
       "      <td>3</td>\n",
       "      <td>Heikkinen, Miss. Laina</td>\n",
       "      <td>female</td>\n",
       "      <td>26.0</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>STON/O2. 3101282</td>\n",
       "      <td>7.9250</td>\n",
       "      <td>NaN</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>4</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>Futrelle, Mrs. Jacques Heath (Lily May Peel)</td>\n",
       "      <td>female</td>\n",
       "      <td>35.0</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>113803</td>\n",
       "      <td>53.1000</td>\n",
       "      <td>C123</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>5</td>\n",
       "      <td>0</td>\n",
       "      <td>3</td>\n",
       "      <td>Allen, Mr. William Henry</td>\n",
       "      <td>male</td>\n",
       "      <td>35.0</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>373450</td>\n",
       "      <td>8.0500</td>\n",
       "      <td>NaN</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "   PassengerId  Survived  Pclass    ...        Fare Cabin  Embarked\n",
       "0            1         0       3    ...      7.2500   NaN         S\n",
       "1            2         1       1    ...     71.2833   C85         C\n",
       "2            3         1       3    ...      7.9250   NaN         S\n",
       "3            4         1       1    ...     53.1000  C123         S\n",
       "4            5         0       3    ...      8.0500   NaN         S\n",
       "\n",
       "[5 rows x 12 columns]"
      ]
     },
     "execution_count": 97,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data = pd.read_csv('../input/datasetsdifferent-format/data-titanic.csv')\n",
    "data.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "74d7fa3380758da28468e34abb395a24c266a1ae"
   },
   "source": [
    "### 2.Apply functions using apply()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 98,
   "metadata": {
    "_uuid": "187ec6b6f4dbf781aebd24ba413ecb8eeb36db6e"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "0                              braund, mr. owen harris\n",
       "1    cumings, mrs. john bradley (florence briggs th...\n",
       "2                               heikkinen, miss. laina\n",
       "3         futrelle, mrs. jacques heath (lily may peel)\n",
       "4                             allen, mr. william henry\n",
       "Name: Name, dtype: object"
      ]
     },
     "execution_count": 98,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "func_lower = lambda x: x.lower()\n",
    "data.Name.apply(func_lower).head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "8bfb7e56dcf1cfdd18b67f3966a2064837e031fb"
   },
   "source": [
    "### 3.Apply functions using applymap()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 99,
   "metadata": {
    "_uuid": "3a2d5a252cefc2354e4b0bc591b20e5625d5c104"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Age</th>\n",
       "      <th>Pclass</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>484.0</td>\n",
       "      <td>9</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>1444.0</td>\n",
       "      <td>1</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>676.0</td>\n",
       "      <td>9</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>1225.0</td>\n",
       "      <td>1</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>1225.0</td>\n",
       "      <td>9</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "      Age  Pclass\n",
       "0   484.0       9\n",
       "1  1444.0       1\n",
       "2   676.0       9\n",
       "3  1225.0       1\n",
       "4  1225.0       9"
      ]
     },
     "execution_count": 99,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data[['Age', 'Pclass']].applymap(np.square).head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "c327a4138c18d2efa5aeff0c4ab5483a34d50c60"
   },
   "source": [
    "### 3.Applying our own functions"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 100,
   "metadata": {
    "_uuid": "a2a3dca203741e47cc3f38627d5721ba269c0350"
   },
   "outputs": [],
   "source": [
    "def my_func(i):\n",
    "    return i + 20"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 101,
   "metadata": {
    "_uuid": "3ade8dff119165079e8b4ac524de87d49955941b"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Age</th>\n",
       "      <th>Pclass</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>42.0</td>\n",
       "      <td>23</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>58.0</td>\n",
       "      <td>21</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>46.0</td>\n",
       "      <td>23</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>55.0</td>\n",
       "      <td>21</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>55.0</td>\n",
       "      <td>23</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "    Age  Pclass\n",
       "0  42.0      23\n",
       "1  58.0      21\n",
       "2  46.0      23\n",
       "3  55.0      21\n",
       "4  55.0      23"
      ]
     },
     "execution_count": 101,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data[['Age', 'Pclass']].applymap(my_func).head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "b219146024ecc80dac4a12aeb0a307647a117409"
   },
   "source": [
    "# 12.Handling SettingWithCopyWarning<a id=\"12\"></a>\n",
    "---\n",
    "[**Go To TOP**](#00)\n",
    "\n",
    "![](https://www.dataquest.io/blog/content/images/view-vs-copy.png)\n",
    "\n",
    "### In this section, you can learn\n",
    "\n",
    "1. A SettingWithCopyWarning scenario\n",
    "2. Handling the SettingWithCopyWarning"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "044a8a5f763fd288b8f66159a86eb6fd56d83c2d"
   },
   "source": [
    "### 1.A SettingWithCopyWarning scenario"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 102,
   "metadata": {
    "_uuid": "8bec8adf2f445f714e4a7381cec0cbfa9316fbee"
   },
   "outputs": [
    {
     "name": "stderr",
     "output_type": "stream",
     "text": [
      "/opt/conda/lib/python3.6/site-packages/pandas/core/generic.py:4405: SettingWithCopyWarning: \n",
      "A value is trying to be set on a copy of a slice from a DataFrame.\n",
      "Try using .loc[row_indexer,col_indexer] = value instead\n",
      "\n",
      "See the caveats in the documentation: http://pandas.pydata.org/pandas-docs/stable/indexing.html#indexing-view-versus-copy\n",
      "  self[name] = value\n"
     ]
    }
   ],
   "source": [
    "data[data.Age.isnull()].Age = data.Age.mean()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "939cfa644669a17da37532518abc0db59c59cbb3"
   },
   "source": [
    "### 2.Handling the SettingWithCopyWarning"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 103,
   "metadata": {
    "_uuid": "50faa074dd2c4c1431ad4a32ca119a537795b2a0"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "5    NaN\n",
       "17   NaN\n",
       "19   NaN\n",
       "26   NaN\n",
       "28   NaN\n",
       "Name: Age, dtype: float64"
      ]
     },
     "execution_count": 103,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data[data.Age.isnull()].Age.head()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 104,
   "metadata": {
    "_uuid": "233ad0d276a2b9ed32462011ef12881a80213ee5"
   },
   "outputs": [],
   "source": [
    "data.loc[data.Age.isnull(), 'Age'] = data.Age.mean"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 105,
   "metadata": {
    "_uuid": "7978b6f3b9a9819bfa7c91e42ac2f3f7507b0a19"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>PassengerId</th>\n",
       "      <th>Survived</th>\n",
       "      <th>Pclass</th>\n",
       "      <th>Name</th>\n",
       "      <th>Sex</th>\n",
       "      <th>Age</th>\n",
       "      <th>SibSp</th>\n",
       "      <th>Parch</th>\n",
       "      <th>Ticket</th>\n",
       "      <th>Fare</th>\n",
       "      <th>Cabin</th>\n",
       "      <th>Embarked</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "Empty DataFrame\n",
       "Columns: [PassengerId, Survived, Pclass, Name, Sex, Age, SibSp, Parch, Ticket, Fare, Cabin, Embarked]\n",
       "Index: []"
      ]
     },
     "execution_count": 105,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data[data.Age.isnull()]"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "19f59d3c81aa929297708dbc3bb2ec98e3417d2d"
   },
   "source": [
    "# 13.Handling missing values in pandas<a id=\"13\"></a>\n",
    "---\n",
    "[**Go To TOP**](#00)\n",
    "\n",
    "![](https://cdn-images-1.medium.com/max/1600/1*_RA3mCS30Pr0vUxbp25Yxw.png)\n",
    "\n",
    "### In this section, you can learn\n",
    "\n",
    "1. Missing Records \n",
    "    1. Find out total records in the dataset\n",
    "    1. Number of valid records per column\n",
    "2. Dropping missing records\n",
    "    1. Drop all records that have one or more missing values\n",
    "    1. Drop only those rows that have all records missing\n",
    "3. Fill in missing data\n",
    "    1. Fill in missing data with zeros\n",
    "    1. Fill in missing data with a mean of the values from other rows"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 106,
   "metadata": {
    "_uuid": "ea32197e3c192da841ce31923793e9b3052a62e4"
   },
   "outputs": [],
   "source": [
    "data = pd.read_csv(\"../input/datasetsdifferent-format/data-titanic.csv\")"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "c7ca85d63e5972f24a4b87b065fa4bafc178d225"
   },
   "source": [
    "### 1. Missing Records \n",
    "1. **Find out total records in the dataset**"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 107,
   "metadata": {
    "_uuid": "3e6271ae4ff42998a2eedb87ee7f02f7def78089"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "(891, 12)"
      ]
     },
     "execution_count": 107,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data.shape"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "3cd684c000fdb34a0ad644a3492d07c0548b0247"
   },
   "source": [
    "2. **Number of valid records per column**"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 108,
   "metadata": {
    "_uuid": "180e92074bcd20e0f33826ba1ba95c8362a94962"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "PassengerId    891\n",
       "Survived       891\n",
       "Pclass         891\n",
       "Name           891\n",
       "Sex            891\n",
       "Age            714\n",
       "SibSp          891\n",
       "Parch          891\n",
       "Ticket         891\n",
       "Fare           891\n",
       "Cabin          204\n",
       "Embarked       889\n",
       "dtype: int64"
      ]
     },
     "execution_count": 108,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data.count()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "05885e972eb1aaa034504a193721425b96cb248c"
   },
   "source": [
    "### 2. Dropping missing records\n",
    "\n",
    "1. **Drop all records that have one or more missing values**\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 109,
   "metadata": {
    "_uuid": "f7de03f21884b47dca8d7f938756dea84baab6e2"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "(183, 12)"
      ]
     },
     "execution_count": 109,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data_missing_dropped = data.dropna()\n",
    "data_missing_dropped.shape"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "037775db411f832eadc6f24e8eb41af794bb0c42"
   },
   "source": [
    "2. **Drop only those rows that have all records missing**"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 110,
   "metadata": {
    "_uuid": "89253caae0e10ca237cccb9233aaae886828f3b9"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "(891, 12)"
      ]
     },
     "execution_count": 110,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data_all_missing_dropped = data.dropna(how=\"all\")\n",
    "data_all_missing_dropped.shape"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "3c30936e71f13c7c575ffef77d9f0a0d6c5c1c1e"
   },
   "source": [
    "### 3. Fill in missing data\n",
    "    \n",
    "1. **Fill in missing data with zeros**"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 111,
   "metadata": {
    "_uuid": "7b086ead1e2c6056ff2ab2c89f7d650adb273577"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "PassengerId    891\n",
       "Survived       891\n",
       "Pclass         891\n",
       "Name           891\n",
       "Sex            891\n",
       "Age            891\n",
       "SibSp          891\n",
       "Parch          891\n",
       "Ticket         891\n",
       "Fare           891\n",
       "Cabin          891\n",
       "Embarked       891\n",
       "dtype: int64"
      ]
     },
     "execution_count": 111,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data_filled_zeros =  data.fillna(0)\n",
    "data_filled_zeros.count()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "72acb9bbbc7b6e0b4d4da444c233eb94a0a74eac"
   },
   "source": [
    "2. **Fill in missing data with a mean of the values from other rows**"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 112,
   "metadata": {
    "_uuid": "9707b243646e60627633dd97799c553ca8f24d48"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "PassengerId    891\n",
       "Survived       891\n",
       "Pclass         891\n",
       "Name           891\n",
       "Sex            891\n",
       "Age            891\n",
       "SibSp          891\n",
       "Parch          891\n",
       "Ticket         891\n",
       "Fare           891\n",
       "Cabin          204\n",
       "Embarked       889\n",
       "dtype: int64"
      ]
     },
     "execution_count": 112,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data_filled_in_mean = data.copy()\n",
    "data_filled_in_mean.Age.fillna(data.Age.mean(), inplace=True)\n",
    "data_filled_in_mean.count()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "240939ed7a03b5e831dac34a8d1f2e7d48face2c"
   },
   "source": [
    "# 14.Indexing in pandas dataframes<a id=\"14\"></a>\n",
    "---\n",
    "[**Go To TOP**](#00)\n",
    "\n",
    "![](https://bookdata.readthedocs.io/en/latest/_images/base_01_pandas_5_0.png)\n",
    "\n",
    "### In this section, you can learn\n",
    "\n",
    "1. Default Index\n",
    "2. Set an Index post reading of data\n",
    "3. Set an Index while reading data\n",
    "4. Selection using Index\n",
    "5. Reset Index"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 113,
   "metadata": {
    "_uuid": "c91dcf2965d02c0b8047688974b7385e88288b11"
   },
   "outputs": [],
   "source": [
    "data = pd.read_csv('../input/datasetsdifferent-format/data-titanic.csv')"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "d94121fb0d915104cc60ff6c9bf37306991723e0"
   },
   "source": [
    "### 1.Default Index"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 114,
   "metadata": {
    "_uuid": "bd4fdaad8ff83b3eced90fb79504d52195cdbac4"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>PassengerId</th>\n",
       "      <th>Survived</th>\n",
       "      <th>Pclass</th>\n",
       "      <th>Name</th>\n",
       "      <th>Sex</th>\n",
       "      <th>Age</th>\n",
       "      <th>SibSp</th>\n",
       "      <th>Parch</th>\n",
       "      <th>Ticket</th>\n",
       "      <th>Fare</th>\n",
       "      <th>Cabin</th>\n",
       "      <th>Embarked</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>3</td>\n",
       "      <td>Braund, Mr. Owen Harris</td>\n",
       "      <td>male</td>\n",
       "      <td>22.0</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>A/5 21171</td>\n",
       "      <td>7.2500</td>\n",
       "      <td>NaN</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>Cumings, Mrs. John Bradley (Florence Briggs Th...</td>\n",
       "      <td>female</td>\n",
       "      <td>38.0</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>PC 17599</td>\n",
       "      <td>71.2833</td>\n",
       "      <td>C85</td>\n",
       "      <td>C</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>3</td>\n",
       "      <td>1</td>\n",
       "      <td>3</td>\n",
       "      <td>Heikkinen, Miss. Laina</td>\n",
       "      <td>female</td>\n",
       "      <td>26.0</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>STON/O2. 3101282</td>\n",
       "      <td>7.9250</td>\n",
       "      <td>NaN</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>4</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>Futrelle, Mrs. Jacques Heath (Lily May Peel)</td>\n",
       "      <td>female</td>\n",
       "      <td>35.0</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>113803</td>\n",
       "      <td>53.1000</td>\n",
       "      <td>C123</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>5</td>\n",
       "      <td>0</td>\n",
       "      <td>3</td>\n",
       "      <td>Allen, Mr. William Henry</td>\n",
       "      <td>male</td>\n",
       "      <td>35.0</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>373450</td>\n",
       "      <td>8.0500</td>\n",
       "      <td>NaN</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "   PassengerId  Survived  Pclass    ...        Fare Cabin  Embarked\n",
       "0            1         0       3    ...      7.2500   NaN         S\n",
       "1            2         1       1    ...     71.2833   C85         C\n",
       "2            3         1       3    ...      7.9250   NaN         S\n",
       "3            4         1       1    ...     53.1000  C123         S\n",
       "4            5         0       3    ...      8.0500   NaN         S\n",
       "\n",
       "[5 rows x 12 columns]"
      ]
     },
     "execution_count": 114,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "6ae91401ff9f46e352172d4fb34d087da10911a7"
   },
   "source": [
    "### 2. Set an Index post reading of data"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 115,
   "metadata": {
    "_uuid": "49f04023c64b165e23454840fd89e350e56ab264"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>PassengerId</th>\n",
       "      <th>Survived</th>\n",
       "      <th>Pclass</th>\n",
       "      <th>Sex</th>\n",
       "      <th>Age</th>\n",
       "      <th>SibSp</th>\n",
       "      <th>Parch</th>\n",
       "      <th>Ticket</th>\n",
       "      <th>Fare</th>\n",
       "      <th>Cabin</th>\n",
       "      <th>Embarked</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Name</th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>Braund, Mr. Owen Harris</th>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>3</td>\n",
       "      <td>male</td>\n",
       "      <td>22.0</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>A/5 21171</td>\n",
       "      <td>7.2500</td>\n",
       "      <td>NaN</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Cumings, Mrs. John Bradley (Florence Briggs Thayer)</th>\n",
       "      <td>2</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>female</td>\n",
       "      <td>38.0</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>PC 17599</td>\n",
       "      <td>71.2833</td>\n",
       "      <td>C85</td>\n",
       "      <td>C</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Heikkinen, Miss. Laina</th>\n",
       "      <td>3</td>\n",
       "      <td>1</td>\n",
       "      <td>3</td>\n",
       "      <td>female</td>\n",
       "      <td>26.0</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>STON/O2. 3101282</td>\n",
       "      <td>7.9250</td>\n",
       "      <td>NaN</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Futrelle, Mrs. Jacques Heath (Lily May Peel)</th>\n",
       "      <td>4</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>female</td>\n",
       "      <td>35.0</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>113803</td>\n",
       "      <td>53.1000</td>\n",
       "      <td>C123</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Allen, Mr. William Henry</th>\n",
       "      <td>5</td>\n",
       "      <td>0</td>\n",
       "      <td>3</td>\n",
       "      <td>male</td>\n",
       "      <td>35.0</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>373450</td>\n",
       "      <td>8.0500</td>\n",
       "      <td>NaN</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "                                                    PassengerId    ...     Embarked\n",
       "Name                                                               ...             \n",
       "Braund, Mr. Owen Harris                                       1    ...            S\n",
       "Cumings, Mrs. John Bradley (Florence Briggs Tha...            2    ...            C\n",
       "Heikkinen, Miss. Laina                                        3    ...            S\n",
       "Futrelle, Mrs. Jacques Heath (Lily May Peel)                  4    ...            S\n",
       "Allen, Mr. William Henry                                      5    ...            S\n",
       "\n",
       "[5 rows x 11 columns]"
      ]
     },
     "execution_count": 115,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data.set_index('Name').head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "94ae9a5a794ef9462fa03e1ceccb4306656d5016"
   },
   "source": [
    "### 3. Set an Index while reading data"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 116,
   "metadata": {
    "_uuid": "7e0fe8838eb66fa2696ff4a59fad80bdc68176c9"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>PassengerId</th>\n",
       "      <th>Survived</th>\n",
       "      <th>Pclass</th>\n",
       "      <th>Sex</th>\n",
       "      <th>Age</th>\n",
       "      <th>SibSp</th>\n",
       "      <th>Parch</th>\n",
       "      <th>Ticket</th>\n",
       "      <th>Fare</th>\n",
       "      <th>Cabin</th>\n",
       "      <th>Embarked</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Name</th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>Braund, Mr. Owen Harris</th>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>3</td>\n",
       "      <td>male</td>\n",
       "      <td>22.0</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>A/5 21171</td>\n",
       "      <td>7.2500</td>\n",
       "      <td>NaN</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Cumings, Mrs. John Bradley (Florence Briggs Thayer)</th>\n",
       "      <td>2</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>female</td>\n",
       "      <td>38.0</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>PC 17599</td>\n",
       "      <td>71.2833</td>\n",
       "      <td>C85</td>\n",
       "      <td>C</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Heikkinen, Miss. Laina</th>\n",
       "      <td>3</td>\n",
       "      <td>1</td>\n",
       "      <td>3</td>\n",
       "      <td>female</td>\n",
       "      <td>26.0</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>STON/O2. 3101282</td>\n",
       "      <td>7.9250</td>\n",
       "      <td>NaN</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Futrelle, Mrs. Jacques Heath (Lily May Peel)</th>\n",
       "      <td>4</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>female</td>\n",
       "      <td>35.0</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>113803</td>\n",
       "      <td>53.1000</td>\n",
       "      <td>C123</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Allen, Mr. William Henry</th>\n",
       "      <td>5</td>\n",
       "      <td>0</td>\n",
       "      <td>3</td>\n",
       "      <td>male</td>\n",
       "      <td>35.0</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>373450</td>\n",
       "      <td>8.0500</td>\n",
       "      <td>NaN</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "                                                    PassengerId    ...     Embarked\n",
       "Name                                                               ...             \n",
       "Braund, Mr. Owen Harris                                       1    ...            S\n",
       "Cumings, Mrs. John Bradley (Florence Briggs Tha...            2    ...            C\n",
       "Heikkinen, Miss. Laina                                        3    ...            S\n",
       "Futrelle, Mrs. Jacques Heath (Lily May Peel)                  4    ...            S\n",
       "Allen, Mr. William Henry                                      5    ...            S\n",
       "\n",
       "[5 rows x 11 columns]"
      ]
     },
     "execution_count": 116,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data = pd.read_csv('../input/datasetsdifferent-format/data-titanic.csv', index_col=3)\n",
    "data.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "4633edbff177eafc27d28f8b4a8431b5171a2d3a"
   },
   "source": [
    "### 4. Selection using Index"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 117,
   "metadata": {
    "_uuid": "4d1264022da5ca65d3f84d26eae66bfe04201b1f"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "PassengerId            1\n",
       "Survived               0\n",
       "Pclass                 3\n",
       "Sex                 male\n",
       "Age                   22\n",
       "SibSp                  1\n",
       "Parch                  0\n",
       "Ticket         A/5 21171\n",
       "Fare                7.25\n",
       "Cabin                NaN\n",
       "Embarked               S\n",
       "Name: Braund, Mr. Owen Harris, dtype: object"
      ]
     },
     "execution_count": 117,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data.loc['Braund, Mr. Owen Harris',:]"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "0c96097582281be77a84791c2eac5b933c626ab8"
   },
   "source": [
    "### 5. Reset Index"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 118,
   "metadata": {
    "_uuid": "59f597ce7a1f4f97e0ede149035252d9a54232f4"
   },
   "outputs": [],
   "source": [
    "data.reset_index(inplace=True)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 119,
   "metadata": {
    "_uuid": "5bec37b4ea3aee8149751505cacbe8b246efd4d7"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Name</th>\n",
       "      <th>PassengerId</th>\n",
       "      <th>Survived</th>\n",
       "      <th>Pclass</th>\n",
       "      <th>Sex</th>\n",
       "      <th>Age</th>\n",
       "      <th>SibSp</th>\n",
       "      <th>Parch</th>\n",
       "      <th>Ticket</th>\n",
       "      <th>Fare</th>\n",
       "      <th>Cabin</th>\n",
       "      <th>Embarked</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>Braund, Mr. Owen Harris</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>3</td>\n",
       "      <td>male</td>\n",
       "      <td>22.0</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>A/5 21171</td>\n",
       "      <td>7.2500</td>\n",
       "      <td>NaN</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>Cumings, Mrs. John Bradley (Florence Briggs Th...</td>\n",
       "      <td>2</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>female</td>\n",
       "      <td>38.0</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>PC 17599</td>\n",
       "      <td>71.2833</td>\n",
       "      <td>C85</td>\n",
       "      <td>C</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>Heikkinen, Miss. Laina</td>\n",
       "      <td>3</td>\n",
       "      <td>1</td>\n",
       "      <td>3</td>\n",
       "      <td>female</td>\n",
       "      <td>26.0</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>STON/O2. 3101282</td>\n",
       "      <td>7.9250</td>\n",
       "      <td>NaN</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>Futrelle, Mrs. Jacques Heath (Lily May Peel)</td>\n",
       "      <td>4</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>female</td>\n",
       "      <td>35.0</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>113803</td>\n",
       "      <td>53.1000</td>\n",
       "      <td>C123</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>Allen, Mr. William Henry</td>\n",
       "      <td>5</td>\n",
       "      <td>0</td>\n",
       "      <td>3</td>\n",
       "      <td>male</td>\n",
       "      <td>35.0</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>373450</td>\n",
       "      <td>8.0500</td>\n",
       "      <td>NaN</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "                                                Name    ...     Embarked\n",
       "0                            Braund, Mr. Owen Harris    ...            S\n",
       "1  Cumings, Mrs. John Bradley (Florence Briggs Th...    ...            C\n",
       "2                             Heikkinen, Miss. Laina    ...            S\n",
       "3       Futrelle, Mrs. Jacques Heath (Lily May Peel)    ...            S\n",
       "4                           Allen, Mr. William Henry    ...            S\n",
       "\n",
       "[5 rows x 12 columns]"
      ]
     },
     "execution_count": 119,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "c877b36e639fb37d0cb33c47a75051ac927070d9"
   },
   "source": [
    "# 15.Merging and concatenating multiple data frames into one<a id=\"15\"></a>\n",
    "---\n",
    "[**Go To TOP**](#00)\n",
    "\n",
    "![](https://cdn-images-1.medium.com/max/1600/1*uG1vjoSQj7gMm8craCj2xA.png)\n",
    "\n",
    "### In this section, you can learn\n",
    "\n",
    "1. Concatenate Dataset DataFrames\n",
    "2. Concatenate using append()\n",
    "3. Concatenate on columns\n",
    "4. Merging DataFrames\n",
    "5. Left outer merge\n",
    "6. Right outer merge\n",
    "7. Full outer merge"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "d4778912dcb549a9628d5c9631d31f1e4a9b6cee"
   },
   "source": [
    "### 1. Concatenate Dataset DataFrames\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 120,
   "metadata": {
    "_uuid": "64a4397c872c4da6c87e687af394126cb6db70e6"
   },
   "outputs": [],
   "source": [
    "dataset1 = pd.DataFrame({'Age': ['32', '26', '29'],\n",
    "                         'Sex': ['F', 'M', 'F'],\n",
    "                         'State': ['CA', 'NY', 'OH']},\n",
    "                         index=['Jane', 'John', 'Cathy'])\n",
    "    \n",
    "dataset2 = pd.DataFrame({'Age': ['34', '23', '24', '21'],\n",
    "                         'Sex': ['M', 'F', 'F', 'F'],\n",
    "                         'State': ['AZ', 'OR', 'CA', 'WA']},\n",
    "                         index=['Dave', 'Kris', 'Xi', 'Jo'])"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 121,
   "metadata": {
    "_uuid": "bf3f6d94018ee8d6957fe59f67e8c0db779ef4d5"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Age</th>\n",
       "      <th>Sex</th>\n",
       "      <th>State</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>Jane</th>\n",
       "      <td>32</td>\n",
       "      <td>F</td>\n",
       "      <td>CA</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>John</th>\n",
       "      <td>26</td>\n",
       "      <td>M</td>\n",
       "      <td>NY</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Cathy</th>\n",
       "      <td>29</td>\n",
       "      <td>F</td>\n",
       "      <td>OH</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Dave</th>\n",
       "      <td>34</td>\n",
       "      <td>M</td>\n",
       "      <td>AZ</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Kris</th>\n",
       "      <td>23</td>\n",
       "      <td>F</td>\n",
       "      <td>OR</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Xi</th>\n",
       "      <td>24</td>\n",
       "      <td>F</td>\n",
       "      <td>CA</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Jo</th>\n",
       "      <td>21</td>\n",
       "      <td>F</td>\n",
       "      <td>WA</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "      Age Sex State\n",
       "Jane   32   F    CA\n",
       "John   26   M    NY\n",
       "Cathy  29   F    OH\n",
       "Dave   34   M    AZ\n",
       "Kris   23   F    OR\n",
       "Xi     24   F    CA\n",
       "Jo     21   F    WA"
      ]
     },
     "execution_count": 121,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "pd.concat([dataset1, dataset2])"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "ebca9041c04ba97a409b10ca05c7e9d63fb8ea76"
   },
   "source": [
    "### 2. Concatenate using append()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 122,
   "metadata": {
    "_uuid": "fec8d4c508f50d2909e6e2722227330c09043383"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Age</th>\n",
       "      <th>Sex</th>\n",
       "      <th>State</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>Jane</th>\n",
       "      <td>32</td>\n",
       "      <td>F</td>\n",
       "      <td>CA</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>John</th>\n",
       "      <td>26</td>\n",
       "      <td>M</td>\n",
       "      <td>NY</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Cathy</th>\n",
       "      <td>29</td>\n",
       "      <td>F</td>\n",
       "      <td>OH</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Dave</th>\n",
       "      <td>34</td>\n",
       "      <td>M</td>\n",
       "      <td>AZ</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Kris</th>\n",
       "      <td>23</td>\n",
       "      <td>F</td>\n",
       "      <td>OR</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Xi</th>\n",
       "      <td>24</td>\n",
       "      <td>F</td>\n",
       "      <td>CA</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Jo</th>\n",
       "      <td>21</td>\n",
       "      <td>F</td>\n",
       "      <td>WA</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "      Age Sex State\n",
       "Jane   32   F    CA\n",
       "John   26   M    NY\n",
       "Cathy  29   F    OH\n",
       "Dave   34   M    AZ\n",
       "Kris   23   F    OR\n",
       "Xi     24   F    CA\n",
       "Jo     21   F    WA"
      ]
     },
     "execution_count": 122,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "dataset1.append(dataset2)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "b742b03b80842138fe6b0d94c18f80308fa851cd"
   },
   "source": [
    "### 3. Concatenate on columns\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 123,
   "metadata": {
    "_uuid": "c00a9eefe0597423206ad605b3ab3081a6fe488d"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Age</th>\n",
       "      <th>Sex</th>\n",
       "      <th>State</th>\n",
       "      <th>City</th>\n",
       "      <th>Work Status</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>Jane</th>\n",
       "      <td>32</td>\n",
       "      <td>F</td>\n",
       "      <td>CA</td>\n",
       "      <td>SF</td>\n",
       "      <td>No</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>John</th>\n",
       "      <td>26</td>\n",
       "      <td>M</td>\n",
       "      <td>NY</td>\n",
       "      <td>NY</td>\n",
       "      <td>Yes</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Cathy</th>\n",
       "      <td>29</td>\n",
       "      <td>F</td>\n",
       "      <td>OH</td>\n",
       "      <td>Columbus</td>\n",
       "      <td>Yes</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "      Age Sex State      City Work Status\n",
       "Jane   32   F    CA        SF          No\n",
       "John   26   M    NY        NY         Yes\n",
       "Cathy  29   F    OH  Columbus         Yes"
      ]
     },
     "execution_count": 123,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "dataset1 = pd.DataFrame({'Age': ['32', '26', '29'],\n",
    "                         'Sex': ['F', 'M', 'F'],\n",
    "                         'State': ['CA', 'NY', 'OH']},\n",
    "                         index=['Jane', 'John', 'Cathy'])\n",
    "\n",
    "dataset2 = pd.DataFrame({'City': ['SF', 'NY', 'Columbus'],\n",
    "                         'Work Status': ['No', 'Yes', 'Yes']},\n",
    "                         index=['Jane', 'John', 'Cathy'])\n",
    "\n",
    "\n",
    "pd.concat([dataset1, dataset2], axis=1)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "9b00fda156ce029913258a97c359684589e740f8"
   },
   "source": [
    "### 4. Merging DataFrames"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 124,
   "metadata": {
    "_uuid": "be24ce790b850058039192352401eba19387123b"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Name</th>\n",
       "      <th>Age</th>\n",
       "      <th>Sex</th>\n",
       "      <th>State</th>\n",
       "      <th>City</th>\n",
       "      <th>Work Status</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>Jane</td>\n",
       "      <td>32</td>\n",
       "      <td>F</td>\n",
       "      <td>CA</td>\n",
       "      <td>SF</td>\n",
       "      <td>No</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>John</td>\n",
       "      <td>26</td>\n",
       "      <td>M</td>\n",
       "      <td>NY</td>\n",
       "      <td>NY</td>\n",
       "      <td>Yes</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>Cathy</td>\n",
       "      <td>29</td>\n",
       "      <td>F</td>\n",
       "      <td>OH</td>\n",
       "      <td>Columbus</td>\n",
       "      <td>Yes</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "    Name Age Sex State      City Work Status\n",
       "0   Jane  32   F    CA        SF          No\n",
       "1   John  26   M    NY        NY         Yes\n",
       "2  Cathy  29   F    OH  Columbus         Yes"
      ]
     },
     "execution_count": 124,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "dataset1 = pd.DataFrame({'Name': ['Jane', 'John', 'Cathy', 'Sarah'],\n",
    "                         'Age': ['32', '26', '29', '23'],\n",
    "                         'Sex': ['F', 'M', 'F', 'F'],\n",
    "                         'State': ['CA', 'NY', 'OH', 'TX']})\n",
    "\n",
    "dataset2 = pd.DataFrame({'Name': ['Jane', 'John', 'Cathy', 'Rob'],\n",
    "                        'City': ['SF', 'NY', 'Columbus', 'Austin'],\n",
    "                         'Work Status': ['No', 'Yes', 'Yes', 'Yes']})\n",
    "\n",
    "pd.merge(dataset1, dataset2, on='Name', how='inner')"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "af2d844c0322be00375ad36cd06204874797bf68"
   },
   "source": [
    "### 5. Left outer merge"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 125,
   "metadata": {
    "_uuid": "65cee7d88741a07d6c91debd85c99f44d8d34ad0"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Name</th>\n",
       "      <th>Age</th>\n",
       "      <th>Sex</th>\n",
       "      <th>State</th>\n",
       "      <th>City</th>\n",
       "      <th>Work Status</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>Jane</td>\n",
       "      <td>32</td>\n",
       "      <td>F</td>\n",
       "      <td>CA</td>\n",
       "      <td>SF</td>\n",
       "      <td>No</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>John</td>\n",
       "      <td>26</td>\n",
       "      <td>M</td>\n",
       "      <td>NY</td>\n",
       "      <td>NY</td>\n",
       "      <td>Yes</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>Cathy</td>\n",
       "      <td>29</td>\n",
       "      <td>F</td>\n",
       "      <td>OH</td>\n",
       "      <td>Columbus</td>\n",
       "      <td>Yes</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>Sarah</td>\n",
       "      <td>23</td>\n",
       "      <td>F</td>\n",
       "      <td>TX</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "    Name Age Sex State      City Work Status\n",
       "0   Jane  32   F    CA        SF          No\n",
       "1   John  26   M    NY        NY         Yes\n",
       "2  Cathy  29   F    OH  Columbus         Yes\n",
       "3  Sarah  23   F    TX       NaN         NaN"
      ]
     },
     "execution_count": 125,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "pd.merge(dataset1, dataset2, on='Name', how='left')"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "8f3eb0a6dc738ecbcb0a632874649f7bc22617f3"
   },
   "source": [
    "### 6. Right outer merge"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 126,
   "metadata": {
    "_uuid": "6444c3a81886e643817b6ef4b6915449e738c8f0"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Name</th>\n",
       "      <th>Age</th>\n",
       "      <th>Sex</th>\n",
       "      <th>State</th>\n",
       "      <th>City</th>\n",
       "      <th>Work Status</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>Jane</td>\n",
       "      <td>32</td>\n",
       "      <td>F</td>\n",
       "      <td>CA</td>\n",
       "      <td>SF</td>\n",
       "      <td>No</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>John</td>\n",
       "      <td>26</td>\n",
       "      <td>M</td>\n",
       "      <td>NY</td>\n",
       "      <td>NY</td>\n",
       "      <td>Yes</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>Cathy</td>\n",
       "      <td>29</td>\n",
       "      <td>F</td>\n",
       "      <td>OH</td>\n",
       "      <td>Columbus</td>\n",
       "      <td>Yes</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>Rob</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>Austin</td>\n",
       "      <td>Yes</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "    Name  Age  Sex State      City Work Status\n",
       "0   Jane   32    F    CA        SF          No\n",
       "1   John   26    M    NY        NY         Yes\n",
       "2  Cathy   29    F    OH  Columbus         Yes\n",
       "3    Rob  NaN  NaN   NaN    Austin         Yes"
      ]
     },
     "execution_count": 126,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "pd.merge(dataset1, dataset2, on='Name', how='right')"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "934197307fc81866012f68cfba9cb2a79601d062"
   },
   "source": [
    "### 7. Full outer merge"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 127,
   "metadata": {
    "_uuid": "ed462abb8da36b6db5c3359002634cb5990c96e0"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Name</th>\n",
       "      <th>Age</th>\n",
       "      <th>Sex</th>\n",
       "      <th>State</th>\n",
       "      <th>City</th>\n",
       "      <th>Work Status</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>Jane</td>\n",
       "      <td>32</td>\n",
       "      <td>F</td>\n",
       "      <td>CA</td>\n",
       "      <td>SF</td>\n",
       "      <td>No</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>John</td>\n",
       "      <td>26</td>\n",
       "      <td>M</td>\n",
       "      <td>NY</td>\n",
       "      <td>NY</td>\n",
       "      <td>Yes</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>Cathy</td>\n",
       "      <td>29</td>\n",
       "      <td>F</td>\n",
       "      <td>OH</td>\n",
       "      <td>Columbus</td>\n",
       "      <td>Yes</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>Sarah</td>\n",
       "      <td>23</td>\n",
       "      <td>F</td>\n",
       "      <td>TX</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>Rob</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>Austin</td>\n",
       "      <td>Yes</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "    Name  Age  Sex State      City Work Status\n",
       "0   Jane   32    F    CA        SF          No\n",
       "1   John   26    M    NY        NY         Yes\n",
       "2  Cathy   29    F    OH  Columbus         Yes\n",
       "3  Sarah   23    F    TX       NaN         NaN\n",
       "4    Rob  NaN  NaN   NaN    Austin         Yes"
      ]
     },
     "execution_count": 127,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "pd.merge(dataset1, dataset2, on='Name', how='outer')"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "80a9f24149a1b275f338fb790a428fa65c40a474"
   },
   "source": [
    "# 16.Modifying a Pandas Dataframe inplace<a id=\"16\"></a>\n",
    "---\n",
    "[**Go To TOP**](#00)\n",
    "\n",
    "### In this section, you can learn\n",
    "\n",
    "1. Modify without inplace\n",
    "2. Modify inplace\n",
    "3. inplace not required for very method"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 128,
   "metadata": {
    "_uuid": "2be34010ca01b82e196940e77b6ba7fa891eb58f"
   },
   "outputs": [],
   "source": [
    "top_movies = pd.read_table('../input/datasetsdifferent-format/data-movies-top-grossing.csv', sep=',')"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 129,
   "metadata": {
    "_uuid": "31f2f42bdc95d6f0df94d9b2d9f72bd2572c52b4"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Rank</th>\n",
       "      <th>Title</th>\n",
       "      <th>Worldwide gross</th>\n",
       "      <th>Year</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>1</td>\n",
       "      <td>Avatar</td>\n",
       "      <td>$2,787,965,087</td>\n",
       "      <td>2009</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2</td>\n",
       "      <td>Titanic</td>\n",
       "      <td>$2,186,772,302</td>\n",
       "      <td>1997</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>3</td>\n",
       "      <td>Star Wars: The Force Awakens</td>\n",
       "      <td>$2,068,223,624</td>\n",
       "      <td>2015</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>4</td>\n",
       "      <td>Jurassic World</td>\n",
       "      <td>$1,671,713,208</td>\n",
       "      <td>2015</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>5</td>\n",
       "      <td>The Avengers</td>\n",
       "      <td>$1,518,812,988</td>\n",
       "      <td>2012</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "   Rank                         Title Worldwide gross  Year\n",
       "0     1                        Avatar  $2,787,965,087  2009\n",
       "1     2                       Titanic  $2,186,772,302  1997\n",
       "2     3  Star Wars: The Force Awakens  $2,068,223,624  2015\n",
       "3     4                Jurassic World  $1,671,713,208  2015\n",
       "4     5                  The Avengers  $1,518,812,988  2012"
      ]
     },
     "execution_count": 129,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "top_movies.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "dbe6de69de84aebe54c9e5b90137686af2367698"
   },
   "source": [
    "### 1.Modify without inplace"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 130,
   "metadata": {
    "_uuid": "aa3a589d3cacd173160e4ab41ed319bb4c0b6646"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Title</th>\n",
       "      <th>Worldwide gross</th>\n",
       "      <th>Year</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Rank</th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>Avatar</td>\n",
       "      <td>$2,787,965,087</td>\n",
       "      <td>2009</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>Titanic</td>\n",
       "      <td>$2,186,772,302</td>\n",
       "      <td>1997</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>Star Wars: The Force Awakens</td>\n",
       "      <td>$2,068,223,624</td>\n",
       "      <td>2015</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>Jurassic World</td>\n",
       "      <td>$1,671,713,208</td>\n",
       "      <td>2015</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>5</th>\n",
       "      <td>The Avengers</td>\n",
       "      <td>$1,518,812,988</td>\n",
       "      <td>2012</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "                             Title Worldwide gross  Year\n",
       "Rank                                                    \n",
       "1                           Avatar  $2,787,965,087  2009\n",
       "2                          Titanic  $2,186,772,302  1997\n",
       "3     Star Wars: The Force Awakens  $2,068,223,624  2015\n",
       "4                   Jurassic World  $1,671,713,208  2015\n",
       "5                     The Avengers  $1,518,812,988  2012"
      ]
     },
     "execution_count": 130,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "top_movies.set_index('Rank').head()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 131,
   "metadata": {
    "_uuid": "78f2f099e83d56e611f5e72e2d4bea4890f2fa55"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Rank</th>\n",
       "      <th>Title</th>\n",
       "      <th>Worldwide gross</th>\n",
       "      <th>Year</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>1</td>\n",
       "      <td>Avatar</td>\n",
       "      <td>$2,787,965,087</td>\n",
       "      <td>2009</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2</td>\n",
       "      <td>Titanic</td>\n",
       "      <td>$2,186,772,302</td>\n",
       "      <td>1997</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>3</td>\n",
       "      <td>Star Wars: The Force Awakens</td>\n",
       "      <td>$2,068,223,624</td>\n",
       "      <td>2015</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>4</td>\n",
       "      <td>Jurassic World</td>\n",
       "      <td>$1,671,713,208</td>\n",
       "      <td>2015</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>5</td>\n",
       "      <td>The Avengers</td>\n",
       "      <td>$1,518,812,988</td>\n",
       "      <td>2012</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "   Rank                         Title Worldwide gross  Year\n",
       "0     1                        Avatar  $2,787,965,087  2009\n",
       "1     2                       Titanic  $2,186,772,302  1997\n",
       "2     3  Star Wars: The Force Awakens  $2,068,223,624  2015\n",
       "3     4                Jurassic World  $1,671,713,208  2015\n",
       "4     5                  The Avengers  $1,518,812,988  2012"
      ]
     },
     "execution_count": 131,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "top_movies.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "d7f287b63175e029edb46e3752d15d39b0298cb3"
   },
   "source": [
    "### 2.Modify inplace"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 132,
   "metadata": {
    "_uuid": "9cd60cf21b731e33d02621363dd1696ed781ceac"
   },
   "outputs": [],
   "source": [
    "top_movies.set_index('Rank', inplace=True)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 133,
   "metadata": {
    "_uuid": "cf006af8d2caa042f7a7d77760c7678dbf22ad78"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Title</th>\n",
       "      <th>Worldwide gross</th>\n",
       "      <th>Year</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Rank</th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>Avatar</td>\n",
       "      <td>$2,787,965,087</td>\n",
       "      <td>2009</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>Titanic</td>\n",
       "      <td>$2,186,772,302</td>\n",
       "      <td>1997</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>Star Wars: The Force Awakens</td>\n",
       "      <td>$2,068,223,624</td>\n",
       "      <td>2015</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>Jurassic World</td>\n",
       "      <td>$1,671,713,208</td>\n",
       "      <td>2015</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>5</th>\n",
       "      <td>The Avengers</td>\n",
       "      <td>$1,518,812,988</td>\n",
       "      <td>2012</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "                             Title Worldwide gross  Year\n",
       "Rank                                                    \n",
       "1                           Avatar  $2,787,965,087  2009\n",
       "2                          Titanic  $2,186,772,302  1997\n",
       "3     Star Wars: The Force Awakens  $2,068,223,624  2015\n",
       "4                   Jurassic World  $1,671,713,208  2015\n",
       "5                     The Avengers  $1,518,812,988  2012"
      ]
     },
     "execution_count": 133,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "top_movies.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "e7590d95d8c4bd92a3a81f7ddadbf9edb561d0d1"
   },
   "source": [
    "\n",
    "### 3.inplace not required for very method"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 134,
   "metadata": {
    "_uuid": "c6ba1222cd5d7ca86baa4e3738dc1ffbf4d124e7"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Title</th>\n",
       "      <th>Worldwide gross</th>\n",
       "      <th>Release Year</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Rank</th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>Avatar</td>\n",
       "      <td>$2,787,965,087</td>\n",
       "      <td>2009</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>Titanic</td>\n",
       "      <td>$2,186,772,302</td>\n",
       "      <td>1997</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>Star Wars: The Force Awakens</td>\n",
       "      <td>$2,068,223,624</td>\n",
       "      <td>2015</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>Jurassic World</td>\n",
       "      <td>$1,671,713,208</td>\n",
       "      <td>2015</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>5</th>\n",
       "      <td>The Avengers</td>\n",
       "      <td>$1,518,812,988</td>\n",
       "      <td>2012</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "                             Title Worldwide gross  Release Year\n",
       "Rank                                                            \n",
       "1                           Avatar  $2,787,965,087          2009\n",
       "2                          Titanic  $2,186,772,302          1997\n",
       "3     Star Wars: The Force Awakens  $2,068,223,624          2015\n",
       "4                   Jurassic World  $1,671,713,208          2015\n",
       "5                     The Avengers  $1,518,812,988          2012"
      ]
     },
     "execution_count": 134,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "top_movies.rename(columns = {'Year': 'Release Year'}).head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "6c0231e4bc9d9046c6ff70c0169416784f680a69"
   },
   "source": [
    "# 17.Removing columns from a pandas DataFrame <a id=\"17\"></a>\n",
    "---\n",
    "[**Go To TOP**](#00)\n",
    "\n",
    "![](https://i1.wp.com/cmdlinetips.com/wp-content/uploads/2018/04/How_To_Drop_Columns_in_Pandas.jpg)\n",
    "\n",
    "### In this section, you can learn\n",
    "\n",
    "1. Remove one column\n",
    "2. Remove more than one column\n",
    "3. Remove row(s)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "0c27babb013188fb3bf535129a6b26fb32af1c07"
   },
   "source": [
    "### 1.Remove one column"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 135,
   "metadata": {
    "_uuid": "f3b8046794477494b6015b11b86eb6354b4c3e8b"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>PassengerId</th>\n",
       "      <th>Survived</th>\n",
       "      <th>Pclass</th>\n",
       "      <th>Sex</th>\n",
       "      <th>Age</th>\n",
       "      <th>SibSp</th>\n",
       "      <th>Parch</th>\n",
       "      <th>Ticket</th>\n",
       "      <th>Fare</th>\n",
       "      <th>Cabin</th>\n",
       "      <th>Embarked</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Name</th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>Braund, Mr. Owen Harris</th>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>3</td>\n",
       "      <td>male</td>\n",
       "      <td>22.0</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>A/5 21171</td>\n",
       "      <td>7.2500</td>\n",
       "      <td>NaN</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Cumings, Mrs. John Bradley (Florence Briggs Thayer)</th>\n",
       "      <td>2</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>female</td>\n",
       "      <td>38.0</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>PC 17599</td>\n",
       "      <td>71.2833</td>\n",
       "      <td>C85</td>\n",
       "      <td>C</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Heikkinen, Miss. Laina</th>\n",
       "      <td>3</td>\n",
       "      <td>1</td>\n",
       "      <td>3</td>\n",
       "      <td>female</td>\n",
       "      <td>26.0</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>STON/O2. 3101282</td>\n",
       "      <td>7.9250</td>\n",
       "      <td>NaN</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Futrelle, Mrs. Jacques Heath (Lily May Peel)</th>\n",
       "      <td>4</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>female</td>\n",
       "      <td>35.0</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>113803</td>\n",
       "      <td>53.1000</td>\n",
       "      <td>C123</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Allen, Mr. William Henry</th>\n",
       "      <td>5</td>\n",
       "      <td>0</td>\n",
       "      <td>3</td>\n",
       "      <td>male</td>\n",
       "      <td>35.0</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>373450</td>\n",
       "      <td>8.0500</td>\n",
       "      <td>NaN</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "                                                    PassengerId    ...     Embarked\n",
       "Name                                                               ...             \n",
       "Braund, Mr. Owen Harris                                       1    ...            S\n",
       "Cumings, Mrs. John Bradley (Florence Briggs Tha...            2    ...            C\n",
       "Heikkinen, Miss. Laina                                        3    ...            S\n",
       "Futrelle, Mrs. Jacques Heath (Lily May Peel)                  4    ...            S\n",
       "Allen, Mr. William Henry                                      5    ...            S\n",
       "\n",
       "[5 rows x 11 columns]"
      ]
     },
     "execution_count": 135,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data = pd.read_csv('../input/datasetsdifferent-format/data-titanic.csv', index_col=3)\n",
    "data.head()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 136,
   "metadata": {
    "_uuid": "62cd9b619226d98730e0b707b171496a48be0faf"
   },
   "outputs": [],
   "source": [
    "data.drop('Ticket', axis=1, inplace=True)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 137,
   "metadata": {
    "_uuid": "24ed3693a49e899f38a4d3ca1aee2089447eb5d4"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>PassengerId</th>\n",
       "      <th>Survived</th>\n",
       "      <th>Pclass</th>\n",
       "      <th>Sex</th>\n",
       "      <th>Age</th>\n",
       "      <th>SibSp</th>\n",
       "      <th>Parch</th>\n",
       "      <th>Fare</th>\n",
       "      <th>Cabin</th>\n",
       "      <th>Embarked</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Name</th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>Braund, Mr. Owen Harris</th>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>3</td>\n",
       "      <td>male</td>\n",
       "      <td>22.0</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>7.2500</td>\n",
       "      <td>NaN</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Cumings, Mrs. John Bradley (Florence Briggs Thayer)</th>\n",
       "      <td>2</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>female</td>\n",
       "      <td>38.0</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>71.2833</td>\n",
       "      <td>C85</td>\n",
       "      <td>C</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Heikkinen, Miss. Laina</th>\n",
       "      <td>3</td>\n",
       "      <td>1</td>\n",
       "      <td>3</td>\n",
       "      <td>female</td>\n",
       "      <td>26.0</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>7.9250</td>\n",
       "      <td>NaN</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Futrelle, Mrs. Jacques Heath (Lily May Peel)</th>\n",
       "      <td>4</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>female</td>\n",
       "      <td>35.0</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>53.1000</td>\n",
       "      <td>C123</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Allen, Mr. William Henry</th>\n",
       "      <td>5</td>\n",
       "      <td>0</td>\n",
       "      <td>3</td>\n",
       "      <td>male</td>\n",
       "      <td>35.0</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>8.0500</td>\n",
       "      <td>NaN</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "                                                    PassengerId    ...     Embarked\n",
       "Name                                                               ...             \n",
       "Braund, Mr. Owen Harris                                       1    ...            S\n",
       "Cumings, Mrs. John Bradley (Florence Briggs Tha...            2    ...            C\n",
       "Heikkinen, Miss. Laina                                        3    ...            S\n",
       "Futrelle, Mrs. Jacques Heath (Lily May Peel)                  4    ...            S\n",
       "Allen, Mr. William Henry                                      5    ...            S\n",
       "\n",
       "[5 rows x 10 columns]"
      ]
     },
     "execution_count": 137,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "92f05b2c741cb0af27fc3c4fc4bc384dddb77928"
   },
   "source": [
    "### 2.Remove more than one column"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 138,
   "metadata": {
    "_uuid": "8a01198341eaf828007f2a221fb53c2642669d3c"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>PassengerId</th>\n",
       "      <th>Survived</th>\n",
       "      <th>Pclass</th>\n",
       "      <th>Sex</th>\n",
       "      <th>Age</th>\n",
       "      <th>SibSp</th>\n",
       "      <th>Cabin</th>\n",
       "      <th>Embarked</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Name</th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>Braund, Mr. Owen Harris</th>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>3</td>\n",
       "      <td>male</td>\n",
       "      <td>22.0</td>\n",
       "      <td>1</td>\n",
       "      <td>NaN</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Cumings, Mrs. John Bradley (Florence Briggs Thayer)</th>\n",
       "      <td>2</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>female</td>\n",
       "      <td>38.0</td>\n",
       "      <td>1</td>\n",
       "      <td>C85</td>\n",
       "      <td>C</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Heikkinen, Miss. Laina</th>\n",
       "      <td>3</td>\n",
       "      <td>1</td>\n",
       "      <td>3</td>\n",
       "      <td>female</td>\n",
       "      <td>26.0</td>\n",
       "      <td>0</td>\n",
       "      <td>NaN</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Futrelle, Mrs. Jacques Heath (Lily May Peel)</th>\n",
       "      <td>4</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>female</td>\n",
       "      <td>35.0</td>\n",
       "      <td>1</td>\n",
       "      <td>C123</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Allen, Mr. William Henry</th>\n",
       "      <td>5</td>\n",
       "      <td>0</td>\n",
       "      <td>3</td>\n",
       "      <td>male</td>\n",
       "      <td>35.0</td>\n",
       "      <td>0</td>\n",
       "      <td>NaN</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "                                                    PassengerId    ...     Embarked\n",
       "Name                                                               ...             \n",
       "Braund, Mr. Owen Harris                                       1    ...            S\n",
       "Cumings, Mrs. John Bradley (Florence Briggs Tha...            2    ...            C\n",
       "Heikkinen, Miss. Laina                                        3    ...            S\n",
       "Futrelle, Mrs. Jacques Heath (Lily May Peel)                  4    ...            S\n",
       "Allen, Mr. William Henry                                      5    ...            S\n",
       "\n",
       "[5 rows x 8 columns]"
      ]
     },
     "execution_count": 138,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data.drop(['Parch', 'Fare'], axis=1, inplace=True)\n",
    "data.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "16a5e3418d703eacc3f080c35af0789d9f585b3f"
   },
   "source": [
    "### 3.Remove row(s)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 139,
   "metadata": {
    "_uuid": "132146b2e3471bb8f186e34e1133ca36c3aa5f73"
   },
   "outputs": [],
   "source": [
    "data.drop(['Braund, Mr. Owen Harris', 'Heikkinen, Miss. Laina'], inplace=True)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 140,
   "metadata": {
    "_uuid": "9ac61480f4ccb1e3ee1d6827cf014cd142bb0007"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>PassengerId</th>\n",
       "      <th>Survived</th>\n",
       "      <th>Pclass</th>\n",
       "      <th>Sex</th>\n",
       "      <th>Age</th>\n",
       "      <th>SibSp</th>\n",
       "      <th>Cabin</th>\n",
       "      <th>Embarked</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Name</th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>Cumings, Mrs. John Bradley (Florence Briggs Thayer)</th>\n",
       "      <td>2</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>female</td>\n",
       "      <td>38.0</td>\n",
       "      <td>1</td>\n",
       "      <td>C85</td>\n",
       "      <td>C</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Futrelle, Mrs. Jacques Heath (Lily May Peel)</th>\n",
       "      <td>4</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>female</td>\n",
       "      <td>35.0</td>\n",
       "      <td>1</td>\n",
       "      <td>C123</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Allen, Mr. William Henry</th>\n",
       "      <td>5</td>\n",
       "      <td>0</td>\n",
       "      <td>3</td>\n",
       "      <td>male</td>\n",
       "      <td>35.0</td>\n",
       "      <td>0</td>\n",
       "      <td>NaN</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Moran, Mr. James</th>\n",
       "      <td>6</td>\n",
       "      <td>0</td>\n",
       "      <td>3</td>\n",
       "      <td>male</td>\n",
       "      <td>NaN</td>\n",
       "      <td>0</td>\n",
       "      <td>NaN</td>\n",
       "      <td>Q</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>McCarthy, Mr. Timothy J</th>\n",
       "      <td>7</td>\n",
       "      <td>0</td>\n",
       "      <td>1</td>\n",
       "      <td>male</td>\n",
       "      <td>54.0</td>\n",
       "      <td>0</td>\n",
       "      <td>E46</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "                                                    PassengerId    ...     Embarked\n",
       "Name                                                               ...             \n",
       "Cumings, Mrs. John Bradley (Florence Briggs Tha...            2    ...            C\n",
       "Futrelle, Mrs. Jacques Heath (Lily May Peel)                  4    ...            S\n",
       "Allen, Mr. William Henry                                      5    ...            S\n",
       "Moran, Mr. James                                              6    ...            Q\n",
       "McCarthy, Mr. Timothy J                                       7    ...            S\n",
       "\n",
       "[5 rows x 8 columns]"
      ]
     },
     "execution_count": 140,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "52579047844c6045f74bc88afe0c296f83505733"
   },
   "source": [
    "# 18.Renaming columns in a pandas DataFrame <a id=\"18\"></a>\n",
    "---\n",
    "[**Go To TOP**](#00)\n",
    "\n",
    "![](https://image.slidesharecdn.com/datamanagementinpython-170925110242/95/data-management-in-python-19-638.jpg)\n",
    "\n",
    "### In this section, you can learn\n",
    "\n",
    "1. Rename columns while reading the data\n",
    "2. Rename columns using rename method \n",
    "    1. Read in the dataset again \n",
    "    2. Rename\n",
    "3. Rename all columns"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "f251685179b1422f9d701bb491fcf9a0f2bdf2ef"
   },
   "source": [
    "### 1.Rename columns while reading the data"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 141,
   "metadata": {
    "_uuid": "054a502e1f47fc5623057f6336a38b610742adbf"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Date</th>\n",
       "      <th>Region ID</th>\n",
       "      <th>Region Name</th>\n",
       "      <th>State</th>\n",
       "      <th>City</th>\n",
       "      <th>County</th>\n",
       "      <th>Size Rank</th>\n",
       "      <th>Price</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>Date</td>\n",
       "      <td>RegionID</td>\n",
       "      <td>RegionName</td>\n",
       "      <td>State</td>\n",
       "      <td>Metro</td>\n",
       "      <td>County</td>\n",
       "      <td>SizeRank</td>\n",
       "      <td>Price</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>6181</td>\n",
       "      <td>New York</td>\n",
       "      <td>NY</td>\n",
       "      <td>New York</td>\n",
       "      <td>Queens</td>\n",
       "      <td>0</td>\n",
       "      <td>672400</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>12447</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>CA</td>\n",
       "      <td>Los Angeles-Long Beach-Anaheim</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>1</td>\n",
       "      <td>629900</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>17426</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>IL</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>Cook</td>\n",
       "      <td>2</td>\n",
       "      <td>222700</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>13271</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>PA</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>3</td>\n",
       "      <td>137300</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "         Date Region ID   Region Name   ...          County Size Rank   Price\n",
       "0        Date  RegionID    RegionName   ...          County  SizeRank   Price\n",
       "1  2017-05-31      6181      New York   ...          Queens         0  672400\n",
       "2  2017-05-31     12447   Los Angeles   ...     Los Angeles         1  629900\n",
       "3  2017-05-31     17426       Chicago   ...            Cook         2  222700\n",
       "4  2017-05-31     13271  Philadelphia   ...    Philadelphia         3  137300\n",
       "\n",
       "[5 rows x 8 columns]"
      ]
     },
     "execution_count": 141,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "list_columns = ['Date', 'Region ID', 'Region Name', 'State',\n",
    "             'City', 'County', 'Size Rank','Price']\n",
    "data = pd.read_csv('../input/datasetsdifferent-format/data-zillow1.csv', names = list_columns)\n",
    "data.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "b3b904d313bafc80cd2d7bddc0c7905316ea63ac"
   },
   "source": [
    "### 2.Rename columns using rename method\n",
    "1. **Read in the dataset again**"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 142,
   "metadata": {
    "_uuid": "18797cfa4e90c95b519a2553a7bfe3c6986143cb"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Date</th>\n",
       "      <th>RegionID</th>\n",
       "      <th>RegionName</th>\n",
       "      <th>State</th>\n",
       "      <th>Metro</th>\n",
       "      <th>County</th>\n",
       "      <th>SizeRank</th>\n",
       "      <th>Price</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>6181</td>\n",
       "      <td>New York</td>\n",
       "      <td>NY</td>\n",
       "      <td>New York</td>\n",
       "      <td>Queens</td>\n",
       "      <td>0</td>\n",
       "      <td>672400</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>12447</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>CA</td>\n",
       "      <td>Los Angeles-Long Beach-Anaheim</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>1</td>\n",
       "      <td>629900</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>17426</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>IL</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>Cook</td>\n",
       "      <td>2</td>\n",
       "      <td>222700</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>13271</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>PA</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>3</td>\n",
       "      <td>137300</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>40326</td>\n",
       "      <td>Phoenix</td>\n",
       "      <td>AZ</td>\n",
       "      <td>Phoenix</td>\n",
       "      <td>Maricopa</td>\n",
       "      <td>4</td>\n",
       "      <td>211300</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "         Date  RegionID    RegionName   ...          County SizeRank   Price\n",
       "0  2017-05-31      6181      New York   ...          Queens        0  672400\n",
       "1  2017-05-31     12447   Los Angeles   ...     Los Angeles        1  629900\n",
       "2  2017-05-31     17426       Chicago   ...            Cook        2  222700\n",
       "3  2017-05-31     13271  Philadelphia   ...    Philadelphia        3  137300\n",
       "4  2017-05-31     40326       Phoenix   ...        Maricopa        4  211300\n",
       "\n",
       "[5 rows x 8 columns]"
      ]
     },
     "execution_count": 142,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data = pd.read_csv('../input/datasetsdifferent-format/data-zillow1.csv')\n",
    "data.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "881b4c87f918cc88090a4de2a204c326f76d799b"
   },
   "source": [
    "2. **Rename**"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 143,
   "metadata": {
    "_uuid": "d6b8eedbe9aea64f15200c1a8ebd986b0e278311"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "Index(['Date', 'RegionID', 'RegionName', 'State', 'Metro', 'County',\n",
       "       'SizeRank', 'Price'],\n",
       "      dtype='object')"
      ]
     },
     "execution_count": 143,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data.columns"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 144,
   "metadata": {
    "_uuid": "93bc39e18df54d181af4a41c107e4ab63b71fbd7"
   },
   "outputs": [],
   "source": [
    "data.rename(columns={'RegionName':'Region', 'Metro':'City'}, inplace=True)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 145,
   "metadata": {
    "_uuid": "0594eea707a3f91da75efe6fe8e6c0104bf6304f"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "Index(['Date', 'RegionID', 'Region', 'State', 'City', 'County', 'SizeRank',\n",
       "       'Price'],\n",
       "      dtype='object')"
      ]
     },
     "execution_count": 145,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data.columns"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "62fcd30a3893e1c41c5c750e354af6381db43cbb"
   },
   "source": [
    "### 3.Rename all columns"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 146,
   "metadata": {
    "_uuid": "efdeb10a186cc08baf7b3707a4b7450b645f43fa"
   },
   "outputs": [],
   "source": [
    "data.columns = ['Date', 'Region ID', 'Region Name', 'State',\n",
    "             'City', 'County', 'Size Rank','Price']"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "91b6c91ff2c68acd847c7e17faf58cbfedad66af"
   },
   "source": [
    "# 19.Using groupby method <a id=\"19\"></a>\n",
    "---\n",
    "[**Go To TOP**](#00)\n",
    "\n",
    "![](https://i.stack.imgur.com/sgCn1.jpg)\n",
    "\n",
    "### In this section, you can learn\n",
    "\n",
    "1. Get Mean price for every State\n",
    "2. Split the data into groups\n",
    "3. Apply a function on each group and combine the results\n",
    "4. Get Descriptive statistics by Groups(States)\n",
    "5. Group by data on State and Region\n",
    "6. Get the number of records per State\n",
    "7. Group by Columns\n",
    "8. Iterate over Groups"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 147,
   "metadata": {
    "_uuid": "680791116e8b32a8b1113ee61f112daa78f5415c"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Date</th>\n",
       "      <th>RegionID</th>\n",
       "      <th>RegionName</th>\n",
       "      <th>State</th>\n",
       "      <th>Metro</th>\n",
       "      <th>County</th>\n",
       "      <th>SizeRank</th>\n",
       "      <th>Price</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>6181</td>\n",
       "      <td>New York</td>\n",
       "      <td>NY</td>\n",
       "      <td>New York</td>\n",
       "      <td>Queens</td>\n",
       "      <td>0</td>\n",
       "      <td>672400</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>12447</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>CA</td>\n",
       "      <td>Los Angeles-Long Beach-Anaheim</td>\n",
       "      <td>Los Angeles</td>\n",
       "      <td>1</td>\n",
       "      <td>629900</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>17426</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>IL</td>\n",
       "      <td>Chicago</td>\n",
       "      <td>Cook</td>\n",
       "      <td>2</td>\n",
       "      <td>222700</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>13271</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>PA</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>Philadelphia</td>\n",
       "      <td>3</td>\n",
       "      <td>137300</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>2017-05-31</td>\n",
       "      <td>40326</td>\n",
       "      <td>Phoenix</td>\n",
       "      <td>AZ</td>\n",
       "      <td>Phoenix</td>\n",
       "      <td>Maricopa</td>\n",
       "      <td>4</td>\n",
       "      <td>211300</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "         Date  RegionID    RegionName   ...          County SizeRank   Price\n",
       "0  2017-05-31      6181      New York   ...          Queens        0  672400\n",
       "1  2017-05-31     12447   Los Angeles   ...     Los Angeles        1  629900\n",
       "2  2017-05-31     17426       Chicago   ...            Cook        2  222700\n",
       "3  2017-05-31     13271  Philadelphia   ...    Philadelphia        3  137300\n",
       "4  2017-05-31     40326       Phoenix   ...        Maricopa        4  211300\n",
       "\n",
       "[5 rows x 8 columns]"
      ]
     },
     "execution_count": 147,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "data = pd.read_csv('../input/datasetsdifferent-format/data-zillow1.csv')\n",
    "data.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "40f98ea89a62a40263102bbedee918c4c0fd4581"
   },
   "source": [
    "### 1.Get Mean price for every State"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 148,
   "metadata": {
    "_uuid": "3ca4482d813565afbce6cd830f7f709d5844a35e"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Price</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>State</th>\n",
       "      <th></th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>AK</th>\n",
       "      <td>237783.333333</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>AL</th>\n",
       "      <td>137645.637584</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>AR</th>\n",
       "      <td>136331.707317</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>AZ</th>\n",
       "      <td>232353.921569</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>CA</th>\n",
       "      <td>617425.392297</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "               Price\n",
       "State               \n",
       "AK     237783.333333\n",
       "AL     137645.637584\n",
       "AR     136331.707317\n",
       "AZ     232353.921569\n",
       "CA     617425.392297"
      ]
     },
     "execution_count": 148,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "grouped_data = data[['State', 'Price']].groupby('State').mean()\n",
    "grouped_data.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "e5c503ac2d0aa70f7bf78eca9b129afb9efc02e7"
   },
   "source": [
    "### 2.Split the data into groups"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 149,
   "metadata": {
    "_kg_hide-output": true,
    "_uuid": "83d1f18fa0cea7b12c84a8fd388e000e32b32aed"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>State</th>\n",
       "      <th>Price</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>NY</td>\n",
       "      <td>672400</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>CA</td>\n",
       "      <td>629900</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>IL</td>\n",
       "      <td>222700</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>PA</td>\n",
       "      <td>137300</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>AZ</td>\n",
       "      <td>211300</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>5</th>\n",
       "      <td>NV</td>\n",
       "      <td>216500</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>6</th>\n",
       "      <td>CA</td>\n",
       "      <td>572100</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>7</th>\n",
       "      <td>TX</td>\n",
       "      <td>164700</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>9</th>\n",
       "      <td>FL</td>\n",
       "      <td>152300</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>11</th>\n",
       "      <td>TX</td>\n",
       "      <td>321600</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>12</th>\n",
       "      <td>MI</td>\n",
       "      <td>41500</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>13</th>\n",
       "      <td>OH</td>\n",
       "      <td>128300</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>14</th>\n",
       "      <td>TN</td>\n",
       "      <td>81100</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>15</th>\n",
       "      <td>NC</td>\n",
       "      <td>183800</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>17</th>\n",
       "      <td>MA</td>\n",
       "      <td>554600</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>18</th>\n",
       "      <td>WA</td>\n",
       "      <td>670300</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>19</th>\n",
       "      <td>MD</td>\n",
       "      <td>121100</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>20</th>\n",
       "      <td>CO</td>\n",
       "      <td>383200</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>21</th>\n",
       "      <td>DC</td>\n",
       "      <td>555900</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>22</th>\n",
       "      <td>TN</td>\n",
       "      <td>228500</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>23</th>\n",
       "      <td>WI</td>\n",
       "      <td>107900</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>24</th>\n",
       "      <td>AZ</td>\n",
       "      <td>164800</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>25</th>\n",
       "      <td>OR</td>\n",
       "      <td>417900</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>26</th>\n",
       "      <td>OK</td>\n",
       "      <td>132700</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>27</th>\n",
       "      <td>NE</td>\n",
       "      <td>152100</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>28</th>\n",
       "      <td>NM</td>\n",
       "      <td>189600</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>33</th>\n",
       "      <td>MO</td>\n",
       "      <td>121600</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>34</th>\n",
       "      <td>VA</td>\n",
       "      <td>259500</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>35</th>\n",
       "      <td>CO</td>\n",
       "      <td>251200</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>36</th>\n",
       "      <td>GA</td>\n",
       "      <td>208100</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>...</th>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>137</th>\n",
       "      <td>IN</td>\n",
       "      <td>108800</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>138</th>\n",
       "      <td>NY</td>\n",
       "      <td>439700</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>145</th>\n",
       "      <td>MS</td>\n",
       "      <td>52800</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>148</th>\n",
       "      <td>KS</td>\n",
       "      <td>262600</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>174</th>\n",
       "      <td>OR</td>\n",
       "      <td>232900</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>183</th>\n",
       "      <td>NM</td>\n",
       "      <td>140300</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>185</th>\n",
       "      <td>CT</td>\n",
       "      <td>164100</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>189</th>\n",
       "      <td>DE</td>\n",
       "      <td>223200</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>194</th>\n",
       "      <td>SC</td>\n",
       "      <td>129900</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>199</th>\n",
       "      <td>UT</td>\n",
       "      <td>202100</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>206</th>\n",
       "      <td>CT</td>\n",
       "      <td>161200</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>229</th>\n",
       "      <td>MT</td>\n",
       "      <td>204800</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>255</th>\n",
       "      <td>NH</td>\n",
       "      <td>215500</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>275</th>\n",
       "      <td>ID</td>\n",
       "      <td>155100</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>290</th>\n",
       "      <td>ND</td>\n",
       "      <td>217800</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>302</th>\n",
       "      <td>IA</td>\n",
       "      <td>131300</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>368</th>\n",
       "      <td>WY</td>\n",
       "      <td>205000</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>390</th>\n",
       "      <td>NH</td>\n",
       "      <td>251600</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>417</th>\n",
       "      <td>MS</td>\n",
       "      <td>92500</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>453</th>\n",
       "      <td>AR</td>\n",
       "      <td>111500</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>454</th>\n",
       "      <td>RI</td>\n",
       "      <td>223000</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>519</th>\n",
       "      <td>DE</td>\n",
       "      <td>135500</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>547</th>\n",
       "      <td>HI</td>\n",
       "      <td>633700</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>550</th>\n",
       "      <td>ND</td>\n",
       "      <td>263500</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>561</th>\n",
       "      <td>MT</td>\n",
       "      <td>160500</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>572</th>\n",
       "      <td>WY</td>\n",
       "      <td>178900</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>842</th>\n",
       "      <td>AK</td>\n",
       "      <td>221000</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1007</th>\n",
       "      <td>WV</td>\n",
       "      <td>99400</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3545</th>\n",
       "      <td>WV</td>\n",
       "      <td>96200</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>7646</th>\n",
       "      <td>ME</td>\n",
       "      <td>73900</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "<p>96 rows × 2 columns</p>\n",
       "</div>"
      ],
      "text/plain": [
       "     State   Price\n",
       "0       NY  672400\n",
       "1       CA  629900\n",
       "2       IL  222700\n",
       "3       PA  137300\n",
       "4       AZ  211300\n",
       "5       NV  216500\n",
       "6       CA  572100\n",
       "7       TX  164700\n",
       "9       FL  152300\n",
       "11      TX  321600\n",
       "12      MI   41500\n",
       "13      OH  128300\n",
       "14      TN   81100\n",
       "15      NC  183800\n",
       "17      MA  554600\n",
       "18      WA  670300\n",
       "19      MD  121100\n",
       "20      CO  383200\n",
       "21      DC  555900\n",
       "22      TN  228500\n",
       "23      WI  107900\n",
       "24      AZ  164800\n",
       "25      OR  417900\n",
       "26      OK  132700\n",
       "27      NE  152100\n",
       "28      NM  189600\n",
       "33      MO  121600\n",
       "34      VA  259500\n",
       "35      CO  251200\n",
       "36      GA  208100\n",
       "...    ...     ...\n",
       "137     IN  108800\n",
       "138     NY  439700\n",
       "145     MS   52800\n",
       "148     KS  262600\n",
       "174     OR  232900\n",
       "183     NM  140300\n",
       "185     CT  164100\n",
       "189     DE  223200\n",
       "194     SC  129900\n",
       "199     UT  202100\n",
       "206     CT  161200\n",
       "229     MT  204800\n",
       "255     NH  215500\n",
       "275     ID  155100\n",
       "290     ND  217800\n",
       "302     IA  131300\n",
       "368     WY  205000\n",
       "390     NH  251600\n",
       "417     MS   92500\n",
       "453     AR  111500\n",
       "454     RI  223000\n",
       "519     DE  135500\n",
       "547     HI  633700\n",
       "550     ND  263500\n",
       "561     MT  160500\n",
       "572     WY  178900\n",
       "842     AK  221000\n",
       "1007    WV   99400\n",
       "3545    WV   96200\n",
       "7646    ME   73900\n",
       "\n",
       "[96 rows x 2 columns]"
      ]
     },
     "execution_count": 149,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "grouped_data = data[['State', 'Price']].groupby('State')\n",
    "grouped_data.head(2)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 150,
   "metadata": {
    "_uuid": "74676b8214edccfbf69982ce10f0cee05a5825b9"
   },
   "outputs": [],
   "source": []
  },
  {
   "cell_type": "code",
   "execution_count": 150,
   "metadata": {
    "_uuid": "903f0d3e0b8197c413e4e35072e23bb466a411fa"
   },
   "outputs": [],
   "source": []
  },
  {
   "cell_type": "code",
   "execution_count": 150,
   "metadata": {
    "_uuid": "ea398ecdbcc6d8726cf5ce5df3c20cccb74a9dd0"
   },
   "outputs": [],
   "source": []
  },
  {
   "cell_type": "code",
   "execution_count": 150,
   "metadata": {
    "_uuid": "7d48fe2602686c67b0dc6d02aa53e2cac9f6e728"
   },
   "outputs": [],
   "source": []
  },
  {
   "cell_type": "code",
   "execution_count": 150,
   "metadata": {
    "_uuid": "6f6f3242a1b184eb4b2a877e089bc123145a9940"
   },
   "outputs": [],
   "source": []
  },
  {
   "cell_type": "code",
   "execution_count": 150,
   "metadata": {
    "_uuid": "b783fa494291fd526713bd3d95d5a94d771af44f"
   },
   "outputs": [],
   "source": []
  },
  {
   "cell_type": "code",
   "execution_count": 150,
   "metadata": {
    "_uuid": "1b106b3a3d6c855ea756290f103dfa90dd0fc3ae"
   },
   "outputs": [],
   "source": []
  },
  {
   "cell_type": "code",
   "execution_count": 150,
   "metadata": {
    "_uuid": "73fa5af2a93ddc79695d7ffdf68089fe8334dd89"
   },
   "outputs": [],
   "source": []
  },
  {
   "cell_type": "code",
   "execution_count": 150,
   "metadata": {
    "_uuid": "a59863aac530be5ef509ad165aed45f192b246a5"
   },
   "outputs": [],
   "source": []
  },
  {
   "cell_type": "code",
   "execution_count": 150,
   "metadata": {
    "_uuid": "6bebbffce41469bfbf0b22543ffcbcb4c4fb9c8d"
   },
   "outputs": [],
   "source": []
  },
  {
   "cell_type": "code",
   "execution_count": 150,
   "metadata": {
    "_uuid": "83c07b76d64d927d4db8545c46210cf8ed5a6682"
   },
   "outputs": [],
   "source": []
  },
  {
   "cell_type": "code",
   "execution_count": 150,
   "metadata": {
    "_uuid": "ea2226cc70d49d83eb229581a479d4b782176138"
   },
   "outputs": [],
   "source": []
  },
  {
   "cell_type": "code",
   "execution_count": 150,
   "metadata": {
    "_uuid": "687d433ad24a6974e86e861e3334936b675fdb74"
   },
   "outputs": [],
   "source": []
  },
  {
   "cell_type": "code",
   "execution_count": 150,
   "metadata": {
    "_uuid": "7c86ab9ef8a37d37078b113592c40e0856b58402"
   },
   "outputs": [],
   "source": []
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "8781b6bdac3c8297550211212502a70d7993363f"
   },
   "source": [
    "### 3.Apply a function on each group and combine the results"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 150,
   "metadata": {
    "_uuid": "f294ed2bc08f427ec94117ee933850b6bd352543"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Price</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>State</th>\n",
       "      <th></th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>AK</th>\n",
       "      <td>237783.333333</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>AL</th>\n",
       "      <td>137645.637584</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>AR</th>\n",
       "      <td>136331.707317</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>AZ</th>\n",
       "      <td>232353.921569</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>CA</th>\n",
       "      <td>617425.392297</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "               Price\n",
       "State               \n",
       "AK     237783.333333\n",
       "AL     137645.637584\n",
       "AR     136331.707317\n",
       "AZ     232353.921569\n",
       "CA     617425.392297"
      ]
     },
     "execution_count": 150,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "grouped_data.mean().head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "c5ec0b023db358e4f3e64be250186883946a949f"
   },
   "source": [
    "### 4.Get Descriptive statistics by Groups(States)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 151,
   "metadata": {
    "_uuid": "5e6a4a59532dcb79e17c7a656d4e7f0c513a7e7e"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead tr th {\n",
       "        text-align: left;\n",
       "    }\n",
       "\n",
       "    .dataframe thead tr:last-of-type th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr>\n",
       "      <th></th>\n",
       "      <th colspan=\"8\" halign=\"left\">Price</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th></th>\n",
       "      <th>count</th>\n",
       "      <th>mean</th>\n",
       "      <th>std</th>\n",
       "      <th>min</th>\n",
       "      <th>25%</th>\n",
       "      <th>50%</th>\n",
       "      <th>75%</th>\n",
       "      <th>max</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>State</th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>AK</th>\n",
       "      <td>12.0</td>\n",
       "      <td>237783.333333</td>\n",
       "      <td>41433.711205</td>\n",
       "      <td>175800.0</td>\n",
       "      <td>211700.0</td>\n",
       "      <td>222850.0</td>\n",
       "      <td>254950.0</td>\n",
       "      <td>323100.0</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>AL</th>\n",
       "      <td>149.0</td>\n",
       "      <td>137645.637584</td>\n",
       "      <td>72538.539135</td>\n",
       "      <td>44700.0</td>\n",
       "      <td>103900.0</td>\n",
       "      <td>126400.0</td>\n",
       "      <td>155800.0</td>\n",
       "      <td>598900.0</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>AR</th>\n",
       "      <td>82.0</td>\n",
       "      <td>136331.707317</td>\n",
       "      <td>42370.537394</td>\n",
       "      <td>65300.0</td>\n",
       "      <td>108175.0</td>\n",
       "      <td>128750.0</td>\n",
       "      <td>155050.0</td>\n",
       "      <td>268800.0</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>AZ</th>\n",
       "      <td>102.0</td>\n",
       "      <td>232353.921569</td>\n",
       "      <td>173068.589203</td>\n",
       "      <td>81500.0</td>\n",
       "      <td>148875.0</td>\n",
       "      <td>211950.0</td>\n",
       "      <td>258425.0</td>\n",
       "      <td>1611700.0</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>CA</th>\n",
       "      <td>701.0</td>\n",
       "      <td>617425.392297</td>\n",
       "      <td>604628.412673</td>\n",
       "      <td>74400.0</td>\n",
       "      <td>277000.0</td>\n",
       "      <td>453500.0</td>\n",
       "      <td>720200.0</td>\n",
       "      <td>6343800.0</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "       Price                   ...                         \n",
       "       count           mean    ...           75%        max\n",
       "State                          ...                         \n",
       "AK      12.0  237783.333333    ...      254950.0   323100.0\n",
       "AL     149.0  137645.637584    ...      155800.0   598900.0\n",
       "AR      82.0  136331.707317    ...      155050.0   268800.0\n",
       "AZ     102.0  232353.921569    ...      258425.0  1611700.0\n",
       "CA     701.0  617425.392297    ...      720200.0  6343800.0\n",
       "\n",
       "[5 rows x 8 columns]"
      ]
     },
     "execution_count": 151,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "grouped_data.describe().head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "f9dd60e9e45889cd484d393e86f2998070308557"
   },
   "source": [
    "### 5.Group by data on State and Region"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 152,
   "metadata": {
    "_uuid": "c593862cc80afa26bb467c28284c4290fab23873"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th>Price</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>State</th>\n",
       "      <th>RegionName</th>\n",
       "      <th></th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th rowspan=\"5\" valign=\"top\">AK</th>\n",
       "      <th>Anchor Point</th>\n",
       "      <td>175800.0</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Anchorage</th>\n",
       "      <td>293900.0</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Fairbanks</th>\n",
       "      <td>221000.0</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Juneau</th>\n",
       "      <td>323100.0</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Kenai</th>\n",
       "      <td>206500.0</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "                       Price\n",
       "State RegionName            \n",
       "AK    Anchor Point  175800.0\n",
       "      Anchorage     293900.0\n",
       "      Fairbanks     221000.0\n",
       "      Juneau        323100.0\n",
       "      Kenai         206500.0"
      ]
     },
     "execution_count": 152,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "grouped_data = data[['State',\n",
    "                     'RegionName', \n",
    "                     'Price']].groupby(['State','RegionName']).mean()\n",
    "grouped_data.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "ad75a696ad9ad900080591936f83ca156547dfbb"
   },
   "source": [
    "### 6.Get the number of records per State"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 153,
   "metadata": {
    "_uuid": "daa4a9c149b05adb143c5f9fdf79383f422a228b"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "State\n",
       "AK     12\n",
       "AL    149\n",
       "AR     82\n",
       "AZ    102\n",
       "CA    701\n",
       "dtype: int64"
      ]
     },
     "execution_count": 153,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "grouped_data = data.groupby(['State']).size()\n",
    "grouped_data.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "3bf8f0e279a32206c1bc6f6d560b799b5761bc8e"
   },
   "source": [
    "### 7.Group by Columns"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 154,
   "metadata": {
    "_kg_hide-output": true,
    "_uuid": "22e15f3bd5e04e9d6095f27bf1402a3c9ad258df"
   },
   "outputs": [],
   "source": [
    "grouped_data = data.groupby(data.dtypes, axis=1)\n",
    "# list(grouped_data)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "7ec5956bca29419aeec5405eb15aec3ef6f90df8"
   },
   "source": [
    "### 8.Iterate over Groups"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 155,
   "metadata": {
    "_uuid": "40d1f0fb681cf1361c8313cbcec4f5313030b23b"
   },
   "outputs": [],
   "source": [
    "# for state, grouped_data in data.groupby('State'):\n",
    "#     print(state, '\\n', grouped_data)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "406201335624a1eecb0ba756eb1dfc8ce086e637"
   },
   "source": [
    "# 20.Work with dates and times data <a id=\"20\"></a>\n",
    "---\n",
    "[**Go To TOP**](#00)\n",
    "\n",
    "![](https://i.stack.imgur.com/Zfni3.jpg)\n",
    "\n",
    "### In this section, you can learn\n",
    "\n",
    "1. let's first convert our date column to datetime\n",
    "2. Let's set index to the date column\n",
    "3. Filter and select time series Data\n",
    "4. Get properties of date-time series data"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "ad9db367ed7ce00168d0e75ccdc11bffa7da6fd2"
   },
   "source": [
    "### 1.let's first convert our date column to datetime"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 156,
   "metadata": {
    "_uuid": "3cfa298bfacb5edf42263707a1de63ee04f2d686"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>DOB</th>\n",
       "      <th>Sex</th>\n",
       "      <th>State</th>\n",
       "      <th>Name</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>1976-06-01</td>\n",
       "      <td>F</td>\n",
       "      <td>CA</td>\n",
       "      <td>Jane</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>1980-09-23</td>\n",
       "      <td>M</td>\n",
       "      <td>NY</td>\n",
       "      <td>John</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>1984-03-30</td>\n",
       "      <td>F</td>\n",
       "      <td>OH</td>\n",
       "      <td>Cathy</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>1991-12-31</td>\n",
       "      <td>M</td>\n",
       "      <td>OR</td>\n",
       "      <td>Jo</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>1994-10-2</td>\n",
       "      <td>M</td>\n",
       "      <td>TX</td>\n",
       "      <td>Sam</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>5</th>\n",
       "      <td>1973-11-11</td>\n",
       "      <td>F</td>\n",
       "      <td>CA</td>\n",
       "      <td>Tai</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "          DOB Sex State   Name\n",
       "0  1976-06-01   F    CA   Jane\n",
       "1  1980-09-23   M    NY   John\n",
       "2  1984-03-30   F    OH  Cathy\n",
       "3  1991-12-31   M    OR     Jo\n",
       "4   1994-10-2   M    TX    Sam\n",
       "5  1973-11-11   F    CA    Tai"
      ]
     },
     "execution_count": 156,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "dataset = pd.DataFrame({'DOB': ['1976-06-01', '1980-09-23', '1984-03-30', '1991-12-31', '1994-10-2', '1973-11-11'],\n",
    "                        'Sex': ['F', 'M', 'F', 'M', 'M', 'F'],\n",
    "                        'State': ['CA', 'NY', 'OH', 'OR', 'TX', 'CA'],\n",
    "                        'Name': ['Jane', 'John', 'Cathy', 'Jo', 'Sam', 'Tai']})\n",
    "dataset"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 157,
   "metadata": {
    "_uuid": "00e01cd3d8ff09aeb8eb5a0b3b88525d69d4fde4"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "DOB      object\n",
       "Sex      object\n",
       "State    object\n",
       "Name     object\n",
       "dtype: object"
      ]
     },
     "execution_count": 157,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "dataset.dtypes"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 158,
   "metadata": {
    "_uuid": "9a25af8566ccaa80905ba24754b045d7b3a442da"
   },
   "outputs": [],
   "source": [
    "dataset.DOB = pd.to_datetime(dataset.DOB)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 159,
   "metadata": {
    "_uuid": "3fa6192a0fa60f4e38ce96854136703534da9a8b"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "DOB      datetime64[ns]\n",
       "Sex              object\n",
       "State            object\n",
       "Name             object\n",
       "dtype: object"
      ]
     },
     "execution_count": 159,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "dataset.dtypes"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "201617691e6b61854160fdfc79341cf8cca39657"
   },
   "source": [
    "### 2.Let's set index to the date column"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 160,
   "metadata": {
    "_uuid": "d6d8b0ddaca8e65d4474a6c1ecc2c5b480af3c08"
   },
   "outputs": [],
   "source": [
    "dataset.set_index('DOB', inplace=True)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 161,
   "metadata": {
    "_uuid": "baac800ecdc0abf0f3b92f39b296992e16b5a5ff"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Sex</th>\n",
       "      <th>State</th>\n",
       "      <th>Name</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>DOB</th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>1976-06-01</th>\n",
       "      <td>F</td>\n",
       "      <td>CA</td>\n",
       "      <td>Jane</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1980-09-23</th>\n",
       "      <td>M</td>\n",
       "      <td>NY</td>\n",
       "      <td>John</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1984-03-30</th>\n",
       "      <td>F</td>\n",
       "      <td>OH</td>\n",
       "      <td>Cathy</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1991-12-31</th>\n",
       "      <td>M</td>\n",
       "      <td>OR</td>\n",
       "      <td>Jo</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1994-10-02</th>\n",
       "      <td>M</td>\n",
       "      <td>TX</td>\n",
       "      <td>Sam</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1973-11-11</th>\n",
       "      <td>F</td>\n",
       "      <td>CA</td>\n",
       "      <td>Tai</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "           Sex State   Name\n",
       "DOB                        \n",
       "1976-06-01   F    CA   Jane\n",
       "1980-09-23   M    NY   John\n",
       "1984-03-30   F    OH  Cathy\n",
       "1991-12-31   M    OR     Jo\n",
       "1994-10-02   M    TX    Sam\n",
       "1973-11-11   F    CA    Tai"
      ]
     },
     "execution_count": 161,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "dataset"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "46a8b030bc9c118b188c6528fe8f09764331a258"
   },
   "source": [
    "### 3.Filter and select time series Data"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 162,
   "metadata": {
    "_uuid": "c6dc828b3306a4533665df839897b912d00f305f"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Sex</th>\n",
       "      <th>State</th>\n",
       "      <th>Name</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>DOB</th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>1980-09-23</th>\n",
       "      <td>M</td>\n",
       "      <td>NY</td>\n",
       "      <td>John</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "           Sex State  Name\n",
       "DOB                       \n",
       "1980-09-23   M    NY  John"
      ]
     },
     "execution_count": 162,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "dataset['1980']"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 163,
   "metadata": {
    "_uuid": "bc4112803f8e0ac0c6ab4af7e917f970d207203f"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Sex</th>\n",
       "      <th>State</th>\n",
       "      <th>Name</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>DOB</th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>1980-09-23</th>\n",
       "      <td>M</td>\n",
       "      <td>NY</td>\n",
       "      <td>John</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1984-03-30</th>\n",
       "      <td>F</td>\n",
       "      <td>OH</td>\n",
       "      <td>Cathy</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1991-12-31</th>\n",
       "      <td>M</td>\n",
       "      <td>OR</td>\n",
       "      <td>Jo</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1994-10-02</th>\n",
       "      <td>M</td>\n",
       "      <td>TX</td>\n",
       "      <td>Sam</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "           Sex State   Name\n",
       "DOB                        \n",
       "1980-09-23   M    NY   John\n",
       "1984-03-30   F    OH  Cathy\n",
       "1991-12-31   M    OR     Jo\n",
       "1994-10-02   M    TX    Sam"
      ]
     },
     "execution_count": 163,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "dataset['1980':]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 164,
   "metadata": {
    "_uuid": "bd0dd8ffec076cd9bf0ae07d46d00d0d2ef4691a"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Sex</th>\n",
       "      <th>State</th>\n",
       "      <th>Name</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>DOB</th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>1976-06-01</th>\n",
       "      <td>F</td>\n",
       "      <td>CA</td>\n",
       "      <td>Jane</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1980-09-23</th>\n",
       "      <td>M</td>\n",
       "      <td>NY</td>\n",
       "      <td>John</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1973-11-11</th>\n",
       "      <td>F</td>\n",
       "      <td>CA</td>\n",
       "      <td>Tai</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "           Sex State  Name\n",
       "DOB                       \n",
       "1976-06-01   F    CA  Jane\n",
       "1980-09-23   M    NY  John\n",
       "1973-11-11   F    CA   Tai"
      ]
     },
     "execution_count": 164,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "dataset[:'1980']"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 165,
   "metadata": {
    "_uuid": "6db3cf6c78056e896de48a27f9eb42129bfbaeef"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Sex</th>\n",
       "      <th>State</th>\n",
       "      <th>Name</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>DOB</th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>1980-09-23</th>\n",
       "      <td>M</td>\n",
       "      <td>NY</td>\n",
       "      <td>John</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1984-03-30</th>\n",
       "      <td>F</td>\n",
       "      <td>OH</td>\n",
       "      <td>Cathy</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "           Sex State   Name\n",
       "DOB                        \n",
       "1980-09-23   M    NY   John\n",
       "1984-03-30   F    OH  Cathy"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "display(dataset['1980':'1984'])\n",
    "dataset.reset_index(inplace=True)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "a7ee8354ea6a871c0771f742c21b9a20a735ec7c"
   },
   "source": [
    "### 4.Get properties of date-time series data"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 166,
   "metadata": {
    "_uuid": "66770eb5818b751ea0e4a5f39f2ff00e849983b8"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "0    153\n",
       "1    267\n",
       "2     90\n",
       "3    365\n",
       "4    275\n",
       "5    315\n",
       "Name: DOB, dtype: int64"
      ]
     },
     "execution_count": 166,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "dataset.DOB.dt.dayofyear"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 167,
   "metadata": {
    "_uuid": "4a3c170f7ce93185ca02e63c72d39ab975376709"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "0    Tuesday\n",
       "1    Tuesday\n",
       "2     Friday\n",
       "3    Tuesday\n",
       "4     Sunday\n",
       "5     Sunday\n",
       "Name: DOB, dtype: object"
      ]
     },
     "execution_count": 167,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "dataset.DOB.dt.weekday_name"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "7ca27b11d576c37de6a858959d766cb4ca318ce5"
   },
   "source": [
    "# 21.Choosing the colors for the plots <a id=\"211\"></a>\n",
    "---\n",
    "[**Go To TOP**](#00)\n",
    "\n",
    "![](https://i.stack.imgur.com/dLUh4.png)\n",
    "\n",
    "### In this section, you can learn\n",
    "\n",
    "1. Color Palettes\n",
    "2. Look at how these colors look on a plot\n",
    "3. Change the color palette\n",
    "4. Impact on the plot\n",
    "5. seaborn palettes\n",
    "6. matplotlib colormaps as color palettes\n",
    "7. Let's set the palette to a matplotlib colormap\n",
    "8. Impact on the plot\n",
    "9. Building custom color palettes\n",
    "10. Let's see how the plot has changed"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 168,
   "metadata": {
    "_uuid": "3f01ce87ab64d504a3f3f2c78c811c19c309783a"
   },
   "outputs": [],
   "source": [
    "import pandas as pd\n",
    "from matplotlib import pyplot as plt\n",
    "%matplotlib inline\n",
    "import seaborn as sns"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 169,
   "metadata": {
    "_uuid": "1905d0955e8c6ac278ec93bcc21072ade4af5ee1"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>country</th>\n",
       "      <th>beer_servings</th>\n",
       "      <th>spirit_servings</th>\n",
       "      <th>wine_servings</th>\n",
       "      <th>total_litres_of_pure_alcohol</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>Afghanistan</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>0.0</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>Albania</td>\n",
       "      <td>89</td>\n",
       "      <td>132</td>\n",
       "      <td>54</td>\n",
       "      <td>4.9</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>Algeria</td>\n",
       "      <td>25</td>\n",
       "      <td>0</td>\n",
       "      <td>14</td>\n",
       "      <td>0.7</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>Andorra</td>\n",
       "      <td>245</td>\n",
       "      <td>138</td>\n",
       "      <td>312</td>\n",
       "      <td>12.4</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>Angola</td>\n",
       "      <td>217</td>\n",
       "      <td>57</td>\n",
       "      <td>45</td>\n",
       "      <td>5.9</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "       country              ...               total_litres_of_pure_alcohol\n",
       "0  Afghanistan              ...                                        0.0\n",
       "1      Albania              ...                                        4.9\n",
       "2      Algeria              ...                                        0.7\n",
       "3      Andorra              ...                                       12.4\n",
       "4       Angola              ...                                        5.9\n",
       "\n",
       "[5 rows x 5 columns]"
      ]
     },
     "execution_count": 169,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df = pd.read_csv('../input/datasetsdifferent-format/data-alcohol.csv')\n",
    "df.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "6fe753936d55df6c5f52499ae791941ad4a2a5fe"
   },
   "source": [
    "### 1. Color Palettes"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 170,
   "metadata": {
    "_uuid": "58d4db02168f058ba50c0a33754ad0ccffa6718d"
   },
   "outputs": [
    {
     "data": {
      "image/png": "iVBORw0KGgoAAAANSUhEUgAAAkMAAABLCAYAAABz9YPfAAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADl0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uIDIuMi4zLCBodHRwOi8vbWF0cGxvdGxpYi5vcmcvIxREBQAAArBJREFUeJzt3aFqlmEYxvH7FedWFkQHDkQZw+aCuKqIJyFosO0UPASbWCxiswkegh6BNkUwDJsMJsKSYfJYLBY/xnh49r3X79cGT7jutD+8g02ttQIASHVu9AAAgJHEEAAQTQwBANHEEAAQTQwBANHEEAAQTQwBANHEEAAQTQwBANHOL3owTdNeVe1VVU0ra7dXLl3tPmqUnWl/9ISuPq9eGD2hm+3voxf0dbR+bfSErtrvg9ETurq4emX0hK4Op6PRE7pZX/8xekJX+7U9ekJXx1+/HLbWNha9m07y7zhWN2+0zcfPTzXsLPu29nD0hK52tub7C/XN0+PRE7p6f+/F6Ald/fr5bPSErh5sPRk9oatXa+9GT+jmzt3Xoyd09Wh6O3pCVwf3b31sre0ueuczGQAQTQwBANHEEAAQTQwBANHEEAAQTQwBANHEEAAQTQwBANHEEAAQTQwBANHEEAAQTQwBANHEEAAQTQwBANHEEAAQTQwBANHEEAAQTQwBANHEEAAQTQwBANHEEAAQTQwBANHEEAAQTQwBANHEEAAQTQwBANHEEAAQTQwBANHEEAAQTQwBANHEEAAQTQwBANHEEAAQTQwBANHEEAAQTQwBANHEEAAQTQwBANHEEAAQTQwBANHEEAAQTQwBANHEEAAQTQwBANHEEAAQTQwBANHEEAAQTQwBANHEEAAQTQwBANHEEAAQTQwBANGm1tr/H0zTXlXt/f3xZlV96j1qoMtVdTh6RCdzvq3KfcvOfctrzrdVuW/ZXW+tbSx6tDCG/nk8TR9aa7unmnWGzfm+Od9W5b5l577lNefbqtyXwmcyACCaGAIAop00hl52WXF2zPm+Od9W5b5l577lNefbqtwX4UR/MwQAMDc+kwEA0cQQABBNDAEA0cQQABBNDAEA0f4Az7tyiUpEylQAAAAASUVORK5CYII=\n",
      "text/plain": [
       "<Figure size 720x72 with 1 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "sns.palplot(sns.color_palette())"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "500d473161642155f8a2a92e4840ac114bfc1afd"
   },
   "source": [
    "### 2.Look at how these colors look on a plot"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 171,
   "metadata": {
    "_uuid": "39e43519a1a60effe578d8ec7aebdba9c5bc86ac"
   },
   "outputs": [
    {
     "data": {
      "image/png": "\n",
      "text/plain": [
       "<Figure size 1080x576 with 1 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "plt.figure(figsize = (15,8))\n",
    "sns.set()\n",
    "sns.boxplot(data=df);"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "ea47638fc9f2602c00ebfb1864fc3a52bbffa084"
   },
   "source": [
    "### 3. Change the color palette"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 172,
   "metadata": {
    "_uuid": "9dd35f4075898f525a08d767cbec18803ed04561"
   },
   "outputs": [],
   "source": [
    "sns.set_palette(\"bright\")"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "749f3b7bd11d7fd8a75ad57d3dd39a90cf823ec6"
   },
   "source": [
    "### 4. Impact on the plot"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 173,
   "metadata": {
    "_uuid": "6b66473d19da8a439373cea4fd662b56fb11e769"
   },
   "outputs": [
    {
     "data": {
      "image/png": "\n",
      "text/plain": [
       "<Figure size 1080x576 with 1 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "plt.figure(figsize = (15,8))\n",
    "sns.boxplot(data=df);"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "8f543d3b341609a616ef67e751a18e9a68debc04"
   },
   "source": [
    "### 5. seaborn palettes"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 174,
   "metadata": {
    "_uuid": "7f20a42913a493573c39f1fc5b849c268428ea70"
   },
   "outputs": [
    {
     "data": {
      "image/png": "iVBORw0KGgoAAAANSUhEUgAAAaAAAABQCAYAAABf06H0AAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADl0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uIDIuMi4zLCBodHRwOi8vbWF0cGxvdGxpYi5vcmcvIxREBQAAAjRJREFUeJzt2MGKjXEcx+F3dJpBw2AaO27Bxh0oOzdgY6UUZSMWZslC2Qg1kYVrsLax5R6U5UyDQzFTU8cdqDnN6zON59n/Ot/6d/rUuzCbzWYDAPxjx+oBAPyfBAiAhAABkBAgABICBEBCgABICBAACQECICFAACQECICEAAGQECAAEpN5D+8//zBsT3cOcsuh8Wb96vDlxa16xmgu3tkYbr97WM8Yzctrj4dPN4/m+11+vTE8e/S+njGau+tXhrdP7tUzRnPjwdPh86uP9YxRTJYXhwvXL+3vZt4f257uDJvffs97fujtTbfqCaPa+vW1njCq3c2j+37TI/y/G4Zh+Pl9u54wqr0fu/WEQ8MnOAASAgRAQoAASAgQAAkBAiAhQAAkBAiAhAABkBAgABICBEBCgABICBAACQECICFAACQECICEAAGQECAAEgIEQEKAAEgIEAAJAQIgIUAAJAQIgIQAAZAQIAASAgRAQoAASAgQAAkBAiAhQAAkBAiAhAABkBAgABICBEBCgABICBAACQECICFAACQECICEAAGQECAAEgIEQEKAAEgIEAAJAQIgIUAAJAQIgIQAAZAQIAASAgRAQoAASEzmPVxdOX6QOw6dycpaPWFUayfP1RNGtXT+6L7fytkT9YRRnTqzWk8Y1eT0Uj1hFJPlxX3fLMxms9kIWwDgr3yCAyAhQAAkBAiAhAABkBAgABICBEBCgABICBAACQECICFAACQECICEAAGQ+AMuEjiDXA7XCQAAAABJRU5ErkJggg==\n",
      "text/plain": [
       "<Figure size 504x72 with 1 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    },
    {
     "data": {
      "image/png": "iVBORw0KGgoAAAANSUhEUgAAAaAAAABQCAYAAABf06H0AAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADl0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uIDIuMi4zLCBodHRwOi8vbWF0cGxvdGxpYi5vcmcvIxREBQAAAjNJREFUeJzt2L9uzmEch+Gf/tEYtKKNGIwmicEpOAejQcRkVGEREYuExCQGg4jROZjtRhabSNu0NQgVrzOQtOnjbriu/Zt8nulOnmOz2Ww2AcBfNlcPAOD/JEAAJAQIgIQAAZAQIAASAgRAQoAASAgQAAkBAiAhQAAkBAiAhAABkFg46OGt5x+mjd29w9xyZLy6c2HaeXatnjHMys2X0733d+sZwzy8+Gj6uL5ezxji/JMn0+v7b+sZw1x9cHl6cftKPWOYG4/fTJ+evqtnDDG/vDSdu35pXzcHDtDG7t70ZfvfDNA0TdOvnS/1hKG2fmzWE4ba29ioJwzzdetbPWGo3c3P9YShfm5/ryccGb7gAEgIEAAJAQIgIUAAJAQIgIQAAZAQIAASAgRAQoAASAgQAAkBAiAhQAAkBAiAhAABkBAgABICBEBCgABICBAACQECICFAACQECICEAAGQECAAEgIEQEKAAEgIEAAJAQIgIUAAJAQIgIQAAZAQIAASAgRAQoAASAgQAAkBAiAhQAAkBAiAhAABkBAgABICBEBCgABICBAACQECICFAACQECICEAAGQECAAEgIEQEKAAEgIEAAJAQIgsXDQw7XlxcPcceTMrZypJwx1+vhqPWGoxbW1esIwJ0+fqCcMtbx6tp4w1MKppXrCEPPL+3/XsdlsNhuwBQD+yBccAAkBAiAhQAAkBAiAhAABkBAgABICBEBCgABICBAACQECICFAACQECIDEbxKgOIoYpMmkAAAAAElFTkSuQmCC\n",
      "text/plain": [
       "<Figure size 504x72 with 1 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    },
    {
     "data": {
      "image/png": "iVBORw0KGgoAAAANSUhEUgAAAaAAAABQCAYAAABf06H0AAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADl0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uIDIuMi4zLCBodHRwOi8vbWF0cGxvdGxpYi5vcmcvIxREBQAAAh1JREFUeJzt3D1OVkEAhtFBwHwg0cpYuQErF0As3AIWLMDaxVi7AAutTSysWAMbsNFSw19Act2BCcTxIZ/n9JO8N1M8yRR3Y1mWZQDAP3avHgDA/0mAAEgIEAAJAQIgIUAAJAQIgIQAAZAQIAASAgRAQoAASAgQAAkBAiCxdduDn47Px9nlev7H9OD57hhf3tYz5nn5Zrz7/rFeMc3rJwdjfHhfz5jj1eE4PqpHzPNsf4yvR2t6d2OMp/uH4+rzt3rGHKvNsf3i8Y2O3DpAZ5fLOF3TAI0xxjj/US+Y6uf1aT1hrpOTesE0Vxf1grl+Xazv3Y0xxji/rhfcGZ7gAEgIEAAJAQIgIUAAJAQIgIQAAZAQIAASAgRAQoAASAgQAAkBAiAhQAAkBAiAhAABkBAgABICBEBCgABICBAACQECICFAACQECICEAAGQECAAEgIEQEKAAEgIEAAJAQIgIUAAJAQIgIQAAZAQIAASAgRAQoAASAgQAAkBAiAhQAAkBAiAhAABkBAgABICBEBCgABICBAACQECICFAACQECICEAAGQECAAEgIEQEKAAEgIEAAJAQIgsXXbg7v3N/7mjrtn51G9YKqHmw/qCXPt7dULptle1Qvm2lqt792NMcbY2awXzLG6+XdtLMuyTJgCAH/kCQ6AhAABkBAgABICBEBCgABICBAACQECICFAACQECICEAAGQECAAEgIEQOI3jbwxliF7xuUAAAAASUVORK5CYII=\n",
      "text/plain": [
       "<Figure size 504x72 with 1 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    },
    {
     "data": {
      "image/png": "iVBORw0KGgoAAAANSUhEUgAAAaAAAABQCAYAAABf06H0AAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADl0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uIDIuMi4zLCBodHRwOi8vbWF0cGxvdGxpYi5vcmcvIxREBQAAAiVJREFUeJzt2LFKV2Ecx+FXE1skAlHCIp2bampqbQ/aG4Kmtu4i6AbaGoOgvTtwa2oMjZTQJaqloE53EPjH08fsefYffOHw8oGzNE3TNADgL1uuBwDwfxIgABICBEBCgABICBAACQECICFAACQECICEAAGQECAAEgIEQEKAAEisLHp45/EYB8enOeXseP9yjPFsp54xnyd74/bbe/WK2ezefD2Od27UM2axsfduPL97WM+YzaM3W+PVg516xmzuv9gbXx/u1jNmsbS+Otae3jrRzcIBOjgeY//Totf/gM/79YJZffxxnj/eGL/2P9QTZvPl8Gc9YVbfjs7325uOvtcTzgy/4ABICBAACQECICFAACQECICEAAGQECAAEgIEQEKAAEgIEAAJAQIgIUAAJAQIgIQAAZAQIAASAgRAQoAASAgQAAkBAiAhQAAkBAiAhAABkBAgABICBEBCgABICBAACQECICFAACQECICEAAGQECAAEgIEQEKAAEgIEAAJAQIgIUAAJAQIgIQAAZAQIAASAgRAQoAASAgQAAkBAiAhQAAkBAiAhAABkBAgABICBEBCgABICBAACQECILGy6OHVjdOccQZd3q4XzOra6pV6wqyWt6/XE2ZzaetCPWFWa5vn++0tbV6sJ8xiaX315DfTNE0zbAGAP/ILDoCEAAGQECAAEgIEQEKAAEgIEAAJAQIgIUAAJAQIgIQAAZAQIAASAgRA4jfoMDKA6IgrpQAAAABJRU5ErkJggg==\n",
      "text/plain": [
       "<Figure size 504x72 with 1 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    },
    {
     "data": {
      "image/png": "iVBORw0KGgoAAAANSUhEUgAAAaAAAABQCAYAAABf06H0AAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADl0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uIDIuMi4zLCBodHRwOi8vbWF0cGxvdGxpYi5vcmcvIxREBQAAAjFJREFUeJzt2D9LlmEchuHHcMnQEMT+YNbW0tAatQbREm2tDa3SF3DyG7gKzU2NLc3h2uDSpvZSBoIlROPbNwgUn06x49h/cC03J9wz0+l0OgDAP3apHgDA/0mAAEgIEAAJAQIgIUAAJAQIgIQAAZAQIAASAgRAQoAASAgQAAkBAiAxe9rDR8/fDJOD47Pccm7sbr8ePry4V88YzeO3O8P9jQf1jNF8Wt8etu7eqWeM4tXn3WHt4UY9YzSbH9eHtacX9+1tvt8Z3j3brGeMYm55fniy9fJEN6cO0OTgeNib/Djt+bn3+/t+PWFUX44m9YRRHe/v1RNGczg5qieM6vDrxX57v779rCecG77gAEgIEAAJAQIgIUAAJAQIgIQAAZAQIAASAgRAQoAASAgQAAkBAiAhQAAkBAiAhAABkBAgABICBEBCgABICBAACQECICFAACQECICEAAGQECAAEgIEQEKAAEgIEAAJAQIgIUAAJAQIgIQAAZAQIAASAgRAQoAASAgQAAkBAiAhQAAkBAiAhAABkBAgABICBEBCgABICBAACQECICFAACQECICEAAGQECAAEgIEQEKAAEgIEAAJAQIgMXvaw5XrC2e549y5fG21njCqW4sr9YRRLazerieMZmllsZ4wqqWbF/vtXblxtZ4wirnl+RPfzEyn0+kIWwDgr3zBAZAQIAASAgRAQoAASAgQAAkBAiAhQAAkBAiAhAABkBAgABICBEBCgABI/AF9czh/Zv/UFwAAAABJRU5ErkJggg==\n",
      "text/plain": [
       "<Figure size 504x72 with 1 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    },
    {
     "data": {
      "image/png": "iVBORw0KGgoAAAANSUhEUgAAAaAAAABQCAYAAABf06H0AAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADl0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uIDIuMi4zLCBodHRwOi8vbWF0cGxvdGxpYi5vcmcvIxREBQAAAjBJREFUeJzt2M+KTmEAx/Ez5jVN8jesJFKzlAsgK7fgArgFyWLWFpKdNRfgDqy5AFlOYdKUBY1BTGP0WtmqeZvjO70+n/1Tv9Pp9D09C9PpdDoAwD92qB4AwP9JgABICBAACQECICFAACQECICEAAGQECAAEgIEQEKAAEgIEAAJAQIgMZn14LXHL4aNre393HJgvF29Mbx/slLPGM3522vDpWf36xmjeXNzdVi7c7GeMYqVR++G1w9e1jNGc/ne1eHV07v1jNFcufVw2H3+oZ4xjuXFYXL97J6OzBygja3tYX3zx6zHD7zdL+v1hFGtf9usJ4zq58f5fX87n+fzx++Pna+f6gnj+v6rXnBguIIDICFAACQECICEAAGQECAAEgIEQEKAAEgIEAAJAQIgIUAAJAQIgIQAAZAQIAASAgRAQoAASAgQAAkBAiAhQAAkBAiAhAABkBAgABICBEBCgABICBAACQECICFAACQECICEAAGQECAAEgIEQEKAAEgIEAAJAQIgIUAAJAQIgIQAAZAQIAASAgRAQoAASAgQAAkBAiAhQAAkBAiAhAABkBAgABICBEBCgABICBAACQECICFAACQECIDEZNaD504s7+eOA2dy/EI9YVQXjp6qJ4zq8Jn5fX9LJ+f721s6drqeMK4ji/WCcSzv/bkWptPpdIQpAPBXruAASAgQAAkBAiAhQAAkBAiAhAABkBAgABICBEBCgABICBAACQECICFAACR+A0IQN28HjP48AAAAAElFTkSuQmCC\n",
      "text/plain": [
       "<Figure size 504x72 with 1 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "sns.palplot(sns.color_palette(\"deep\", 7))\n",
    "sns.palplot(sns.color_palette(\"muted\", 7))\n",
    "sns.palplot(sns.color_palette(\"pastel\", 7))\n",
    "\n",
    "sns.palplot(sns.color_palette(\"bright\", 7))\n",
    "sns.palplot(sns.color_palette(\"dark\", 7))\n",
    "sns.palplot(sns.color_palette(\"colorblind\", 7))"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "dc1def773ac8561ab75b5c890168ce95f428fd0f"
   },
   "source": [
    "### 6. matplotlib colormaps as color palettes"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 175,
   "metadata": {
    "_uuid": "7494f9284274a1974f9286864cbde3d1af4fb7f4"
   },
   "outputs": [
    {
     "data": {
      "image/png": "iVBORw0KGgoAAAANSUhEUgAAAaAAAABQCAYAAABf06H0AAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADl0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uIDIuMi4zLCBodHRwOi8vbWF0cGxvdGxpYi5vcmcvIxREBQAAAjFJREFUeJzt3LFqU2EcxuHvtLE9ugQyGOjSXW/AQURw9BbcnIsX0K2QK+g99AaKiy6OBWfdXSLpEOhST6L1eAdCQj/fcHye/Q/v8vE7ZEjT931fAOAf20sPAOD/JEAARAgQABECBECEAAEQIUAARAgQABECBECEAAEQIUAARAgQABECBEDEaNvDz2/eltXi+j637IznHy/L4vw0PaOa6cms/PzyKT2jmgdPX5au69Izqmjbtlx9W6ZnVPPseFJO339Nz6hm9vpJeXX2IT2jium4LRfvXmx0s3WAVovr0s2/b3u+8+5uhvvISymlrH+kF1Q15D957379Tk+oanm7Tk+oar4c9tvbhJ/gAIgQIAAiBAiACAECIEKAAIgQIAAiBAiACAECIEKAAIgQIAAiBAiACAECIEKAAIgQIAAiBAiACAECIEKAAIgQIAAiBAiACAECIEKAAIgQIAAiBAiACAECIEKAAIgQIAAiBAiACAECIEKAAIgQIAAiBAiACAECIEKAAIgQIAAiBAiACAECIEKAAIgQIAAiBAiACAECIEKAAIgQIAAiBAiACAECIEKAAIgQIAAiBAiACAECIEKAAIgQIAAiBAiAiNG2h4fTx/e5Y+fsjyfpCXUdPEwvqKppmvSEatrRsL8bJ48O0hOqOpoM8+1Nx+3GN03f932FLQDwV8P+lAJgZwkQABECBECEAAEQIUAARAgQABECBECEAAEQIUAARAgQABECBECEAAEQ8Qe1ETeQS+UBxAAAAABJRU5ErkJggg==\n",
      "text/plain": [
       "<Figure size 504x72 with 1 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    },
    {
     "data": {
      "image/png": "iVBORw0KGgoAAAANSUhEUgAAAaAAAABQCAYAAABf06H0AAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADl0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uIDIuMi4zLCBodHRwOi8vbWF0cGxvdGxpYi5vcmcvIxREBQAAAjZJREFUeJzt2D2rznEcx/H/4XCkk7sOdcpuVRaPwYLFYLYwUiYsTAaT2JTFYHEzeAgyWo2mU8fpuElyV5dnoFz5eV/l9dq/9Zl+7/otzWaz2QQA/9iOegAA/ycBAiAhQAAkBAiAhAABkBAgABICBEBCgABICBAACQECICFAACQECIDE8ryHF67cmDa3tv/mloXx/OHd6fTl2/WMYZ7duTqdufW4njHM02vnprP3X9Uzhnhy8eR06dHresYw984fn26+eFPPGOb6qWPTg5dv6xlDrK7snM6dOPpHN3MHaHNre9rYfDfv+cLb2PpQTxhq4/3nesJQGx+/1ROGeff5ez1hqPdfftQThvr09Wc9YWH4ggMgIUAAJAQIgIQAAZAQIAASAgRAQoAASAgQAAkBAiAhQAAkBAiAhAABkBAgABICBEBCgABICBAACQECICFAACQECICEAAGQECAAEgIEQEKAAEgIEAAJAQIgIUAAJAQIgIQAAZAQIAASAgRAQoAASAgQAAkBAiAhQAAkBAiAhAABkBAgABICBEBCgABICBAACQECICFAACQECICEAAGQECAAEgIEQEKAAEgIEAAJAQIgIUAAJAQIgMTyvIdH1g79zR0LZ33tQD1hqPWDq/WEodb3r9QThjm8urueMNTBvbvqCUPt2zP3s7vQVld2/vHN0mw2mw3YAgC/5QsOgIQAAZAQIAASAgRAQoAASAgQAAkBAiAhQAAkBAiAhAABkBAgABICBEDiF2r0OG+JC+JEAAAAAElFTkSuQmCC\n",
      "text/plain": [
       "<Figure size 504x72 with 1 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "sns.palplot(sns.color_palette(\"RdBu\", 7))\n",
    "sns.palplot(sns.color_palette(\"Blues_d\", 7))"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "0785273fc541cdb2c442b5693e8ee055899c1923"
   },
   "source": [
    "### 7. Let's set the palette to a matplotlib colormap"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 176,
   "metadata": {
    "_uuid": "737791f09cfa858a87602fcb17a96953ffc735ee"
   },
   "outputs": [],
   "source": [
    "sns.set_palette(\"Blues_d\")"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "04fd24c235edfe8b903abd441ff387a8afc68fc1"
   },
   "source": [
    "### 8. Impact on the plot"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 177,
   "metadata": {
    "_uuid": "68ead4cfe19783677e31bf4b7ba19f486113e290"
   },
   "outputs": [
    {
     "data": {
      "image/png": "\n",
      "text/plain": [
       "<Figure size 1080x576 with 1 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "plt.figure(figsize = (15,8))\n",
    "sns.boxplot(data=df);"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "0eaf3c037139d44d978aa5be97ed5be1567afeff"
   },
   "source": [
    "### 9. Building custom color palettes"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 178,
   "metadata": {
    "_uuid": "519e93cf69bda9f295852d0691c5d7df1de6a160"
   },
   "outputs": [
    {
     "data": {
      "image/png": "iVBORw0KGgoAAAANSUhEUgAAAWkAAABQCAYAAADbeYSfAAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADl0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uIDIuMi4zLCBodHRwOi8vbWF0cGxvdGxpYi5vcmcvIxREBQAAAdVJREFUeJzt2DFq3VAQQNHn4DZgSCv4WkjWFNJkH1mBvRkvRB9cGQwJXoDcpUth8YUu4pxC3cBM8W6hu3Vd1wFA0pejFwDg/0QaIEykAcJEGiBMpAHCRBogTKQBwkQaIEykAcJEGiBMpAHCRBog7H7r4K/vT+Pt5f2Wu2Q8Lj/GPB+9xX6WZYx5zEevsYvl33c+cIs9LWP8no9eYj8/l3HaxzdNYzw/f3psc6TfXt7H6/Xv1vG86/XoDfZ1HSc/8Mz3/TnxbWOc//F9kt8dAGEiDRAm0gBhIg0QJtIAYSINECbSAGEiDRAm0gBhIg0QJtIAYSINECbSAGEiDRAm0gBhIg0QJtIAYSINECbSAGEiDRAm0gBhIg0QJtIAYSINECbSAGEiDRAm0gBhIg0QJtIAYSINECbSAGEiDRAm0gBhIg0QJtIAYSINECbSAGEiDRAm0gBhIg0QJtIAYSINECbSAGEiDRAm0gBhIg0QJtIAYSINECbSAGEiDRAm0gBh91sHv01fb7lHzuVy9Ab7uoyTH3jm+x5OfNsY531807Rp7G5d1/XGqwBwI353AISJNECYSAOEiTRAmEgDhIk0QJhIA4SJNECYSAOEiTRAmEgDhIk0QNgHDp4mkcDM5bwAAAAASUVORK5CYII=\n",
      "text/plain": [
       "<Figure size 432x72 with 1 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "my_palette = ['#4B0082', '#0000FF', '#00FF00', '#FFFF00', '#FF7F00', '#FF0000']\n",
    "sns.set_palette(my_palette)\n",
    "sns.palplot(sns.color_palette())"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "33dd2673277c1f11b4c09511c255c8e92098a2c6"
   },
   "source": [
    "### 10. Let's see how the plot has changed"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 179,
   "metadata": {
    "_uuid": "6fbf116079bbf7d4d858dc73ed79d15e91e4cf37"
   },
   "outputs": [
    {
     "data": {
      "image/png": "\n",
      "text/plain": [
       "<Figure size 1080x576 with 1 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "plt.figure(figsize = (15,8))\n",
    "sns.boxplot(data=df);"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "cc3381713177b3c63d695118a53ccca7916fbc17"
   },
   "source": [
    "# 22.Controlling plot aesthetics <a id=\"221\"></a>\n",
    "---\n",
    "[**Go To TOP**](#00)\n",
    "\n",
    "![](https://tgmstat.files.wordpress.com/2013/11/tips1.png)\n",
    "\n",
    "### In this section, you can learn\n",
    "\n",
    "1. First plot with seaborn\n",
    "2. Changing the plot style with set_style\n",
    "\t1. Set plot background to a white grid\n",
    "\t1. Set the plot background to dark\n",
    "\t1. Set the background to white\n",
    "\t1. Adding 'ticks\n",
    "3. Customizing the styles\n",
    "\t1. Style parameters\n",
    "4. Plotting Context Presets\n",
    "\t1. Plotting Context Preset - paper\n",
    "\t1. Plotting Preset - talk\n",
    "\t1. Plotting Preset - poster"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "22e56ecfc3f49dd1787b3b4e21089f152188e1ed"
   },
   "source": [
    "### 1. First plot with seaborn"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 180,
   "metadata": {
    "_uuid": "d3724c36033c696346882ddf49fa2ad3e034cf71"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>country</th>\n",
       "      <th>beer_servings</th>\n",
       "      <th>spirit_servings</th>\n",
       "      <th>wine_servings</th>\n",
       "      <th>total_litres_of_pure_alcohol</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>Afghanistan</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>0.0</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>Albania</td>\n",
       "      <td>89</td>\n",
       "      <td>132</td>\n",
       "      <td>54</td>\n",
       "      <td>4.9</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>Algeria</td>\n",
       "      <td>25</td>\n",
       "      <td>0</td>\n",
       "      <td>14</td>\n",
       "      <td>0.7</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>Andorra</td>\n",
       "      <td>245</td>\n",
       "      <td>138</td>\n",
       "      <td>312</td>\n",
       "      <td>12.4</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>Angola</td>\n",
       "      <td>217</td>\n",
       "      <td>57</td>\n",
       "      <td>45</td>\n",
       "      <td>5.9</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "       country              ...               total_litres_of_pure_alcohol\n",
       "0  Afghanistan              ...                                        0.0\n",
       "1      Albania              ...                                        4.9\n",
       "2      Algeria              ...                                        0.7\n",
       "3      Andorra              ...                                       12.4\n",
       "4       Angola              ...                                        5.9\n",
       "\n",
       "[5 rows x 5 columns]"
      ]
     },
     "execution_count": 180,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "import pandas as pd\n",
    "from matplotlib import pyplot as plt\n",
    "%matplotlib inline\n",
    "import seaborn as sns\n",
    "df = pd.read_csv('../input/datasetsdifferent-format/data-alcohol.csv')\n",
    "df.head()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 181,
   "metadata": {
    "_uuid": "f8b4307ad537d4245c7f2192ee5b1de0897ea78e"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "<matplotlib.axes._subplots.AxesSubplot at 0x7fb885df9dd8>"
      ]
     },
     "execution_count": 181,
     "metadata": {},
     "output_type": "execute_result"
    },
    {
     "data": {
      "image/png": "iVBORw0KGgoAAAANSUhEUgAAAYwAAAESCAYAAADuVeJ5AAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADl0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uIDIuMi4zLCBodHRwOi8vbWF0cGxvdGxpYi5vcmcvIxREBQAAIABJREFUeJzt3Xl8VNX9+P/XZJKQbdjCsARIWELe7CCLLIqAFKEUl7ohRelqq+3Hrnb5dPFLtVo/7efTfmqlP/1IF0VFRakrCm4ooiJbkPXNmoSdEAImBMj6+2Nu7JhCckMmmSXv5+PBg5l7zr33nLmTed9zz73neGpqajDGGGMaEhfuAhhjjIkOFjCMMca4YgHDGGOMKxYwjDHGuGIBwxhjjCsWMIwxxrhiAcMYY4wrFjCMMca4YgHDGGOMKxYwjDHGuGIBwxhjjCvx4S5AE7UBRgOHgKowl8UYY6KFF+gGrAHOul0p2gPGaGBluAthjDFRagLwntvM0R4wDgEUF5+iujp0o+6mp6dRVFQasu2Fk9UlMsVKXWKlHtC66hIX56FDh1RwfkPdivaAUQVQXV0T0oBRu81YYXWJTLFSl1ipB7TKujTqUr51ehtjjHHFAoYxxhhXLGAYY4xxxQKGMcYYVyxgGGOMccUChjHGGFcsYBhjjHEl2p/DMA0oKT7NqWOnW2RfbVITiE+2r5Qxscr+umPc6U/OsmbprhbZ1+gZ2RYwjIlhrv66RSQHeBRIB4qAuaq6s04eL/AAMB2oAe5X1QUu0joDfwd6AgnA28B3VbWyybUzxhgTMm77MB4C5qtqDjAfePgceeYA2UA/YBwwT0R6uUj7ObBNVYcCQ4GRwLWNrokxxphm1WDAcFoAI4BFzqJFwAgR8dfJOgt4RFWrVbUQeB64wUVaDeATkTgCw5UnAgeaUCdjjDHNwM0lqZ7AAVWtAlDVKhE56CwvDMqXCeQHvS9w8jSUdg/wHIFRE1OBB1V1VWMqkZ6e1pjsrvj9vpBvMxyO5p/A50tqkX2lpCQ2++cWK8cFYqcusVIPsLo0JBJ6KG8APgamAD7gVRG5XlWfdbuBoqLSkI4y6ff7KCwsCdn2wq2k5EyL7KesrLxZP7dYOi6xUpdYqQe0rrrExXku6ETbTR/GPqC703Fd24Gd4SwPVgBkBb3PDMpTX9odwBPO5aqTwAvA5MZUwhhjTPNrMGCo6lEgF5jtLJoNbHD6IoItBm4VkTinf+Ma4FkXaXsJ3D2FiCQCnwM2X3iVjDHGNAe3d0ndBtwhIjsItAhuAxCRpSIyysmzENgD7AQ+BO5W1b0u0r4PTBCRTQQC0w7gkSbVyhhjTMi56sNQ1e3AmHMsnxH0ugq4/Tzr15e2G5jqphzGGGPCx8aSMsYY44oFDGOMMa5YwDDGGOOKBQxjjDGuWMAwxhjjigUMY4wxrljAMMYY44oFDGOMMa5YwDDGGOOKBQxjjDGuWMAwxhjjigUMY4wxrljAMMYY44oFDGOMMa5YwDDGGOOKBQxjjDGuuJpASURygEeBdKAImKuqO+vk8QIPEJhutQa4X1UXuEh7DBgatKmhwDWq+mIT6mWMMSbE3LYwHgLmq2oOMB94+Bx55gDZQD9gHDBPRHo1lKaqc1V1uKoOB74MFAPLLqg2xhhjmk2DAUNEOgMjgEXOokXACBHx18k6C3hEVatVtRB4HrjBRVqwrwNPqOrZxlfFGGNMc3LTwugJHHDm5a6dn/ugszxYJpAf9L4gKE99aQCISCLwJeBvbgtvjDGm5bjqw2gh1wAFqprb2BXT09NCXhi/3xfybYbD0fwT+HxJLbKvlJTEZv/cYuW4QOzUJVbqAVaXhrgJGPuA7iLiVdUqpwM7w1kerADIAtY474NbFfWl1foaF9i6KCoqpbq65kJWPSe/30dhYUnIthduJSVnWmQ/ZWXlzfq5xdJxiZW6xEo9oHXVJS7Oc0En2g1eklLVo0AuMNtZNBvY4PRFBFsM3CoicU7/xjXAsy7SEJEewATgiUbXwBhjTItwe5fUbcAdIrIDuMN5j4gsFZFRTp6FwB5gJ/AhcLeq7nWRBoG7o15S1eKmVMYYY0zzcdWHoarbgTHnWD4j6HUVcPt51j9vmpN+r5tyGGOMCR970tsYY4wrFjCMMca4YgHDGGOMKxYwjDHGuGIBwxhjjCsWMIwxxrhiAcMYY4wrFjCMMca4YgHDGGOMKxYwjDHGuGIBwxhjjCsWMIwxxrhiAcMYY4wrFjCMMca4YgHDGGOMKxYwjDHGuOJqAiURyQEeBdKBImCuqu6sk8cLPABMB2qA+1V1QUNpTvqNwK8Aj5P+OVU90rSqGWOMCSW3LYyHgPmqmgPMBx4+R545QDbQDxgHzBORXg2lOVO8zgOmqupg4FLg5AXUxRhjTDNqMGCISGdgBLDIWbQIGCEi/jpZZwGPqGq1qhYCzwM3uEj7AfDfqnoYQFVPquqZplTKGGNM6Lm5JNUTOODMy42qVonIQWd5YVC+TCA/6H2Bk6ehtIHAXhF5F0gDlgD3qmpNI+tijDGmGbnqw2hmXmAoMBVIBF4jEFAec7uB9PS0kBfK7/eFfJvhcDT/BD5fUovsKyUlsdk/t1g5LhA7dYmVeoDVpSFuAsY+oLuIeJ3WhRfIcJYHKwCygDXO++BWRUNpz6rqWeCsiLwAXEwjAkZRUSnV1aFrkPj9PgoLS0K2vXArKWmZK3xlZeXN+rnF0nGJlbrESj2gddUlLs5zQSfaDfZhqOpRIBeY7SyaDWxw+iKCLQZuFZE4p3/jGuBZF2lPAleIiEdEEoApwMZG18QYY0yzcnuX1G3AHSKyA7jDeY+ILHXucgJYCOwBdgIfAner6l4XaU8BR4GtBALTFuCvTamUMcaY0HPVh6Gq24Ex51g+I+h1FXD7edavL60a+KHzzxhjTISyJ72NMca4YgHDGGOMKxYwjDHGuGIBwxhjjCsWMIwxxrhiAcMYY4wrFjCMMca4YgHDGGOMKxYwjDHGuGIBwxhjjCsWMIwxxrhiAcMYY4wrFjCMMca4YgHDGGOMKxYwjDHGuGIBwxhjjCuuJlASkRzgUSAdKALmqurOOnm8wAPAdKAGuF9VF7hImwd8GzjobGqVqn6nadUyxhgTam5bGA8B81U1B5gPPHyOPHOAbKAfMA6YJyK9XKQBPKaqw51/FiyMMSYCNRgwRKQzMAJY5CxaBIwQEX+drLOAR1S1WlULgeeBG1ykGWOMiQJuWhg9gQPOvNy183MfdJYHywTyg94XBOWpLw3gJhH5WESWi8i4RpTfGGNMC3HVh9HMHgLuVdUKEZkKvCAiA1S1yO0G0tPTQl4ov98X8m2Gw9H8E/h8SS2yr5SUxGb/3GLluEDs1CVW6gFWl4a4CRj7gO4i4lXVKqcDO8NZHqwAyALWOO+DWxXnTVPVw7UbUNXXRWQfMBh4x20liopKqa6ucZu9QX6/j8LCkpBtL9xKSs64zltdVc3pknJOf1KOx+uhY0YaHo/H1bplZeXN+rnF0nGJlbrESj2gddUlLs5zQSfaDQYMVT0qIrnAbOBx5/8NTl9EsMXArSKyhMDdVNcAExpKE5HuqnrAeT0c6AVoo2timqSqspodHx5g4+t7OXuq4tPlqe3bkDmkM4MnZ5HStk0YS2iMCTe3l6RuAx4VkbuAYmAugIgsBe5S1bXAQmAMUHu77d2qutd5XV/afSIyEqgCyoFbglsdpvkVHSjh3YWb+OTYabpmd6DXsM6ktG3D2bIK8jcVou/vZ8+6w4y/cQCZg+ve62CMaS08NTWhu5QTBr2AvXZJ6vxqyqp459mt500/frCE5Q+tJz7Ry9jr+tO9f/q/XYI6ceQUK5/cwvEDJQyY0JPRV/U752Wq0TOySe2UHPI61Iql4xIrdYmVekDrqkvQJaneQJ7b7UZCp7cJk+LDpSx/eAPeBC/Tbh+JL/3cP/btu6Qy445RrH1pJ9tW7sPjgVFXnjtoGGNilwWMVqribCVv/W0jcXEept024rzBopY3Po6Lr8nB44Gt7+7Dm+BlxOf7tlBpjTGRwAJGK7XulV2UFp9h+u0jaOtPcbWOx+Nh9NU5VFZUs+nNPNp2SiF7dLdmLqkxJlLY4IOt0KGdx9H3DzBwQk+69OnQqHU9Hg9jrxW69m3Ph0u2U3yotJlKaYyJNBYwWpmKs1WsemYbbf0pXDT9wi4pxXnjuGzOYBKT4lnx2CYqzlSGuJTGmEhkAaOV2b5qH6eKzzD+hv7EJ3oveDvJbdtw2ZzBlBwrY/XzO0JYQmNMpLKA0YqUn65k89v59BiQ3uhLUefSNbsDgydnsXvtIQ64H8nFGBOlLGC0IlvfLaD8dCXDp/UJ2TaHTe1Nu84pfLB4O2dOlYdsu8aYyGMBo5U4c6qCre8WkDXET3qPtiHbrjfBy/gbB3Lq5Ble+MPqkG3XGBN5LGC0ElvfLaCivIphIWxd1Orcqx0DLu3Je09tZc8GG9XFmFhlAaMVqKqoYseHB+g5sBMduoZ+KHiA4dP6kNYxmSf+8+2QDtNijIkcFjBagb25Rzh7qoIBl9ad8yp0EpPi+eKPx7Jn/RFWPX3+sauMMdHLAkaMq6mpYdt7+2nfJZWu2U2/M6o+o6/sR7+LM1h8z3uUnXQ/B4cxJjpYwIhxu9cf5viBEvpf0qPZBwv0eDzM+e0kSotO89If1zS8gjEmqljAiHFv/GMjCUnx9BnZtUX2lzWkM5fMGsgbC3I5VnCyRfZpjGkZFjBiWEnRada9tpvs0d1IaNNy40x+8WfjiPN6eO6+91tsn8aY5mcBI4atfl6pqqhu8RFlO2b4uOJbF/HhEmVv7pEW3bcxpvm4Ou0UkRzgUQLzcRcBc1V1Z508XuABYDpQA9yvqgsaSgtaX4ANwF9U9c6mVMoErHp6G5kDO9Exw9fi+55xxyjeeXwzz/x6JT9Zcp1NtmRMDHDbwngImK+qOcB84OFz5JkDZAP9gHHAPBHp5SKtNqA8DDzf+CqYczmgReTlHmH8tf3Dsv9kXxuu/tFYtq/az8bX9za8gjEm4jUYMESkMzACWOQsWgSMEBF/nayzgEdUtVpVCwn8+N/gIg3gZ8DLgA17GiLvP7ONOK+HMVdL2Mowce5guvbtwOK736Oqsjps5TDGhIabS1I9gQOqWgWgqlUictBZXhiULxPID3pf4OSpN01EhgHTgMnAry6gDrWTmYeU39/yl3FCpaqqmg+fU0Z+Ppt2/hR8vqQW2W9KSuK/fW5f+/3nuO/axeS+tJvp3xzR5H1E83GpK1bqEiv1AKtLQ8I6RauIJAD/B3zVCUQXtJ2iotKQDkfh9/soLCwJ2fZa2pZ3Cjh+sISb7rkMgJKSlnmIrqys/N8+t+xLMuh3cQaP/fJtBk3NIikt8YK3H+3HJVis1CVW6gGtqy5xcZ4LOtF204exD+ju9DPU9jdkOMuDFQBZQe8zg/KcL60b0BdYKiJ5wPeBW0Xk/xpVC/MZH72wg6S0RIZP7R3uouDxeJg1bwKfFJax7KH14S6OMaYJGmxhqOpREckFZgOPO/9vcPoigi0m8GO/hMDdVNcAE+pLU9UCoFPtBkRkHpBmd0lduMqKKta9sovh0/qQkBRPTVlVuItE31HdGPmFbF6bv47JXx5KW39KuItkjLkAbu+Sug24Q0R2AHc47xGRpSIyysmzENgD7AQ+BO5W1b0u0kwIbX9vP6eKzzD6qn7hLspnXPeL8ZSfqeSF/7E5M4yJVq76MFR1OzDmHMtnBL2uAm4/z/rnTauTb56b8pjzW/PSTpLSEhkyOavhzC2oW3ZHJt48mHce28TUbw6nawimiDXGtCx70juG1L0cFWmuvnMs8YlelvzWhgwxJhpZwIghkXo5qla7LqlMu30Ea17YaTPzGROFLGDEkI9e3BGRl6OCTf/OSNp2SmHx3e9RU2Mz8xkTTSxgxIjqqmo2vLaHYVf0jsjLUbWS0xK56kdj2L5qPx+/mRfu4hhjGsECRozYteYQpUWnGfH5vuEuSoMmzh1M597tWXyPDRliTDSxgBEj1r+6m/hEL0Muj9zLUbXiE7zc8MtLOLCtiHcWbgp3cYwxLlnAiAE1NTVseHU3Ayb0JNnXJtzFcWXkzGxkfHf++V8fcOqEzf9tTDSwgBEDDmwv4mjeSS6a3ifcRXHN4/HwpXsmcurEWV60h/mMiQoWMGLA+ld3A3DR9MjvvwiWOaQzl80ZxJt/3cihncfDXRxjTAMsYMSADa/upu/IrrTvkhruojTatT8bT2JyPIvuejfcRTHGNMACRpQ7frCEvI1HuSgK7o46l7b+FK6+cyyb3sxj4xs2vJgxkcwCRpSrnf50+LTo6b+oa8rXh9G1bweeuutdKivCP7quMebcLGBEuY2v78Wf2ZaMnI7hLsoFi0/0MuvXEzi8q5g3/7ox3MUxxpyHBYwodrasgq3vFjDsij54PJ5wF6dJhk3tzZDLs3j+dx9SfKg03MUxxpyDBYwotn3VfirOVDEsAmbWayqPx8PNv51MVWUVT/5iRbiLY4w5BwsYUWzj8j20SUlAxncPd1FConPv9lz1wzGsfXnXp30zxpjI4WqUOhHJAR4lML1qETBXVXfWyeMFHgCmAzXA/aq6wEXaV4EfANWAF3hEVR9oetViW01NDbmv72XQpEwS2kTuYIONNf3bI/ngue0s/NlbyLhbSEpLDHeRjDEOty2Mh4D5qpoDzAcePkeeOUA20A8YB8wTkV4u0p4DhqnqcGA88CMRGdr4qrQu+7Yco/hgKcOuiKy7o04dO92kf2c/KWfWLy/l+P4SFv3ync+kHc0/8Zn3lacrw11dY1qVBk9NRaQzMAKY6ixaBDwoIn5VLQzKOotA66AaKBSR54EbgN/Xl6aqnwRtIwVIINAKMfWovWQzbEqv8BYkSHlZJRtX5IVkW/0v7cm7T24hqW0i3bIDd4D5fEmUlPxr3KnRM7KJT46d1pUxkc5NC6MncMCZl7t2fu6DzvJgmUB+0PuCoDz1pSEiV4nIFifP71XVhjBtwMble+h9URfaReHT3W6M+HxffOnJvP/MNirOWkvCmEgQEadnqvoi8KKIZALPi8hSVVW366enp4W8TH6/L+TbDJUTR0+xZ/1hvjRvYoPlPJp/Ap8vqUXKlZDgDem+pn51OEv+5wM2vraXyTcPAfjM9lNSEiP6ODUkmsseLFbqAVaXhrgJGPuA7iLiVdUqpwM7w1kerADIAtY474NbFfWlfUpVC0TkI2Am4DpgFBWVUl0duqtYfr+PwsKSkG0v1N57eis1NZB9SXdX5Qy+jNOcKiqqQrovX9cUBk3MZMuKAvx92jF4fOZntl9WVh7Rx6k+kf4dcytW6gGtqy5xcZ4LOtFu8JKUqh4FcoHZzqLZwIY6/RcAi4FbRSRORPzANcCzDaWJyIDaDYhIJ2AyYJek6rHx9b2075pK1hB/uIvS7C6a3pf0Hj4+WLyN0uLT4S6OMa2a27ukbgPuEJEdwB3Oe0RkqYiMcvIsBPYAO4EPgbtVda+LtG+KyBYRyQXeBB5U1eVNrFfMqiyvYvPb+Qyb2jvqn+52wxsfx2VzBlNVWcPyv+VSXWVTuhoTLq76MFR1OzDmHMtnBL2uAm4/z/r1pf3AVUkNADs+PMCZ0nKGR9jttM2prT+FsdcJ7y3ayvpXdzNqZr9wF8mYVikiOr2Nextf30tCkpcBE+repBbb+o7sxomDpWxeUUCnnm3pNaxLuItkTKtjASOK1NTUkLtsDwMu7UmblIRwF6fFTbhxEEfyTrDqmW206xybtxMbE8lsLKkocmhnMUfzTsbEYIMXwhsfx8S5Q0hoE8+bf93IycJT4S6SMa2KBYwokrtsD0DEDQfSklLbJTHl68M4W1bBQ99+jbOnKsJdJGNaDQsYUSR3+R4yh/hJ7x47DxddiPTuPibeMpj924qY/41XqCy3WfqMaQkWMKJESdFpdq051KrujqpPjwGdmP3ry9j0Zh4PfetVqirtdltjmpsFjCjx8Rt7qamuieq5u0Nt/HX9+dK9E1n3yi4W/McyCxrGNDO7SypK5C53nu4e2jncRYkoU2+9iPLTlTz7m1WUn6nktoc+T0KSfa2NaQ7WwogCFWcr2fxWHsOm9iYuLvaf7m6sL3x3NLN/M5H1S3fzv3Ne4HRpebiLZExMsoARBfT9A5w5VWGXo+pxxTcv4ht/voLt7+/ntzOf4VjByXAXyZiYYwEjCuQu30NicjwDJ2SGuygR7ZJZA/n+E1dzbP8n/HraU+gH+8NdJGNiigWMCFf7dPfAiZkk2uxyDRpyeS/uWjabtPZJ/O7a53jxD6ttwEJjQsQCRoTbv/UYRftL7HbaRujatwO/WnYTF1+dwz/v/4DfXfscR/PsEpUxTWUBI8LlLg+MAj+8lQ4HcqFS2rbhm//fdG59cBr5mwr51cSFvPLAGior7CE/Yy6UBYwIl7tsN31GxO7c3c3J4/Ew/sYB3PveLQyenMWzv1nFXZMeZ/3SXdTUhG6GRmNaCwsYEezkkVPsWX+E4dP6hrsoUa1jho87/nEl31t4FTU18OevvMy9X3iG3OV7Qjq1rzGxzlUvqojkAI8C6UARMFdVd9bJ4wUeAKYDNcD9qrrARdqvgJuAKqAC+LmqLmt61aLfxjecy1FX2OWoUBg+rQ9DpvRi5ZNbePl/P+JPN79I9/7pTPnaMMZeJyT72oS7iMZENLctjIeA+aqaA8wHHj5HnjlANtAPGAfME5FeLtI+Akar6lDga8DTIpLc+KrEng3L9pDew0ePgZ3CXZSY4Y2PY9LcIdy/+ivcOn8acV4Pj/3kLb4/5BH+79uvseG13VScqQx3MY2JSA22MESkMzACmOosWgQ8KCJ+VS0MyjoLeERVq4FCEXkeuAH4fX1pdVoTHwMeAi2ZVn0T/enScja/nc/kLw9pFXN3t7T4BC/jbxjAuOv7k7fxKO8s3MTal3bywbPbSUpNoP+lPRk0MZOcMRlk9E8nPsEb7iIbE3ZuLkn1BA4483KjqlUictBZHhwwMoH8oPcFTp6G0oLNBXaraqsOFhAYbLDybJXNX93MPB4PvYd3offwLtx8/2S2v7ef9Ut3seWdgk/nH4lP9NJjYCd6De1M1tDOdM3uQKeebenQLQ1vvHUDmtYjYp4EE5GJwD38qyXjWnp6WsjL4/eHd86Jza/n0b5zKmNn5OD1XviP0tH8E/h8SSEs2fklJHibfV/B209JSQz5cep2Y3sm3zgYgMN7i9HVB9i97jC71h9izYs7WfHYpk/zxnk9+Hu2w5/Zlrb+FNqmp+BLT8aXnvzp67bOe196Cmkdkj5zLMP9HQuVWKkHWF0a4iZg7AO6i4jXaV14gQxnebACIAtY47wPblXUl4aIjAMeB65WVW1sJYqKSkN6t4vf76OwsCRk22us8tOVrHllF+Ou78/x402fhrSk5EwIStWwioqqZt2Xz5f0me2XlZU363HypsUzcEoWA6dkAYGn7o8VfEJh/kmO7fsk8K/gE4r2l5C36Silx89w6sSZ8w6z7vFASvsk0jok0b5zKkltE+nQNY1OmW3xZ7WjU2ZbOme1I61j9HThhftvJZRaU13i4jwXdKLdYMBQ1aMikgvMJvCjPhvYUKf/AmAxcKuILCHQB3ENMKGhNBEZDTwNXK+q6xtdgxi0eUU+Z8sqGDUzO9xFiXinjp1ukf20SU0gPjkef1Y7/FntzpuvpqaG0yXllB4/Q2nxaU4Vn/n0denxM5QeP01p8RnKSys4frCU3WsPUXr8s0G2fZdUMgf5yRzsJ3NQJzKHdCatffO03GrrZYwbbr8ptwGPishdQDGBvgZEZClwl6quBRYCY4Da223vVtW9zuv60v4CJAMPi0jt/m5R1X+1/VuZdS/vIrV9G+SSHuEuSkQrL6tk44q8FtnX6BnZrn5YPR4PKW3bkNK2DZ17nT+wBJ8Bni4t51j+SQoLPqEw7yS71xxEVx/k47fyPs3fMSONDEmnx8BOdM5qhydEw9y7rZcx4DJgqOp2Aj/4dZfPCHpdBdx+nvXrSxvtqqStRGV5FRuW7WbkjGy7MyfChLI1U1NWRVnZv+bt6NgljY5d0pDRGYy/Rlj3+h7KT1dSdCAQRA7uOM6WdwrY/HY+yb5EsoZ2pu+obqT38NlddKbF2KlFhNm8Ip/Tn5Qz6kq7OyqShLo1U7c/JtiwSb0ASEyOp1t2R7pld2To53pTfqaS/duOkf/xUXasPsj2Vfvp0C2N/pf0oO/IrnjtBMM0MwsYEWb1EiW1QxKDJtrcF+azEpPi6XNRV/pc1JXy0xXs3XCEHasP8sGz29nw2m76X9ITGd+DpNSEcBfVxCgLGBHkbFkFG17bw9jrhPhEO1s055eYnICM70HOuO4c2X2CzSvyyV22h81v5ZE9JoMhl/cipa0NdWJCywJGBMldvoezZRWMvVYazmwMgU72rtkd6JrdgeLDpWx9p4AdHxxg1+qDDJqUxaCJmSQk2Z+5CQ37JkWQ1UuU9l1TyRnbPdxFMVGoQ9c0Lpk1kCFTerHhtd1sfH0v+sF+hl3Rh5wxGcQ14QFQY8CGN48YZSfPsOmtfC6+Osf+sE2TtO2UwsSbhzDju6No1zmV1UuUF/57NYd2Hg930UyUs1+mCLH2ld1Ullcxxi5HmRDxZ7Zj2u0juPyrQ6mprmH5wxt494nNnP7kbLiLZqKUXZKKEKue2kqXPu3pPbxLuItiYojH46HnID/dcjqy+a18Nr2Vx/5tRYz4fB9yxtmDoaZxrIURAQ7vKWbHhweY8KVB9hCWaRbxCV6GT+vDVXeOpVNPH6v/uYOlf17LAS0Kd9FMFLGAEQFWPbUVT1xg/mljmlM7fwpTv3kRE740iFPHT/NfNyzhuftW2aRRxhW7JBVm1VXVrHp6G0Muz6JD19AP025MXR6Phz4jupIh6eRtOMzL/7uGtS/t4it/mILYZSpTD2thhNmWdwooPlTKhNmDwl0U08okpSZwy32TufOZL1JZUcX9Vz/Lo3e+SZl1ipvzsIARZiuf3EJaejLDp/UJd1FMKzVoUha/eecWpt0+gnce38wvLn2M9Ut3hbtYJgJZwAijk0dOsf7V3Yy/vr8NBWLCqk0tMi0HAAATvElEQVRqAjf9+jJ+9dpN+Dom8+evvMz8r73MiSNNn8DLxA4LGGH09mObqKqoZvJXhoa7KMYA0Ht4F+56fTbX/WI8ua/v5ReXPsa7T2ympiZ0M1qa6GUBI0wqy6tY8ejHDLk8i659O4S7OMZ8Kj7By8zvXcw9K26m56BO/P0Hb/C7657j8J7icBfNhJmru6REJAd4lMD0qkXAXFXdWSePF3gAmA7UAPer6gIXaVcA9wFDgD+r6p0hqFfEW/vyTk4eLWPKN4aHuyjGnFPXvh34yZLrWfnEZp7+9UrumvQ4V985lmm3j7DJvVopty2Mh4D5qpoDzAcePkeeOUA20A8YB8wTkV4u0vYA3wB+fwHlj1pvLNhI597tGXJ5r3AXxZjziovzMPGWIdz73lyGTunNs79ZxT3TniJv45FwF82EQYMBQ0Q6AyOARc6iRcAIEfHXyToLeERVq1W1EHgeuKGhNFXdpaq5QKt5cmhv7hF2rz3ElK8NJS5EczMb05w6dE3jP/4+k+/8bSafFJZx97SneHreSs6cKm94ZRMz3LQwegIHnHm5a+fnPugsD5YJ5Ae9LwjKU19aq/PKn9aQ3DaRS+3ZCxNlRs3M5t73buGyOYN47S/r+Jb8hVVPb6W62jrFW4OYeNI7PT30T0j7/b6QbxMgf8tR1r2yi5t+NYGsvp2aZR/BjuafwOdLavb9ACQkeJt9X8Hbb4n9Nee+zre9lqxXSkpi47/rfh8/fuyLzPjmSBb88HUW3LGct//+Md/4wxUMmZjVPAVtIc31dx8OzVEXNwFjH9BdRLyqWuV0YGc4y4MVAFnAGud9cKuivrQmKyoqDekZjt/vo7CwJGTbC/bYXStISk3gkjkDm20fdZWUnGmR/VRUVDXrvny+pM9sv7n3FyzU+6pbl+bcV33KysrJ23b0gtZNTU/ip09fy8rFW3nhj6v5z0mPMXRKL6783sV0yz73nX9tUhOIT47M89Tm/LtvaQ3VJS7Oc0En2g0eOVU9KiK5wGzgcef/DU5fRLDFwK0isoTA3VTXABNcpLUah3cX89HzO5j+7RGkdUwOd3GMobysko0r8i54fZ8vCbwevvC90Wx9dx+b3srj47fy6D28C8Om9qZd59TP5B89IztiA4ZpmNsjdxvwqIjcBRQDcwFEZClwl6quBRYCY4Da223vVtW9zuvzponIpcBTQFvAIyI3AV9X1WVNqlkEeumPH5HQxsu020aEuyjGhFR8gpehU3qRMzaDLSsK2L5qP3m5R+h9UVeGTu1NO39KuItoQsBVwFDV7QR+8OsunxH0ugq4/Tzr15f2HhDzQ2TmbzrKB4u3Mf3bI//trMuYWJGUmsjIL2QzaGImm1fko6v2s3fDYTKHdGbQpMxwF880kbUNW0BNTQ1P/b93Se2YzMzvjw53cYxpdklpiYya2Y9BEzPZunIf+v4B8j8+iq7azxe+N5phV/SxW8qjkAWMFpC7bA/b39vPzb+dTEq7lrn7xZhIkOxrw8gZ2Qy9vBc7PzrIrjWHeGDuS3Tt24HJXxnCJbMGktre/iaihY0l1cwqzlbyzK/fo1u/DkycOzjcxTEmLBKS4hl4WSbzls3mWw9NJ7VDEot+9S4/HLaAv//wDfI+vrA7tUzLshZGM3vpjx9xeHcxP1h0tY2/Y1o9b3wcQy7LYshlWezbeoyVT2/hw+e28+7jm+k1rDMTbhrEiOl9SGjT9J+mSL6FN1rZp9mM8j4+yit/WsP4GwcwdErvcBfHmLCrextvvzHdyRramd1rD7P9/f0s/M+3eeaelfQZ2Y2cMRm0b8K0xXYLb+jZp9lMKsur+Nt3l9O2Uwpf+s3EcBfHmIiVmJzAgAk96X9pDw7vKkY/OIC+v59tK/fh79WOnDEZZA3tQkIba6GHmwWMZrLk/vfZt/UY31t4lXXqGeOCx+OhW7+OdOvXkdMl5exZd4gdqw+y6ultfPTCDnpf1JWcMRmk92gb7qK2WhYwmsFHL+zg1QfXMWnuEJur25gLkOxLZNCkLAZOzOTo3hPsXH2Q3WsOseODA3Ts7qPfmAz6XNSVRLvk1KLs0w6xfVuP8dfvLSd7dDfm3Dcp3MUxJqp5PB669OlAlz4duPiaHPasP8yO1QdZvURZ+9JOeg3rQs6YDPy92uHx2HMdzc0CRggdP1jCA3NfJMXXhu/8bSbxiXbN1ZhQSUxOoP8lPZHxPSjaX8LO1QfYsyEwt0y7Lqn0G5NB35FdSUpNDHdRY5YFjBA5ceQUv7v2OU4Vn+HHz11H+y42/IcxzcHj8dCpZ1s69WzLqCv7kZd7lJ0fHWDtiztZ/8ouMof4yRnT3eboaAYWMEKg+FApv7/+OU4cPsWPnvkivYd3CXeRjGkVEtrE029MBv3GZFB8qJSdqw+we91h8nKPsn7pbibeMphxNwwgvXvszHMRThYwmmjP+sP8+csvcaa0nO8/eTX9Ls4Id5GMaZU6dEvj4muEkV/IJn9TIYd3FfPcfe+z5LfvI5f0YPwNAxg1M5tkX5twFzVqWcC4QDU1NaxctJWFP32L9l1S+cUzs+gxoPln0DPG1M+b4KXPiK7M+uWlnCo5y/vPbueDxdv42/de5/Gfvc3waX0YfWU/hkzpRZuUhHAXN6pYwLgARfs/4R8/epPNb+fT/9IefPuRL+BLtwmRjIk0nXu355ofj+XqO8ewe+0h3l+8nTUv7eSj53eQmBzPkMt7MerKbIZN7Q0xND1rc7GA0Qilx0/z6l/W8caCXDx4mPPbSVz+1WE2TLMxEc7j8ZA9OoPs0RnMuW8S+sEB1r28k3Wv7GLdK7vwxscxYHwPci7pweDJWWQN7Wx/1+dgAcOFgs2FrHxyC+8t2sLZsgrGfFG47ufj6ZTZLtxFM8Y0kjc+joETejJwQk/m/HYyu9ccIvf1PejK/Sz5baDPI61jEjKuB31HdaPvyK70GtbFHhLEZcAQkRzgUQLzcRcBc1V1Z508XuABYDpQA9yvqguakhYuFWcr2bn6IB+/uZeNy/eyb+sx4tt4GTWzHzO/P5rukh7O4hljQiQuzvPpXVZ+v4/dW4+w5Z0CNq/IZ+fqg6x7ZRcQCDI9B/vpOagT3XPSyZCOZEg6HTPSWtUDg25D5kPAfFV9XERuBh4GLq+TZw6QDfQjEFg2iMgbqprXhLRmVXr8NEfzTlKYH/h3NP8k+7ceY9/mQiorqonzBr5Mc+6bxNjr+pPWwcaEMiaWtfWnMO76/oy7vj8AnxSWsWf9YXatPcSe9YfJXbaHlU9s+TR/UmoC6ZltSe/uI727j449fHTs5sPXKZm0jsn40pPxdUwmMSU+JgJLgwFDRDoDI4CpzqJFwIMi4lfVwqCss4BHVLUaKBSR54EbgN83Ia0hXuCCrjXu+OAAC+5YRk3Qsz1p6cl06d2ey24cSOfsDvQd1TXqb8Gr8caR0rZl6hCf6G3WfSWnJlLl+dcBa+79BQv1vurWpTn3VZ+m7qu+ejTH/hojPiGu0b8NdfO375LKiM/3ZcTn+366rLT4NEf3nuTw7mIK805SfKiU4iOnyMs9wpYVBefcrjc+joRkL4lJCSQkxZOYHE+i8398Gy9xHg+eOE9g/3EePB7wxHnwAFVVNVRXVFNZWUVVRTWV5dWcLaugprqG2fdcRrd+HV3V5TxpjRqOwk0LoydwQFWrAFS1SkQOOsuDA0YmkB/0vsDJ05S0hnQD6NCh8U9Vj5spjJspjV4vGn3h1pEttq8+Q1r2ocWW3J/tKzr31xjp6Q3Pv5GenkZWtv9fp9ARyk1dCPyG7na7zWjvxVkDTAAOAVVhLosxxkQLL4FgsaYxK7kJGPuA7iLidVoXXiDDWR6sAMgKKkBwy+FC0xpyFnjPZV5jjDH/4rplUSuuoQyqehTIBWY7i2YDG+r0XwAsBm4VkTgR8QPXAM82Mc0YY0yEcHtJ6jbgURG5CygG5gKIyFLgLlVdCywExgC1t9verap7ndcXmmaMMSZCeGpqbAhgY4wxDWvwkpQxxhgDFjCMMca4ZAHDGGOMKxYwjDHGuBLtD+41iTMu1k+AgcD3VfXBoLQU4O/ASKASuFNVX24oLVK4GTAykojIfwPXAb2AIaq62Vl+3npEYh1FJJ3AnX99gXICd/99S1ULRWQsgXHYkoE84GbntnXqSwsnZ6ie3kA1UArcoaq50XZcaonI/wPm4XzHovSY5AFnnH8AP1XVZS1Rl9bewsgFbgKePEfancAnqpoNXAksEJE0F2mRonbAyBxgPoEvSyR7HriMf39os756RGIda4Dfqaqo6hACD0fdLyJxwOPAd5zyvgvcD1BfWgT4sqoOU9WLgP8G/uYsj7bjgoiMAMbifMei+JgAXK+qw51/y1qqLq06YKjqZlXdSuDsqa5ZOF905+xoLfB5F2lhFzRg5CJn0SJghPNgZERS1fdU9TOjB9RXj0ito6oeV9UVQYs+JDCSwUjgjKrWjkzwEHCj87q+tLBS1ZNBb9sB1dF4XESkDYHgdXvQ4qg8JufRInVp1QGjAc01YGJL+LcBI4HaASOjSX31iPg6Omd2twMvUuc7o6rHgDgR6dhAWtiJyAIRKQDuBb5MdB6Xu4HH60ybELXHBHhCRD4Wkb+ISHtaqC4x3YchIusJfFjn0qX2S21MM/kzgev+DwJfDHNZLpiqfgNARG4hMO3Ar8JbosYRkXHAKOBn4S5LiExQ1X1Oq+l/CXy//tkSO47pFoaqjlDVTuf511CwqB0UsVYm/xpwsb60SPDpgJHw6ayG5xowMtLVV4+IrqPTid8PmOXM9fKZ74yIdAKqVfV4A2kRQ1UXApOB/UTXcZkIDAD2Oh3GPYBlBCZui7pjUnvpVlXPAn8BLqGFvl8xHTCaaDHwLQAR6QeMBl5zkRZ2jRgwMqLVV49IrqOI3EfguvE1zh81wDogWUQudd7fRuB71FBa2IhImoj0DHp/JXAciKrjoqr3q2qGqvZS1V4EAt40Aq2laDsmqSLSznntIXDTTi4t9P1q1WNJichsAl+aDgRugTwFXKGqW0UkFfgHcBGBuTZ+oqovOOudNy1SiEh/Arc2dsAZMFJVNbylOj8ReQC4FugKHAOKVHVQffWIxDqKyCBgM7ADOO0s3quqXxSR8QRulkjiX7c2HnHWO29auIhIF+AFIJXA9/w4gVvI10fbcQnmtDJmOrfVRtsx6QM8R2A+Cy+wFfiuqh5qibq06oBhjDHGPbskZYwxxhULGMYYY1yxgGGMMcYVCxjGGGNcsYBhjDHGFQsYJiqJSJ6IfC7c5WhuIvKQiETVk9UmdsX00CDGRDtVvS3cZTCmlrUwjDkPEWnWEyoR8TgDFBoTFayFYaLZaOcJ8W4E5tO4XVXPiMhM4DcEJmPaCtymqh8DiEgGgUEBLyMwMOAfVfUBJ20eMJjAxDRXAT8EFpxrxyJyMYFxfHIIPNH9hKr+0EkbC/yBwMRc+cD3aoc8F5EVwCpgEoFhwOeJyA2qOipo2z8AJqvqVSLyD2C/qv5SRCYRmNfgj8BPCTx9/XNV/buzXjqBEQgmAkpgvKRJqnqpM4zEH4A5BJ72zQdmqzNRlTFu2NmNiWZzCIwJ1JfAD/cvReQiApP8fIvAjG8PAy+KSBvnbP4lYCPQHZgCfF9EpgVt82rgWaA98EQ9+/4T8CdVbevs/xkAEekOvEIgYHUkMNnWc3Xmg7gF+CbgIzA3gThjktX6Euee1AsCQ6e0c8r/dWC+iHRw0uYTGN6mK4FhyL8ctN4VBIJkjrP+jQRmwzPGNWthmGj2YO3InSJyL4GWQzrwsKqudvI8KiI/JzDT2hnAr6p3O2l7ROQRAgO4LXOWfaCqzzuva8eCOpcKIFtEOjnzC3zoLL8ZWKqqS533r4vIWmAGgfGVAP6hqluc1ydF5AUCg/Td7QSO/gTm0Djffu9W1UpgqYiUBqovawhMcTtYVcuArSLyKIGWTO16PmfbH6nqtnrqZsw5WcAw0Sx4yOx8AsNoZwFfFpE7gtISnbQqIENETgSleYGV59lmfb5OYFKe7SKyF/i1BuZ1zwJucEZ2rZUAvF3PPp4E/sfZ3peA550f/XMpcoJFrTIgDfAT+HsO3vanr1X1LRF5kEArJEtElhAYSPATV7U1BgsYJroFz+KWSWBmt33Avap6b93MzkQ6e1W1X920IK5G43Sm5p3tXOa6FnjW6UPYByxU1VsbsY/XAb+IDCfQ0viBmzLUUQhUEpjrYYez7DOz3Dl9NQ8406g+A/yYKJsMyYSXBQwTzb4jIi8TOMv+BfA0sAT4p4i8AXwEpBC4LPOu875ERH4KPEBgSPsBQLKqrmnMjkXkZmCZqhYGtViqCXRKr3H6Rd4g0LoYC+xS1f3n2paqVojIYgJD7XckEEAaRVWrnFbDPBH5BoEAOpfA5DmIyGgCfZbrCfRznOHcc9kbc17W6W2i2ZPAcmAPsBv4jaquBW4lMG1lMbAL+Ap8Or/0TGA4sJfAvBsLCHQCN9Z0YIvTh/An4CZVPe30qVwN/JzAWf8+AmfyDf2tPQl8Dlhc55JTY/wHgbocBhYCi4DaCZzaAo8Q+EzyCXR4//4C92NaKZsPw5gYJSL/BXRV1S83mNkYF+ySlDExwpnpLhHYRGDa4K8D3whroUxMsYBhzHmIyKvAhHMk3aeq97V0eVzwEbgMlQEcIXDnVURNHWyim12SMsYY44p1ehtjjHHFAoYxxhhXLGAYY4xxxQKGMcYYVyxgGGOMccUChjHGGFf+fzh9n+cPFQWTAAAAAElFTkSuQmCC\n",
      "text/plain": [
       "<Figure size 432x288 with 1 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "sns.distplot(df.beer_servings)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "995ae29fb621054d29813cc4ede7079690306412"
   },
   "source": [
    "### 2. Changing the plot style with set_style\n",
    "#### 1. Set plot background to a white grid"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 182,
   "metadata": {
    "_uuid": "1ea0d67b849067246516084d59e0d8e0ef5279a4"
   },
   "outputs": [
    {
     "data": {
      "image/png": "\n",
      "text/plain": [
       "<Figure size 360x360 with 1 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "sns.set()\n",
    "sns.set_style(\"whitegrid\")\n",
    "sns.lmplot(x='beer_servings', y='wine_servings', data=df);"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "973689716f722ccfd20a8e4c38bc943a1247f3d7"
   },
   "source": [
    "#### 2. Set the plot background to dark"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 183,
   "metadata": {
    "_uuid": "f52477138027967aca725ff0acff98128532fc86"
   },
   "outputs": [
    {
     "data": {
      "image/png": "\n",
      "text/plain": [
       "<Figure size 360x360 with 1 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "sns.set()\n",
    "sns.set_style(\"dark\")\n",
    "sns.lmplot(x='beer_servings', y='wine_servings', data=df, fit_reg=False);"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "0a0bb68407d3b995188f47d834d7b5afcdc0885c"
   },
   "source": [
    "#### 3.Set the background to white"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 184,
   "metadata": {
    "_uuid": "850e0d73f88dac879f226bdde3b1517348d24655"
   },
   "outputs": [
    {
     "data": {
      "image/png": "\n",
      "text/plain": [
       "<Figure size 1080x576 with 1 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "# sns.set()\n",
    "sns.set_style(\"white\")\n",
    "plt.figure(figsize=(15,8))\n",
    "sns.swarmplot(x='country', y='wine_servings', data=df);"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "8677bc95a455d6e0d804aa366a7dd709f4fb03aa"
   },
   "source": [
    "#### 4.Adding 'ticks"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 185,
   "metadata": {
    "_uuid": "33f9b624f590f7f993c6c46cfa35e1761daace77"
   },
   "outputs": [
    {
     "data": {
      "image/png": "\n",
      "text/plain": [
       "<Figure size 1080x576 with 1 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "plt.figure(figsize=(15,8))\n",
    "sns.set_style(\"ticks\")\n",
    "sns.boxplot(data=df);"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "3aa34f5278090d078eafe0faad52188d2e67c3cb"
   },
   "source": [
    "### 3.Customizing the styles\n",
    "#### 1.Style parameters"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 186,
   "metadata": {
    "_uuid": "1ad852fb636a7a16f4c9ea39b19fed8e60bf7f1c"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "{'axes.facecolor': 'white',\n",
       " 'axes.edgecolor': '.15',\n",
       " 'axes.grid': False,\n",
       " 'axes.axisbelow': True,\n",
       " 'axes.labelcolor': '.15',\n",
       " 'figure.facecolor': 'white',\n",
       " 'grid.color': '.8',\n",
       " 'grid.linestyle': '-',\n",
       " 'text.color': '.15',\n",
       " 'xtick.color': '.15',\n",
       " 'ytick.color': '.15',\n",
       " 'xtick.direction': 'out',\n",
       " 'ytick.direction': 'out',\n",
       " 'lines.solid_capstyle': 'round',\n",
       " 'patch.edgecolor': 'w',\n",
       " 'image.cmap': 'rocket',\n",
       " 'font.family': ['sans-serif'],\n",
       " 'font.sans-serif': ['Arial',\n",
       "  'DejaVu Sans',\n",
       "  'Liberation Sans',\n",
       "  'Bitstream Vera Sans',\n",
       "  'sans-serif'],\n",
       " 'patch.force_edgecolor': True,\n",
       " 'xtick.bottom': True,\n",
       " 'xtick.top': False,\n",
       " 'ytick.left': True,\n",
       " 'ytick.right': False,\n",
       " 'axes.spines.left': True,\n",
       " 'axes.spines.bottom': True,\n",
       " 'axes.spines.right': True,\n",
       " 'axes.spines.top': True}"
      ]
     },
     "execution_count": 186,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "sns.axes_style()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 187,
   "metadata": {
    "_uuid": "e842cd83a6c71930e5db29364c83790f6997aae2"
   },
   "outputs": [
    {
     "data": {
      "image/png": "\n",
      "text/plain": [
       "<Figure size 1080x576 with 1 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "plt.figure(figsize=(15,8))\n",
    "sns.set_style(\"ticks\", {\"axes.facecolor\": \".1\"})\n",
    "sns.boxplot(data=df);"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "9b88183d1c4b6e5354c95cad1d6a69f19c1392de"
   },
   "source": [
    "### 4.Plotting Context Presets\n",
    "#### 1.Plotting Context Preset - paper"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 188,
   "metadata": {
    "_uuid": "5f817e3d91922aee9283ecc8af87a6216d326a1f"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "<Figure size 1080x576 with 0 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    },
    {
     "data": {
      "image/png": "\n",
      "text/plain": [
       "<Figure size 360x360 with 1 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "sns.set()\n",
    "sns.set_context(\"paper\")\n",
    "plt.figure(figsize=(15, 8))\n",
    "sns.lmplot(x='beer_servings', y='wine_servings', data=df);"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "7582447457a970b35af417e7bfe7fb9e13045e9d"
   },
   "source": [
    "#### 2.Plotting Preset - talk"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 189,
   "metadata": {
    "_uuid": "9ba82357ef06175b55b23b252f861026631acbde"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "<Figure size 576x432 with 0 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    },
    {
     "data": {
      "image/png": "\n",
      "text/plain": [
       "<Figure size 360x360 with 1 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "sns.set()\n",
    "sns.set_context(\"talk\")\n",
    "plt.figure(figsize=(8, 6))\n",
    "sns.lmplot(x='beer_servings', y='wine_servings', data=df);"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "cb41c074e1dea6e8a1deb766776bfed9cc2f89e8"
   },
   "source": [
    "#### 3.Plotting Preset - poster"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 190,
   "metadata": {
    "_uuid": "b7c7e9d5f617de2c464e7617dea90c3f519bc702"
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "<Figure size 576x432 with 0 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    },
    {
     "data": {
      "image/png": "\n",
      "text/plain": [
       "<Figure size 360x360 with 1 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "sns.set()\n",
    "sns.set_context(\"poster\")\n",
    "plt.figure(figsize=(8, 6))\n",
    "sns.lmplot(x='beer_servings', y='wine_servings', data=df);"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "7e8b1c2c4d44cde925e2c9baf4e0d54c29adbca5"
   },
   "source": [
    "# 23.Plotting categorical data <a id=\"231\"></a>\n",
    "---\n",
    "[**Go To TOP**](#00)\n",
    "\n",
    "![](https://i.stack.imgur.com/IsxzL.png)\n",
    "\n",
    "### In this section, you can learn\n",
    "\n",
    "1. Scatterplots\n",
    "2. Swarmplot\n",
    "3. Boxplot\n",
    "4. Violinplot\n",
    "5. Barplot\n",
    "6. Countplot\n",
    "7. Wide form plots"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "8705d5125d49dd7a92a7dfc10d49656e21f51bda"
   },
   "source": [
    "#### 1.Scatterplots"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 191,
   "metadata": {
    "_uuid": "0706495b152a080a101347bd95ef2da581dd72b2"
   },
   "outputs": [],
   "source": [
    "import pandas as pd\n",
    "from matplotlib import pyplot as plt\n",
    "%matplotlib inline\n",
    "import seaborn as sns"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 192,
   "metadata": {
    "_uuid": "9d0aaf8edb925c4c6de5184a8c22a0effe7b50b4"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>id</th>\n",
       "      <th>title</th>\n",
       "      <th>original_air_date</th>\n",
       "      <th>production_code</th>\n",
       "      <th>season</th>\n",
       "      <th>number_in_season</th>\n",
       "      <th>number_in_series</th>\n",
       "      <th>us_viewers_in_millions</th>\n",
       "      <th>views</th>\n",
       "      <th>imdb_rating</th>\n",
       "      <th>imdb_votes</th>\n",
       "      <th>image_url</th>\n",
       "      <th>video_url</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>10</td>\n",
       "      <td>Homer's Night Out</td>\n",
       "      <td>1990-03-25</td>\n",
       "      <td>7G10</td>\n",
       "      <td>1</td>\n",
       "      <td>10</td>\n",
       "      <td>10</td>\n",
       "      <td>30.3</td>\n",
       "      <td>50816.0</td>\n",
       "      <td>7.4</td>\n",
       "      <td>1511.0</td>\n",
       "      <td>http://static-media.fxx.com/img/FX_Networks_-_...</td>\n",
       "      <td>http://www.simpsonsworld.com/video/275197507879</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>12</td>\n",
       "      <td>Krusty Gets Busted</td>\n",
       "      <td>1990-04-29</td>\n",
       "      <td>7G12</td>\n",
       "      <td>1</td>\n",
       "      <td>12</td>\n",
       "      <td>12</td>\n",
       "      <td>30.4</td>\n",
       "      <td>62561.0</td>\n",
       "      <td>8.3</td>\n",
       "      <td>1716.0</td>\n",
       "      <td>http://static-media.fxx.com/img/FX_Networks_-_...</td>\n",
       "      <td>http://www.simpsonsworld.com/video/288019523914</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>14</td>\n",
       "      <td>Bart Gets an \"F\"</td>\n",
       "      <td>1990-10-11</td>\n",
       "      <td>7F03</td>\n",
       "      <td>2</td>\n",
       "      <td>1</td>\n",
       "      <td>14</td>\n",
       "      <td>33.6</td>\n",
       "      <td>59575.0</td>\n",
       "      <td>8.2</td>\n",
       "      <td>1638.0</td>\n",
       "      <td>http://static-media.fxx.com/img/FX_Networks_-_...</td>\n",
       "      <td>http://www.simpsonsworld.com/video/260539459671</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>17</td>\n",
       "      <td>Two Cars in Every Garage and Three Eyes on Eve...</td>\n",
       "      <td>1990-11-01</td>\n",
       "      <td>7F01</td>\n",
       "      <td>2</td>\n",
       "      <td>4</td>\n",
       "      <td>17</td>\n",
       "      <td>26.1</td>\n",
       "      <td>64959.0</td>\n",
       "      <td>8.1</td>\n",
       "      <td>1457.0</td>\n",
       "      <td>http://static-media.fxx.com/img/FX_Networks_-_...</td>\n",
       "      <td>http://www.simpsonsworld.com/video/260537411822</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>19</td>\n",
       "      <td>Dead Putting Society</td>\n",
       "      <td>1990-11-15</td>\n",
       "      <td>7F08</td>\n",
       "      <td>2</td>\n",
       "      <td>6</td>\n",
       "      <td>19</td>\n",
       "      <td>25.4</td>\n",
       "      <td>50691.0</td>\n",
       "      <td>8.0</td>\n",
       "      <td>1366.0</td>\n",
       "      <td>http://static-media.fxx.com/img/FX_Networks_-_...</td>\n",
       "      <td>http://www.simpsonsworld.com/video/260539459670</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "   id                       ...                                                               video_url\n",
       "0  10                       ...                         http://www.simpsonsworld.com/video/275197507879\n",
       "1  12                       ...                         http://www.simpsonsworld.com/video/288019523914\n",
       "2  14                       ...                         http://www.simpsonsworld.com/video/260539459671\n",
       "3  17                       ...                         http://www.simpsonsworld.com/video/260537411822\n",
       "4  19                       ...                         http://www.simpsonsworld.com/video/260539459670\n",
       "\n",
       "[5 rows x 13 columns]"
      ]
     },
     "execution_count": 192,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df = pd.read_csv('../input/datasetsdifferent-format/data_simpsons_episodes.csv')\n",
    "df.head()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 193,
   "metadata": {
    "_uuid": "f05550ce139766da3c76323d65c09356ae0d8613"
   },
   "outputs": [
    {
     "data": {
      "image/png": "\n",
      "text/plain": [
       "<Figure size 1440x576 with 1 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "plt.figure(figsize=(20,8))\n",
    "sns.stripplot(x=\"season\", y=\"us_viewers_in_millions\", data=df);"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "c2b6ebb786dbd98dd546362a2346b8f0e4dca1fa"
   },
   "source": [
    "#### 2.Swarmplot"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 194,
   "metadata": {
    "_uuid": "111880843157b6edc22a5ebbf9d06e5686254eeb"
   },
   "outputs": [
    {
     "data": {
      "image/png": "\n",
      "text/plain": [
       "<Figure size 1440x576 with 1 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "plt.figure(figsize=(20,8))\n",
    "sns.swarmplot(x=\"season\", y=\"us_viewers_in_millions\", data=df);"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "4d909c671038041d58a4e4b3dca3d0d5bc3bf883"
   },
   "source": [
    "#### 3.Boxplot"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 195,
   "metadata": {
    "_uuid": "2922ba2d47c810f362c7a1b8e6bb4dc0181e18d2"
   },
   "outputs": [
    {
     "data": {
      "image/png": "\n",
      "text/plain": [
       "<Figure size 1440x576 with 1 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "plt.figure(figsize=(20,8))\n",
    "sns.boxplot(x=\"season\", y=\"us_viewers_in_millions\", data=df);\n",
    "# sns.boxenplot(x=\"season\", y=\"us_viewers_in_millions\", data=df);"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "0cb7f658bc504b36ad23d51daae2ec472a6bdff4"
   },
   "source": [
    "#### 4.Violinplot"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 196,
   "metadata": {
    "_uuid": "7972ea45bc24a3470acf6c430820456bc39f617a"
   },
   "outputs": [
    {
     "data": {
      "image/png": "\n",
      "text/plain": [
       "<Figure size 1440x576 with 1 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "plt.figure(figsize=(20,8))\n",
    "sns.violinplot(x=\"season\", y=\"us_viewers_in_millions\", data=df);"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "939f3aa8ba0318087eb295ad06aada3fd7b94980"
   },
   "source": [
    "#### 5.Barplot"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 197,
   "metadata": {
    "_uuid": "79725418a2049974d70b2e0b595bedf63023c5de"
   },
   "outputs": [
    {
     "data": {
      "image/png": "iVBORw0KGgoAAAANSUhEUgAABL8AAAIMCAYAAAAU8g52AAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADl0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uIDIuMi4zLCBodHRwOi8vbWF0cGxvdGxpYi5vcmcvIxREBQAAIABJREFUeJzs3Xt8nFW97/HPTNo0bZq0AQNsbhuO4rKgqCiKqKCgAioqgoriEVDcBRWrPSLbs7fKOd6ouqNsrxxvPSp6VBDxxvaCXERQEVSwxN9L9laRaEuQ0A6lpW0m54+ZtiFMmpknk8wln/frldeT/GY9z/pVhdpv17NWbmxsDEmSJEmSJKkd5RvdgCRJkiRJkjRTDL8kSZIkSZLUtgy/JEmSJEmS1LYMvyRJkiRJktS2DL8kSZIkSZLUtgy/JEmSJEmS1LYMvyRJkiRJktS2DL8kSZIkSZLUtgy/JEmSJEmS1LYMvyRJkiRJktS2DL8kSZIkSZLUtgy/JEmSJEmS1LYMvyRJkiRJktS2DL8kSZIkSZLUtgy/JEmSJEmS1LYMvyRJkiRJktS25jW6gRb3a+BA4H7gjgb3IkmSJEmS1A4eBSwG/gg8cboPy42NjU27oznsPmBJo5uQJEmSJElqQ+uBpdN9iCu/pud+YEmxOMa2baON7kWSJEmSJKnlzZvXQT6fg1LuMv3n1eMhc9gdwD7bto2yfv2mRvciSZIkSZLU8pYsWUhn5zyo0xZTbngvSZIkSZKktmX4JUmSJEmSpLZl+CVJkiRJkqS21dR7fqWUzgWeCTwO2APopXTC4m+B1cAlEfGw4ypTSnngHOBM4DHAKHAr8MmI+OqsNC9JkiRJkqSGa/aVX+cDLwE2ATcAl1Ha7OwY4EvA5eWga4eUUgdwOfBx4CDgh8D1wOHAV1JKF81a95IkSZIkSWqopl75BZwK/DoiNo4vppQOAa4CXgycDnxh3MdvAV4E3A4cExHryvccBPwUeHNK6ScRccUs9C9JkiRJkqQGauqVXxFx/cTgq1xfA3yi/ONzt9fLq77eXv7xnO3BV/meP1BaSQbwLzPTsSRJkiRJkppJU4dfU9hWvj44rvY0SnuD3RUR11W45xvAVuDwlNI+M9yfJEmSJEmSGqwlw6+U0oHA2eUfvz3uoyeWrzdVui8iHgDWlH98wsx0J0mSJEmSpGbR7Ht+AZBSOhM4GpgP7AscSSm4e39EXD5u6IHl65938bg7KQVfB+5iTE06O+fR399Tr8dJkiRJkiSpTloi/AKeTmlj++22Ae8EBiaMW1y+PmyfsHHuL19NqyRJkiRJktpcS4RfEXEWcFZKaSGlFVtnAhcAL08pPT8i/trI/rZs2cb69Zsa2YIkSZIkSVJbWLJkIZ2d9YusWiL82i4iNgG3A+ellNYCHwY+Dry0PGT7qq7uXTxm++qwwow0KUmSJEmSpKbRkhvel60uX09MKc0vf/+n8vUfd3HffhPGSpIkSZIkqU21cvg1Qmnvr3nAbuXaLeXr4ZVuSCktAh5b/vHXM9qdJEmSJEmSGq6Vw6+jKAVf9wH3lGs3AsPAvimloyrc8zJKJ0beFBFDs9JlDYrFUdatW0uxONroViRJkiRJktpC04ZfKaVnpJRemFJ62L5kKaWnA58r//i5iBgFKF8/WK5/KqW0x7h7DgIuLP/4vpnrPJvBwTUsX34mK1aczfLlZzI4uKbRLUmSJEmSJLW8Zt7w/lHAF4D7Ukq3AGuBHuCRwMHlMd8D3jnhvo9QWhV2IvCHlNJVlFZ7PQfoAj4WEVfMfPvVKxZHGRhYRaGwAYBCYQMDA6u4+OLV5PNNm09KkiRJkiQ1vWZOVq4F3gP8BjiI0omOz6N0kuNlwEkR8cLyCZA7lFd/vQQ4F7gDOA44GrgZOC0i3jxrv4IqDQ0N7Qi+tisUNjA0dFeDOpIkSZIkSWoPTbvyKyL+CLwr471F4OPlr6a3deuWmuqSJEmSJEmqTjOv/JIkSZIkSZKmxfBLkiRJkiRJbcvwS5IkSZIkSW3L8EuSJEmSJElty/BLkiRJkiRJbcvwS5IkSZIkSW3L8EuSJEmSJElty/BLkiRJkiRJbcvwS5IkSZIkSW3L8EuSJEmSJElty/BLkiRJkiRJbcvwS5IkSZIkSW3L8EuSJEmSJElty/BLkiRJkiRJbcvwS5IkSZIkSW3L8EuSJEmSJElty/BLkiRJkiRJbcvwS5IkSZIkSW3L8EuSJEmSJElty/BLkiRJkiRJbWteoxtoJ/39PZnuGxnprljv6+vO9Mzh4UKmPiRJkiRJktqNK78kSZIkSZLUtlz5NQOGP/XlmsaPjNxTuX7p9xnu+2XVz+k/59U1zStJkiRJktTuXPklSZIkSZKktmX4JUmSJEmSpLZl+CVJkiRJkqS2ZfglSZIkSZKktmX4JUmSJEmSpLZl+CVJkiRJkqS2ZfglSZIkSZKktmX4JUmSJEmSpLZl+CVJkiRJkqS2ZfglSZIkSZKktmX4JUmSJEmSpLZl+CVJkiRJkqS2ZfglSZIkSZKktmX4JUmSJEmSpLZl+CVJkiRJkqS2ZfglSZIkSZKktmX4JUmSJEmSpLZl+CVJkiRJkqS2ZfglSZIkSZKktmX4JUmSJEmSpLZl+KWqFYujrFu3lmJxtNGtSJIkSZIkVWVeoxtQaxgcXMPAwCoKhQ309PSycuX5LFt2SKPbkiRJkiRJ2iVXfmlKxeLojuALoFDYwMDAKorFYoM7kyRJkiRJ2jXDL01paGhoR/C1XaGwgaGhuxrUkSRJkiRJUnUMvzSlrVu31FSXJEmSJElqFoZfTWB+vqOmuiRJkiRJkqpj+NUE9uldQk/ngofUejoXsE/v0gZ1JEmSJEmS1B4Mv5pAPpdn5dOO2RGA9XQuYOXTjiGfyzW4M0mSJEmSpNY2r9ENqGRZ/15cfOKpDG/cSH/3YoMvSZIkSZKkOjD8aiL5XJ49F/c0ug1JkiRJkqS20bThV0ppPnAU8HzgaODRQBcwDNwIfDwirqlw32rg9F08OiLiMfXuV5IkSZIkSc2nacMvSoHXj8rfrwWuAzYCBwMnAyenlN4TEe+a5P6fAXdUqP+t3o1KkiRJkiSpOTVz+FUELgMuioifjv8gpfQK4BLgnSmlqyPi6gr3fzYiVs98m5IkSZIkSWpWTRt+RcRPgJ9M8tnXUkrPBV4HvBqoFH5JkiRJkiRpjss3uoFp+HX5um9Du5AkSZIkSVLTatqVX1U4qHydbA+vZ6eUDgUWA+uA64EfRURxNpqTJEmSJElS47Vk+JVS2gs4o/zjZZMMe02F2u0ppVMj4rYZaUySJEmSJElNpeXCr5TSPODLwBLgqoj4zoQhvwFuBn4M3An0AocB7wMeD/w4pXRYRAzVq6fOznn09/fU63HTVu9eRka6K9b7+rqb6tctSZIkSZI0UcuFX8CngWOBv1Da7P4hIuKjE0obge+llH4EXAscAbwDeNMM9ylJkiRJkqQGa6nwK6V0EaUTHtcCx0bE2mrvjYgtKaUPAFcAz69nX1u2bGP9+k1NswpqeLhQ1+eNjGyctF7vuSRJkiRJ0ty2ZMlCOjvrF1m1zGmPKaV/A94MDFMKvv6Q4TG/L1/3qVtjkiRJkiRJalotEX6llD4IrAT+DjwnIm7P+Kjdy9f769KYJEmSJEmSmlrmNWQppU6gGBHbJtRzwNnA0cAC4D+Az0REMeM8FwLnASPAcyPi1qw9Ay8vX2+axjMkSZIkSZLUIjKFXymlfwI+BXyVh286/x3ghPL3OeBFwAvK11rneS9wPnAfpeDr11OMfwKwL3BlRIyOq88DVlB6bRLgI7X2IkmSJEmSpNaTdeXX9nDri+OLKaUTKW0mPwZ8DdgEnAa8IKX0qoj4SrUTpJReBPxL+cc7gHNTSpWG/j4iLix/fwBwOXBvSukW4G5Krzo+DtgbKAJvj4gfVNuHJEmSJEmSWlfW8OuQ8vWXE+r/nVLw9YGI+FeAlNLPgYuB1wBVh1/AbuO+f3L5q5Jrge3h12+Bi4CnAAcDzyz3cxfwBeATEXFzDT20laynUY6MdFes9/V1Z3qmJ0RKkiRJkqTZkjX82gPYGBH3TagfU75+Zlzty8CngSfWMkFErAZW13jPH4G31HKPJEmSJEmS2lfW8GshsGV8IZXeSdwN+M+I+PP2ekRsSindByzN3KXq6u5Pf7Sm8feOTMw4y/XLvsLdfdX/17rH2eaSkiRJkiRpduUz3nc3sCiltM+42vZ9wK6vML4LWJ9xLkmSJEmSJCmTrOHXL8rXd6eUcimlRwBvorS/1g/HD0wp7U9ppdhfM3cpSZIkSZIkZZA1/PoYkANeR2lF11+A/wYMAd+cMPZ55estGeeSJEmSJEmSMskUfkXEtcDZwEZgMbAA+ANwUkQ8OGH4a8vXH2dtUpIkSZIkScoi68ovIuL/AHsCTwWWAcsi4ubxY1JK84FVwEnAt6fRpyRJkiRJklSzrKc9AqWTHIGbdvH5VuCK6cwhSZIkSZIkZZV55ZckSZIkSZLU7Ka18gsgpbQv8FigD5i/q7ER8cXpzie1omJxlOHhYfr7+8nnOxrdjiRJkiRJc0bm8Cul9FTgIuDwGm4z/NKcMzi4hoGBVRQKG+jp6WXlyvNZtuyQRrclSZIkSdKckCn8Sik9CfgJ0AXkgLuAIWBz/VqTWl+xOLoj+AIoFDYwMLCKiy9eTT7vW8eSJEmSJM20rCu/LgAWArcBZ0bELXXrSGojQ0NDO4Kv7QqFDQwN3cV+++3foK4kSZIkSZo7soZfRwJjwGkR8bs69iO1la1bt9RUlyRJkiRJ9ZX1vasu4H6DL0mSJEmSJDWzrOHXHcCClNK0T4uUJEmSJEmSZkrW8OsLQCfw4jr2IkmSJEmSJNVV1vDr34EfAp9OKT2tjv1IkiRJkiRJdZP1tcV/BX4JPAW4PqX0U+AmoLCrmyLif2ecT5IkSZIkSapZ1vDrAkqnPebKPx8FPHMX43Pl8YZfkiRJkiRJmjVZw68vUgqzJEmSJEmSpKaVKfyKiDPq3Ic0LcXiKMPDw/T395PPdzS6HUmSJEmS1CSyrvzSHDJ/kjBpsvpsGxxcw8DAKgqFDfT09LJy5fksW3ZIo9uSJEmSJElNIOtpj5pD9uldTE9n50NqPZ2d7NO7uEEd7VQsju4IvgAKhQ0MDKyiWCw2uDNJkiRJktQMpr3yK6X0LODlwGFAf7k8DNwCfD0irpnuHGqsfC7HW498Mh+54VcUtmyhp7OTtx75ZPK53NQ3z7ChoaEdwdd2hcIGhobuYr/99m9QV5IkSZIkqVlkDr9SSo8ALgGeUy6NT0IOBA4HlqeUfgS8OiLuydylGm5Z/+58+kXPY3jjA/R3L2qK4Atg69YtNdUlSZIkSdLckin8Sil1Aj8CDqUUet0I/AS4qzxkX+AY4GnAc4EfppSOiAgTiRaWz+XYc3F3o9uQJEmSJEmqWtaVX28CHg/cC7wyIn5UYcw7U0rPA75aHvtG4CMZ55Maqr+/J9N9IyOVw8K+vu5MzxweLmTqQ5IkSZKkuSrrhvevAMaAf5ok+AIgIn4I/BOl1WGnZpxLkiRJkiRJyiTryq8EbAYur2Ls5eWxj8k4l9Q0fvfJF9U0/i8joxXr//mNt7Klr6Pq5zz2Dd+uaV5JkiRJklSSdeXXfGBrRIxNNTAiisBW6nCypCRJkiRJklSLrOHXnUBPSumwqQamlJ4E9JTvkSRJkiRJkmZN1vDr+5T28fpcSql/skEppT2Bz1HaH+x7GeeSJEmSJEmSMsn6KuIq4HTgUOD3KaXPANcAQ0AXsD/wbOAMYBGlUyE/OM1eJUmSJEmSpJpkCr8i4u6U0vOBbwF7AeeVvybKAX8DXhIRd2fuUpIkSZIkScog62uPRMQvgYOBdwO3UXq1MVf+GivX3gUcEhE3Tb9VSZIkSZIkqTbTOoExIu4D3gO8J6U0H9it/NG9EbF1us1pbunv76n5npGR7or1vr7uTM8bHi7UfI8kSZIkSWpe0wq/xiuHXevq9TxJkiRJkiRpuuoWfkn1MvSJN1Y99u6RzZXrX19Fd19X1c/Z542fqHqsJEmSJElqHVOGXymlo8rfPhARv5pQq0lEXJflPkmSJEmSJCmLalZ+XUNpA/ugtMH9+FotxqqcT5IkSZIkSaqLasKoOykFV3+tUJMkSZIkSZKa1pThV0QcUE1NkiRJkiRJajb5RjcgSZIkSZIkzRTDL0mSJEmSJLUtwy9pBs3ryNVUlyRJkiRJ9TXlnl8ppc/Xaa6xiHhdnZ4ltYQ9e3J0d8LGLTtr3Z2luiRJkiRJmnnVnPZ4BqWTHaf7p/UxwPBLc0o+l+O1R3Tx+Z9vZuOWUvD12iO6yOcMvyRJkiRJmg3VhF//a8a7kNrYo/o7eO8LF3HvxjF2684ZfEmSJEmSNIumDL8iwvBLmqZ8LscjFht6SZIkSZI029zwXpIkSZIkSW3L8Estbf4kpyZOVpckSZIkSXOL4Zda2t49nfR0djyk1tPZwd49nQ3qSJIkSZIkNZMp9/xKKY3Waa6xiKhmg32pavlcjnOP2IeP/XyIwpZRejo7OPeIfdxUXpIkSZIkAdWd9livFME0QjPiMY9YxMde8CjueWArj1g03+BLkiRJkiTtUE34deCMd1FBSmk+cBTwfOBo4NFAFzAM3Ah8PCKu2cX9rwLOAQ4FOoDfA18APhURxRltXrMun8uxR7evOkqSJEmSpIeaMvyKiD/PRiMVHA38qPz9WuA6YCNwMHAycHJK6T0R8a6JN6aUPgG8AdgMXAVsBY4FPg4cm1I6xQBMkiRJkiSp/TXzHlxF4DLgooj46fgPUkqvAC4B3plSujoirh732cmUgq+1wFER8YdyfU/gauAk4Fzgoln5VUiSJEmSJKlhmva0x4j4SUScMjH4Kn/2NWB1+cdXT/j4HeXr+duDr/I96yi9Bgnwzymlpv21S5IkSZIkqT6qOe3xNeVv10fEFRNqNYmIL2a5bxK/Ll/33V5IKe0LPAnYAnyjwvzXppSGgH2AI4Ab6tiPJEmSJEmSmkw1rz2uBsaAAK6YUKtVPcOvg8rXv42rPbF8XRMRmya57yZK4dcTMfySJEmSJElqa9WEX9dRCrrurFBriJTSXsAZ5R8vG/fR9pMpd7VJ//ZfR91OsezsnEd/f0+9HjdtzdRLJc3cXzP3BjPX3+joKOvWrWPPPfeko6NjRuaQJEmSJKkRqjnt8VnV1GZLSmke8GVgCXBVRHxn3MeLy9eNu3jE/eVrc6cc0iy59dZbueCCC1i/fj1Llizhggsu4NBDD210W5IkSZIk1UUzn/Y4mU8DxwJ/4eGb3TfEli3bWL9+U9OsGhoeLlSs29/Umrk3mLy/rIrFUd71rndTKGwAYP369bzrXe/m4otXk897JoQkSZIkafYtWbKQzs76RVYt9afblNJFwOuAtcCxEbF2wpDtq7q6d/GY7avD6psiSC1oaGhoR/C1XaGwgaGhuxrUkSRJkiRJ9dUy4VdK6d+ANwPDlIKvP1QY9qfy9R938aj9JoyV5qytW7fUVJckSZIkqdVkXkNW3nvrLOAU4LFA3xTPG4uITPOllD4IrAT+DjwnIm6fZOivy9dDUkoLJznx8fAJYyVJkiRJktSmMq38Sin1AT8HPgEcA+wBzAdyu/jKOteFwHnACPDciLh1srER8RfgFqATeFmFZx0N7Evptckbs/QjSZIkSZKk1pF15dcHgMMo7Zv1IeAqYB0wWqe+AEgpvRc4H7iPUvBVzWqtDwDfAFallG6IiDvKz9oD+GR5zIURUaxnr5IkSZIkSWo+WcOvlwBjwGkR8d069rNDSulFwL+Uf7wDODelVGno7yPiwu0/RMSlKaVPAecAt6WUfgxspXRCZC/wLeDjM9GzJEmSJEmSmkvW8KsH2AR8r469TLTbuO+fXP6q5FrgwvGFiHhDSul64I3A0UAH8Hvg88CnXPUlSZIkSZI0N2QNv/4IHFjPRiaKiNXA6mnc/xXgK/XqR5IkSZIkSa0n0yb0wJeALuC4OvYiSZIkSZIk1VXW8GsAuA74XErp6XXsR5IkSZIkSaqbTK89RsTWlNLxwIeB61JKNwC/A/42xX3/O8t8kiRJkiRJUhZZ9/wCeCHwYiAHPB04chdjc5ROhzT8kiRJkiRJ0qzJFH6llE4AvkbptckNwM+Bu4HR+rUmqVr9/T2Z7hsZ6a5Y7+vrzvTM4eFCpj4kSZIkSZopWVd+/Sul4OtbwKsj4oH6tSRJkiRJkiTVR9bw63GUXmN8vcGX1Dx++pkX1jT+r/cWK9Z/fflbWLdb9edhPPP1361pXkmSJEmSZkvW8GszsC0i/l7PZiRJkiRJkqR6qn5px0PdCPSmlPrr2YwkSZIkSZJUT1nDr/dR2tz+vXXsRZIkSZIkSaqrTOFXRPwSeBnw8pTSj1JKz0kp7Vnf1iRJkiRJkqTpybTnV0ppdNyPx5S/SCnt6raxiMi6x5gkSZIkSZJUs6xhVG6W7pEkSZIkSZIyyxp+HVjXLiRJkiRJkqQZkCn8iog/17sRSZIkSZIkqd6ynvZYFymlv6WUtjWyB0mSJEmSJLWvhoZfZe4FJkmSJEmSpBnRDOGXJEmSJEmSNCMMvyRJkiRJktS2DL+kOWxeR211SZIkSZJajeGXNIc9oifHogUPrS1aAI/odSs+SZIkSVJ7MPyS5rB8PsfLn965IwBbtABe/vRO8jnDL0mSJElSe5jX6AYkNdYBe3Twthd3cd8DYyztzhl8SZIkSZLaiuGXJPL5HLstNvSSJEmSJLUfX3uUJEmSJElS2zL8kiRJkiRJUttqdPjle1aSJEmSJEmaMY3e8+tDwOIG9yBJkiRJkqQ21dDwKyL+rZHzS5IkSZIkqb1NK/xKKS0DTgYeC/QB83cxfCwijp3OfJIkSZIkSVItModfKaUB4M2U9u2qZu+usaxzSZIkSZIkSVlkCr9SSm8E3lL+8TbgCmAI2FynviRJkiRJkqRpy7ry6/WUVnJ9LCLeMtVgSZIkSZIkqRHyGe97dPn6rno1IkmSJEmSJNVb1pVfG4HNEbGhns1IkiRJkiRJ9ZR15dcvgN6UUn89m5GkSorFUdatW0uxONroViRJkiRJLSZr+PUBSnt+/Usde5GkhxkcXMPy5WeyYsXZLF9+JoODaxrdkiRJkiSphWQKvyLiZ8BZwPKU0qdTSgfUtStJorTia2BgFYVC6Q3rQmEDAwOrKBaLDe5MkiRJktQqMu35lVL6r/K3o5ROfnx9SuleoLCL28Yi4pFZ5pM0Nw0NDe0IvrYrFDYwNHQX++23f4O6kiRJkiS1kqwb3h9QobZ7+WsyYxnnkjRHbd26paa6JEmSJEkTZQ2/nl3XLiRJkiRJkqQZkCn8iohr692IJElqL8XiKMPDw/T395PPdzS6HUmSJM1RWVd+SZIkTWpwcM2OAyt6enpZufJ8li07pNFtSZIkaQ7KdNqjJEnSZDypVZIkSc1kypVfKaXXlL9dHxFXTKjVJCK+mOU+SZLUOjypVZIkSc2kmtceV1M6qTGAKybUamX4JUlSm/OkVkmSJDWTasKv6ygFXXdWqEmSJEmSJElNa8rwKyKeVU1NkiRJkiRJajYN3fA+pXRESumoRvYgSe2uWBxl3bq1FIujjW5FkiRJkmZdNa89zqTLgf4m6EOS2tLg4Jodp+719PSycuX5LFt2SKPbklSFYnGU4eFh+vv7yec7Gt2OJElSy2royq+yXKMbkKR2VCyO7gi+oHTa3sDAKorFYoM7kzSVwcE1LF9+JitWnM3y5WcyOLim0S1JkiS1LFdcSZpx/f09me4bGemuWO/r6870zOHhQqY+WtXQ0NCO4Gu7QmEDQ0N3sd9++zeoK0lTmSy4vvji1eTzzfD3lpIkSa3F8EuSpqlZX03aunVLTXVpIoPrxjC4liRJqq+mDr9SSgk4HjgceDLwaEqvSb4sIi6d5J7VwOm7eGxExGPq3KqkKnzv8yfUNH7d3yu/nnf9Fefyh92rX/3wgtdeWdO8tXBPLUn1ZnAtSZJUX00dfgHnACsy3vsz4I4K9b9lb0eSdvLVJM0Vz7/8gprGF++uvFLrzVdfTH5N9Su/vn9SbfNKkiRJlTR7+PU74EPAr4Cbgc8BR1d572cjYvUM9SVJvpokSZIkSS2gqcOviPjs+J9Lb0FKUnPw1SRJkiRJan6+lyNJkiRJkqS21dQrv6bp2SmlQ4HFwDrgeuBHEVF5B21JkiRJkiS1nXYOv15ToXZ7SunUiLitnhN1ds7LfBz8TGimXipp5v6auTewv+mqd38jI90V63193U3xn0Wz9ydVa67979V/diVJkuqr0a895mbgmb8B3gwcTGnV197AC4Hflms/TintMwPzSpIkSZIkqck0euXXSUBnPR8YER+dUNoIfC+l9CPgWuAI4B3Am+o155Yt21i/flPT/G3s8HDlI+btb2rN3BvY33RN1l9WIyMbJ63Xe64smr0/Nb92/We32fnPriRJmuuWLFlIZ2f9IqtMT0opdQH7A5sj4s4Jny0C3g0cDSwA/gN4X0TcP/E5EXFjlvmziIgtKaUPAFcAz5+teSVJkiRJktQ4WV97XA4MAv9zfDGl1AFcB7wNeArweODtlF41bPQqM4Dfl6++9ihJkiRJkjQHZA2kji9fvzShfipwGLAJuKh8XQkcDrwe+FTG+epl9/L1YavQJM1dWV/tqvem1L7OpLbRMcnfrU1WlyRJkmZQ1vDr0eXrbyfUXwmMAe+MiAGAlNLtwDcoBWONDr9eXr7e1NAuJKkGhnNqNbm+RdA1DzZv21nsmkdut0WNa0qSJElzVtbwqx/YMH4fr5RSHjiq/OOXx429AigCh2Scq2oppScA+wJXRsTouPo8YAWlUyABPjLTvUhqPV9dfVxN4+/5+1jF+g++8yZu3r36w2xfecYPappXana5fI75JxzM1itvLwVgXfOYf8LB5HIzccizJEmStGtZw69OSoHWeI8DFgO3R8Td24sRsS2ldB9Q8zKDlNJhwCfHlQ4uX9+fUnrbuDmOKH97AHA5cG9K6RbgbkqvOj4O2Lvc89sjwj9pSi2go6O2eru7+Eu1hXN74QzsAAAgAElEQVT33lM5nLvse29it19UH0Is/+/+K1O1y++zhM7XHgGFzdDbZfAlSZKkhskafv0N2D+l9MiI+M9y7YXl608rjO8G/p5hnl7gqRXqB00y/reU9hp7CqWg7JmUXsO8C/gC8ImIuDlDH5IaYLfeHAsXwKYHd9YWLoDdl/iHaKkV5PI5WLKw0W00jK8sS5IkNYes4ddPgdOAf0spnUVpVdW5lIKmK8cPTCkdBCwA/lrrJBFxDVD1n3Ij4o/AW2qdR1JzyudznPjMeXznp9vY9GAp+DrxmfNcQSJJkiRJqlrW8OvDwCuAE4F15VoOuB347oSxzy9ff5FxLklz2L575ln+0vls2AhLFmPwJanlvOCyz9Y0vnj3vRXrK676Fvnbdqv6Od87+aya5pUkSWpXmc4cj4hbgZOAP1MKvcaAq4ETI2LiXmCvL19/nLVJSXNbPp9jaU/O4EuSJEmSVLOsK7+IiO8D/y2l1A/cHxGbJo4pn7J4WvnHwaxzSZIkSZIkSVlkCr9SSm8uf3tpREy6l1dEbKO0Cb0kSZIkSZI067Ku/PoIMAp8uo69SJIkSZIkSXWVNfy6B5gXEVvq2YwkSZIkSZJUT5k2vAduAZaU9/uSpDmpo6O2uiRJkiRp9mUNv/69fO8769iLJLWUJb3QteChta4FsHRJY/qRJEmSJD1cpvArIq4E3gacnVL6Ukrp8fVtS5KaXz6f49ijO3YEYF0L4NijO8jlco1tTJIkSZK0Q9bTHv+r/O024FXAq1JKm4C/U9oIv5KxiHhklvkkqVn9w545XnVKB/dvhJ7FGHxJkiRJUpPJuuH9ARVqi8pfkxnLOJckNbV8PkdvT6O7eDj3JJMkSZKk7OHXs+vahSSp7nqWwIIF8OCDO2sLFkDv0sb1JEmSJEmzLVP4FRHX1rsRSVJ95fM5jnx2BzdcPcqDD5aCryOf7Z5kkiRJkuaWrCu/JEktoH+vHCe+ooMHNkK3e5JJkiRJmoOmHX6llOYBTwL2AxZFxBen3ZUkqW7y+RyLm3BPMkmSJEmaDdMKv1JK5wPnAX3jyl8c9/lS4AagEzgqIv46nfkkSZLqpVgcZXh4mP7+fvL5JjoJYt4kvUxWlyRJ0i7ls96YUroEeD+l4OuPwLaJYyLiPuBa4EDg1KxzSZIk1dPg4BqWLz+TFSvOZvnyMxkcXNPolnbILe2BrgUPLXYtINfX25iGJEmSWlym8CuldCrwSmAtcGREPAq4d5LhlwA54DmZOpQkSaqjYnGUgYFVFAobACgUNjAwsIpisdjgzkpy+Tydxz9jZwDWtYDO45/hnn2SJEkZZX3t8XXAGPCWiPjFFGN/BRSBx2acS5IkqW6GhoZ2BF/bFQobGBq6i/32279BXT1Ufu89WHDGSxgrPECut9vgS5IkaRqyvvb4RErh17enGhgRm4H1QH/GuSRJkupm69YtNdUbJZfPk1+y2OBLkiRpmrKGX4uBQkQ8WOX4TmA041ySJEmSJElSJlnDr2GgN6XUM9XAlNJBQDdwV8a5JEmSJEmSpEyyhl8/K19fVsXY8yi9Inl1xrkkSZIkSZKkTLKGXx+jdILje1NKFTeyTyktSCm9DziLUvj18YxzSZIkSZIkSZlkOu0xIn6WUvoQpVVdv0gp/RjoAUgpDQD7A88C+sq3vCsi1ky/XUmSJEmSJKl6mcIvgIg4P6X0V+A9wInjPlpBaVUYwEbgHRHhqi9JklRX/f1Tbj1a0chId8V6X193pmcODxcy9SFJkqTZkTn8AoiIi1JKq4GTgSOBf6D0KuU64EbgGxFx73SblCRJkiRJkrKYVvgFEBHrgc+XvyRJkmbVC7750ZrGF+++r2J9xU++Qv53S6t+zvde+paa5pUkSVJjZN3wXpIkSZIkSWp6mVZ+pZR+AlwDXAfcGBEP1rMpSZIkSZIkqR6yvvb4LODo8vdbUko3AddSCsN+FhEP1KE3SZIkNalicZTh4WH6+/vJ5zsa3Y4kSdKksoZf5wBHUQrA9gaeATwd+J/AtpTSLZTCsGuB6yPCY5AkSZLaxODgGgYGVlEobKCnp5eVK89n2bJDGt3WQxjOSZKk7TLt+RURF0fEaRGxL3AQcBbwZeAvwHzgqcDbge8C96aUbkopfbhOPUuSJKlBisXRHcEXQKGwgYGBVRSLxQZ3ttPg4BqWLz+TFSvOZvnyMxkcXNPoliRJUgNNe8P7iPjPiPh8RJweEQcABwJnAF8A/gR0AE8C3jrduSRJktRYQ0NDO4Kv7QqFDQwN3dWgjh6qFcI5SZI0u2bitMfecV+LZ+D5kiRJapCtW7fUVJ9tzR7OSZKk2Zd1zy8AUko54AmU9v46Gngm0Ff+OAfcD/yQnft/SZIkNVbHJPs/TVZXS2n2cE6SJM2+TOFXSuk8ShveP4PSCq9c+aP1wPfZefLjzRExWoc+JUmS6iLXtxi6OmHzuDCkq5Pcbi5YlyRJakdZV36tAsaAAvAt4KeUAq/fRMRYnXqTJEmqu1w+R+cJT2bLlb8qBWBdnXSe8GRyudzUN0uSJKnlTOe1xxzQAyTgb8BQ+evuOvQlSZI0Y/J7786CM5/HWOEBcr2LDL4kSZLaWNbw6xR27vP1OOBg4ByAlFIA11BaCXZNRKybfpuSJEn1lcvnyC3pbnQbkiRJmmGZwq+I+CbwTYCUUh+lje6PBp4FPB54DLC8/PkfKIdhEfHVaXcsSZIkSZIkVWlapz0CRMQI8O3yFymlXkob4R8NHAccChwEnAUYfkmSJEmSJGnWTDv8Gi+l1AU8CXhK+evRlDbGdyMNSZIkSZIkzbpphV8ppUXA09m5/9fhwPzyx9sDr3vYeRqkJEmSJEmSNGsyhV8ppVXAUcBh456xPez6G3Bd+evaiLh9uk1KkiRJkiRJWWRd+XXeuO/vpBx0UQq77ph2V5IkSZIkSVIdZA2/Pk8p7LouIv5cx34kSZI0C/r7ezLdNzLSXbHe19ed6ZnDw4VMfUiSJFUrU/gVEWfVuxFJkiRJkiSp3up62qMkSbUqFkcZHh6mv7+ffL6j0e1Ic84LL72kpvHF4Xsq1t9y1ZXkb72p6ud895TTappXkiQpq+me9ngg8FbgucB+QFdEzBv3+VLgzcAYcGFEbJ3OfJKk9jI4uIaBgVUUChvo6ell5crzWbbskEa3JUmSJKmN5LPemFI6CbgVeCOQgEXsPPERgIi4DzgGuAB4UeYuJUltp1gc3RF8ARQKGxgYWEWxWGxwZ5IkSZLaSabwK6X0GOASoBv4P8BRQOU18PAZSqHYC7PMJUlqT0NDQzuCr+0KhQ0MDd3VoI4kSZIktaOsrz2eB3QBH4mI/wGQUhqdZOyPy9enZJxLktSGtm7dUlNdkiRJkrLIGn4dS2kfrw9ONTAi1qWUNlLaE6wmKaUEHA8cDjwZeDSlVWQvi4hLp7j3VcA5wKFAB/B74AvApyLCd2okSZIkSZLmgKzh115AISLWVTn+QWBxhnnOAVbUelNK6RPAG4DNwFXAVkqB3ceBY1NKpxiASZIkNa/+/p5M942MdFes9/V1Z3rm8HAhUx+SJKl5ZN3wfiPQnVKa8kz6lFIPsBS4N8M8vwM+BLwCeBRwbRXznUwp+FoLHBoRL4yIk4CDgEHgJODcDL1IkiRJkiSpxWRd+bUGeDrwJOCXU4x9BaWQ7eZaJ4mIz47/ufQW5JTeUb6eHxF/GPesdSmlc4BrgH9OKX3M1V+SJEnN7cRLL69pfHH47or1t151Nflb11T9nO+cclJN80qSpOaVdeXX1yntvfWelNKkz0gpPQ64kNL+YJdknKtqKaV9KQVyW4BvTPw8Iq4Fhii9tnnETPcjSZIkTVQsjrJu3VqKxcnOi5IkSfWUdeXXxcBZwHOAq1JK/779WeXA6x+BE4AzgIXA9cDXpttsFZ5Yvq6JiE2TjLkJ2Kc89oZZ6EmSJEkCYHBwDQMDqygUNtDT08vKleezbNkhjW7rIYrFUYaHh+nv7yefn3KXE0mSml6mlV8RsZXSKYw3A0cDlwK7lT/+DXAFcDal4OvnwEsjYmza3U7twPL1z7sYc+eEsZIkSdKMKxZHdwRfAIXCBgYGVlEsNs9OHIODa1i+/ExWrDib5cvPZHCw+ldFJUlqVllXfhERa1NKR1Ja3XU6cDjQWf54FPgVsBr4XERsm16bVdt+ouTGXYy5v3zNdoRQBZ2d8zKfSDQTmqmXSpq5v2buDexvuuxveurdX71PZFP7avb/Pcy5/jomWQk0WX0Kc+k/vz/96U87gq/tCoUNbNo0wgEHHFC3ebIaHR3lox/94EPCuY9+9INceuml5PNZd0uRJKnxModfAOVQ67PAZ8snP+5GaTXZ32cx8JIkSdIsyS1dAl1dsHnzzmJXF7m+pY1rqkVs2bKlpvps+8tf/sL69esfUlu/fj133nlnU4RzkiRlNa3wa7yIGAWG6/W8jLav6qq8nKBk++qwQr0m3bJlG+vXb2qav7kcHq78S7O/qTVzb2B/02V/0zNVf6v+33E1PW/DcOW34f/vD95E7y25qp9z/qk/2GV/yq5V/rfXaHOtv1w+T+dxx7DlBz8pBWBdXXQedwy5XPX/3M5kf/VWz3+3jIxUfjlhZGRjU/w7bN26kUnr3d27z3I3kqS5bMmShXR21i2yql/41ST+VL7+4y7G7DdhrCRJFbnps1RZx9570XX6qYzdfz+5np7MwZckSdJsmDL8SikdVf72gYj41YRaTSLiuiz31eDX5eshKaWFk5z4ePiEsZIkPUwrnMgmNVIunyfX29voNiRJkqZUzcqva4AxIICDJ9RqMVblfJlFxF9SSrcAhwEvA744/vOU0tHAvsBa4MaZ7EWS1LomO5Ht4otXu+mzJEmS1GKqCaPupBRc/bVCrRl9APgGsCqldENE3AGQUtoD+GR5zIUR0TxnSkuSmsrQ0FDFE9mGhu5iv/32b1BXkppJlj3J6n3KbTPsEyZJUiuYMvyKiAOqqc2ElNJh7AysYOfKs/enlN42rp8jxn1/aUrpU8A5wG0ppR8DW4FjgV7gW8DHZ7p3SVLr2rq18slrk9UlSZIkNa9m3/C+F3hqhfpBu7opIt6QUroeeCNwNNAB/B74PPApV31JUuNNtn+8+8pXzw35pcZ78aU/qHrs6PDaivWVV91Ix61/rPo5V5xS2+m6kiTNdZnCr5RSfjYCpIi4Bsh0fFBEfAX4Sl0bkiTVTfdSmN8FWzfvrM3vgu6+xvXUStyQX5IkSapO1l17/5pS+veU0tPq2o0kac7I5XM8/jkdzO8q/Ty/Cx7/nA5yuUx/5zGnTLYhf7HowmaJjklWQU5WlyRJbS/ra497UHql8I0ppT8BXwW+GhFr6tSXJGkO2O0fcjzrtA423Q8LezD4qpIb8kuTyy3tg64u2DxuWWlXF7m+3RrXlCRJaqis4ddpwCuB5wEHAu8A3pFS+h1wCfD/IuLO+rQoSWpnuXyORb2N7qK1uCG/NLlcPs+C407gwR9cWQrAurpYcNwJcypcz3JyJHgapSSpfWUKvyLiq8BXU0pLgZOBV1HaWP5xwAconcZ4A6U9t74REX+vU7+SJEnSLnXsvQ8LT38tY/cXyPX0zqngay7wsA9JUq2mddpjRNwHfA74XEppL+DU8tdTgGcATwcuSin9GLikvAm9JEmSNKNy+Ty53iWNbuOhOib5v96T1afppZfdUNP40buHKtbPu+q3dNx2T9XP+ebJR9Y0by087EOSlEXWDe8fJiLWRsRHI+II4FHAO4HbgfnACcCX6jWXJEmS1GryS3eDroUPLXYtJN+3e2MaajEe9iFJyqpu4dd4EfFfwIXA+cAtMzGHJEmS1Epy+TwLj3vJzgCsayELj3uJr2VWaVeHfUiStCt1X2OdUnompc3wTwHG/zXW3+o9lyRJU8mySfNsbfrsptTS3NOx9350n/5Gxu7fQK5nicFXDTzsQ5KUVV3Cr5TSEyhtev8KYN9yOQfcB3yT0sb3V9djLkmSJKmVlfYjW9roNiRJmjMyh18ppUdRWuH1SiCVyzlgM/BdSoHX9yPCv4qRJDXcisuOr3rsprvHKtY/fNW5LLyt+lUaF538H1WNO+GK06t+JkDx7sq/tZ577bvJD3ZW/ZwrX/x/a5pXkiRJakWZwq+U0i+BJ5V/zAGjwFWUAq9vRsT99WlPkiRJkiRJyi7ryq8nl68/pxR4fS0ihuvTkiRJkiRJklQfWcOvfwW+EhF/qmMvkiRJkiRJUl1lCr8i4v31mLz8+uTuEfHIejxPkiRJUkbzJvmjwWR1SZJaRKN/J9sP2KPBPUiSJElzXn5pP7muRYxtfmBHLde1iHxff13n6e/vyXTfyEh3xXpfX3emZw4PFzL10eqKxVGGh4fp7+8nn+9odDuSNCvyjW5AkiTVqGOSEycnq0tSFXL5PIuOfyW5rkWln7sWlX7O+UeGdjE4uIbly89kxYqzWb78TAYH1zS6JUmaFY1e+SVJkmqU65sHXXnYXNxZ7MqT283f1iVNz7y9D6DnjPMZK9xHrnfpjAZfL7/s9zWN33b3nRXr/3zVn5h3W7HiZ5V8/eTH1DRvuygWRxkYWEWhsAGAQmEDAwOruPji1eTzBpyS2pv/lpMkqcXk8jnmvWD3UgAG0JVn3gt2J5dz5Zek6cvl8+SX7OaKrzYzNDS0I/jarlDYwNDQXQ3qSJJmj39FLElSC8rvs4D5Z/0DFEaht8PgS5K0S1u3bqmp3ijuSSZpJhh+SZI0QW6S3x0nqzdKLp+DJU3WlCRJGQ0OrtnxamZPTy8rV57PsmWHNLotSW3AtcySJE2wYCl0dD201tEFC/oa048kqXUUi6OsW7eWYnG00a20lMn2JCsWq9/PTZImY/glSdIEuXyO/Y/v2BGAdXTB/sf7aqEkadc8TTE79ySTNJMMvyRJqqB77zyPOWMejz5tHo85cx7de/tbpiRpcq5cmp5W2ZNMUmvy/8lLkjSJXD5H55KcK74kSVNy5ZIkNS/DL0mSJEnNb94kB3xMVp9lrlySpObV6N8pvg70NrgHSZIkSU2uY+le5LoWM7b5/h21XNdiOvr2amBXkqRWMCPhV0rpscAzgAXA/2/vzuMkqev7j79m9mDZ3VlYYFhdQDRqPi78PBBQoj8FJShEvKKoPzQBjccPD0Q80CReMR5Es4KCqPFYr/xiEhVNDCqKIngkCp4In6DxYnWXARYYzl139vdH1cAwzsxOV3dX9da+no/HPGqmurrr3dXT9Zn5dNW3zs/Mn8y0XGa+tB/rlyRJktQuQ8PDjBz9XMa/8AG23XYTQ0uWM3L0cxka8mQWSdLcKjW/IuKxwOuBizPzVdNuezXwJu48pXJbRPxVZp7eVVJJkiRJO7VFq+/LyhPfysT4dQyv2MPGlyRpXqpWi6cBDwV+NHVmRDwIeDOwAFgP/KJcx1si4uHVY0qSJElScQTYgt32svElSZq3qhXjoeX0S9PmPx8YAj4N3DMz7w2cVc57YcV1SZIkSZIkSZVUHfNrb2BzZm6cNv9oYBvw1sycKOf9LfBiwCO/JEmSJKkLo6Mjle63adOyGeevXLms0mOOjY1XyiFJTaja/NoduGnqjIi4O3BP4JrMvGRyfmZeHRHjwKqqISVJkiSpDjaXuuP2kzSIqja/bgRWRsSyzLy5nPfocnrxDMtvA26vuC5JkiRJ0hQf+fRYR8tfe/WmGef/+wWb2PPH83+sE/50tKP1ts3ExFbGxsYYHR1leHhB03EkzVPV5tcPgcOB5wDvjoghivG+tgFfnbpgRKwEVgDZRU5JkiRJqs3rP/Objpa/6eqZG0jv++oYyy9bPu/HeeOTV3e03kF13iev6Wj5DWPXzzj/G1+6np99b/6PdczT9+povZ24/PLLWLv2dMbHb2RkZAWnnnoaa9Yc2Lf1SeqdqgPef5RiEPu/j4jPA/8FPAK4Ffinacs+spxeXnFdkiRJkiQ1ZmJi6x2NL4Dx8RtZu/Z0JiYmtnNPSYOgavPrI8D/ozhy7BjgYGAz8OLMnP6Rx7PK6VcqrkuSJEmSpMasX7/+jsbXpPHxG1m//qqGEknqRKXTHjNzG/DMiHgvcBjFGGBfycyfTl0uIhYBvwDOBD7XXVRJkiRJkuq3ZcvmjuZLGixVx/wCIDMvAi6a4/YtwCu7WYckSZIkSZJUVdXTHiVJkiRJkqSBZ/NLkiRJkiRJrVXptMeI2Frhbtsys6vTLCVJkiRpEA0tWNTRfElSfaoe+TVU4cujzCRJkiS10tKVq1m4ZOQu8xYuGWHpHqsbSrRjWThLk3C2+ZLUiaoNqXtt5+tBwPOAy4DrgCeW8yVJkiSpdYaGh1lzzMl3NMAWLhlhzTEnMzQ0GMcALJiliTTb/Lrtsfs+7DqtebjrkhH2XLlPQ4kktUml0xAz85fzWOyHEfEx4Dzgg8DBVdYlSZIkSTuC3fa5Hw99zlncPn4Nu6zYa2AaXwArVu7DLktGuP228Tvm7bJkhN32GIzm0vDwME846lQ+d/5abr1tnF2XjPCEo04dqG2o7kxMbGVsbIzR0VGGhxc0HUc7mb6OwZWZmyPiZOBHwOuB5/ZzfZIkSZLUpKHhYZbstnfTMX7P8PAwRxxzKl87by233zbOLktGOOKYwWou7Xf3AzjpWe/nxpvG2G1ktC/ZRkdHtr/QDDZtWjbj/JUrl1V6zLGx8e0v1KFBbi5dfvllrF17OuPjNzIysoJTTz2NNWsObDqWdiJ9H4A+My+LiBuBo/u9LkmSJEnSzFatXsNxz34fN4+PsXxFf5pL3RoeHmb3FauajrHDGeTm0sTE1juyAYyP38jatafzvvetY3h48H4H1U59b35FxGJgKbBLv9clSZIkSZrd8PAwI7vZXPr2uqs7Wn79tdfNOP9H/3Yd1+05/8c67MTeHxU46M2l9evX35Ft0vj4jaxffxX77XePhlJpZ1PHO+F4iibbb2pYlyRJkiRJO425mkuDYMuWzR3Nb8rExFY2btzAxMTWpqOoDyod+RUR22vPLgH2pbjK4/OAbcC/VFmXJEmSJEma2Y7SXBpkg3zaqHqj6mmPP+9g2SHgP4E3VVyXJEmSJElSzw36aaPqjaqv5NB2viaA64ALgRcCj8jMm7tOK0mSJEmS1CODftqoeqPSkV+ZaftTkiRJkiTt0DxtdOdgE0uSJEmSJEmtZfNLkiRJkiRJrVV1wPuBFhHrgBPmWCQz8341xZEkSZIk7cAWLljU0fyqRkdHOr7Ppk3LZpy/cuWySo83Njbe8X2kQdfK5tcU3wB+OsP839YdRJIkSZK0YxpdsQ9LdxnhltvvbAwt3WWE0d32aTCVpPlqe/PrA5m5rukQkiRJkqQd1/DwMMcf/jL+8cJ3csvt4yzdZYTjD38Zw0P9GUnov8/eOO9lf7np2pnn//O1sHL+j/OHL1o172WlHU3bm1+SJEmSJHXtXqvW8Jqnvpfrbx5j9+WjfWt8Seo9m1+SJEmSJM3D8PAwe4x4hJS0o2l78+tREfEAYDmwEbgYOD8zJ5qNJUmSJEmSpDq0vfn15zPM+0lEPCMzf9SrlSxevLDSVTT6ZZCyzGSQ8w1yNjBft8zXHfNVN8jZwHzdMl93zFfdIGcD83XLfN0xX3W9ztbrq1H22qDnU2+09STl7wMnAwdQHPW1GjgW+EE578sR4WU5JEmSJEmSWq6VR35l5hnTZt0MfD4izgcuBA4DXgO8uBfr27z5d9xww60D0xUeGxufcb75tm+Qs4H5umW+7pivukHOBubrlvm6Y77qBjkbmK9b5uuO+arbXrYNb/9lR4937Q2/nXn+R3/Lht12mffj3O2V+8+Zr6pNm26edX6v19WNiYmtjI2NMTo6yvDwgqbj9N1uu+3K4sW9a1m1svk1m8zcHBFvBT4L/EnTeSRJkiRJ6saiBYs7mt9WVRuHvT7tsR8Ns8svv4y1a09nfPxGRkZWcOqpp7FmzYE9X0+btfW0x7lcUU497VGSJEmStEO728hqli++a5Nm+eIR7jayuqFE6qWJia13NL4AxsdvZO3a05mY8Dp+ndipjvwq7VlOb2o0hSRJkiRJXRoeGub5h53C+799BjdtHmf54hGef9gpDA/tjMe6wMZ3fr+j5a+9/tczz/9EsnH3W+b9OKte9qCO1jtf69evv6PxNWl8/EbWr7+K/fa7R1/W2UY7Y/PraeX0O42mkCRJkiSpB+671xpOf9w5XHvLGHsuHd1pG19ttGXL5o7ma2ata35FxIOAfYHzMnPrlPkLgZdSXAUS4J0NxJMkSZIkqeeGh4YZXbaq6RjSQGpd8wu4J/AZ4LqIuBS4muJUx/sDq4EJ4FWZ+cXGEkqSJEmSJKkWbWx+/QA4E3gIcADwCGAbcBXwYeDszLykuXiSJEmSJEmqS+uaX5n5c+CUpnNIkiRJkiSpeY6CJ0mSJEmS+mLR8KKO5kv9YPNLkiRJkiT1xerld2dk0fK7zBtZtJzVy+/eUCLtjGx+SZIkSZKkvhgeGualB73ojgbYyKLlvPSgFzE8ZDtC9WndmF+SJEmSJGlwrNkjeM+jz+CaW69hr1332ikbX6OjI5Xut2nTshnnr1y5rNJjjo2NV8qxo7P5JUmSJEmS+mp4aJi9l+7ddAztpGx+SZIkSZKkndLiBTMPvD/b/G5tfNfXO1r+2ut/M/P8f7qUjbtvmPfjrDr5kR2tt212vmMNJUmSJEmSgNXL92Zk8V1PLRxZvIzVyz1KrU1sfkmSJEmSpJ3S8NAwpxx8wh0NsJHFyzjl4BN2ynHJ2szTHiVJkiRJ0k5rzZ735pyj3sDYLZsYXbrSxlcL2fySJEmSJEk7teGhYVYt27PpGOoT25mSJEmSJElqLZtfkiRJkiRJai1Pe5QkSZIkSdqJjY6ONB0BgLGx8b48rkd+SZIkSZIkDaDFwzMfszTbfM3MrSVJkiRJkjSAVo/sxVxaJvQAABoCSURBVMjipYxvvuWOeSOLl7J6ZLQv67v6rH/vy+Nuz94vPravj++RX5IkSZIkSQNoeGiYUw59GiOLlwJF4+uUQ5/G8NBQw8l2LB75JUmSJEmSNKDW7LU/5xz9csZuuYHRpbvb+KrA5pckSZIkSdIAGx4aZtWylU3H2GF52qMkSZIkSZJay+aXJEmSJEmSWsvmlyRJkiRJklrL5pckSZIkSZJay+aXJEmSJEmSWsvmlyRJkiRJklrL5pckSZIkSZJay+aXJEmSJEmSWsvmlyRJkiRJklrL5pckSZIkSZJay+aXJEmSJEmSWsvmlyRJkiRJklrL5pckSZIkSZJay+aXJEmSJEmSWsvmlyRJkiRJklrL5pckSZIkSZJay+aXJEmSJEmSWsvmlyRJkiRJklrL5pckSZIkSZJay+aXJEmSJEmSWsvmlyRJkiRJklrL5pckSZIkSZJay+aXJEmSJEmSWsvmlyRJkiRJklrL5pckSZIkSZJay+aXJEmSJEmSWsvmlyRJkiRJklrL5pckSZIkSZJay+aXJEmSJEmSWsvmlyRJkiRJklrL5pckSZIkSZJay+aXJEmSJEmSWsvmlyRJkiRJklrL5pckSZIkSZJay+aXJEmSJEmSWmth0wH6KSKOB04CHgAsAK4APgyck5kTTWaTJEmSJElS/7X2yK+IOBv4BHAIcBFwPvCHwFnAv0ZEa5+7JEmSJEmSCq1sAEXEU4AXAhuAB2TmsZn5ZOC+wOXAk4GXNBhRkiRJkiRJNWhl8wt4TTk9LTOvnJyZmRspToMEeLVHf0mSJEmSJLVb65o/EbEvcDCwGfiX6bdn5oXAeuBuwGH1ppMkSZIkSVKdWtf8Ag4qp5dl5q2zLPOdactKkiRJkiSphYa2bdvWdIaeioiTgTOBc8txvmZa5kzgZODvM/MVXazuKmCfLu4vSZIkSZKkma0H9u32Qdp45NfycnrzHMvcVE5HerQuSZIkSZIk9VZP+i4Le/EgO7GfA/eiaKb9tOEskiRJkiRJbXAfisbXz3vxYG1sfk0e1bVsjmUmO4fjXa7LMcMkSZIkSZIGWBtPe/xFOd1/jmX2m7asJEmSJEmSWqiNza/vldMDI2LXWZY5dNqykiRJkiRJaqHWNb8y89fApcBi4Ljpt0fE4RRXCtgAfKvedJIkSZIkSapT65pfpbeW09Mj4j6TMyNib+A95Y9vy8yJ2pNJkiRJkiSpNkPbtm1rOkNfRMR7gJOA24AvA1uAI4EVwLnAUzNza3MJJUmSJEmS1G+tbX4BRMTxwIuA+wMLgCuADwHneNSXJEmSJElS+7W6+SVJkiRJkqSdW1vH/JIkSZIkSZJsfkmSJEmSJKm9bH5JkiRJkiSptWx+SZIkSZIkqbVsfkmSJEmSJKm1bH5JkiRJkiSptWx+SZIkSZIkqbUWNh1gZxYRARwNHAocAvwhMAQcl5n/2nC2RcAjgT8BDi+zLQHGgG8BZ2Xm1xoLCETES4BHAPcH9gZWANcDPwDWAZ/IzG2NBZwmIt4CvKb88ZWZ+Y6G86wDTphjkczM+9UUZ1YRsSvwEuA44L7AYmAj8F3gjMz8RgOZjgC+Os/F98/MX/UxzowiYl/gNOAxwD0o9i2/Br4C/F1m/k/dmaaKiP0o8h0D7AuMA5cA78rMz9ew/sr734g4HjgJeACwALgC+DBwTmZONJWvzprS6brqrikVt18tNaVXr1O/akrFbbeOmupJl+/dvteTCu+NI6ixnlTdfnXVlC7y9b2mdLsf63ftqJqvjtpRJVuddaOLbVdX3ejZtuhH7ehi+62jhtrRg/duX2tHxffHEdRUO7rZfoP2/4jNr2adBLy06RCzOBw4v/x+A/B14GbgAOApwFMi4k2Z+bqG8kHxRtob+DHwzTLf/sCjgSOBp0bEn/bqn9FuRMShwKuAbRRv+kHyDeCnM8z/bd1BpouIewFfAu5DkeerwO8oXucnUfxxUXvzi+I98ZE5bn8IsAb4GcUOvlYRcRBwAbA7cBXwxfKmQ4AXAM+MiMdm5jfrzlbmOxT4ArAH8Evg88DdKN67j4mIv8nM1/c5RqX9b0ScDbwQuI2icG+h2N+cBRwZEU/t0T6nSr46a0qn66q7plTZFnXVlK5fpz7XlG7y1VFPqr5366onnearu550vP1qrilV8tVVUyrvx2qqHVXz1VE7qmSrs25UXVdddaMn26KPtaPbfP2uHd28d+uoHVXy1Vk7Km2/Qfx/xOZXs34MvJ2ia3wJ8EGKX65BMAF8CjgzMy+aekNEPB34BPDaiPhqZs6369xrzwC+l5k3T50ZEQdS/GHxRIpPEz7cQLapeXah2DltBP6LYkc5SD6QmeuaDjFdRCyj2NH+AfBq4B2ZuXXK7XsCezaRLTOvAE6c7faI+En57YcaOvrwbIpC8w/AizJzS5lrEfBe4DnAOcAD6w4WEUso9i17AO8GTs3M35W3PYzin5bXRcTFmXn+7I/UtY73vxHxFIp/XjYAj8zMK8v5qyj+GHoyxSeDZzaRr+J96spXd02psi3qqildvU411JRu8tVRT6q8d+usJx3la6CeVHl966wpHeWruaZU2o/VWDuq7mfrqB1VstVZN6quq6660fW26HPt6DZfv2tH1fduXbWj43w1146qr+/A/T9i86tBmfmBqT8XRx0Phsy8gKJTO9Ntn4yIo4C/AJ7F/A+57KnMvHiW+ZeVn7D9DXAUDTe/yhxrgCdQdMc1P38N3JviUNrTp9+YmdcC19aeajsi4o8oXu+tFIe8173+JcAflT++frLQAGTmloj4a4pi84CIWJqZt9Qc8cnAfhSfRL188p+UMt83I+LNFH+Ev447P2XquYr738nTBE6b/OelfKyNEXES8DXg1RHx7m4/5a2Sr86a0um66q4pFbdfLTWlB69TX2vKIP9tApXz1VZPern9+lFPOs1Xd02psP1qqyld7MdqqR1V89Xxnq+Src660cW2q6tu9GJb9K12DPr/jV3kq6V29Hr79bp2VMk3qP+POOC9qvpeOd230RSzm/zj5/YmQ0TEQ4GXA/+Ymf/WZJYdSUQsBp5X/ri2ySwVPKecfiEzf9PA+rdy5+//XG4Gbu1zlpkcWk4vnFoIp/hSOX14RNytpkzbFcWYBQcDm4F/mX57Zl4IrKc41eawetO1gjVlHqwpnbOedM2aMn+/tx8bsNoxyPvZKtnqfD5V1lVn3Zgz3wDUjkH+3YOZ37uDVDs63X51146Z8g1k7fDIL1V133La+LhQ05XnZv/f8sfPNZhjCcXhxdcxuGO7ATwqIh4ALKc4FPpi4Pxuj1zp0sEUhxGvz8yfR8SDKT7d3bvM+KXZPm1rUkQsBZ5e/vjBJjKUn6Z8BXgs8MaImH6Y8Zsm8zV0SubycnrNLLdPzh8CHgz8R98Tzc9B5fSyzJytSH8H2KdctpHx1HZg1pTt59gRaor1pEcGoZ6ANaVDM+3HBql2DOx+lmrZ6nw+Ha2rgboxa74BqR3b235N146Z8g1S7Zj3719DteP38g1q7bD5pY6Vn5ydWP74qQajABARz6YYm2ARRcf5YRRHNb4lMz/TYLQ3AwE8IzNn+6NsEPz5DPN+EhHPyMwf1Z6mcP9yuj4i3kHxadVUr42Ic4FnTR9noWHHASPA1cC/N5jjhRSD/z4POCYivlvOPxRYCZxBMeBpE64up38wy+33nvL9vfqcpROTWX45xzKTV9IZpNwDz5oybztCTbGe9M6g1BOwpmzXHPuxgagdg7afnapKtjqfz3zW1WTdmEe+RmvHPF+rxmrHHPkGonZU+F2vtXZsJ9/A1Q5Pe1RHImIh8HFgN+ArA3LaxcMpBpM8nuIyrACv5c6Ocu2iGGT1FODczPxkUzm24/vAyRRX6lgOrAaOpbhqyQHAlyNin4ay7VFOD6IoNmdQXGVlJcXgoespBup8TyPpZjd5mPFHZzn9ohZZXDb4YcB5FH+EPan82gf4CXBRg/kmxwx4XHk6yHQnTfl+RQ155mvy6IK5/sC5qZyO9DlLa1hT5mcHqCnWk94biHoC1pTt2c5+rPHaMaD7WaBatjqfTwfraqRubC9f07VjHtuv0dqxnXyN146Kv+u11Y7t5RvE2mHzS516L8Wle39NMahd4zLzuZk5BCwFDqTYOb0B+HZErK47T0TsSjG44I0UHe+BlJlnZOa7M/PyzLw5M3+bmZ+nuDTutykO633N3I/SN5P7pkXAxzPzZZn5s8y8PjM/R7Hj3Ab8WUTce9ZHqVFE3Ic7/+D5UMNZHkZx9ab7UBTo0fLrSRRF+1MR0YtLg3esHDTz68CuwPkRcWREjETEfSLi3cD/obgEPBRXl1G7WVO2Y0eoKdaT3hqkegLWlHkYuP3YNIOcr0q2Op/PvNbVYN2YNd+A1I45t98A1I658g1C7ejod72B2jFnvkGsHTa/NG8RcSbFlRw2AEdm5oaGI91FZt6amT/JzFdS7CgfCJzVQJS3UJz7fGpmDuK4CnPKzM3AW8sf/6ShGONTvv+H6Tdm5uTluIfo/eW4q5r8pOVbmXl5UyEiYnfgXIpPkI/OzM9l5jXl12eBoykGlnxtRNx3rsfqo+OAbwD3A75M8YfZlcCLKf5g/EG53HWNpJvZ5Cfzy+ZYZvIT/vE5llHJmjJvO2xNsZ5UNhD1BKwp2zOP/VijtWOQ97NVstX5fKqsq866MY98jdaObl6rOmrHPPI1Wjsqbr/aasf28g1q7XDML81LRPw9xWGpYxS/4Fdu5y5NWwe8A3h8RCyq+ZDKJ1N8unhCRJww7bb7ldOTIuJY4KeZ+dwas83XFeW0qdNUfj7L99OXOYTiCkmNiogF3DleQWMDE5ceR/GpygXl4cZ3kZk/jYj/BI4ov2p/L2fm1RHxCOCPgUdTDCi6EfhsZn43IiavTNPUGEEz+UU53X+OZfabtqxmYU3pyI5eU6wnHRiwegLWlFnNcz/2i3Jae+0Y5P1slWx1Pp8erWsdfaob88zXWO3o0fbrW+2YZ77GakfF90dttWOe+Qaydtj80nZFxN8BpwLXAn+cmT9pONJ8bKK4vOpCinO2N9a8/mHm/hTgD8qv3euJ07E9y+lNcy7VP9+b8v2eFIfTTrdXOW0q41SPpSjONwFNj8dzj3J6wxzLXF9O95hjmb7K4sou55dfdygPHb87xf7m0gaizWbyd/LAiNg1Z75q16HTltUMrCmV7Mg1xXrSmUGqJ2BNmVEH+7FGascg72erZKvz+fRwXX2pGx3mq7129HD79aV2VHjvTmappXZ0sf1qqR0d5BvI2mHzS3OKiLcBr6TYgR+VmT9sONJ8PZLi9/t6Zr/8dV9k5j1nuy0i1lEMiPnKzHxHXZkqeFo5/U4TK8/M9eWnAQ+lOJf8+1Nvj4iVFJcsB/guzfuLcvrPmdn0P0+Tn3AfPNMnjVFcXvjg8sfZPslq0ivK6fvLw94HQmb+OiIupfi9Ow746NTbI+JwisE8NwDfqj/hjsGa0rkW1BTrSWcGqZ6ANeX3dLIfa6J2DPJ+tkq2Op9Pj9fV87rR4e/ePed4nHX0oXb0ePv1vHZ0uP1qrx1dbr++144O8w1k7XDML80qIv4WOI1ip31UZg7M0QwR8b8j4tjyKhPTb3s4dx7u+cHM3FpvusEXEQ8qt9+CafMXRsTLKQ5lBXhn/enu8OZy+pcRccjkzIhYApxDcWWRS2i40RARewGPL38chFNUzgNuofjE5Z0RscvkDeX376I4xWIT8MUmAkbE/SNi2bR5CyPir4AXAD/lztd/kEyOP3F6OagoABGxN3de7edtmelA/TOwprST9aR3BrCegDVl+rqq7Mdqqx0Dvp/tOFudz6fTddVdNwb5tYVK26/W2lFx+9VWO7p5feuoHRXyDWTtGNq2bVtd69I0EfFg7np51AMoBoW7kimDcmbmYTVHIyKeAHy2/PG7wGWzLHpFZr6tnlR3iogTgQ9TvAEvpfjEbAS4N8V2BPg8cNwsh5g3YlA+pY+IJwGfofg9uxS4muKQ3vtTXGZ4Anh1Zr69qYwAEfEOissLb6G46su1FFeAWU1xieFHNT2GRUS8DFhL8V5Y02SWSeXYDh8EFlB88jJ5qsfBFKd/3A48IzPPbSjfOopPwC+leB13BQ6juKrPlcBjMvMXfc5Qaf8bEe8BTgJuoxhYeQvFJ4IrKAb2fGqP/sjtOF+dNaXTddVdUyrkO5GaakovX6d+1JQK267WetLFe7eWetLN61tHPam4b6mtplTMt44aako3+7E6akfVfHXUjirZ6qwbFfOdSH11o6fbote1o+L2q612dPne7Xvt6Pb17Xft6GLfMnD/j3jaY7NWUBxKOV1TV8uZauq5t4eUXzO5EKi9+VWu903AIyi218MorraxAfgUxSVpG/nHfgfxA+BMip33ARTbcRtwFUUhPzszL2kuXiEzXxER36S4YtNBFJeQ/hXFDv5tmTnWZL7Ss8tp45ejn5SZH4mIHwGnULy2R5U3racoQmsbHv/jXIpxEh5IMdbJbUACf0fxu3dbDRkq7X8z84URcTHwIopxNBZQDMr6IeCcHh71VSVfnTWl03XVXVM6zVdnTRnk2g+d56u7nlR979ZVT7p5feuoJx3nq7mmVNl+ddWUyvuxmmpH1Xx17JOqZKuzblRZV511Y9D/L6uSr87a0c17t47a0e3r2+/aUSnfIP4/4pFfkiRJkiRJai3H/JIkSZIkSVJr2fySJEmSJElSa9n8kiRJkiRJUmvZ/JIkSZIkSVJr2fySJEmSJElSa9n8kiRJkiRJUmvZ/JIkSZIkSVJr2fySJEmSJElSa9n8kiRJkiRJUmvZ/JIkSZIkSVJr2fySJEmSJElSa9n8kiRJkiRJUmstbDqAJElSW0TEYuAk4OnAAcAy4DpgA3Ax8PHM/NYM93l+eZ8Dy/tsAL4CvD0zL59hPbsATwAeDzwQ2AdYDmws17M2My/pVcbyfquAVwGPA+4BbAES+CRwVmbePsN91gEnAG8E3gS8BHg2cF/gNuCbwBsy87szZZUkSeqFoW3btjWdQZIkaYcXEQuBLwOHl7O2ATcAI8CCct4nM/MZU+5zd+A8igYWwARwc3kfKBpEz8zMT09b17HAv01Zz/XArsCSct7vgOdk5se6zVje7yFlzj3KWePAoinr+wHwmMy8etr91lE0v94MHAI8lqJpdjtFs27yOT56poabJElSL3jaoyRJUm8cT9FUugX4M2BpZq4EdgH2B15M0SQCICIWAZ+laHx9BXgYsCQzVwCrgTMomksfi4h7T1vXTcC7gEcCyzNzj8zctVzPGRRH978/Iu7RTcYy50rgXIrG14+Ah5QZlwPHAZvK5/CJObbNi4BDKY42W56ZI+V9flw+xzPnuK8kSVJXPPJLkiSpByLiPRSnE743M0+ax/LPBf4BuAg4MjO3zLDMe4EXAGdn5os7yPJB4DkUpxS+sWrG8j6vBf6G4uiyNZm5YdrtjwG+WP54ZGZeMOW2dRRHfgE8IjMvnnbfg4HJUx73z8xfze8ZSpIkzZ9HfkmSJPXGjeX07vNcfrIpdOZMja/S5NFUR3WYZfKUyIdPm99pRoCnltMPTG98AWTml4DJUxafNstjXDS98VXe9xLgqvLH/9VBJkmSpHlzwHtJkqTeOA84DXhiRHwOWAdcmJnXTl+wHHvrIeWP74uIs2d5zMlxuPab4TH2oDid8BgggN2mLD9pddWM5ToWc2dT6quzZAS4APgj4MGz3P6dOe67HtgXWDnHMpIkSZXZ/JIkSeqBzLwwIl4HvI7iKoyPB4iIK4DPA+/LzCvLxfcAFpff7zmPh9916g8RcQBFw2nVlNnjwK0Ug9gvpmgmLesi42TOyTMF1s+Rb/LordFZbh+f4763ldNFcywjSZJUmc0vSZKkHsnMN0XExykGdj+C4mio+5VfL42Iv8jMj3LXoScOyszvd7iqD1M0vi4F/hL4RmbeNHljRBxJcVXHoS4yTrdkhnmSJEkDzzG/JEmSeigzf56Zb8vMoymOnHoU8HWKDx3fExF7A9cCW8u7TL8i45zKKzg+pLz/EzLzi1MbX6VVv3/PjjMCXAdMzCPnvuV0bP7PRJIkqR42vyRJkvokM7dm5teAY4EtFKchHlIOcD95lcNjOnzYOxpNmTnbqYh/3G3G8rbNwI/LRR81x8M8upxeOt/1SpIk1cXmlyRJUg+Ug8PPZjN3Hum1SzldV05PjIgHbuexpw4Gf0M5XTXlCK2py94fOL5HGQH+dUrO37tKZEQ8huLUSYB/nuPxJUmSGmHzS5IkqTc+GhEfjojHRsTI5MyIuCfwEYoxs24FLipv+iDw7XL+BRHxvIhYMeV+d4uIZ0bEhcBLp6zncooB5oeAT0bEfcrlF0XEnwLnA9NPg6yaEeAs4LcUg+5/ISIOKe+zICKeAvxTudyXM/OC+WwoSZKkOjngvSRJUm8soRhE/kRgW0TcQHHVxaXl7VuBF2TmNQCZuSUingh8Gng48H7gvRFxPcWRV1Ov1HhHUykzJyLiZIojso4AroyI8fI+i4FfAa8APtZtxnJ9myLiScAXgAcA3ynXt4g7B8H/IfDM+W4oSZKkOnnklyRJUm+8GngVRZPofyiaSguAn1FcnfHBmXmXhlRmXg0cTtE4+g+KAeMnj8i6Avgo8DTgbdPu9xmKcbbOByYbUb8E3gEcRHFkWE8yluv7L+AA4J3Af5fr+x3FuGWvBB5aPhdJkqSBM7Rt27amM0iSJEmSJEl94ZFfkiRJkiRJai2bX5IkSZIkSWotm1+SJEmSJElqLZtfkiRJkiRJai2bX5IkSZIkSWotm1+SJEmSJElqLZtfkiRJkiRJai2bX5IkSZIkSWotm1+SJEmSJElqLZtfkiRJkiRJaq3/D11wE+j2aKrQAAAAAElFTkSuQmCC\n",
      "text/plain": [
       "<Figure size 1440x576 with 1 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "plt.figure(figsize=(20,8))\n",
    "sns.barplot(x=\"season\", y=\"us_viewers_in_millions\", data=df);"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "4f09caf12ffa7c248bdbf444a66706a151bef27b"
   },
   "source": [
    "#### 6.Count Plot"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 198,
   "metadata": {
    "_uuid": "8dffc91ab3a4682d8f7a755f2495727d9ad22deb"
   },
   "outputs": [
    {
     "data": {
      "image/png": "iVBORw0KGgoAAAANSUhEUgAABL4AAAIMCAYAAAD7MGVIAAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADl0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uIDIuMi4zLCBodHRwOi8vbWF0cGxvdGxpYi5vcmcvIxREBQAAIABJREFUeJzs3Xu8b3VdJ/7XkeNJFMRLRw0ws9CP6cioQDGVopFlRhohao6l0+XXoHnJMnXKnK5gaUqCNJOXU2kzJYRdyAwV8ZLNqCB5ic9PGq8kcNIgIBqEc+aPtbacdnvvs/f6XtY6H57Px+M81v5+v+v7Xa/93Xt/3/u89vqutW3v3r0BAAAAgNbcYewAAAAAALAIii8AAAAAmqT4AgAAAKBJii8AAAAAmqT4AgAAAKBJii8AAAAAmqT4AgAAAKBJii8AAAAAmqT4AgAAAKBJii8AAAAAmqT4AgAAAKBJii8AAAAAmqT4AgAAAKBJii8AAAAAmqT4AgAAAKBJii8AAAAAmrR97AAHsEuT3D/JDUmuGDkLAAAAQAuOSnJIkk8lefisD7Zt7969Mye6nbo2yWFjhwAAAABo0HVJ7jbrg9jja7gbkhy2Z8/e3HLLrWNnAQAAADjgbd9+UO5wh21J17vM/njzeJDbqSuSHHHLLbfmuutuGjsLAAAAwAHvsMMOzo4d25M5HVbKwe0BAAAAaJLiCwAAAIAmKb4AAAAAaJLiCwAAAIAmKb4AAAAAaJLiCwAAAIAmKb4AAAAAaJLiCwAAAIAmKb4AAAAAaJLiCwAAAIAmKb4AAAAAaNL2sQOspZRyxySPSvL4JCckeWCSOyXZneQDSc6qtb57jfvtSvKMDR661lofNO+8AAAAAEzPJIuvdGXXhf3HVyV5T5Ibkzw4ySlJTiml/FKt9efXuf/7k1yxxvVfmHdQAAAAAKZpqsXXniTnJTmz1vrefW8opTwlyZuTvLSUclGt9aI17v+6WuuuxccEAAAAYKomWXzVWt+V5F3r3PYHpZTHJvmRJE9PslbxBQAAAMDt3IF6cPtL++WRo6YAAAAAYLImucfXJjygX653zK7HlFKOTnJIkquTvC/JhbXWPcsIBwAAAMD4Drjiq5RynyTP7C+et85qP7TGdZ8opTy11vrReebZsWN7du48dJ4PCQAAAMAcHFBvdSylbE/ypiSHJXlnrfVPV63ykSTPTXf2x0OSHJ7kpCSX9de9o5RyxPISAwAAADCWA22Pr99KcmKSz6U7sP2/Umt99aqrbkxyQSnlwiQXJzk+yUuS/MS8At188y257rqb5vVw0JyVPSIveMN3j7L97/nhtyVJdu++fs3bV/L9j13ftbRM+/qBZ749yf7z/bffGyffj//g5vK9/H+Ok+9FT904HwAAcGA57LCDs2PH/OqqA2aPr1LKmenO5HhVkhNrrVdt9r611puTnN5ffPwC4gEAAAAwMQdE8VVKeWW6tzDuTld6fXLAw1zeL73VEQAAAOB2YPLFVynl15K8IMkXk3xHrfUTAx/qnv3yhrkEAwAAAGDSJl18lVLOSPLCJP+Y5LG11r+Z4eGe3C8/OHMwAAAAACZvsge3L6X8cpIXJbk2Xel16X7Wf1iSI5O8rdZ66z7Xb0/yvHRvlUySVy0mMQAAAABTMsniq5TyhCQ/21+8IslzSilrrXp5rfWM/uOvS3J+ki+VUi5Jck26tzc+NMnhSfYk+Zla69sXGB0AAACAiZhk8ZXkHvt8fGz/by0XJ1kpvi5LcmaSb0ry4CSPTLI3yeeTvDHJ2bXWDy8kLQAAAACTM8niq9a6K8muLd7nU0mev4g8AAAAABx4Jn1wewAAAAAYSvEFAAAAQJMUXwAAAAA0SfEFAAAAQJMUXwAAAAA0SfEFAAAAQJO2jx0AkmTnzkPHjpDdu68fOwIAAAAwR/b4AgAAAKBJ9vhiUq48+9lL3+YRzz576dsEAAAAFs8eXwAAAAA0SfEFAAAAQJMUXwAAAAA0SfEFAAAAQJMUXwAAAAA0SfEFAAAAQJMUXwAAAAA0SfEFAAAAQJMUXwAAAAA0SfEFAAAAQJMUXwAAAAA0SfEFAAAAQJMUXwAAAAA0afvYAQBgUXbuPHTsCNm9+/qxIwDQiCnMtcRsAw4s9vgCAAAAoEn2+AKgec8773FL3+aZp/zF0rcJwO3D7/zR7lG2+4zv3znKdgFmYY8vAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSdvHDsBy7Nx56NgRkiS7d18/doQtm/pzN/V8wNr87M5m6s+ffJvj+284X1umaOrff/Lt35SzJfLN6vb62myPLwAAAACaZI+v25lrfuvVo2z3Xv/5+aNsd54+9tonjLLdf/esP9nUeu/97ZMWnGRtj/yxPxtlu9CK7/7jZ4yy3bc98XdG2e68fc8fjTPXLvj+zc217znvdQtOsrYLTvnRTa130rlvXnCStf3Zk/7jKNudtyee+/alb/OPn/Rdm1rv+8/7qwUnWdsfnfIto2yXaXnbH/zDKNv97qd89abW++td1yw4ydqOf+a9NrXe/3/21QtO8m898Nn33tR6V/36ZxacZG33eeH9NrXe1a/6yIKTrO3eP/mwTa139W++Z8FJ1nbv5z5qlO1OhT2+AAAAAGiS4gsAAACAJim+AAAAAGiS4gsAAACAJim+AAAAAGiS4gsAAACAJim+AAAAAGiS4gsAAACAJim+AAAAAGiS4gsAAACAJim+AAAAAGiS4gsAAACAJim+AAAAAGjS9rEDAADTs3PnoWNHSJLs3n392BEYge8/xuJ7D6A99vgCAAAAoEn2+AIA1vX48//rKNv985PH2S7T8r3nnj/Kdv/0SSePsl2m42Xn//0o2/2Fkw8fZbsALbPHFwAAAABNUnwBAAAA0CTFFwAAAABNUnwBAAAA0CTFFwAAAABNUnwBAAAA0CTFFwAAAABNUnwBAAAA0KTtYwdYSynljkkeleTxSU5I8sAkd0qyO8kHkpxVa333Bvd/WpLTkhyd5KAklyd5Y5Jzaq17FhoeAAAAgEmY6h5fJyR5R5IXJDkiyXuSnJ/kS0lOSXJRKeUX17pjKeXsJG9OcmyS9ya5MF1xdlaSc0spU/2cAQAAAJijSe7xlWRPkvOSnFlrfe++N5RSnpKu2HppKeWiWutF+9x2SpJnJbkqyaNqrZ/sr793kouSnJzkOUnOXMpnAQAAAMBoJrn3U631XbXWJ60uvfrb/iDJrv7i01fd/JJ++aKV0qu/z9Xp3vqYJC+21xcAAABA+w7UAujSfnnkyhWllCOTHJPk5iRvWX2HWuvFSa5Mcp8kxy8hIwAAAAAjOlCLrwf0yy/sc93D++XHa603rXO/D65aFwAAAIBGTfUYX+sqpdwnyTP7i+ftc9P9++VnNrj7Z1etO7MdO7Zn585D5/VwzZvyczXlbIl8s5JvNvINN+VsiXyzkm828g035WzJ9PNN3dSfP/lmM+V8U86WyDerqedblANqj69SyvYkb0pyWJJ31lr/dJ+bD+mXN27wEDf0y9vnVxsAAADgduRA2+Prt5KcmORz+bcHth/FzTffkuuuW++dldMxlWZ39+7r17x+CvmmnC2Rb1byzUa+4aacLZFvVvLNRr7hppwt2X++J593+TLjfMUfnvKgJAf+8zc2+WYz5XxTzpbIN6v18k3NYYcdnB075ldXHTB7fJVSzkzyI0muSnJirfWqVaus7M11lw0eZmWvsAPjqw0AAADAYAdE8VVKeWWS5ybZna70+uQaq326X95vg4e676p1AQAAAGjU5IuvUsqvJXlBki8m+Y5a6yfWWfXSfvmQUsrB66xz3Kp1AQAAAGjUpIuvUsoZSV6Y5B+TPLbW+jfrrVtr/VySS5LsSHLqGo91QpIj071V8gMLCQwAAADAZEy2+Cql/HKSFyW5Nl3ptZm9tE7vly8vpRy1z2PdK8lr+4tn1Fr3zDUsAAAAAJMzybM6llKekORn+4tXJHlOKWWtVS+vtZ6xcqHWem4p5ZwkpyX5aCnlHUm+nO5MkHdN8tYkZy0yOwAAAADTMMniK8k99vn42P7fWi5Ocsa+V9Ran1VKeV+SZyc5IclBSS5P8oYk59jbCwAAAOD2YZLFV611V5JdM9z/95P8/rzyAAAAAHDgmewxvgAAAABgFoovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJqk+AIAAACgSYovAAAAAJq0fewA6ymllCSPS3JckmOTPDDJtiSn1lrPXec+u5I8Y4OHrbXWB805KgAAAAATNNniK8lpSZ438L7vT3LFGtd/YXgcAAAAAA4kUy6+Ppbk15N8KMmHk7w+yQmbvO/raq27FpQLAAAAgAPAZIuvWuvr9r3cvfMRAAAAADbHwe0BAAAAaNJk9/ia0WNKKUcnOSTJ1Unel+TCWuuecWMBAAAAsCytFl8/tMZ1nyilPLXW+tF5bmjHju3ZufPQeT5k06b8XE05WyLfrOSbjXzDTTlbIt+s5JuNfMNNOVsi36zkm418w005WyLfrKaeb1Fae6vjR5I8N8mD0+3tdXiSk5Jc1l/3jlLKEePFAwAAAGBZBu3xVUr52iS31lqv3OT6hyfZXmv97JDtbVat9dWrrroxyQWllAuTXJzk+CQvSfIT89rmzTffkuuuu2leD7cwU2l2d+++fs3rp5BvytkS+WYl32zkG27K2RL5ZiXfbOQbbsrZEvlmJd9s5BtuytkS+Wa1Xr6pOeywg7Njx/zeoDh0j69PJ/nfW1j//Un+z8BtzazWenOS0/uLjx8rBwAAAADLM8tbHbcteP15u7xfeqsjAAAAwO3Aso7xdecktyxpW+u5Z7+8YdQUAAAAACzFwouvUspRSb46yVWL3tZ+PLlffnDUFAAAAAAsxaaOFlZKeWKSJ666+rBSyhs2uNu2JHdL8m395Yu2Hm/zSikPS3JkkrfVWm/d5/rtSZ6X7myPSfKqReYAAAAAYBo2e5j8hyV55qrrDl7juvX8XZKXbnLdJEkp5RFJXrvPVQ/ul79aSvnplStrrcf3H35dkvOTfKmUckmSa9K9vfGhSQ5PsifJz9Ra376VHAAAAAAcmDZbfL171eWXpTtW1is3uM+eJP+U5ONJ3l1r3eoxvu6a5JvXuP4B66x/WZIzk3xTupLskUn2Jvl8kjcmObvW+uEtZgAAAADgALWp4qvWenGSi1cul1JeluSGWusvLCpYrfXd2cKZIGutn0ry/EXlAQAAAODAstk9vla7f5Jb97sWAAAAAIxkUPFVa/3MvIMAAAAAwDwN3ePrK/qzJh6V5O5J7rjRurXW98y6PQAAAADYjMHFVynl/klOT/KEJF+1ibvsnWV7AAAAALAVg4qoUspRST6Q5B7pDkC/N8k1Sf5lftEAAAAAYLihe2D9UpJ7Jvl8ujMp/kmt9Za5pQIAAACAGQ0tvr493V5eP1Brff8c8wAAAADAXNxh4P0OTXKT0gsAAACAqRpafH02yR1KKdvmGQYAAAAA5mVo8fU/053J8cQ5ZgEAAACAuRlafJ2R5LIk/62Ucv855gEAAACAuRh6cPsnJ3ljkl9I8tFSyrlJPpjk+o3uVGv93YHbAwAAAIAtGVp87Up3VseVY3z9YP9vfxRfAAAAACzF0OLrPemKLwAAAACYpEHFV6310XPOAQAAAABzNfTg9gAAAAAwaYovAAAAAJqk+AIAAACgSYOO8VVKuXXA3fbWWoceTB8AAAAAtmRoEbVtSfcBAAAAgEGGFl/338/thyU5Lsnzk3xNkv+U5G8GbgsAAAAAtmxQ8VVr/cwmVvubUsrvJXlbktcnOWbItgAAAABgiIUec6vWenMp5blJPprkZUl+dJHbG9POnYeOHSFJsnv39WNHAAAAAJiEhZ/Vsdb68ST/lORxi94WAAAAAKxY+FkWSyk7ktw5yVcteltTsPucN42y3Z2nPX2U7QIAAABM1cL3+ErytHQF298vYVsAAAAAkGTgHl+llK/dzyp3SnJkkicm+bEke5O8Zci2AAAAAGCIoW91/NQW1t2W5H8l+aWB2wIAAACALRtafG3bz+23Jrk23dkc/zDJ62qttwzcFgAAAABs2aDiq9a6jGODAQAAAMBgCiwAAAAAmqT4AgAAAKBJQ4/x9RWllEOSPD7JI5Ls7K/eneSSJH9ea71h1m0AAAAAwFYNLr5KKduSvCTJi5Icss5qN5RSTk/y8lrr3qHbAgAAAICtmmWPr11Jnp7uDI//kuTDST7f33ZkkmOSHJrkV5J8Y5JnzLAtAAAAANiSQcVXKeX7k/xgkr1JVvbo+qdV69w1yYvT7RH29FLKW2ut58+YFwAAAAA2ZegeX/9futLr52qtp6+1Ql+E/ZdSyg1Jfrm/j+ILAAAAgKUYelbHY5LcmuTMTax7Zr/usQO3BQAAAABbNrT4OjTJ9bXWf97firXWG5P8U38fAAAAAFiKocXXNUnuVko5fH8rllKOSHK3JLsHbgsAAAAAtmxo8fWefvkbpZRt+1n3N/rluwduCwAAAAC2bOjB7V+R5KlJTk3yNaWU05O8Z+Wtj6WUeyZ5TLozOj4iyZ4kr5w9LgAAAABszqDiq9b6kVLKs5K8Nsm3Jbkgyd5SynVJvirJwf2q29KVXs+utX5kDnkBAAAAYFOGvtUxtdb/nuRRue0tjHdIcvckd05XeCXJu5I8sl8XAAAAAJZm6FsdkyS11r9KcmIp5e5JHp5kZ3/T7iSX1lr/ccZ8AAAAADDITMXXir7getc8HgsAAAAA5mFQ8VVKeUS6A9x/uNb6wv2se2aShyb5yVrrZUO2BwAAAABbNfQYX89IckKSSzax7seSPDrJDw3cFgAAAABs2dDi6zH98m2bWPfcfvntA7cFAAAAAFs2tPi6b5Jra63X7m/F/vhf1/b3AQAAAIClGHpw+x1Jbt3iduZyIH0AAAAA2Iyhe3x9PsldSillfyv26xyS5AsDtwUAAAAAWza0+LooybYkv7CJdX8xyd7+PgAAAACwFEOLr1ene6vjqaWU3yulfM3qFUopX1NKeVOSU5Ps6e8DAAAAAEsx6LhbtdbLSykvSHJmkqcleUop5bIkn+1XuV+So5Mc1F9+Ya31Y7OGBQAAAIDNGrrHV2qtr0nylHTH7tqe5JgkJ/f/HtFf9/dJnlprtbcXAAAAAEs105kWa61vKaWcn+TEJMcnuXd/09VJ/jrJO2utt8wWEQAAAAC2bqbiK0n6Yuvt/T8AAAAAmITBb3UEAAAAgClTfAEAAADQJMUXAAAAAE1SfAEAAADQJMUXAAAAAE1SfAEAAADQJMUXAAAAAE3aPnaA9ZRSSpLHJTkuybFJHphkW5JTa63n7ue+T0tyWpKjkxyU5PIkb0xyTq11zyJzAwAAADANky2+0hVXz9vqnUopZyd5VpJ/SfLOJF9OcmKSs5KcWEp5kvILAAAAoH1Tfqvjx5L8epKnJDkqycX7u0Mp5ZR0pddVSY6utZ5Uaz05yQOS/G2Sk5M8Z2GJAQAAAJiMye7xVWt93b6Xu3c+7tdL+uWLaq2f3Oexri6lnJbk3UleXEp5jb2+AAAAANo25T2+tqSUcmSSY5LcnOQtq2+vtV6c5Mok90ly/HLTAQAAALBszRRfSR7eLz9ea71pnXU+uGpdAAAAABrVUvF1/375mQ3W+eyqdQEAAABo1GSP8TXAIf3yxg3WuaFfHjqvje7YsT07d87t4WY2pSxrmXK+KWdL5JuVfLORb7gpZ0vkm5V8s5FvuClnS+SblXyzkW+4KWdL5JvV1PMtSkt7fAEAAADAV7S0x9fK3lx32WCdlb3Crp/XRm+++ZZcd91Nk2lOd+9e+1OTb/+mnC2Rb1byzUa+4aacLZFvVvLNRr7hppwtkW9W8s1GvuGmnC2Rb1br5Zuaww47ODt2zK+uammPr0/3y/ttsM59V60LAAAAQKNaKr4u7ZcPKaUcvM46x61aFwAAAIBGNVN81Vo/l+SSJDuSnLr69lLKCUmOTHJVkg8sNx0AAAAAy9ZM8dU7vV++vJRy1MqVpZR7JXltf/GMWuuepScDAAAAYKkme3D7UsojcltZlSQP7pe/Wkr56ZUra63H7/PxuaWUc5KcluSjpZR3JPlykhOT3DXJW5OctejsAAAAAIxvssVXuqLqm9e4/gEb3anW+qxSyvuSPDvJCUkOSnJ5kjckOcfeXgAAAAC3D5Mtvmqt706ybeB9fz/J7881EAAAAAAHlNaO8QUAAAAASRRfAAAAADRK8QUAAABAkxRfAAAAADRJ8QUAAABAkxRfAAAAADRJ8QUAAABAkxRfAAAAADRJ8QUAAABAkxRfAAAAADRJ8QUAAABAkxRfAAAAADRJ8QUAAABAkxRfAAAAADRJ8QUAAABAkxRfAAAAADRJ8QUAAABAkxRfAAAAADRJ8QUAAABAkxRfAAAAADRJ8QUAAABAkxRfAAAAADRJ8QUAAABAkxRfAAAAADRJ8QUAAABAkxRfAAAAADRJ8QUAAABAkxRfAAAAADRJ8QUAAABAkxRfAAAAADRJ8QUAAABAkxRfAAAAADRJ8QUAAABAkxRfAAAAADRJ8QUAAABAkxRfAAAAADRJ8QUAAABAkxRfAAAAADRJ8QUAAABAkxRfAAAAADRJ8QUAAABAkxRfAAAAADRJ8QUAAABAkxRfAAAAADRJ8QUAAABAkxRfAAAAADRJ8QUAAABAkxRfAAAAADRJ8QUAAABAkxRfAAAAADRJ8QUAAABAkxRfAAAAADRJ8QUAAABAkxRfAAAAADRJ8QUAAABAkxRfAAAAADRJ8QUAAABAkxRfAAAAADRJ8QUAAABAkxRfAAAAADRJ8QUAAABAkxRfAAAAADRJ8QUAAABAkxRfAAAAADRJ8QUAAABAkxRfAAAAADRp+9gB5q2UsivJMzZYpdZaH7SkOAAAAACMpLniax/vT3LFGtd/YdlBAAAAAFi+louv19Vad40dAgAAAIBxOMYXAAAAAE1SfAEAAADQpJbf6viYUsrRSQ5JcnWS9yW5sNa6Z9xYAAAAACxDy8XXD61x3SdKKU+ttX50XhvZsWN7du48dF4PN7MpZVnLlPNNOVsi36zkm418w005WyLfrOSbjXzDTTlbIt+s5JuNfMNNOVsi36ymnm9RWnyr40eSPDfJg9Pt7XV4kpOSXNZf945SyhHjxQMAAABgGZrb46vW+upVV92Y5IJSyoVJLk5yfJKXJPmJeWzv5ptvyXXX3TSZ5nT37uvXvF6+/ZtytkS+Wck3G/mGm3K2RL5ZyTcb+YabcrZEvlnJNxv5hptytkS+Wa2Xb2oOO+zg7Ngxv7qqxT2+1lRrvTnJ6f3Fx4+ZBQAAAIDFu90UX73L+6W3OgIAAAA07vZWfN2zX94wagoAAAAAFu72Vnw9uV9+cNQUAAAAACxcUwe3L6U8LMmRSd5Wa711n+u3J3leurM9JsmrRogHAAAAwBI1VXwl+bok5yf5UinlkiTXpHt740OTHJ5kT5KfqbW+fbSEAAAAACxFa8XXZUnOTPJNSR6c5JFJ9ib5fJI3Jjm71vrh8eIBAAAAsCxNFV+11k8lef7YOQAAAAAY3+3t4PYAAAAA3E4ovgAAAABokuILAAAAgCYpvgAAAABokuILAAAAgCYpvgAAAABo0vaxAwAAAAAwjp07Dx07QpJk9+7rF/K49vgCAAAAoEn2+AIAAAC4nbvmrD8bZbv3+omTFvr49vgCAAAAoEmKLwAAAACapPgCAAAAoEmKLwAAAACapPgCAAAAoEmKLwAAAACapPgCAAAAoEmKLwAAAACapPgCAAAAoEmKLwAAAACapPgCAAAAoEmKLwAAAACapPgCAAAAoEmKLwAAAACapPgCAAAAoEmKLwAAAACapPgCAAAAoEmKLwAAAACapPgCAAAAoEmKLwAAAACapPgCAAAAoEmKLwAAAACapPgCAAAAoEmKLwAAAACapPgCAAAAoEmKLwAAAACapPgCAAAAoEmKLwAAAACapPgCAAAAoEmKLwAAAACapPgCAAAAoEmKLwAAAACapPgCAAAAoEmKLwAAAACapPgCAAAAoEmKLwAAAACapPgCAAAAoEmKLwAAAACapPgCAAAAoEmKLwAAAACapPgCAAAAoEmKLwAAAACapPgCAAAAoEmKLwAAAACapPgCAAAAoEmKLwAAAACapPgCAAAAoEmKLwAAAACapPgCAAAAoEmKLwAAAACapPgCAAAAoEmKLwAAAACapPgCAAAAoEmKLwAAAACapPgCAAAAoEmKLwAAAACapPgCAAAAoEnbxw6wKKWUpyU5LcnRSQ5KcnmSNyY5p9a6Z8xsAAAAACxek3t8lVLOTvLmJMcmeW+SC5M8MMlZSc4tpTT5eQMAAABwm+YKoFLKKUmeleSqJEfXWk+qtZ6c5AFJ/jbJyUnCQLtZAAAY70lEQVSeM2JEAAAAAJagueIryUv65YtqrZ9cubLWenW6tz4myYvt9QUAAADQtqbKn1LKkUmOSXJzkresvr3WenGSK5PcJ8nxy00HAAAAwDI1VXwleXi//Hit9aZ11vngqnUBAAAAaNC2vXv3jp1hbkopz01yZpK39sf1WmudM5M8N8kra60/PcPmPp/kiBnuDwAAAMDarkxy5KwP0toeX4f0yxs3WOeGfnnonLYFAAAAwHzNpXfZPo8HuZ36VJL7pyvSrhg5CwAAAEALjkpXen1qHg/WWvG1sjfXXTZYZ6UxvH7GbTlGGAAAAMCEtfZWx0/3y/ttsM59V60LAAAAQINaK74u7ZcPKaUcvM46x61aFwAAAIAGNVV81Vo/l+SSJDuSnLr69lLKCenOCHBVkg8sNx0AAAAAy9RU8dU7vV++vJRy1MqVpZR7JXltf/GMWuuepScDAAAAYGm27d27d+wMc1dKeW2S05L8S5J3JPlykhOT3DXJW5M8qdZ663gJAQAAAFi0JouvJCmlPC3Js5M8NMlBSS5P8oYk59jbCwAAAKB9zRZfAAAAANy+tXiMLwAAAABQfAEAAADQJsUXAAAAAE1SfAEAAADQJMUXAAAAAE1SfAEAAADQJMUXAAAAAE3aPnaA26tSSknyuCTHJTk2yQOTbEtyaq313JGz3THJo5I8PskJfbY7Jdmd5ANJzqq1vnu0gElKKc9J8sgkD01yryR3TXJtksuS7Ery5lrr3tECrlJK+dUkL+kvvrDW+oqR8+xK8owNVqm11gctKc66SikHJ3lOklOTPCDJjiRXJ/lQklfXWt8/QqZHJ7lok6vfr9b62QXGWVMp5cgkL0rynUm+Nt1ry+eSvDPJr9Va/8+yM+2rlHLfdPm+O8mRSa5P8uEkv1lrvWAJ2x/8+ltKeVqS05IcneSgJJcneWOSc2qte8bKt8yZstVtLXumDHz+ljJT5vV1WtRMGfjc7cqS5smMP7sLnycDfjYenSXOk6HP37Jmygz5Fj5TZn0dW/TsGJpvGbNjSLZlzo0ZnrtlzY25PReLmB0zPH+7soTZMYef3YXOjoE/H4/OkmbHLM/f1P4/ovgaz2lJnjd2iHWckOTC/uOrkrwnyY1JHpzklCSnlFJ+qdb68yPlS7ofonsl+ViSv+rz3S/Jtyc5McmTSinfP6//iM6ilHJckp9JsjfdD/yUvD/JFWtc/4VlB1mtlHL/JH+Z5Kh0eS5Kcku6r/P3pfvFYunFV7qfid/Z4PZvSvKNSf4u3Yv7UpVSHp7kXUnuluTzSd7e33Rskh9P8h9LKd9Va/2rZWfr8x2X5C+S3CPJZ5JckOQ+6X52v7OU8ou11pctOMag199SytlJnpXkX9IN7S+ne705K8mJpZQnzek1Z0i+Zc6UrW5r2TNlyHOxrJky89dpwTNllnzLmCdDf3aXNU+2mm/Z82TLz9+SZ8qQfMuaKYNfx5Y0O4bmW8bsGJJtmXNj6LaWNTfm8lwscHbMmm/Rs2OWn91lzI4h+ZY5OwY9f1P8/4jiazwfS/Lr6driDyd5fbpvrCnYk+S8JGfWWt+77w2llKckeXOSl5ZSLqq1brZtnrenJrm01nrjvleWUh6S7peKJ6b7K8IbR8i2b56vSvfCdHWS/53uRXJKXldr3TV2iNVKKXdJ9yL79UlenOQVtdZb97n9nknuOUa2WuvlSZ653u2llE/0H75hpL0Oz043ZH47ybNrrV/uc90xyW8l+eEk5yT598sOVkq5U7rXlnskeU2SF9Rab+lv+5Z0/2H5+VLK+2qtF67/SDPb8utvKeWUdP9xuSrJo2qtn+yvv3e6X4ROTvcXwTPHyDfwPsvKt+yZMuS5WNZMmenrtISZMku+ZcyTIT+7y5wnW8o3wjwZ8vVd5kzZUr4lz5RBr2NLnB1DX2eXMTuGZFvm3Bi6rWXNjZmfiwXPjlnzLXp2DP3ZXdbs2HK+Jc+OoV/fyf1/RPE1klrr6/a93O1pPA211nela2jXuu0PSimPTfIjSZ6eze9mOVe11vetc/3H+7+s/WKSx2bk4qvP8Y1JnpCuFWdzfi7JN6Tbffblq2+stX4xyReXnmo/Sin/Id3X+9Z0u7kve/t3SvIf+osvWxkySVJr/XIp5efSDZqjSyl3rrX+85Ijnpzkvun+AvVTK/9B6fP9VSnlV9L9Av7zue2vS3M38PV35a0BL1r5j0v/WFeXUk5L8u4kLy6lvGbWv+4OybfMmbLVbS17pgx8/pYyU+bwdVroTJny7ybJ4HxLmyfzfP4WMU+2mm/ZM2XA87e0mTLD69hSZsfQfMv4mR+SbZlzY4bnbllzYx7PxcJmx9T/3zhDvqXMjnk/f/OeHUPyTfX/Iw5uzxCX9ssjR02xvpVffP7vmCFKKd+c5KeS/H6t9U/HzHIgKaXsSPJj/cXfGDPLAD/cL/+i1vr3I2z/1tz2/b+RG5PctOAsazmuX1687xDcx1/2y28tpdxnSZn2q3THKDgmyc1J3rL69lrrxUmuTPf2muOXm64JZsommClbZ57MzEzZvH/zOjax2THl19kh2Zb5+QzZ1jLnxob5JjA7pvy9l6z9szul2bHV52/Zs2OtfJOcHfb4YogH9MvRjwO1Wv9e7P/cX/yTEXPcKd0uxV/KdI/lliSPKaUcneSQdLs/vy/JhbPusTKjY9LtOnxlrfVTpZRHpPur7r36jH+53l/ZxlRKuXOSp/QXXz9Ghv6vKO9M8l1JfqGUsnrX4l9ayTfS2zAP6Zf/sM7tK9dvS/KIJH++8ESb8/B++fFa63oD+oNJjujXHeX4aQcwM2X/OQ6EmWKezMkU5klipmzRWq9jU5odk32dzbBsy/x8trStEebGuvkmMjv29/yNPTvWyjel2bHp77+RZse/yTfV2aH4Ykv6v5g9s7943ohRkiSllP+U7lgEd0zXNH9Luj0Zf7XWev6I0X4lSUny1Frrer+QTcEPrXHdJ0opT621fnTpaToP7ZdXllJeke6vVPt6aSnlrUmevvq4CiM7NcmhSa5J8mcj5nhWugP9/liS7y6lfKi//rgkd0/y6nQHNx3DNf3y69e5/Rv2+fj+C86yFStZPrPBOitnzJlS7skzUzbtQJgp5sn8TGWeJGbKfm3wOjaJ2TG119l9Dcm2zM9nM9sac25sIt+os2OTX6vRZscG+SYxOwZ8ry91duwn3+Rmh7c6smmllO1J3pTksCTvnMhbLb413YEjn5buVKtJ8tLc1iQvXekOqPr8JG+ttf7BWDn24yNJnpvujByHJDk8yUnpzk7y4CTvKKUcMVK2e/TLh6cbNK9OdzaVu6c7UOiV6Q7K+dpR0q1vZdfi313nLRdLUbtTA39Lkrel+wXs+/p/RyT5RJL3jphv5RgB39O/BWS10/b5+K5LyLNZK3sVbPTLzQ398tAFZ2mGmbI5B8BMMU/mbxLzJDFT9mc/r2Ojz46Jvs4mGZZtmZ/PFrY1ytzYX76xZ8cmnr9RZ8d+8o0+OwZ+ry9tduwv3xRnh+KLrfitdKfn/Vy6A9iNrtb6o7XWbUnunOQh6V6Y/muSvy6lHL7sPKWUg9MdSPCf0jXdk1RrfXWt9TW11r+ttd5Ya/1CrfWCdKe//et0u/K+ZONHWZiV16U7JnlTrfUna61/V2u9ttb6J+leNPcm+cFSyjes+yhLVEo5Krf9svOGkbN8S7qzNB2Vbjjv7P99X7qBfV4pZR6n/96y/gCZ70lycJILSyknllIOLaUcVUp5TZIfSHea96Q7iwxtM1P240CYKebJfE1pniRmyiZM7nVslSnnG5JtmZ/PprY14txYN99EZseGz98EZsdG+aYwO7b0vT7C7Ngw3xRnh+KLTSmlnJnujA1XJTmx1nrVyJH+lVrrTbXWT9RaX5juRfLfJzlrhCi/mu69zi+otU7xOAobqrXenOT0/uLjR4px/T4f//bqG2utK6fc3pb5n3J7qJW/sHyg1vq3Y4UopdwtyVvT/eX4cbXWP6m1/kP/74+TPC7dQSRfWkp5wEaPtUCnJnl/kgcleUe6X8o+meQn0v2yeFm/3pdGSbe2lb/I32WDdVb+sn/9BuvQM1M27YCdKebJYJOYJ4mZsj+beB0bdXZM+XV2SLZlfj5DtrXMubGJfKPOjlm+VsuYHZvIN+rsGPj8LW127C/fVGeHY3yxX6WUV6bbFXV3um/uT+7nLmPbleQVSb63lHLHJe9GeXK6vyo+o5TyjFW3PahfnlZKOSnJFbXWH11its26vF+O9daUT63z8ep1jk13JqRRlVIOym3HJxjtIMS970n315R39bsY/yu11itKKf8ryaP7f0v/Wa61XlNKeWSS70jy7ekOHnp1kj+utX6olLJyBpqxjgm0lk/3y/ttsM59V63LOsyULTnQZ4p5sgUTmyeJmbKuTb6OfbpfLn12TPl1dki2ZX4+c9rWrixobmwy32izY07P38JmxybzjTY7Bv58LG12bDLfJGeH4osNlVJ+LckLknwxyXfUWj8xcqTN+Md0p1Ddnu492lcveft3yMbt/9f3/+62nDhbds9+ecOGay3Opft8fM90u9Cu9tX9cqyM+/qudIP5hiRjH3/na/vldRusc22/vMcG6yxU7c7gcmH/7yv63cW/Jt3rzSUjRFvPyvfkQ0opB9e1z8513Kp1WYOZMsiBPFPMk62Z0jxJzJQ1beF1bJTZMeXX2SHZlvn5zHFbC5kbW8y39Nkxx+dvIbNjwM/uSpalzI4Znr+lzI4t5Jvk7FB8sa5SyhlJXpjuxfuxtda/GTnSZj0q3ff2tVn/FNcLUWv9uvVuK6XsSnfwyxfWWl+xrEwDPLlffnCMjddar+z/CvDN6d47/pF9by+l3D3dacmT5EMZ34/0yz+stY79H6eVv2wfs9ZfGEt3CuFj+ovr/QVrTD/dL/97v6v7JNRaP1dKuSTd992pSX5339tLKSekO3DnVUk+sPyEBwYzZesamCnmydZMaZ4kZsq/sZXXsTFmx5RfZ4dkW+bnM+dtzX1ubPF77+s2eJxdWcDsmPPzN/fZscXnb+mzY8bnb+GzY4v5Jjk7HOOLNZVSfjnJi9K9YD+21jqZvRhKKd9WSjmpP5vE6tu+Nbft4vn6Wuuty003faWUh/XP30Grrt9eSvmpdLuvJsmrlp/uK36lX/6XUsqxK1eWUu6U5Jx0ZxD5cEYuGUopX53ke/uLU3hbytuS/HO6v7S8qpTyVSs39B//Zrq3VfxjkrePEbCU8tBSyl1WXbe9lPKzSX48yRW57es/JSvHm3h5fwDRJEkp5V657aw+Z9RaHZR/DWZKm8yT+ZngPEnMlNXbGvI6trTZMfHX2S1nW+bns9VtLXtuTPlrmwx6/pY6OwY+f0ubHbN8fZcxOwbkm+Ts2LZ3795lbYt9lFIekX99CtQHpzsA3CezzwE4a63HLzlaSilPSPLH/cUPJfn4OqteXms9YzmpblNKeWaSN6b74bsk3V/KDk3yDemexyS5IMmp6+xWPoqp/HW+lPJ9Sc5P9312SZJr0u3G+9B0pxLek+TFtdZfHytjkpRSXpHuFMJfTnd2ly+mO9PL4elOI/yYsY9ZUUr5ySS/ke5n4RvHzLKiP5bD65MclO4vLitv7zgm3Vs+/m+Sp9Za3zpSvl3p/vJ9Sbqv48FJjk939p5PJvnOWuunF5xh0Otv+X/t3XusHFUdwPFvU/oASk1rtIIIJmp+Wh+kWGuEIC8BiTyMIhqIseCDVFD8A9GYSMTGpFGDYGgtIFIeJmIUUaOAQLWCaMQSERL4SdRIIFaQl0UpFHr945yFZbn3endnO7dsvp+kmd6ZOXN+O9udX/d3z5yJWAOsALZQJlHeSvlN4HzKJJ7HDek/uH3H12ZO6bevtnPKAPEtp6WcMsz3aXvklAHOXav5pMFnt5V80uT9bSOfDHhtaS2nDBjfOlrIKU2uY23kjkHjayN3DBJbm3ljwPiW017eGOq5GHbuGPD8tZY7Gn52t3vuaPr+bu/c0eDassN9H/FWx+kznzJ8std0PRWnW/e9tkvrn/FsAFovfNV+VwIHUM7XfpSnamwCfkh57Oy0fKl/kbgdOI9y4V5MOY9jwH2UJL46MzdOX3hFZp4REbdQnsy0hPKY6HspF/dVmfngdMZXnVSX0/7I+Y7MvDQi7gA+Q3lvD6ub7qckoHOmeb6PqynzIuxDmdtkC5DAVyn/9ra0EMNA19/M/GRE3AycSpk3YyZlAtbvAN8a4mivQeJrM6f021fbOaXf+NrMKTty7of+42s7nwz62W0rnzR5f9vIJ33H13JOGeT8tZVTBr6OtZQ7Bo2vjWvSILG1mTcG6avNvLGjfy8bJL42c0eTz24buaPp+7u9c8dA8e2I30cc8SVJkiRJkqSR5BxfkiRJkiRJGkkWviRJkiRJkjSSLHxJkiRJkiRpJFn4kiRJkiRJ0kiy8CVJkiRJkqSRZOFLkiRJkiRJI8nClyRJkiRJkkaShS9JkiRJkiSNJAtfkiRJkiRJGkkWviRJkiRJkjSSLHxJkiRJkiRpJFn4kiRJkiRJ0kjaaboDkCRJGgURMRtYAXwQWAzsCjwMbAJuBq7IzN+O0+YTtc0ba5tNwI3A1zLzrnH6mQMcAxwN7AO8EpgH/LP2c05mbhxWjLXdIuBM4D3AXsBWIIErgfMz88lx2qwDPgKcDawEPgWcBLwO2ALcAnwpM/8wXqySJEnDMGNsbGy6Y5AkSXpRi4idgBuAA+uqMeAxYDdgZl13ZWZ+qKvN7sA1lOIVwDbgP7UNlOLQiZl5VU9fRwE/7ernUWBnYG5d9zRwcmZe3jTG2m5ZjXNhXbUZmNXV3+3A4Zn5QE+7dZTC11eApcARlILZk5RCXec1HjJesU2SJGkYvNVRkiSpuRMoBaX/Ah8GdsnMBcAcYG/gNEqBCICImAX8mFL0uhHYD5ibmfOBPYBzKYWlyyPiNT19PQ58E3gnMC8zF2bmzrWfcykj+i+MiL2axFjjXABcTSl63QEsqzHOAz4APFJfw3cnOTenAm+jjDKbl5m71TZ31td43iRtJUmSGnHElyRJUkMRsYZyC+HazFwxhf0/BlwE3AQcmplbx9lnLXAKsDozT+sjlouBkym3EZ49aIy1zReBL1NGlb0hMzf1bD8cuK7+eGhmru/ato4y4gvggMy8uaftW4HObY57Z+a9U3uFkiRJU+eIL0mSpOb+XZe7T3H/TkHovPGKXlVnFNVhfcbSuQ1y/571/cYIcFxdfru36AWQmb8AOrcpHj/BMW7qLXrVthuB++qPb+ojJkmSpClzcntJkqTmrgE+BxwbET8B1gEbMvOh3h3rXFvL6o8XRMTqCY7ZmXfrVeMcYyHlFsIjgQBe0rV/xx6Dxlj7mM1zBalfThAjwHrgHcC+E2y/dZK29wN7Agsm2UeSJGlgFr4kSZIayswNEXEWcBblaYtHA0TE3cDPgAsy8566+0Jgdv37S6dw+J27f4iIxZRi06Ku1ZuBJygT1s+mFJJ2bRBjJ87O3QH3TxJfZ9TWyybYvnmStlvqctYk+0iSJA3MwpckSdIQZObKiLiCMon7QZRRUK+vf06PiI9m5mU8f6qJJZn5xz67uoRS9LoN+ALwm8x8vLMxIg6lPL1xRoMYe80dZ50kSdIOzzm+JEmShiQz/5aZqzLz3ZQRUwcDv6b8snFNRLwceAh4pjbpffLipOqTGpfV9sdk5nXdRa9q0Qtb9h0jwMPAtinEuWddPjj1VyJJktQOC1+SJEnbQWY+k5m/Ao4CtlJuPVxaJ7PvPM3wyD4P+2yRKTMnuv3wXU1jrNueAu6sux48yWEOqcvbptqvJElSWyx8SZIkNVQngp/IUzw3wmtOXa6ry+URsc//OXb3xO+P1eWirpFZ3fu+GThhSDEC/KArzhc8DTIiDqfcLgnw/UmOL0mSNC0sfEmSJDV3WURcEhFHRMRunZUR8WrgUsocWU8AN9VNFwO/q+vXR8THI2J+V7tXRMSJEbEBOL2rn7sok8nPAK6MiNfW/WdFxPuA64HeWx8HjRHgfOAflAn2r42IpbXNzIh4P/C9ut8Nmbl+KidKkiSpTU5uL0mS1NxcyoTxy4GxiHiM8nTFXer2Z4BTMvNfAJm5NSKOBa4C9gcuBNZGxKOUEVfdT2R8tqCUmdsi4tOUkVgHAfdExObaZjZwL3AGcHnTGGt/j0TEe4FrgbcAt9b+ZvHchPd/Ak6c6omSJElqkyO+JEmSmvs8cCalQPRXSkFpJvAXylMY983M5xWjMvMB4EBK0ejnlMnhOyOx7gYuA44HVvW0+xFlXq3rgU4R6u/A14EllBFhQ4mx9vd7YDHwDeDPtb+nKfOUfRZ4e30tkiRJO5wZY2Nj0x2DJEmSJEmSNHSO+JIkSZIkSdJIsvAlSZIkSZKkkWThS5IkSZIkSSPJwpckSZIkSZJGkoUvSZIkSZIkjSQLX5IkSZIkSRpJFr4kSZIkSZI0kix8SZIkSZIkaSRZ+JIkSZIkSdJIsvAlSZIkSZKkkfQ/7o9GI4AztoEAAAAASUVORK5CYII=\n",
      "text/plain": [
       "<Figure size 1440x576 with 1 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "plt.figure(figsize=(20,8))\n",
    "sns.countplot(x=\"season\", data=df);"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "62b5f769410b48e96880ffb0c86564dc4934c2dc"
   },
   "source": [
    "#### 7.Wide form plot"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 199,
   "metadata": {
    "_uuid": "73fa21a03b7b91d7fc20813b242119d3c27b4701"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>country</th>\n",
       "      <th>beer_servings</th>\n",
       "      <th>spirit_servings</th>\n",
       "      <th>wine_servings</th>\n",
       "      <th>total_litres_of_pure_alcohol</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>Afghanistan</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>0.0</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>Albania</td>\n",
       "      <td>89</td>\n",
       "      <td>132</td>\n",
       "      <td>54</td>\n",
       "      <td>4.9</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>Algeria</td>\n",
       "      <td>25</td>\n",
       "      <td>0</td>\n",
       "      <td>14</td>\n",
       "      <td>0.7</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>Andorra</td>\n",
       "      <td>245</td>\n",
       "      <td>138</td>\n",
       "      <td>312</td>\n",
       "      <td>12.4</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>Angola</td>\n",
       "      <td>217</td>\n",
       "      <td>57</td>\n",
       "      <td>45</td>\n",
       "      <td>5.9</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "       country              ...               total_litres_of_pure_alcohol\n",
       "0  Afghanistan              ...                                        0.0\n",
       "1      Albania              ...                                        4.9\n",
       "2      Algeria              ...                                        0.7\n",
       "3      Andorra              ...                                       12.4\n",
       "4       Angola              ...                                        5.9\n",
       "\n",
       "[5 rows x 5 columns]"
      ]
     },
     "execution_count": 199,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df = pd.read_csv('../input/datasetsdifferent-format/data-alcohol.csv')\n",
    "df.head()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 200,
   "metadata": {
    "_uuid": "7d173cd7675a262df02185c117878517cc0bd767"
   },
   "outputs": [
    {
     "data": {
      "image/png": "\n",
      "text/plain": [
       "<Figure size 1440x576 with 1 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "plt.figure(figsize=(20,8))\n",
    "sns.boxplot(data=df, orient=\"h\");"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "8ffd9bea4f44ff1da5cd3690ebec622a3bb29937"
   },
   "source": [
    "# 24.Plotting with data aware grids <a id=\"241\"></a>\n",
    "---\n",
    "[**Go To TOP**](#00)\n",
    "\n",
    "![](https://i.stack.imgur.com/YsSZc.png)\n",
    "\n",
    "### In this section, you can learn\n",
    "\n",
    "1. Plotting with FacetGrid()\n",
    "2. Plotting with PairGrid()\n",
    "\t1. MLB Players Height, Weight, Age and Positions dataset\n",
    "3. Plot it with PairGrid()\n",
    "4. Plotting with PairPlot()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "09b33f3006fc87ea5a3627dd614ef9d01e23e87f"
   },
   "source": [
    "### 1. Plotting with FacetGrid()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 201,
   "metadata": {
    "_uuid": "85228153c49345cdc87a166505302b38e804efad"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>PassengerId</th>\n",
       "      <th>Survived</th>\n",
       "      <th>Pclass</th>\n",
       "      <th>Name</th>\n",
       "      <th>Sex</th>\n",
       "      <th>Age</th>\n",
       "      <th>SibSp</th>\n",
       "      <th>Parch</th>\n",
       "      <th>Ticket</th>\n",
       "      <th>Fare</th>\n",
       "      <th>Cabin</th>\n",
       "      <th>Embarked</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>3</td>\n",
       "      <td>Braund, Mr. Owen Harris</td>\n",
       "      <td>male</td>\n",
       "      <td>22.0</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>A/5 21171</td>\n",
       "      <td>7.2500</td>\n",
       "      <td>NaN</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>2</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>Cumings, Mrs. John Bradley (Florence Briggs Th...</td>\n",
       "      <td>female</td>\n",
       "      <td>38.0</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>PC 17599</td>\n",
       "      <td>71.2833</td>\n",
       "      <td>C85</td>\n",
       "      <td>C</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>3</td>\n",
       "      <td>1</td>\n",
       "      <td>3</td>\n",
       "      <td>Heikkinen, Miss. Laina</td>\n",
       "      <td>female</td>\n",
       "      <td>26.0</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>STON/O2. 3101282</td>\n",
       "      <td>7.9250</td>\n",
       "      <td>NaN</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>4</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>Futrelle, Mrs. Jacques Heath (Lily May Peel)</td>\n",
       "      <td>female</td>\n",
       "      <td>35.0</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>113803</td>\n",
       "      <td>53.1000</td>\n",
       "      <td>C123</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>5</td>\n",
       "      <td>0</td>\n",
       "      <td>3</td>\n",
       "      <td>Allen, Mr. William Henry</td>\n",
       "      <td>male</td>\n",
       "      <td>35.0</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>373450</td>\n",
       "      <td>8.0500</td>\n",
       "      <td>NaN</td>\n",
       "      <td>S</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "   PassengerId  Survived  Pclass    ...        Fare Cabin  Embarked\n",
       "0            1         0       3    ...      7.2500   NaN         S\n",
       "1            2         1       1    ...     71.2833   C85         C\n",
       "2            3         1       3    ...      7.9250   NaN         S\n",
       "3            4         1       1    ...     53.1000  C123         S\n",
       "4            5         0       3    ...      8.0500   NaN         S\n",
       "\n",
       "[5 rows x 12 columns]"
      ]
     },
     "execution_count": 201,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df = pd.read_csv('../input/datasetsdifferent-format/data-titanic.csv')\n",
    "df.head()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 202,
   "metadata": {
    "_uuid": "dd03387f1052a8ed71d0a55ca929ebed8ade473a"
   },
   "outputs": [
    {
     "data": {
      "image/png": "\n",
      "text/plain": [
       "<Figure size 539.35x216 with 2 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "g = sns.FacetGrid(df, col=\"Sex\", hue='Survived')\n",
    "g.map(plt.hist, \"Age\");\n",
    "g.add_legend();"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "17a570fc4e7f187e4f75baaa8e01c403937c3581"
   },
   "source": [
    "### 2.Plotting with PairGrid()\n",
    "#### MLB Players Height, Weight, Age and Positions dataset"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 203,
   "metadata": {
    "_uuid": "e8e6f1e37a916c9bda843de68fa734b2fe972352"
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Position</th>\n",
       "      <th>Height</th>\n",
       "      <th>Weight</th>\n",
       "      <th>Age</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>Catcher</td>\n",
       "      <td>74</td>\n",
       "      <td>180.0</td>\n",
       "      <td>22.99</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>Catcher</td>\n",
       "      <td>74</td>\n",
       "      <td>215.0</td>\n",
       "      <td>34.69</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>Catcher</td>\n",
       "      <td>72</td>\n",
       "      <td>210.0</td>\n",
       "      <td>30.78</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>First_Baseman</td>\n",
       "      <td>72</td>\n",
       "      <td>210.0</td>\n",
       "      <td>35.43</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>First_Baseman</td>\n",
       "      <td>73</td>\n",
       "      <td>188.0</td>\n",
       "      <td>35.71</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "        Position  Height  Weight    Age\n",
       "0        Catcher      74   180.0  22.99\n",
       "1        Catcher      74   215.0  34.69\n",
       "2        Catcher      72   210.0  30.78\n",
       "3  First_Baseman      72   210.0  35.43\n",
       "4  First_Baseman      73   188.0  35.71"
      ]
     },
     "execution_count": 203,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "mlb = pd.read_csv('../input/datasetsdifferent-format/data-mlb-players.csv')\n",
    "mlb.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "6e4311aa954b72a2f842847d9db7747ef15be449"
   },
   "source": [
    "### 3.Plot it with PairGrid()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 204,
   "metadata": {
    "_uuid": "bdb984348110b4e98ee7d5cd1209f0345e7a9389"
   },
   "outputs": [
    {
     "data": {
      "image/png": "\n",
      "text/plain": [
       "<Figure size 638.45x360 with 4 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "g = sns.PairGrid(mlb, vars=[\"Height\", \"Weight\"], hue=\"Position\")\n",
    "g.map(plt.scatter);\n",
    "g.add_legend();"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "e0eb43eabe15ae3a3d8396209892c86338f5001a"
   },
   "source": [
    "### 4.Plot it with PairGrid()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 205,
   "metadata": {
    "_uuid": "7805ab03e6e7961c2512338039270392d077dd6c"
   },
   "outputs": [
    {
     "name": "stderr",
     "output_type": "stream",
     "text": [
      "/opt/conda/lib/python3.6/site-packages/seaborn/axisgrid.py:2065: UserWarning: The `size` parameter has been renamed to `height`; pleaes update your code.\n",
      "  warnings.warn(msg, UserWarning)\n"
     ]
    },
    {
     "data": {
      "image/png": "\n",
      "text/plain": [
       "<Figure size 818.45x540 with 12 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "sns.pairplot(mlb, hue=\"Position\", size=2.5);"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "_uuid": "a955f33ac0871464e0f587912b2153b758a5bc9d"
   },
   "source": [
    "### <span style=\"color:orange\">Thanks for Reading this notebook...🙏...🙏...🙏!!!</span>"
   ]
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "Python 3",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.6.6"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 1
}