{
 "cells": [
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# datatable_demo\n",
    "\n",
    "This notebook demonstrates the use of the DataTable object in the ukds package.\n",
    "\n",
    "This demonstration uses for an example the following dataset: Gershuny, J., Sullivan, O. (2017). United Kingdom Time Use Survey, 2014-2015. Centre for Time Use Research, University of Oxford. [data collection]. UK Data Service. SN: 8128, http://doi.org/10.5255/UKDA-SN-8128-1\n"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## Import the ukds package\n",
    "\n",
    "This demonstration used the `ukds` package, which is available on PyPi."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 1,
   "metadata": {},
   "outputs": [],
   "source": [
    "import ukds"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## Set up a filepath to a .tab data table file\n",
    "\n",
    "The filepath to the data table under study is specified here. This can be changed as needed."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 2,
   "metadata": {},
   "outputs": [],
   "source": [
    "fp_tab=r'C:\\Users\\cvskf\\OneDrive - Loughborough University\\_Data\\United_Kingdom_Time_Use_Survey_2014-2015'+\\\n",
    "       r'\\UKDA-8128-tab\\tab\\uktus15_household.tab'"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## Set up a filepath to a UKDS .rtf data dictionary file\n",
    "\n",
    "The filepath to the associated data dictionary is specified here. This can be changed as needed."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 3,
   "metadata": {},
   "outputs": [],
   "source": [
    "fp_dd=r'C:\\Users\\cvskf\\OneDrive - Loughborough University\\_Data\\United_Kingdom_Time_Use_Survey_2014-2015' + \\\n",
    "      r'\\UKDA-8128-tab\\mrdoc\\allissue\\uktus15_household_ukda_data_dictionary.rtf'"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## Create a DataTable object \n",
    "\n",
    "A DataTable object is created. The filepaths are supplied as arguments and the files are read into the DataTable object."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 4,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "A class for reading a UK Data Service .tab data table file\n",
      "    \n",
      "<ukds.data_table.DataTable object at 0x000001DBFB2F92B0>\n"
     ]
    }
   ],
   "source": [
    "dt=ukds.DataTable(fp_tab,fp_dd)\n",
    "print(dt.__doc__)\n",
    "print(dt)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "The data table .tab file is stored in the `tab` attribute as a pandas DataFrame:"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 5,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>serial</th>\n",
       "      <th>strata</th>\n",
       "      <th>psu</th>\n",
       "      <th>HhOut</th>\n",
       "      <th>hh_wt</th>\n",
       "      <th>IMonth</th>\n",
       "      <th>IYear</th>\n",
       "      <th>DM014</th>\n",
       "      <th>DM016</th>\n",
       "      <th>DM510</th>\n",
       "      <th>...</th>\n",
       "      <th>Relate10_P1</th>\n",
       "      <th>Relate10_P2</th>\n",
       "      <th>Relate10_P3</th>\n",
       "      <th>Relate10_P4</th>\n",
       "      <th>Relate10_P5</th>\n",
       "      <th>Relate10_P6</th>\n",
       "      <th>Relate10_P7</th>\n",
       "      <th>Relate10_P8</th>\n",
       "      <th>Relate10_P9</th>\n",
       "      <th>Relate10_P10</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>11010903</td>\n",
       "      <td>-2</td>\n",
       "      <td>-2</td>\n",
       "      <td>598</td>\n",
       "      <td>NaN</td>\n",
       "      <td>9</td>\n",
       "      <td>2014</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>...</td>\n",
       "      <td>-2</td>\n",
       "      <td>-2.0</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>11010904</td>\n",
       "      <td>-2</td>\n",
       "      <td>-2</td>\n",
       "      <td>598</td>\n",
       "      <td>NaN</td>\n",
       "      <td>9</td>\n",
       "      <td>2014</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>...</td>\n",
       "      <td>-2</td>\n",
       "      <td>-2.0</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>11010906</td>\n",
       "      <td>-2</td>\n",
       "      <td>-2</td>\n",
       "      <td>598</td>\n",
       "      <td>NaN</td>\n",
       "      <td>10</td>\n",
       "      <td>2014</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>...</td>\n",
       "      <td>-2</td>\n",
       "      <td>-2.0</td>\n",
       "      <td>-2.0</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>11010907</td>\n",
       "      <td>-2</td>\n",
       "      <td>-2</td>\n",
       "      <td>598</td>\n",
       "      <td>NaN</td>\n",
       "      <td>9</td>\n",
       "      <td>2014</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>...</td>\n",
       "      <td>-2</td>\n",
       "      <td>-2.0</td>\n",
       "      <td>-2.0</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>11010908</td>\n",
       "      <td>-2</td>\n",
       "      <td>-2</td>\n",
       "      <td>598</td>\n",
       "      <td>NaN</td>\n",
       "      <td>9</td>\n",
       "      <td>2014</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>...</td>\n",
       "      <td>-2</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "<p>5 rows × 335 columns</p>\n",
       "</div>"
      ],
      "text/plain": [
       "     serial  strata  psu  HhOut  hh_wt  IMonth  IYear  DM014  DM016  DM510  \\\n",
       "0  11010903      -2   -2    598    NaN       9   2014      0      0      0   \n",
       "1  11010904      -2   -2    598    NaN       9   2014      0      0      0   \n",
       "2  11010906      -2   -2    598    NaN      10   2014      0      0      0   \n",
       "3  11010907      -2   -2    598    NaN       9   2014      1      1      0   \n",
       "4  11010908      -2   -2    598    NaN       9   2014      0      0      0   \n",
       "\n",
       "   ...  Relate10_P1  Relate10_P2  Relate10_P3  Relate10_P4  Relate10_P5  \\\n",
       "0  ...           -2         -2.0          NaN          NaN          NaN   \n",
       "1  ...           -2         -2.0          NaN          NaN          NaN   \n",
       "2  ...           -2         -2.0         -2.0          NaN          NaN   \n",
       "3  ...           -2         -2.0         -2.0          NaN          NaN   \n",
       "4  ...           -2          NaN          NaN          NaN          NaN   \n",
       "\n",
       "   Relate10_P6  Relate10_P7  Relate10_P8  Relate10_P9  Relate10_P10  \n",
       "0          NaN          NaN          NaN          NaN           NaN  \n",
       "1          NaN          NaN          NaN          NaN           NaN  \n",
       "2          NaN          NaN          NaN          NaN           NaN  \n",
       "3          NaN          NaN          NaN          NaN           NaN  \n",
       "4          NaN          NaN          NaN          NaN           NaN  \n",
       "\n",
       "[5 rows x 335 columns]"
      ]
     },
     "execution_count": 5,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "dt.tab.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "The data dictionary .rtf file is stored in the `datadictionary` attribute as a ukds.DataDictionary object:"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 6,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "<ukds.data_dictionary.DataDictionary at 0x1dbfb2f9358>"
      ]
     },
     "execution_count": 6,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "dt.datadictionary"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## Get dataframe\n",
    "\n",
    "The information in the `tab` and `datadictionary` attributes can be combined by the `get_dataframe` method.\n",
    "\n",
    "This method returns a new pandas Dataframe in which:\n",
    "- the columns are a multi-level index which hold the data dictionary information \n",
    "- the table values are converted from numerical values to the label values, where applicable\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 7,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead tr th {\n",
       "        text-align: left;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr>\n",
       "      <th>variable</th>\n",
       "      <th>serial</th>\n",
       "      <th>strata</th>\n",
       "      <th>psu</th>\n",
       "      <th>HhOut</th>\n",
       "      <th>hh_wt</th>\n",
       "      <th>IMonth</th>\n",
       "      <th>IYear</th>\n",
       "      <th>DM014</th>\n",
       "      <th>DM016</th>\n",
       "      <th>DM510</th>\n",
       "      <th>...</th>\n",
       "      <th>Relate10_P1</th>\n",
       "      <th>Relate10_P2</th>\n",
       "      <th>Relate10_P3</th>\n",
       "      <th>Relate10_P4</th>\n",
       "      <th>Relate10_P5</th>\n",
       "      <th>Relate10_P6</th>\n",
       "      <th>Relate10_P7</th>\n",
       "      <th>Relate10_P8</th>\n",
       "      <th>Relate10_P9</th>\n",
       "      <th>Relate10_P10</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>variable_label</th>\n",
       "      <th>Household number</th>\n",
       "      <th>Strata</th>\n",
       "      <th>Primary sampling unit</th>\n",
       "      <th>Final outcome - household</th>\n",
       "      <th>Household weight</th>\n",
       "      <th>Interview month</th>\n",
       "      <th>Interview Year</th>\n",
       "      <th>Number of children aged 0-14</th>\n",
       "      <th>Number of children aged 0-16</th>\n",
       "      <th>Number of children aged 5-10</th>\n",
       "      <th>...</th>\n",
       "      <th>Relate10_P1: How related to person 10</th>\n",
       "      <th>Relate10_P2: How related to person 10</th>\n",
       "      <th>Relate10_P3: How related to person 10</th>\n",
       "      <th>Relate10_P4: How related to person 10</th>\n",
       "      <th>Relate10_P5: How related to person 10</th>\n",
       "      <th>Relate10_P6: How related to person 10</th>\n",
       "      <th>Relate10_P7: How related to person 10</th>\n",
       "      <th>Relate10_P8: How related to person 10</th>\n",
       "      <th>Relate10_P9: How related to person 10</th>\n",
       "      <th>Relate10_P10: How related to person 10</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>variable_type</th>\n",
       "      <th>numeric</th>\n",
       "      <th>numeric</th>\n",
       "      <th>numeric</th>\n",
       "      <th>numeric</th>\n",
       "      <th>numeric</th>\n",
       "      <th>numeric</th>\n",
       "      <th>numeric</th>\n",
       "      <th>numeric</th>\n",
       "      <th>numeric</th>\n",
       "      <th>numeric</th>\n",
       "      <th>...</th>\n",
       "      <th>numeric</th>\n",
       "      <th>numeric</th>\n",
       "      <th>numeric</th>\n",
       "      <th>numeric</th>\n",
       "      <th>numeric</th>\n",
       "      <th>numeric</th>\n",
       "      <th>numeric</th>\n",
       "      <th>numeric</th>\n",
       "      <th>numeric</th>\n",
       "      <th>numeric</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>SPSS_measurement_level</th>\n",
       "      <th>SCALE</th>\n",
       "      <th>SCALE</th>\n",
       "      <th>SCALE</th>\n",
       "      <th>SCALE</th>\n",
       "      <th>SCALE</th>\n",
       "      <th>NOMINAL</th>\n",
       "      <th>NOMINAL</th>\n",
       "      <th>NOMINAL</th>\n",
       "      <th>NOMINAL</th>\n",
       "      <th>NOMINAL</th>\n",
       "      <th>...</th>\n",
       "      <th>SCALE</th>\n",
       "      <th>SCALE</th>\n",
       "      <th>SCALE</th>\n",
       "      <th>SCALE</th>\n",
       "      <th>SCALE</th>\n",
       "      <th>SCALE</th>\n",
       "      <th>SCALE</th>\n",
       "      <th>SCALE</th>\n",
       "      <th>SCALE</th>\n",
       "      <th>NOMINAL</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>SPSS_user_missing_values</th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th>...</th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>pos</th>\n",
       "      <th>1</th>\n",
       "      <th>2</th>\n",
       "      <th>3</th>\n",
       "      <th>4</th>\n",
       "      <th>5</th>\n",
       "      <th>6</th>\n",
       "      <th>7</th>\n",
       "      <th>8</th>\n",
       "      <th>9</th>\n",
       "      <th>10</th>\n",
       "      <th>...</th>\n",
       "      <th>326</th>\n",
       "      <th>327</th>\n",
       "      <th>328</th>\n",
       "      <th>329</th>\n",
       "      <th>330</th>\n",
       "      <th>331</th>\n",
       "      <th>332</th>\n",
       "      <th>333</th>\n",
       "      <th>334</th>\n",
       "      <th>335</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>11010903</td>\n",
       "      <td>Schedule not applicable</td>\n",
       "      <td>Schedule not applicable</td>\n",
       "      <td>Other reasons why unproductive</td>\n",
       "      <td>NaN</td>\n",
       "      <td>September</td>\n",
       "      <td>2014</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>...</td>\n",
       "      <td>Schedule not applicable</td>\n",
       "      <td>Schedule not applicable</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>11010904</td>\n",
       "      <td>Schedule not applicable</td>\n",
       "      <td>Schedule not applicable</td>\n",
       "      <td>Other reasons why unproductive</td>\n",
       "      <td>NaN</td>\n",
       "      <td>September</td>\n",
       "      <td>2014</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>...</td>\n",
       "      <td>Schedule not applicable</td>\n",
       "      <td>Schedule not applicable</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>11010906</td>\n",
       "      <td>Schedule not applicable</td>\n",
       "      <td>Schedule not applicable</td>\n",
       "      <td>Other reasons why unproductive</td>\n",
       "      <td>NaN</td>\n",
       "      <td>October</td>\n",
       "      <td>2014</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>...</td>\n",
       "      <td>Schedule not applicable</td>\n",
       "      <td>Schedule not applicable</td>\n",
       "      <td>Schedule not applicable</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>11010907</td>\n",
       "      <td>Schedule not applicable</td>\n",
       "      <td>Schedule not applicable</td>\n",
       "      <td>Other reasons why unproductive</td>\n",
       "      <td>NaN</td>\n",
       "      <td>September</td>\n",
       "      <td>2014</td>\n",
       "      <td>1</td>\n",
       "      <td>1</td>\n",
       "      <td>0</td>\n",
       "      <td>...</td>\n",
       "      <td>Schedule not applicable</td>\n",
       "      <td>Schedule not applicable</td>\n",
       "      <td>Schedule not applicable</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>11010908</td>\n",
       "      <td>Schedule not applicable</td>\n",
       "      <td>Schedule not applicable</td>\n",
       "      <td>Other reasons why unproductive</td>\n",
       "      <td>NaN</td>\n",
       "      <td>September</td>\n",
       "      <td>2014</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>0</td>\n",
       "      <td>...</td>\n",
       "      <td>Schedule not applicable</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "      <td>NaN</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "<p>5 rows × 335 columns</p>\n",
       "</div>"
      ],
      "text/plain": [
       "variable                           serial                   strata  \\\n",
       "variable_label           Household number                   Strata   \n",
       "variable_type                     numeric                  numeric   \n",
       "SPSS_measurement_level              SCALE                    SCALE   \n",
       "SPSS_user_missing_values                                             \n",
       "pos                                     1                        2   \n",
       "0                                11010903  Schedule not applicable   \n",
       "1                                11010904  Schedule not applicable   \n",
       "2                                11010906  Schedule not applicable   \n",
       "3                                11010907  Schedule not applicable   \n",
       "4                                11010908  Schedule not applicable   \n",
       "\n",
       "variable                                      psu  \\\n",
       "variable_label              Primary sampling unit   \n",
       "variable_type                             numeric   \n",
       "SPSS_measurement_level                      SCALE   \n",
       "SPSS_user_missing_values                            \n",
       "pos                                             3   \n",
       "0                         Schedule not applicable   \n",
       "1                         Schedule not applicable   \n",
       "2                         Schedule not applicable   \n",
       "3                         Schedule not applicable   \n",
       "4                         Schedule not applicable   \n",
       "\n",
       "variable                                           HhOut            hh_wt  \\\n",
       "variable_label                 Final outcome - household Household weight   \n",
       "variable_type                                    numeric          numeric   \n",
       "SPSS_measurement_level                             SCALE            SCALE   \n",
       "SPSS_user_missing_values                                                    \n",
       "pos                                                    4                5   \n",
       "0                         Other reasons why unproductive              NaN   \n",
       "1                         Other reasons why unproductive              NaN   \n",
       "2                         Other reasons why unproductive              NaN   \n",
       "3                         Other reasons why unproductive              NaN   \n",
       "4                         Other reasons why unproductive              NaN   \n",
       "\n",
       "variable                          IMonth          IYear  \\\n",
       "variable_label           Interview month Interview Year   \n",
       "variable_type                    numeric        numeric   \n",
       "SPSS_measurement_level           NOMINAL        NOMINAL   \n",
       "SPSS_user_missing_values                                  \n",
       "pos                                    6              7   \n",
       "0                              September           2014   \n",
       "1                              September           2014   \n",
       "2                                October           2014   \n",
       "3                              September           2014   \n",
       "4                              September           2014   \n",
       "\n",
       "variable                                        DM014  \\\n",
       "variable_label           Number of children aged 0-14   \n",
       "variable_type                                 numeric   \n",
       "SPSS_measurement_level                        NOMINAL   \n",
       "SPSS_user_missing_values                                \n",
       "pos                                                 8   \n",
       "0                                                   0   \n",
       "1                                                   0   \n",
       "2                                                   0   \n",
       "3                                                   1   \n",
       "4                                                   0   \n",
       "\n",
       "variable                                        DM016  \\\n",
       "variable_label           Number of children aged 0-16   \n",
       "variable_type                                 numeric   \n",
       "SPSS_measurement_level                        NOMINAL   \n",
       "SPSS_user_missing_values                                \n",
       "pos                                                 9   \n",
       "0                                                   0   \n",
       "1                                                   0   \n",
       "2                                                   0   \n",
       "3                                                   1   \n",
       "4                                                   0   \n",
       "\n",
       "variable                                        DM510  ...  \\\n",
       "variable_label           Number of children aged 5-10  ...   \n",
       "variable_type                                 numeric  ...   \n",
       "SPSS_measurement_level                        NOMINAL  ...   \n",
       "SPSS_user_missing_values                               ...   \n",
       "pos                                                10  ...   \n",
       "0                                                   0  ...   \n",
       "1                                                   0  ...   \n",
       "2                                                   0  ...   \n",
       "3                                                   0  ...   \n",
       "4                                                   0  ...   \n",
       "\n",
       "variable                                           Relate10_P1  \\\n",
       "variable_label           Relate10_P1: How related to person 10   \n",
       "variable_type                                          numeric   \n",
       "SPSS_measurement_level                                   SCALE   \n",
       "SPSS_user_missing_values                                         \n",
       "pos                                                        326   \n",
       "0                                      Schedule not applicable   \n",
       "1                                      Schedule not applicable   \n",
       "2                                      Schedule not applicable   \n",
       "3                                      Schedule not applicable   \n",
       "4                                      Schedule not applicable   \n",
       "\n",
       "variable                                           Relate10_P2  \\\n",
       "variable_label           Relate10_P2: How related to person 10   \n",
       "variable_type                                          numeric   \n",
       "SPSS_measurement_level                                   SCALE   \n",
       "SPSS_user_missing_values                                         \n",
       "pos                                                        327   \n",
       "0                                      Schedule not applicable   \n",
       "1                                      Schedule not applicable   \n",
       "2                                      Schedule not applicable   \n",
       "3                                      Schedule not applicable   \n",
       "4                                                          NaN   \n",
       "\n",
       "variable                                           Relate10_P3  \\\n",
       "variable_label           Relate10_P3: How related to person 10   \n",
       "variable_type                                          numeric   \n",
       "SPSS_measurement_level                                   SCALE   \n",
       "SPSS_user_missing_values                                         \n",
       "pos                                                        328   \n",
       "0                                                          NaN   \n",
       "1                                                          NaN   \n",
       "2                                      Schedule not applicable   \n",
       "3                                      Schedule not applicable   \n",
       "4                                                          NaN   \n",
       "\n",
       "variable                                           Relate10_P4  \\\n",
       "variable_label           Relate10_P4: How related to person 10   \n",
       "variable_type                                          numeric   \n",
       "SPSS_measurement_level                                   SCALE   \n",
       "SPSS_user_missing_values                                         \n",
       "pos                                                        329   \n",
       "0                                                          NaN   \n",
       "1                                                          NaN   \n",
       "2                                                          NaN   \n",
       "3                                                          NaN   \n",
       "4                                                          NaN   \n",
       "\n",
       "variable                                           Relate10_P5  \\\n",
       "variable_label           Relate10_P5: How related to person 10   \n",
       "variable_type                                          numeric   \n",
       "SPSS_measurement_level                                   SCALE   \n",
       "SPSS_user_missing_values                                         \n",
       "pos                                                        330   \n",
       "0                                                          NaN   \n",
       "1                                                          NaN   \n",
       "2                                                          NaN   \n",
       "3                                                          NaN   \n",
       "4                                                          NaN   \n",
       "\n",
       "variable                                           Relate10_P6  \\\n",
       "variable_label           Relate10_P6: How related to person 10   \n",
       "variable_type                                          numeric   \n",
       "SPSS_measurement_level                                   SCALE   \n",
       "SPSS_user_missing_values                                         \n",
       "pos                                                        331   \n",
       "0                                                          NaN   \n",
       "1                                                          NaN   \n",
       "2                                                          NaN   \n",
       "3                                                          NaN   \n",
       "4                                                          NaN   \n",
       "\n",
       "variable                                           Relate10_P7  \\\n",
       "variable_label           Relate10_P7: How related to person 10   \n",
       "variable_type                                          numeric   \n",
       "SPSS_measurement_level                                   SCALE   \n",
       "SPSS_user_missing_values                                         \n",
       "pos                                                        332   \n",
       "0                                                          NaN   \n",
       "1                                                          NaN   \n",
       "2                                                          NaN   \n",
       "3                                                          NaN   \n",
       "4                                                          NaN   \n",
       "\n",
       "variable                                           Relate10_P8  \\\n",
       "variable_label           Relate10_P8: How related to person 10   \n",
       "variable_type                                          numeric   \n",
       "SPSS_measurement_level                                   SCALE   \n",
       "SPSS_user_missing_values                                         \n",
       "pos                                                        333   \n",
       "0                                                          NaN   \n",
       "1                                                          NaN   \n",
       "2                                                          NaN   \n",
       "3                                                          NaN   \n",
       "4                                                          NaN   \n",
       "\n",
       "variable                                           Relate10_P9  \\\n",
       "variable_label           Relate10_P9: How related to person 10   \n",
       "variable_type                                          numeric   \n",
       "SPSS_measurement_level                                   SCALE   \n",
       "SPSS_user_missing_values                                         \n",
       "pos                                                        334   \n",
       "0                                                          NaN   \n",
       "1                                                          NaN   \n",
       "2                                                          NaN   \n",
       "3                                                          NaN   \n",
       "4                                                          NaN   \n",
       "\n",
       "variable                                           Relate10_P10  \n",
       "variable_label           Relate10_P10: How related to person 10  \n",
       "variable_type                                           numeric  \n",
       "SPSS_measurement_level                                  NOMINAL  \n",
       "SPSS_user_missing_values                                         \n",
       "pos                                                         335  \n",
       "0                                                           NaN  \n",
       "1                                                           NaN  \n",
       "2                                                           NaN  \n",
       "3                                                           NaN  \n",
       "4                                                           NaN  \n",
       "\n",
       "[5 rows x 335 columns]"
      ]
     },
     "execution_count": 7,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df=dt.get_dataframe()\n",
    "df.head()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": []
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "Python 3",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.7.3"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 2
}