{
 "cells": [
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## Missing column entries\n",
    "\n",
    "<div style=\"text-align: center\">\n",
    "    <table style=\"width:40%\">\n",
    "      <tr>\n",
    "        <th></th>\n",
    "        <th></th> \n",
    "        <th></th>\n",
    "      </tr>\n",
    "      <tr>\n",
    "        <td>Name </td>\n",
    "        <td>457894</td> \n",
    "        <td>**Missing 2**</td>\n",
    "      </tr>\n",
    "      <tr>\n",
    "        <td>Department</td>\n",
    "        <td>457677</td> \n",
    "        <td>**Missing 219**</td>\n",
    "      </tr>\n",
    "      <tr>\n",
    "        <td>OrganizationID</td>\n",
    "        <td>457896</td> \n",
    "        <td>Missing 0</td>\n",
    "      </tr>\n",
    "      <tr>\n",
    "        <td>Institution</td>\n",
    "        <td>457896</td> \n",
    "        <td>Missing 0</td>\n",
    "      </tr>\n",
    "      <tr>\n",
    "        <td>ProvinceEN</td>\n",
    "        <td>454115</td> \n",
    "        <td>**Missing 3781**</td>\n",
    "      </tr>\n",
    "      <tr>\n",
    "        <td>CountryEN</td>\n",
    "        <td>457896</td> \n",
    "        <td>Missing 0</td>\n",
    "      </tr>\n",
    "      <tr>\n",
    "        <td>FiscalYear</td>\n",
    "        <td>457896</td> \n",
    "        <td>Missing 0</td>\n",
    "      </tr>\n",
    "      <tr>\n",
    "        <td>AwardAmount</td>\n",
    "        <td>457896</td> \n",
    "        <td>Missing 0</td>\n",
    "      </tr>\n",
    "      <tr>\n",
    "        <td>ProgramID</td>\n",
    "        <td>457896</td> \n",
    "        <td>Missing 0</td>\n",
    "      </tr>\n",
    "      <tr>\n",
    "        <td>ProgramNameEN</td>\n",
    "        <td>457896</td> \n",
    "        <td>Missing 0</td>\n",
    "      </tr>\n",
    "      <tr>\n",
    "        <td>Committee</td>\n",
    "        <td>457896</td> \n",
    "        <td>Missing 0</td>\n",
    "      </tr>\n",
    "      <tr>\n",
    "        <td>ResearchSubjectEN</td>\n",
    "        <td>457896</td> \n",
    "        <td>Missing 0</td>\n",
    "      </tr>\n",
    "    </table>\n",
    "    <br>\n",
    "    As you can see some of the data columns are missing values.<br>\n",
    "    <br>\n",
    "    The missing Name data is for two 'Head Office' departments, so it does not get assigned to a specific name/person.  One of these is for PIMS.<br>\n",
    "    <br>\n",
    "    The missing Department data seems unimportant but keep it in mind if you use any department selection or functions.<br>\n",
    "    <br>\n",
    "    The missing ProvinceEN data is mostly for non-canadian awards, and the canadian ones are for programs that     are in more than one province.<br>\n",
    "    \n",
    "</div>"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## Messy Data\n",
    "\n",
    "The 'ProgramID' column has leading whitespace that causes an issue if we do not remove it."
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## Search by Subject Issue\n",
    "Currently we are selecting funding based on subjects using the committee number. For example 1508 for mathematics,\n",
    "however this is problematic.  You can see an example by selecting for 'Adem, Alejandro' in the SelectionsTemplate.  They are clearly a mathematician in the math department however none of their committees are 1508.  We will need to implement some regex to search for ex. 'Math*' in a few of the columns, then implement this as subject selection. <br>\n",
    "Columns that need to be searched for the subject word are: Department, Institution. We would also include all of committee 1508."
   ]
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "Python 3",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.6.4"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 2
}