{ "metadata": { "name": "", "signature": "sha256:5822ff5a28d4b248ed0b4e19e85616950ff014eb8db7a8c356c0499462573616" }, "nbformat": 3, "nbformat_minor": 0, "worksheets": [ { "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Map External Values To Dataframe Values in Pandas\n", "\n", "- **Author:** [Chris Albon](http://www.chrisalbon.com/), [@ChrisAlbon](https://twitter.com/chrisalbon)\n", "- **Date:** -\n", "- **Repo:** [Python 3 code snippets for data science](https://github.com/chrisalbon/code_py)\n", "- **Note:**" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### import modules" ] }, { "cell_type": "code", "collapsed": false, "input": [ "import pandas as pd" ], "language": "python", "metadata": {}, "outputs": [], "prompt_number": 2 }, { "cell_type": "markdown", "metadata": {}, "source": [ "### Create dataframe" ] }, { "cell_type": "code", "collapsed": false, "input": [ "raw_data = {'first_name': ['Jason', 'Molly', 'Tina', 'Jake', 'Amy'], \n", " 'last_name': ['Miller', 'Jacobson', 'Ali', 'Milner', 'Cooze'], \n", " 'age': [42, 52, 36, 24, 73], \n", " 'city': ['San Francisco', 'Baltimore', 'Miami', 'Douglas', 'Boston']}\n", "df = pd.DataFrame(raw_data, columns = ['first_name', 'last_name', 'age', 'city'])\n", "df" ], "language": "python", "metadata": {}, "outputs": [ { "html": [ "
\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
first_namelast_nameagecity
0 Jason Miller 42 San Francisco
1 Molly Jacobson 52 Baltimore
2 Tina Ali 36 Miami
3 Jake Milner 24 Douglas
4 Amy Cooze 73 Boston
\n", "

5 rows \u00d7 4 columns

\n", "
" ], "metadata": {}, "output_type": "pyout", "prompt_number": 15, "text": [ " first_name last_name age city\n", "0 Jason Miller 42 San Francisco\n", "1 Molly Jacobson 52 Baltimore\n", "2 Tina Ali 36 Miami\n", "3 Jake Milner 24 Douglas\n", "4 Amy Cooze 73 Boston\n", "\n", "[5 rows x 4 columns]" ] } ], "prompt_number": 15 }, { "cell_type": "markdown", "metadata": {}, "source": [ "### Create a dictionary of values" ] }, { "cell_type": "code", "collapsed": false, "input": [ "city_to_state = { 'San Francisco' : 'California', \n", " 'Baltimore' : 'Maryland', \n", " 'Miami' : 'Florida', \n", " 'Douglas' : 'Arizona', \n", " 'Boston' : 'Massachusetts'}" ], "language": "python", "metadata": {}, "outputs": [], "prompt_number": 16 }, { "cell_type": "markdown", "metadata": {}, "source": [ "### Map the values of the city_to_state dictionary to the values in the city variable" ] }, { "cell_type": "code", "collapsed": false, "input": [ "df['state'] = df['city'].map(city_to_state)\n", "df" ], "language": "python", "metadata": {}, "outputs": [ { "html": [ "
\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
first_namelast_nameagecitystate
0 Jason Miller 42 San Francisco California
1 Molly Jacobson 52 Baltimore Maryland
2 Tina Ali 36 Miami Florida
3 Jake Milner 24 Douglas Arizona
4 Amy Cooze 73 Boston Massachusetts
\n", "

5 rows \u00d7 5 columns

\n", "
" ], "metadata": {}, "output_type": "pyout", "prompt_number": 18, "text": [ " first_name last_name age city state\n", "0 Jason Miller 42 San Francisco California\n", "1 Molly Jacobson 52 Baltimore Maryland\n", "2 Tina Ali 36 Miami Florida\n", "3 Jake Milner 24 Douglas Arizona\n", "4 Amy Cooze 73 Boston Massachusetts\n", "\n", "[5 rows x 5 columns]" ] } ], "prompt_number": 18 } ], "metadata": {} } ] }