{ "metadata": { "name": "", "signature": "sha256:81dd4626ee0888046490b916ffd1545e784fff53b4a1e227a9fc9268ddebee06" }, "nbformat": 3, "nbformat_minor": 0, "worksheets": [ { "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Categorical Data In Pandas\n", "\n", "- **Author:** [Chris Albon](http://www.chrisalbon.com/), [@ChrisAlbon](https://twitter.com/chrisalbon)\n", "- **Date:** -\n", "- **Repo:** [Python 3 code snippets for data science](https://github.com/chrisalbon/code_py)\n", "- **Note:**" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### Create a dataframe" ] }, { "cell_type": "code", "collapsed": false, "input": [ "data = {'county': ['Cochice', 'Pima', 'Santa Cruz', 'Maricopa', 'Yuma'], \n", " 'year': [2012, 2012, 2013, 2014, 2014], \n", " 'reports': [4, 24, 31, 2, 3]}\n", "df = pd.DataFrame(data)\n", "df" ], "language": "python", "metadata": {}, "outputs": [ { "html": [ "
\n", " | county | \n", "reports | \n", "year | \n", "
---|---|---|---|
0 | \n", "Cochice | \n", "4 | \n", "2012 | \n", "
1 | \n", "Pima | \n", "24 | \n", "2012 | \n", "
2 | \n", "Santa Cruz | \n", "31 | \n", "2013 | \n", "
3 | \n", "Maricopa | \n", "2 | \n", "2014 | \n", "
4 | \n", "Yuma | \n", "3 | \n", "2014 | \n", "
5 rows \u00d7 3 columns
\n", "