{ "metadata": { "name": "", "signature": "sha256:5c1510b1e2064dcb9aae50c0ffd6da4529234fccf2690257c3730da3893f97df" }, "nbformat": 3, "nbformat_minor": 0, "worksheets": [ { "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Pandas: Long To Wide Format\n", "\n", "- **Author:** [Chris Albon](http://www.chrisalbon.com/), [@ChrisAlbon](https://twitter.com/chrisalbon)\n", "- **Date:** -\n", "- **Repo:** [Python 3 code snippets for data science](https://github.com/chrisalbon/code_py)\n", "- **Note:**" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### import modules" ] }, { "cell_type": "code", "collapsed": false, "input": [ "import pandas as pd" ], "language": "python", "metadata": {}, "outputs": [], "prompt_number": 2 }, { "cell_type": "markdown", "metadata": {}, "source": [ "### Create dataframe" ] }, { "cell_type": "code", "collapsed": false, "input": [ "raw_data = {'patient': [1, 1, 1, 2, 2], \n", " 'obs': [1, 2, 3, 1, 2], \n", " 'treatment': [0, 1, 0, 1, 0],\n", " 'score': [6252, 24243, 2345, 2342, 23525]} \n", "df = pd.DataFrame(raw_data, columns = ['patient', 'obs', 'treatment', 'score'])\n", "df" ], "language": "python", "metadata": {}, "outputs": [ { "html": [ "
\n", " | patient | \n", "obs | \n", "treatment | \n", "score | \n", "
---|---|---|---|---|
0 | \n", "1 | \n", "1 | \n", "0 | \n", "6252 | \n", "
1 | \n", "1 | \n", "2 | \n", "1 | \n", "24243 | \n", "
2 | \n", "1 | \n", "3 | \n", "0 | \n", "2345 | \n", "
3 | \n", "2 | \n", "1 | \n", "1 | \n", "2342 | \n", "
4 | \n", "2 | \n", "2 | \n", "0 | \n", "23525 | \n", "
5 rows \u00d7 4 columns
\n", "obs | \n", "1 | \n", "2 | \n", "3 | \n", "
---|---|---|---|
patient | \n", "\n", " | \n", " | \n", " |
1 | \n", "6252 | \n", "24243 | \n", "2345 | \n", "
2 | \n", "2342 | \n", "23525 | \n", "NaN | \n", "
2 rows \u00d7 3 columns
\n", "