{ "metadata": { "name": "", "signature": "sha256:b72347f172084cf17e3050cd20aba0b2911b5fbb555736e3739b483e79a837a3" }, "nbformat": 3, "nbformat_minor": 0, "worksheets": [ { "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Descriptive Statistics For pandas Dataframe\n", "\n", "- **Author:** [Chris Albon](http://www.chrisalbon.com/), [@ChrisAlbon](https://twitter.com/chrisalbon)\n", "- **Date:** -\n", "- **Repo:** [Python 3 code snippets for data science](https://github.com/chrisalbon/code_py)\n", "- **Note:**" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### import modules" ] }, { "cell_type": "code", "collapsed": false, "input": [ "import pandas as pd" ], "language": "python", "metadata": {}, "outputs": [], "prompt_number": 40 }, { "cell_type": "markdown", "metadata": {}, "source": [ "### Create dataframe" ] }, { "cell_type": "code", "collapsed": false, "input": [ "data = {'name': ['Jason', 'Molly', 'Tina', 'Jake', 'Amy'], \n", " 'age': [42, 52, 36, 24, 73], \n", " 'preTestScore': [4, 24, 31, 2, 3],\n", " 'postTestScore': [25, 94, 57, 62, 70]}\n", "df = pd.DataFrame(data, columns = ['name', 'age', 'preTestScore', 'postTestScore'])\n", "df" ], "language": "python", "metadata": {}, "outputs": [ { "html": [ "
\n", " | name | \n", "age | \n", "preTestScore | \n", "postTestScore | \n", "
---|---|---|---|---|
0 | \n", "Jason | \n", "42 | \n", "4 | \n", "25 | \n", "
1 | \n", "Molly | \n", "52 | \n", "24 | \n", "94 | \n", "
2 | \n", "Tina | \n", "36 | \n", "31 | \n", "57 | \n", "
3 | \n", "Jake | \n", "24 | \n", "2 | \n", "62 | \n", "
4 | \n", "Amy | \n", "73 | \n", "3 | \n", "70 | \n", "
5 rows \u00d7 4 columns
\n", "\n", " | age | \n", "preTestScore | \n", "postTestScore | \n", "
---|---|---|---|
age | \n", "1.000000 | \n", "-0.105651 | \n", "0.328852 | \n", "
preTestScore | \n", "-0.105651 | \n", "1.000000 | \n", "0.378039 | \n", "
postTestScore | \n", "0.328852 | \n", "0.378039 | \n", "1.000000 | \n", "
3 rows \u00d7 3 columns
\n", "\n", " | age | \n", "preTestScore | \n", "postTestScore | \n", "
---|---|---|---|
age | \n", "340.80 | \n", "-26.65 | \n", "151.20 | \n", "
preTestScore | \n", "-26.65 | \n", "186.70 | \n", "128.65 | \n", "
postTestScore | \n", "151.20 | \n", "128.65 | \n", "620.30 | \n", "
3 rows \u00d7 3 columns
\n", "