{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# USA Births Data\n", "\n", "*By Jake VanderPlas. See http://github.com/jakevdp/data-CDCbirths/*\n", "\n", "This dataset records birth rates in the USA by year. It was compiled from data on the [CDC website](http://www.cdc.gov/nchs/data_access/Vitalstatsonline.htm) and is aggregated so as to comply with their terms of use.\n", "\n", "*(This was inspired by Andrew Gelman's recent [NYC R Meetup](https://www.youtube.com/watch?v=ObS1hkOxyPA&feature=youtu.be) talk. Some of his writeup on the topic can be seen [on his blog](http://andrewgelman.com/2012/06/12/simple-graph-win-the-example-of-birthday-frequencies/))*" ] }, { "cell_type": "code", "execution_count": 1, "metadata": { "collapsed": true }, "outputs": [], "source": [ "%matplotlib inline\n", "import matplotlib as mpl\n", "import matplotlib.pyplot as plt\n", "import seaborn; seaborn.set()\n", "\n", "import pandas as pd\n", "import numpy as np" ] }, { "cell_type": "code", "execution_count": 2, "metadata": { "collapsed": false }, "outputs": [ { "data": { "text/html": [ "
| \n", " | year | \n", "month | \n", "day | \n", "gender | \n", "births | \n", "
|---|---|---|---|---|---|
| 0 | \n", "1969 | \n", "1 | \n", "1 | \n", "F | \n", "4046 | \n", "
| 1 | \n", "1969 | \n", "1 | \n", "1 | \n", "M | \n", "4440 | \n", "
| 2 | \n", "1969 | \n", "1 | \n", "2 | \n", "F | \n", "4454 | \n", "
| 3 | \n", "1969 | \n", "1 | \n", "2 | \n", "M | \n", "4548 | \n", "
| 4 | \n", "1969 | \n", "1 | \n", "3 | \n", "F | \n", "4548 | \n", "