{ "cells": [ { "cell_type": "markdown", "metadata": { "deletable": true, "editable": true }, "source": [ "# Analysis and visualization of a public OKCupid profile dataset using python and pandas\n", "\n", "Author: Alessandro Giusti ([web](http://www.idsia.ch/~giusti), [email](mailto://alessandrog@idsia.ch)), Dalle Molle Institute for Artificial Intelligence ([IDSIA](http://www.idsia.ch/)), [USI](http://www.usi.ch/)-[SUPSI](http://www.supsi.ch/).\n", "\n", "## Discussion\n", "\n", "After publication, this notebook received quite a lot of insightful comments on /r/python [(link to post)](https://www.reddit.com/r/Python/comments/5n0570/visualizations_of_word_usage_in_a_public_okcupid/) and /r/okcupid [(link to post)](https://www.reddit.com/r/OkCupid/comments/5n7nj4/analysis_of_a_public_okcupid_profile_dataset/).\n", "\n", "## Introduction\n", "\n", "This document is an analysis of a public dataset of almost 60000 online dating profiles.\n", "The dataset has been [published](http://ww2.amstat.org/publications/jse/v23n2/kim.pdf) in the [Journal of Statistics Education](http://ww2.amstat.org/publications/jse/), Volume 23, Number 2 (2015) by Albert Y. Kim et al., and its collection and distribution was explicitly allowed by OkCupid president and co-founder [Christian Rudder](http://blog.okcupid.com/). Using these data is therefore ethically and legally acceptable; this is in contrast to another recent release of a [different OkCupid profile dataset](http://www.vox.com/2016/5/12/11666116/70000-okcupid-users-data-release), which was collected without permission and without anonymizing the data (more on the ethical issues in [this Wired article](https://www.wired.com/2016/05/okcupid-study-reveals-perils-big-data-science/)).\n", "\n", "### Notebook setup" ] }, { "cell_type": "code", "execution_count": 1, "metadata": { "collapsed": false, "deletable": true, "editable": true }, "outputs": [], "source": [ "%matplotlib inline\n", "%config InlineBackend.figure_format='svg'\n", "from IPython.display import display,HTML\n", "import pandas as pd\n", "import seaborn as sns\n", "import numpy as np\n", "import matplotlib.pyplot as plt\n", "from prettypandas import PrettyPandas\n", "sns.set_style(\"ticks\")\n", "sns.set_context(context=\"notebook\",font_scale=1)" ] }, { "cell_type": "markdown", "metadata": { "deletable": true, "editable": true }, "source": [ "## Dataset details\n", "\n", "The data is available at [this link](https://github.com/rudeboybert/JSE_OkCupid/blob/master/profiles.csv.zip). The [codebook](https://github.com/rudeboybert/JSE_OkCupid/blob/master/okcupid_codebook.txt) includes many details about the available fields. The dataset was collected by web scraping the [OKCupid.com](http://www.okcupid.com) website on 2012/06/30, and includes almost 60k profiles of people within a 25 mile radius of San Francisco, who were online in the previous year (after 06/30/2011), with at least one profile picture.\n", "\n", "The CSV contains a row (observation) for each profile. Let's have a look at the first 10 profiles, excluding the columns whose name contains the string \"essay\", which contain a lot of text and are not practical at the moment." ] }, { "cell_type": "code", "execution_count": 2, "metadata": { "collapsed": false, "deletable": true, "editable": true }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "The dataset contains 59946 records\n", "35829 males (59.8%), 24117 females (40.2%)\n" ] }, { "data": { "text/html": [ "\n", " \n", "\n", "
\n", " \n", " | age\n", " \n", " | body_type\n", " \n", " | diet\n", " \n", " | drinks\n", " \n", " | drugs\n", " \n", " | education\n", " \n", " | ethnicity\n", " \n", " | height\n", " \n", " | income\n", " \n", " | job\n", " \n", " | last_online\n", " \n", " | location\n", " \n", " | offspring\n", " \n", " | orientation\n", " \n", " | pets\n", " \n", " | religion\n", " \n", " | sex\n", " \n", " | sign\n", " \n", " | smokes\n", " \n", " | speaks\n", " \n", " | status\n", " \n", " |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
\n", " 0\n", " \n", " | \n", " 22\n", " \n", " | \n", " a little extra\n", " \n", " | \n", " strictly anything\n", " \n", " | \n", " socially\n", " \n", " | \n", " never\n", " \n", " | \n", " working on college/university\n", " \n", " | \n", " asian, white\n", " \n", " | \n", " 75\n", " \n", " | \n", " -1\n", " \n", " | \n", " transportation\n", " \n", " | \n", " 2012-06-28-20-30\n", " \n", " | \n", " south san francisco, california\n", " \n", " | \n", " doesn’t have kids, but might want them\n", " \n", " | \n", " straight\n", " \n", " | \n", " likes dogs and likes cats\n", " \n", " | \n", " agnosticism and very serious about it\n", " \n", " | \n", " m\n", " \n", " | \n", " gemini\n", " \n", " | \n", " sometimes\n", " \n", " | \n", " english\n", " \n", " | \n", " single\n", " \n", " |
\n", " 1\n", " \n", " | \n", " 35\n", " \n", " | \n", " average\n", " \n", " | \n", " mostly other\n", " \n", " | \n", " often\n", " \n", " | \n", " sometimes\n", " \n", " | \n", " working on space camp\n", " \n", " | \n", " white\n", " \n", " | \n", " 70\n", " \n", " | \n", " 80000\n", " \n", " | \n", " hospitality / travel\n", " \n", " | \n", " 2012-06-29-21-41\n", " \n", " | \n", " oakland, california\n", " \n", " | \n", " doesn’t have kids, but might want them\n", " \n", " | \n", " straight\n", " \n", " | \n", " likes dogs and likes cats\n", " \n", " | \n", " agnosticism but not too serious about it\n", " \n", " | \n", " m\n", " \n", " | \n", " cancer\n", " \n", " | \n", " no\n", " \n", " | \n", " english (fluently), spanish (poorly), french (poorly)\n", " \n", " | \n", " single\n", " \n", " |
\n", " 2\n", " \n", " | \n", " 38\n", " \n", " | \n", " thin\n", " \n", " | \n", " anything\n", " \n", " | \n", " socially\n", " \n", " | \n", " nan\n", " \n", " | \n", " graduated from masters program\n", " \n", " | \n", " nan\n", " \n", " | \n", " 68\n", " \n", " | \n", " -1\n", " \n", " | \n", " nan\n", " \n", " | \n", " 2012-06-27-09-10\n", " \n", " | \n", " san francisco, california\n", " \n", " | \n", " nan\n", " \n", " | \n", " straight\n", " \n", " | \n", " has cats\n", " \n", " | \n", " nan\n", " \n", " | \n", " m\n", " \n", " | \n", " pisces but it doesn’t matter\n", " \n", " | \n", " no\n", " \n", " | \n", " english, french, c++\n", " \n", " | \n", " available\n", " \n", " |
\n", " 3\n", " \n", " | \n", " 23\n", " \n", " | \n", " thin\n", " \n", " | \n", " vegetarian\n", " \n", " | \n", " socially\n", " \n", " | \n", " nan\n", " \n", " | \n", " working on college/university\n", " \n", " | \n", " white\n", " \n", " | \n", " 71\n", " \n", " | \n", " 20000\n", " \n", " | \n", " student\n", " \n", " | \n", " 2012-06-28-14-22\n", " \n", " | \n", " berkeley, california\n", " \n", " | \n", " doesn’t want kids\n", " \n", " | \n", " straight\n", " \n", " | \n", " likes cats\n", " \n", " | \n", " nan\n", " \n", " | \n", " m\n", " \n", " | \n", " pisces\n", " \n", " | \n", " no\n", " \n", " | \n", " english, german (poorly)\n", " \n", " | \n", " single\n", " \n", " |
\n", " 4\n", " \n", " | \n", " 29\n", " \n", " | \n", " athletic\n", " \n", " | \n", " nan\n", " \n", " | \n", " socially\n", " \n", " | \n", " never\n", " \n", " | \n", " graduated from college/university\n", " \n", " | \n", " asian, black, other\n", " \n", " | \n", " 66\n", " \n", " | \n", " -1\n", " \n", " | \n", " artistic / musical / writer\n", " \n", " | \n", " 2012-06-27-21-26\n", " \n", " | \n", " san francisco, california\n", " \n", " | \n", " nan\n", " \n", " | \n", " straight\n", " \n", " | \n", " likes dogs and likes cats\n", " \n", " | \n", " nan\n", " \n", " | \n", " m\n", " \n", " | \n", " aquarius\n", " \n", " | \n", " no\n", " \n", " | \n", " english\n", " \n", " | \n", " single\n", " \n", " |
\n", " 5\n", " \n", " | \n", " 29\n", " \n", " | \n", " average\n", " \n", " | \n", " mostly anything\n", " \n", " | \n", " socially\n", " \n", " | \n", " nan\n", " \n", " | \n", " graduated from college/university\n", " \n", " | \n", " white\n", " \n", " | \n", " 67\n", " \n", " | \n", " -1\n", " \n", " | \n", " computer / hardware / software\n", " \n", " | \n", " 2012-06-29-19-18\n", " \n", " | \n", " san francisco, california\n", " \n", " | \n", " doesn’t have kids, but might want them\n", " \n", " | \n", " straight\n", " \n", " | \n", " likes cats\n", " \n", " | \n", " atheism\n", " \n", " | \n", " m\n", " \n", " | \n", " taurus\n", " \n", " | \n", " no\n", " \n", " | \n", " english (fluently), chinese (okay)\n", " \n", " | \n", " single\n", " \n", " |
\n", " 6\n", " \n", " | \n", " 32\n", " \n", " | \n", " fit\n", " \n", " | \n", " strictly anything\n", " \n", " | \n", " socially\n", " \n", " | \n", " never\n", " \n", " | \n", " graduated from college/university\n", " \n", " | \n", " white, other\n", " \n", " | \n", " 65\n", " \n", " | \n", " -1\n", " \n", " | \n", " nan\n", " \n", " | \n", " 2012-06-25-20-45\n", " \n", " | \n", " san francisco, california\n", " \n", " | \n", " nan\n", " \n", " | \n", " straight\n", " \n", " | \n", " likes dogs and likes cats\n", " \n", " | \n", " nan\n", " \n", " | \n", " f\n", " \n", " | \n", " virgo\n", " \n", " | \n", " nan\n", " \n", " | \n", " english\n", " \n", " | \n", " single\n", " \n", " |
\n", " 7\n", " \n", " | \n", " 31\n", " \n", " | \n", " average\n", " \n", " | \n", " mostly anything\n", " \n", " | \n", " socially\n", " \n", " | \n", " never\n", " \n", " | \n", " graduated from college/university\n", " \n", " | \n", " white\n", " \n", " | \n", " 65\n", " \n", " | \n", " -1\n", " \n", " | \n", " artistic / musical / writer\n", " \n", " | \n", " 2012-06-29-12-30\n", " \n", " | \n", " san francisco, california\n", " \n", " | \n", " doesn’t have kids, but wants them\n", " \n", " | \n", " straight\n", " \n", " | \n", " likes dogs and likes cats\n", " \n", " | \n", " christianity\n", " \n", " | \n", " f\n", " \n", " | \n", " sagittarius\n", " \n", " | \n", " no\n", " \n", " | \n", " english, spanish (okay)\n", " \n", " | \n", " single\n", " \n", " |
\n", " 8\n", " \n", " | \n", " 24\n", " \n", " | \n", " nan\n", " \n", " | \n", " strictly anything\n", " \n", " | \n", " socially\n", " \n", " | \n", " nan\n", " \n", " | \n", " graduated from college/university\n", " \n", " | \n", " white\n", " \n", " | \n", " 67\n", " \n", " | \n", " -1\n", " \n", " | \n", " nan\n", " \n", " | \n", " 2012-06-29-23-39\n", " \n", " | \n", " belvedere tiburon, california\n", " \n", " | \n", " doesn’t have kids\n", " \n", " | \n", " straight\n", " \n", " | \n", " likes dogs and likes cats\n", " \n", " | \n", " christianity but not too serious about it\n", " \n", " | \n", " f\n", " \n", " | \n", " gemini but it doesn’t matter\n", " \n", " | \n", " when drinking\n", " \n", " | \n", " english\n", " \n", " | \n", " single\n", " \n", " |
\n", " 9\n", " \n", " | \n", " 37\n", " \n", " | \n", " athletic\n", " \n", " | \n", " mostly anything\n", " \n", " | \n", " not at all\n", " \n", " | \n", " never\n", " \n", " | \n", " working on two-year college\n", " \n", " | \n", " white\n", " \n", " | \n", " 65\n", " \n", " | \n", " -1\n", " \n", " | \n", " student\n", " \n", " | \n", " 2012-06-28-21-08\n", " \n", " | \n", " san mateo, california\n", " \n", " | \n", " nan\n", " \n", " | \n", " straight\n", " \n", " | \n", " likes dogs and likes cats\n", " \n", " | \n", " atheism and laughing about it\n", " \n", " | \n", " m\n", " \n", " | \n", " cancer but it doesn’t matter\n", " \n", " | \n", " no\n", " \n", " | \n", " english (fluently)\n", " \n", " | \n", " single\n", " \n", " |
\n", " \n", " | age\n", " \n", " | body_type\n", " \n", " | diet\n", " \n", " | drinks\n", " \n", " | drugs\n", " \n", " | education\n", " \n", " | essay0\n", " \n", " | essay1\n", " \n", " | essay2\n", " \n", " | essay3\n", " \n", " | essay4\n", " \n", " | essay5\n", " \n", " | essay6\n", " \n", " | essay7\n", " \n", " | essay8\n", " \n", " | essay9\n", " \n", " | ethnicity\n", " \n", " | height\n", " \n", " | income\n", " \n", " | job\n", " \n", " | last_online\n", " \n", " | location\n", " \n", " | offspring\n", " \n", " | orientation\n", " \n", " | pets\n", " \n", " | religion\n", " \n", " | sex\n", " \n", " | sign\n", " \n", " | smokes\n", " \n", " | speaks\n", " \n", " | status\n", " \n", " |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
\n", " 2512\n", " \n", " | \n", " 110\n", " \n", " | \n", " nan\n", " \n", " | \n", " nan\n", " \n", " | \n", " nan\n", " \n", " | \n", " nan\n", " \n", " | \n", " nan\n", " \n", " | \n", " nan\n", " \n", " | \n", " nan\n", " \n", " | \n", " nan\n", " \n", " | \n", " nan\n", " \n", " | \n", " nan\n", " \n", " | \n", " nan\n", " \n", " | \n", " nan\n", " \n", " | \n", " nan\n", " \n", " | \n", " nan\n", " \n", " | \n", " nan\n", " \n", " | \n", " nan\n", " \n", " | \n", " 67\n", " \n", " | \n", " -1\n", " \n", " | \n", " nan\n", " \n", " | \n", " 2012-06-27-22-16\n", " \n", " | \n", " daly city, california\n", " \n", " | \n", " nan\n", " \n", " | \n", " straight\n", " \n", " | \n", " nan\n", " \n", " | \n", " nan\n", " \n", " | \n", " f\n", " \n", " | \n", " nan\n", " \n", " | \n", " nan\n", " \n", " | \n", " english\n", " \n", " | \n", " single\n", " \n", " |
\n", " 25324\n", " \n", " | \n", " 109\n", " \n", " | \n", " athletic\n", " \n", " | \n", " mostly other\n", " \n", " | \n", " nan\n", " \n", " | \n", " never\n", " \n", " | \n", " working on masters program\n", " \n", " | \n", " nan\n", " \n", " | \n", " nan\n", " \n", " | \n", " nan\n", " \n", " | \n", " nothing\n", " \n", " | \n", " nan\n", " \n", " | \n", " nan\n", " \n", " | \n", " nan\n", " \n", " | \n", " nan\n", " \n", " | \n", " nan\n", " \n", " | \n", " nan\n", " \n", " | \n", " nan\n", " \n", " | \n", " 95\n", " \n", " | \n", " -1\n", " \n", " | \n", " student\n", " \n", " | \n", " 2012-06-30-18-18\n", " \n", " | \n", " san francisco, california\n", " \n", " | \n", " might want kids\n", " \n", " | \n", " straight\n", " \n", " | \n", " nan\n", " \n", " | \n", " other and somewhat serious about it\n", " \n", " | \n", " m\n", " \n", " | \n", " aquarius but it doesn’t matter\n", " \n", " | \n", " when drinking\n", " \n", " | \n", " english (okay)\n", " \n", " | \n", " available\n", " \n", " |
\n", " \n", " | Sex\n", " \n", " | Agemos\n", " \n", " | L\n", " \n", " | M\n", " \n", " | S\n", " \n", " | P3\n", " \n", " | P5\n", " \n", " | P10\n", " \n", " | P25\n", " \n", " | P50\n", " \n", " | P75\n", " \n", " | P90\n", " \n", " | P95\n", " \n", " | P97\n", " \n", " |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
\n", " 0\n", " \n", " | \n", " 1\n", " \n", " | \n", " 24\n", " \n", " | \n", " 0.941524\n", " \n", " | \n", " 86.4522\n", " \n", " | \n", " 0.0403215\n", " \n", " | \n", " 79.9108\n", " \n", " | \n", " 80.7298\n", " \n", " | \n", " 81.9917\n", " \n", " | \n", " 84.1029\n", " \n", " | \n", " 86.4522\n", " \n", " | \n", " 88.8052\n", " \n", " | \n", " 90.9262\n", " \n", " | \n", " 92.1969\n", " \n", " | \n", " 93.0227\n", " \n", " |
\n", " 1\n", " \n", " | \n", " 1\n", " \n", " | \n", " 24.5\n", " \n", " | \n", " 1.00721\n", " \n", " | \n", " 86.8616\n", " \n", " | \n", " 0.0403956\n", " \n", " | \n", " 80.2604\n", " \n", " | \n", " 81.0887\n", " \n", " | \n", " 82.364\n", " \n", " | \n", " 84.4947\n", " \n", " | \n", " 86.8616\n", " \n", " | \n", " 89.228\n", " \n", " | \n", " 91.3575\n", " \n", " | \n", " 92.6318\n", " \n", " | \n", " 93.4592\n", " \n", " |
\n", " 2\n", " \n", " | \n", " 1\n", " \n", " | \n", " 25.5\n", " \n", " | \n", " 0.837251\n", " \n", " | \n", " 87.6525\n", " \n", " | \n", " 0.0405775\n", " \n", " | \n", " 81.0053\n", " \n", " | \n", " 81.8345\n", " \n", " | \n", " 83.1139\n", " \n", " | \n", " 85.2589\n", " \n", " | \n", " 87.6525\n", " \n", " | \n", " 90.0568\n", " \n", " | \n", " 92.2297\n", " \n", " | \n", " 93.5341\n", " \n", " | \n", " 94.3828\n", " \n", " |
\n", " 3\n", " \n", " | \n", " 1\n", " \n", " | \n", " 26.5\n", " \n", " | \n", " 0.681493\n", " \n", " | \n", " 88.4233\n", " \n", " | \n", " 0.0407231\n", " \n", " | \n", " 81.7342\n", " \n", " | \n", " 82.5641\n", " \n", " | \n", " 83.8472\n", " \n", " | \n", " 86.0052\n", " \n", " | \n", " 88.4233\n", " \n", " | \n", " 90.8626\n", " \n", " | \n", " 93.0761\n", " \n", " | \n", " 94.4088\n", " \n", " | \n", " 95.2776\n", " \n", " |
\n", " 4\n", " \n", " | \n", " 1\n", " \n", " | \n", " 27.5\n", " \n", " | \n", " 0.53878\n", " \n", " | \n", " 89.1755\n", " \n", " | \n", " 0.0408332\n", " \n", " | \n", " 82.4485\n", " \n", " | \n", " 83.279\n", " \n", " | \n", " 84.5653\n", " \n", " | \n", " 86.7351\n", " \n", " | \n", " 89.1755\n", " \n", " | \n", " 91.6471\n", " \n", " | \n", " 93.8983\n", " \n", " | \n", " 95.2575\n", " \n", " | \n", " 96.1451\n", " \n", " |
\n", " 5\n", " \n", " | \n", " 1\n", " \n", " | \n", " 28.5\n", " \n", " | \n", " 0.407697\n", " \n", " | \n", " 89.9104\n", " \n", " | \n", " 0.0409091\n", " \n", " | \n", " 83.1494\n", " \n", " | \n", " 83.9805\n", " \n", " | \n", " 85.2696\n", " \n", " | \n", " 87.4498\n", " \n", " | \n", " 89.9104\n", " \n", " | \n", " 92.4116\n", " \n", " | \n", " 94.6976\n", " \n", " | \n", " 96.0815\n", " \n", " | \n", " 96.9866\n", " \n", " |
\n", " 6\n", " \n", " | \n", " 1\n", " \n", " | \n", " 29.5\n", " \n", " | \n", " 0.286762\n", " \n", " | \n", " 90.6291\n", " \n", " | \n", " 0.0409524\n", " \n", " | \n", " 83.8382\n", " \n", " | \n", " 84.6695\n", " \n", " | \n", " 85.961\n", " \n", " | \n", " 88.1503\n", " \n", " | \n", " 90.6291\n", " \n", " | \n", " 93.1572\n", " \n", " | \n", " 95.4752\n", " \n", " | \n", " 96.882\n", " \n", " | \n", " 97.8035\n", " \n", " |
\n", " 7\n", " \n", " | \n", " 1\n", " \n", " | \n", " 30.5\n", " \n", " | \n", " 0.174489\n", " \n", " | \n", " 91.3324\n", " \n", " | \n", " 0.0409653\n", " \n", " | \n", " 84.5156\n", " \n", " | \n", " 85.3469\n", " \n", " | \n", " 86.6403\n", " \n", " | \n", " 88.8375\n", " \n", " | \n", " 91.3324\n", " \n", " | \n", " 93.885\n", " \n", " | \n", " 96.2324\n", " \n", " | \n", " 97.6603\n", " \n", " | \n", " 98.5969\n", " \n", " |
\n", " 8\n", " \n", " | \n", " 1\n", " \n", " | \n", " 31.5\n", " \n", " | \n", " 0.0694445\n", " \n", " | \n", " 92.0213\n", " \n", " | \n", " 0.04095\n", " \n", " | \n", " 85.1824\n", " \n", " | \n", " 86.0136\n", " \n", " | \n", " 87.3082\n", " \n", " | \n", " 89.512\n", " \n", " | \n", " 92.0213\n", " \n", " | \n", " 94.5959\n", " \n", " | \n", " 96.9702\n", " \n", " | \n", " 98.4176\n", " \n", " | \n", " 99.3683\n", " \n", " |
\n", " 9\n", " \n", " | \n", " 1\n", " \n", " | \n", " 32.5\n", " \n", " | \n", " -0.0297206\n", " \n", " | \n", " 92.6964\n", " \n", " | \n", " 0.0409087\n", " \n", " | \n", " 85.8393\n", " \n", " | \n", " 86.67\n", " \n", " | \n", " 87.9654\n", " \n", " | \n", " 90.1746\n", " \n", " | \n", " 92.6964\n", " \n", " | \n", " 95.2908\n", " \n", " | \n", " 97.6898\n", " \n", " | \n", " 99.1551\n", " \n", " | \n", " 100.119\n", " \n", " |
\n", " \n", " | P3\n", " \n", " | P5\n", " \n", " | P10\n", " \n", " | P25\n", " \n", " | P50\n", " \n", " | P75\n", " \n", " | P90\n", " \n", " | P95\n", " \n", " | P97\n", " \n", " |
---|---|---|---|---|---|---|---|---|---|
Sex\n", " \n", " | \n", " \n", " | \n", " \n", " | \n", " \n", " | \n", " \n", " | \n", " \n", " | \n", " \n", " | \n", " \n", " | \n", " \n", " | \n", " \n", " |
\n", " m\n", " \n", " | \n", " 64.3\n", " \n", " | \n", " 64.98\n", " \n", " | \n", " 66.01\n", " \n", " | \n", " 67.73\n", " \n", " | \n", " 69.63\n", " \n", " | \n", " 71.52\n", " \n", " | \n", " 73.21\n", " \n", " | \n", " 74.22\n", " \n", " | \n", " 74.88\n", " \n", " |
\n", " f\n", " \n", " | \n", " 59.49\n", " \n", " | \n", " 60.1\n", " \n", " | \n", " 61.03\n", " \n", " | \n", " 62.58\n", " \n", " | \n", " 64.31\n", " \n", " | \n", " 66.02\n", " \n", " | \n", " 67.56\n", " \n", " | \n", " 68.48\n", " \n", " | \n", " 69.08\n", " \n", " |
\n", " \n", " | CDC\n", " \n", " | users\n", " \n", " | gap\n", " \n", " |
---|---|---|---|
percentile\n", " \n", " | \n", " \n", " | \n", " \n", " | \n", " \n", " |
\n", " 3\n", " \n", " | \n", " 64.3\n", " \n", " | \n", " 64.89\n", " \n", " | \n", " 0.589\n", " \n", " |
\n", " 5\n", " \n", " | \n", " 64.98\n", " \n", " | \n", " 65.56\n", " \n", " | \n", " 0.5848\n", " \n", " |
\n", " 10\n", " \n", " | \n", " 66.01\n", " \n", " | \n", " 66.16\n", " \n", " | \n", " 0.154\n", " \n", " |
\n", " 25\n", " \n", " | \n", " 67.73\n", " \n", " | \n", " 67.88\n", " \n", " | \n", " 0.1547\n", " \n", " |
\n", " 50\n", " \n", " | \n", " 69.63\n", " \n", " | \n", " 70.19\n", " \n", " | \n", " 0.567\n", " \n", " |
\n", " 75\n", " \n", " | \n", " 71.52\n", " \n", " | \n", " 72.26\n", " \n", " | \n", " 0.7426\n", " \n", " |
\n", " 90\n", " \n", " | \n", " 73.21\n", " \n", " | \n", " 74.15\n", " \n", " | \n", " 0.9413\n", " \n", " |
\n", " 95\n", " \n", " | \n", " 74.22\n", " \n", " | \n", " 75.47\n", " \n", " | \n", " 1.241\n", " \n", " |
\n", " 97\n", " \n", " | \n", " 74.88\n", " \n", " | \n", " 76.09\n", " \n", " | \n", " 1.209\n", " \n", " |
\n", " \n", " | CDC\n", " \n", " | users\n", " \n", " | gap\n", " \n", " |
---|---|---|---|
percentile\n", " \n", " | \n", " \n", " | \n", " \n", " | \n", " \n", " |
\n", " 3\n", " \n", " | \n", " 59.49\n", " \n", " | \n", " 59.69\n", " \n", " | \n", " 0.196\n", " \n", " |
\n", " 5\n", " \n", " | \n", " 60.1\n", " \n", " | \n", " 59.93\n", " \n", " | \n", " -0.1718\n", " \n", " |
\n", " 10\n", " \n", " | \n", " 61.03\n", " \n", " | \n", " 60.98\n", " \n", " | \n", " -0.04799\n", " \n", " |
\n", " 25\n", " \n", " | \n", " 62.58\n", " \n", " | \n", " 62.89\n", " \n", " | \n", " 0.3094\n", " \n", " |
\n", " 50\n", " \n", " | \n", " 64.31\n", " \n", " | \n", " 64.85\n", " \n", " | \n", " 0.5411\n", " \n", " |
\n", " 75\n", " \n", " | \n", " 66.02\n", " \n", " | \n", " 66.64\n", " \n", " | \n", " 0.6133\n", " \n", " |
\n", " 90\n", " \n", " | \n", " 67.56\n", " \n", " | \n", " 68.46\n", " \n", " | \n", " 0.896\n", " \n", " |
\n", " 95\n", " \n", " | \n", " 68.48\n", " \n", " | \n", " 69.46\n", " \n", " | \n", " 0.9735\n", " \n", " |
\n", " 97\n", " \n", " | \n", " 69.08\n", " \n", " | \n", " 70.23\n", " \n", " | \n", " 1.153\n", " \n", " |
\n", " \n", " | P3\n", " \n", " | P5\n", " \n", " | P10\n", " \n", " | P25\n", " \n", " | P50\n", " \n", " | P75\n", " \n", " | P90\n", " \n", " | P95\n", " \n", " | P97\n", " \n", " |
---|---|---|---|---|---|---|---|---|---|
Age\n", " \n", " | \n", " \n", " | \n", " \n", " | \n", " \n", " | \n", " \n", " | \n", " \n", " | \n", " \n", " | \n", " \n", " | \n", " \n", " | \n", " \n", " |
\n", " 2.0\n", " \n", " | \n", " 32.99\n", " \n", " | \n", " 33.31\n", " \n", " | \n", " 33.82\n", " \n", " | \n", " 34.68\n", " \n", " | \n", " 35.65\n", " \n", " | \n", " 36.64\n", " \n", " | \n", " 37.55\n", " \n", " | \n", " 38.1\n", " \n", " | \n", " 38.46\n", " \n", " |
\n", " 3.0\n", " \n", " | \n", " 35.94\n", " \n", " | \n", " 36.3\n", " \n", " | \n", " 36.85\n", " \n", " | \n", " 37.78\n", " \n", " | \n", " 38.84\n", " \n", " | \n", " 39.93\n", " \n", " | \n", " 40.92\n", " \n", " | \n", " 41.52\n", " \n", " | \n", " 41.92\n", " \n", " |
\n", " 4.0\n", " \n", " | \n", " 38.28\n", " \n", " | \n", " 38.7\n", " \n", " | \n", " 39.33\n", " \n", " | \n", " 40.39\n", " \n", " | \n", " 41.57\n", " \n", " | \n", " 42.74\n", " \n", " | \n", " 43.8\n", " \n", " | \n", " 44.43\n", " \n", " | \n", " 44.84\n", " \n", " |
\n", " 5.0\n", " \n", " | \n", " 40.54\n", " \n", " | \n", " 41\n", " \n", " | \n", " 41.7\n", " \n", " | \n", " 42.87\n", " \n", " | \n", " 44.16\n", " \n", " | \n", " 45.44\n", " \n", " | \n", " 46.58\n", " \n", " | \n", " 47.26\n", " \n", " | \n", " 47.7\n", " \n", " |
\n", " 6.0\n", " \n", " | \n", " 42.83\n", " \n", " | \n", " 43.31\n", " \n", " | \n", " 44.06\n", " \n", " | \n", " 45.3\n", " \n", " | \n", " 46.69\n", " \n", " | \n", " 48.08\n", " \n", " | \n", " 49.33\n", " \n", " | \n", " 50.08\n", " \n", " | \n", " 50.56\n", " \n", " |
\n", " 7.0\n", " \n", " | \n", " 45.09\n", " \n", " | \n", " 45.59\n", " \n", " | \n", " 46.37\n", " \n", " | \n", " 47.68\n", " \n", " | \n", " 49.16\n", " \n", " | \n", " 50.65\n", " \n", " | \n", " 52.01\n", " \n", " | \n", " 52.82\n", " \n", " | \n", " 53.36\n", " \n", " |
\n", " 8.0\n", " \n", " | \n", " 47.17\n", " \n", " | \n", " 47.7\n", " \n", " | \n", " 48.52\n", " \n", " | \n", " 49.9\n", " \n", " | \n", " 51.47\n", " \n", " | \n", " 53.07\n", " \n", " | \n", " 54.53\n", " \n", " | \n", " 55.42\n", " \n", " | \n", " 56\n", " \n", " |
\n", " 9.0\n", " \n", " | \n", " 48.98\n", " \n", " | \n", " 49.54\n", " \n", " | \n", " 50.42\n", " \n", " | \n", " 51.91\n", " \n", " | \n", " 53.58\n", " \n", " | \n", " 55.29\n", " \n", " | \n", " 56.85\n", " \n", " | \n", " 57.8\n", " \n", " | \n", " 58.42\n", " \n", " |
\n", " 10.0\n", " \n", " | \n", " 50.61\n", " \n", " | \n", " 51.21\n", " \n", " | \n", " 52.15\n", " \n", " | \n", " 53.74\n", " \n", " | \n", " 55.54\n", " \n", " | \n", " 57.36\n", " \n", " | \n", " 59.02\n", " \n", " | \n", " 60.03\n", " \n", " | \n", " 60.69\n", " \n", " |
\n", " 11.0\n", " \n", " | \n", " 52.34\n", " \n", " | \n", " 52.98\n", " \n", " | \n", " 53.97\n", " \n", " | \n", " 55.65\n", " \n", " | \n", " 57.55\n", " \n", " | \n", " 59.49\n", " \n", " | \n", " 61.26\n", " \n", " | \n", " 62.34\n", " \n", " | \n", " 63.04\n", " \n", " |
\n", " 12.0\n", " \n", " | \n", " 54.47\n", " \n", " | \n", " 55.15\n", " \n", " | \n", " 56.21\n", " \n", " | \n", " 58\n", " \n", " | \n", " 60.01\n", " \n", " | \n", " 62.06\n", " \n", " | \n", " 63.93\n", " \n", " | \n", " 65.06\n", " \n", " | \n", " 65.8\n", " \n", " |
\n", " 13.0\n", " \n", " | \n", " 57.02\n", " \n", " | \n", " 57.78\n", " \n", " | \n", " 58.94\n", " \n", " | \n", " 60.87\n", " \n", " | \n", " 62.99\n", " \n", " | \n", " 65.11\n", " \n", " | \n", " 67\n", " \n", " | \n", " 68.12\n", " \n", " | \n", " 68.85\n", " \n", " |
\n", " 14.0\n", " \n", " | \n", " 59.62\n", " \n", " | \n", " 60.43\n", " \n", " | \n", " 61.65\n", " \n", " | \n", " 63.64\n", " \n", " | \n", " 65.79\n", " \n", " | \n", " 67.87\n", " \n", " | \n", " 69.69\n", " \n", " | \n", " 70.76\n", " \n", " | \n", " 71.44\n", " \n", " |
\n", " 15.0\n", " \n", " | \n", " 61.7\n", " \n", " | \n", " 62.49\n", " \n", " | \n", " 63.68\n", " \n", " | \n", " 65.61\n", " \n", " | \n", " 67.68\n", " \n", " | \n", " 69.68\n", " \n", " | \n", " 71.42\n", " \n", " | \n", " 72.44\n", " \n", " | \n", " 73.09\n", " \n", " |
\n", " 16.0\n", " \n", " | \n", " 63.03\n", " \n", " | \n", " 63.76\n", " \n", " | \n", " 64.89\n", " \n", " | \n", " 66.72\n", " \n", " | \n", " 68.7\n", " \n", " | \n", " 70.63\n", " \n", " | \n", " 72.33\n", " \n", " | \n", " 73.33\n", " \n", " | \n", " 73.97\n", " \n", " |
\n", " 17.0\n", " \n", " | \n", " 63.74\n", " \n", " | \n", " 64.44\n", " \n", " | \n", " 65.51\n", " \n", " | \n", " 67.27\n", " \n", " | \n", " 69.2\n", " \n", " | \n", " 71.1\n", " \n", " | \n", " 72.79\n", " \n", " | \n", " 73.79\n", " \n", " | \n", " 74.43\n", " \n", " |
\n", " 18.0\n", " \n", " | \n", " 64.1\n", " \n", " | \n", " 64.77\n", " \n", " | \n", " 65.82\n", " \n", " | \n", " 67.55\n", " \n", " | \n", " 69.45\n", " \n", " | \n", " 71.34\n", " \n", " | \n", " 73.03\n", " \n", " | \n", " 74.03\n", " \n", " | \n", " 74.68\n", " \n", " |
\n", " 19.0\n", " \n", " | \n", " 64.26\n", " \n", " | \n", " 64.93\n", " \n", " | \n", " 65.96\n", " \n", " | \n", " 67.68\n", " \n", " | \n", " 69.58\n", " \n", " | \n", " 71.47\n", " \n", " | \n", " 73.16\n", " \n", " | \n", " 74.17\n", " \n", " | \n", " 74.83\n", " \n", " |
\n", " 20.0\n", " \n", " | \n", " 64.3\n", " \n", " | \n", " 64.98\n", " \n", " | \n", " 66.01\n", " \n", " | \n", " 67.73\n", " \n", " | \n", " 69.63\n", " \n", " | \n", " 71.52\n", " \n", " | \n", " 73.21\n", " \n", " | \n", " 74.22\n", " \n", " | \n", " 74.88\n", " \n", " |