{ "cells": [ { "cell_type": "code", "execution_count": 1, "id": "9eb0122b-88cf-4dff-82e6-a349bb2cdf8e", "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Päivitetty 2025-11-30 / Aki Taanila\n" ] } ], "source": [ "from datetime import datetime\n", "print(f'Päivitetty {datetime.now().date()} / Aki Taanila')" ] }, { "cell_type": "markdown", "id": "6f586c07-ae10-43bb-8238-64158716fe72", "metadata": {}, "source": [ "## Countplot - lukumäärä- ja prosenttiyhteenvedot\n", "\n", "Seabornin **countplot** sopii kategorisille muuttujille lukumäärä- ja prosenttiyhteenvetojen havainnollistamiseen. Lisätietoa https://seaborn.pydata.org/generated/seaborn.countplot.html.\n", "\n", "### Tuonnit ja alkuvalmistelut\n", "\n", "- Grafiikan tyylimäärittelyssä voit käyttää **'whitegrid'** sijasta **'white'**, **'dark'**, **'darkgrid'** tai **'ticks'**. Voit myös jättää tyylimäärittelyn tekemättä.\n", "- **PercentFormatter** auttaa muotoilemaan akselin prosenttiasteikon.\n", "- **MultipleLocator** auttaa muuttamaan akselin asteikon jaotusta." ] }, { "cell_type": "code", "execution_count": 2, "id": "6778a3f2-6168-4664-a5b0-83a05963004a", "metadata": {}, "outputs": [], "source": [ "import pandas as pd\n", "import matplotlib.pyplot as plt\n", "import seaborn as sns\n", "sns.set_style('whitegrid')\n", "from matplotlib.ticker import PercentFormatter, MultipleLocator\n", "ticks = PercentFormatter(xmax=100, decimals=0, symbol=' %')" ] }, { "cell_type": "markdown", "id": "e191cf9a-9b40-46cd-8e48-e3c1d84b9cab", "metadata": {}, "source": [ "### Datan avaaminen\n", "\n", "Seaborn-paketin esimerkkidata **tips** sisältää tietoja ravintolassa asioineista seurueista." ] }, { "cell_type": "code", "execution_count": 3, "id": "320e8059-1cff-41d6-a3a6-80bd4f99208b", "metadata": {}, "outputs": [ { "data": { "text/html": [ "
| \n", " | total_bill | \n", "tip | \n", "sex | \n", "smoker | \n", "day | \n", "time | \n", "size | \n", "
|---|---|---|---|---|---|---|---|
| 0 | \n", "16.99 | \n", "1.01 | \n", "Female | \n", "No | \n", "Sun | \n", "Dinner | \n", "2 | \n", "
| 1 | \n", "10.34 | \n", "1.66 | \n", "Male | \n", "No | \n", "Sun | \n", "Dinner | \n", "3 | \n", "
| 2 | \n", "21.01 | \n", "3.50 | \n", "Male | \n", "No | \n", "Sun | \n", "Dinner | \n", "3 | \n", "
| 3 | \n", "23.68 | \n", "3.31 | \n", "Male | \n", "No | \n", "Sun | \n", "Dinner | \n", "2 | \n", "
| 4 | \n", "24.59 | \n", "3.61 | \n", "Female | \n", "No | \n", "Sun | \n", "Dinner | \n", "4 | \n", "
| ... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
| 239 | \n", "29.03 | \n", "5.92 | \n", "Male | \n", "No | \n", "Sat | \n", "Dinner | \n", "3 | \n", "
| 240 | \n", "27.18 | \n", "2.00 | \n", "Female | \n", "Yes | \n", "Sat | \n", "Dinner | \n", "2 | \n", "
| 241 | \n", "22.67 | \n", "2.00 | \n", "Male | \n", "Yes | \n", "Sat | \n", "Dinner | \n", "2 | \n", "
| 242 | \n", "17.82 | \n", "1.75 | \n", "Male | \n", "No | \n", "Sat | \n", "Dinner | \n", "2 | \n", "
| 243 | \n", "18.78 | \n", "3.00 | \n", "Female | \n", "No | \n", "Thur | \n", "Dinner | \n", "2 | \n", "
244 rows × 7 columns
\n", "