{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Data-analyysi levyraadin tuloksista\n", "\n", "Esittelen tässä Jupyter-muistiossa Python-ohjelmointikielen data-analyysityökaluja. Muistion esimerkit pohjautuvat Facebook-ryhmässä järjestettyyn musiikkikilpailuun, joka on mukailtu televisio-ohjelmasta Levyraati.\n", "\n", "### Analyysissä käytetyt menetelmät:\n", "- laatikkodiagrammi, joka sisältää kvartiilit ja mediaanin\n", "- keskihajonta\n", "- aineiston normalisointi\n", "- euklidisen etäisyyden mittaaminen\n", "- pääkomponenttianalyysi\n", "\n", "### Aineisto\n", "Aineisto on kerätty Facebook-ryhmästä keväällä 2016. Mukana olleet raatilaiset ovat arvostelleet jokaisen kappaleen, kirjoittaneen sanallisen arvioinnin sekä antaneet numeerisen arvosanan. Tässä analyysissä hyödynnän vain arvosanoja.\n", "- 10 kappaletta\n", "- 7 arvostelijaa\n", "- arvosanat asteikolla 0,5 - 5" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Hae aineisto" ] }, { "cell_type": "code", "execution_count": 1, "metadata": { "collapsed": true }, "outputs": [], "source": [ "# Data-analyysin perustyökalut\n", "import numpy as np\n", "import pandas as pd" ] }, { "cell_type": "code", "execution_count": 2, "metadata": { "collapsed": true }, "outputs": [], "source": [ "# Kuvaajat\n", "%matplotlib inline\n", "import seaborn as sns\n", "sns.set_style(\"whitegrid\")" ] }, { "cell_type": "code", "execution_count": 3, "metadata": { "collapsed": false }, "outputs": [ { "data": { "text/html": [ "
\n", " | Sami | \n", "Joe | \n", "Mikko | \n", "Taneli | \n", "Arttu | \n", "Ville | \n", "Kirmo | \n", "
---|---|---|---|---|---|---|---|
Kappale | \n", "\n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " |
Tungevaag & Raaban – Parade | \n", "4.50 | \n", "0.50 | \n", "3 | \n", "2 | \n", "2.0 | \n", "2.00 | \n", "4.50 | \n", "
Asa – Mä haluun olla hippi | \n", "1.00 | \n", "2.25 | \n", "4 | \n", "1 | \n", "2.0 | \n", "2.00 | \n", "2.50 | \n", "
Chubby Wolf – You are the Description that brings me out of Myself | \n", "0.50 | \n", "1.00 | \n", "1 | \n", "3 | \n", "4.0 | \n", "1.00 | \n", "2.00 | \n", "
Supperheads – Easy | \n", "2.00 | \n", "2.75 | \n", "5 | \n", "3 | \n", "2.0 | \n", "5.00 | \n", "3.50 | \n", "
Dream Theater – The Silent Man | \n", "2.25 | \n", "2.25 | \n", "1 | \n", "3 | \n", "2.5 | \n", "4.50 | \n", "3.75 | \n", "
Tony Bennett & Lady Gaga – It Don't Mean a Thing | \n", "3.00 | \n", "1.25 | \n", "3 | \n", "3 | \n", "4.0 | \n", "1.50 | \n", "4.00 | \n", "
Elina Born – Miss Calculation | \n", "2.00 | \n", "2.50 | \n", "4 | \n", "4 | \n", "1.0 | \n", "3.00 | \n", "3.75 | \n", "
Caravan Palace – Lone Digger | \n", "0.50 | \n", "2.50 | \n", "5 | \n", "5 | \n", "4.0 | \n", "1.75 | \n", "4.75 | \n", "
Avril Lavigne – When You're Gone | \n", "4.50 | \n", "2.75 | \n", "2 | \n", "3 | \n", "1.0 | \n", "3.75 | \n", "4.25 | \n", "
Halestorm – Here's to US | \n", "3.50 | \n", "4.00 | \n", "4 | \n", "4 | \n", "0.5 | \n", "3.50 | \n", "4.00 | \n", "
\n", " | Keskihajonta | \n", "
---|---|
Kappale | \n", "\n", " |
Caravan Palace – Lone Digger | \n", "1.790351 | \n", "
Tungevaag & Raaban – Parade | \n", "1.463850 | \n", "
Chubby Wolf – You are the Description that brings me out of Myself | \n", "1.286375 | \n", "
Halestorm – Here's to US | \n", "1.281740 | \n", "
Supperheads – Easy | \n", "1.264205 | \n", "
Avril Lavigne – When You're Gone | \n", "1.253566 | \n", "
Dream Theater – The Silent Man | \n", "1.136515 | \n", "
Elina Born – Miss Calculation | \n", "1.135205 | \n", "
Tony Bennett & Lady Gaga – It Don't Mean a Thing | \n", "1.086990 | \n", "
Asa – Mä haluun olla hippi | \n", "1.019162 | \n", "
\n", " | Sami | \n", "Joe | \n", "Mikko | \n", "Taneli | \n", "Arttu | \n", "Ville | \n", "Kirmo | \n", "
---|---|---|---|---|---|---|---|
Kappale | \n", "\n", " | \n", " | \n", " | \n", " | \n", " | \n", " | \n", " |
Tungevaag & Raaban – Parade | \n", "1.0000 | \n", "0.000000 | \n", "0.50 | \n", "0.25 | \n", "0.428571 | \n", "0.2500 | \n", "0.909091 | \n", "
Asa – Mä haluun olla hippi | \n", "0.1250 | \n", "0.500000 | \n", "0.75 | \n", "0.00 | \n", "0.428571 | \n", "0.2500 | \n", "0.181818 | \n", "
Chubby Wolf – You are the Description that brings me out of Myself | \n", "0.0000 | \n", "0.142857 | \n", "0.00 | \n", "0.50 | \n", "1.000000 | \n", "0.0000 | \n", "0.000000 | \n", "
Supperheads – Easy | \n", "0.3750 | \n", "0.642857 | \n", "1.00 | \n", "0.50 | \n", "0.428571 | \n", "1.0000 | \n", "0.545455 | \n", "
Dream Theater – The Silent Man | \n", "0.4375 | \n", "0.500000 | \n", "0.00 | \n", "0.50 | \n", "0.571429 | \n", "0.8750 | \n", "0.636364 | \n", "
Tony Bennett & Lady Gaga – It Don't Mean a Thing | \n", "0.6250 | \n", "0.214286 | \n", "0.50 | \n", "0.50 | \n", "1.000000 | \n", "0.1250 | \n", "0.727273 | \n", "
Elina Born – Miss Calculation | \n", "0.3750 | \n", "0.571429 | \n", "0.75 | \n", "0.75 | \n", "0.142857 | \n", "0.5000 | \n", "0.636364 | \n", "
Caravan Palace – Lone Digger | \n", "0.0000 | \n", "0.571429 | \n", "1.00 | \n", "1.00 | \n", "1.000000 | \n", "0.1875 | \n", "1.000000 | \n", "
Avril Lavigne – When You're Gone | \n", "1.0000 | \n", "0.642857 | \n", "0.25 | \n", "0.50 | \n", "0.142857 | \n", "0.6875 | \n", "0.818182 | \n", "
Halestorm – Here's to US | \n", "0.7500 | \n", "1.000000 | \n", "0.75 | \n", "0.75 | \n", "0.000000 | \n", "0.6250 | \n", "0.727273 | \n", "
\n", " \n", " | Sami\n", " \n", " | Joe\n", " \n", " | Mikko\n", " \n", " | Taneli\n", " \n", " | Arttu\n", " \n", " | Ville\n", " \n", " | Kirmo\n", " \n", " |
---|---|---|---|---|---|---|---|
\n", " Sami\n", " \n", " | \n", " nan\n", " \n", " | \n", " 1.40019\n", " \n", " | \n", " 1.71505\n", " \n", " | \n", " 1.5013\n", " \n", " | \n", " 1.98273\n", " \n", " | \n", " 1.25468\n", " \n", " | \n", " 1.09199\n", " \n", " |
\n", " Joe\n", " \n", " | \n", " 1.40019\n", " \n", " | \n", " nan\n", " \n", " | \n", " 1.10657\n", " \n", " | \n", " 0.916821\n", " \n", " | \n", " 1.79142\n", " \n", " | \n", " 0.846568\n", " \n", " | \n", " 1.23788\n", " \n", " |
\n", " Mikko\n", " \n", " | \n", " 1.71505\n", " \n", " | \n", " 1.10657\n", " \n", " | \n", " nan\n", " \n", " | \n", " 1.19896\n", " \n", " | \n", " 1.71874\n", " \n", " | \n", " 1.46575\n", " \n", " | \n", " 1.22052\n", " \n", " |
\n", " Taneli\n", " \n", " | \n", " 1.5013\n", " \n", " | \n", " 0.916821\n", " \n", " | \n", " 1.19896\n", " \n", " | \n", " nan\n", " \n", " | \n", " 1.33583\n", " \n", " | \n", " 1.27169\n", " \n", " | \n", " 0.951022\n", " \n", " |
\n", " Arttu\n", " \n", " | \n", " 1.98273\n", " \n", " | \n", " 1.79142\n", " \n", " | \n", " 1.71874\n", " \n", " | \n", " 1.33583\n", " \n", " | \n", " nan\n", " \n", " | \n", " 1.92952\n", " \n", " | \n", " 1.61635\n", " \n", " |
\n", " Ville\n", " \n", " | \n", " 1.25468\n", " \n", " | \n", " 0.846568\n", " \n", " | \n", " 1.46575\n", " \n", " | \n", " 1.27169\n", " \n", " | \n", " 1.92952\n", " \n", " | \n", " nan\n", " \n", " | \n", " 1.33103\n", " \n", " |
\n", " Kirmo\n", " \n", " | \n", " 1.09199\n", " \n", " | \n", " 1.23788\n", " \n", " | \n", " 1.22052\n", " \n", " | \n", " 0.951022\n", " \n", " | \n", " 1.61635\n", " \n", " | \n", " 1.33103\n", " \n", " | \n", " nan\n", " \n", " |
\n", " \n", " | Sami\n", " \n", " | Joe\n", " \n", " | Mikko\n", " \n", " | Taneli\n", " \n", " | Arttu\n", " \n", " | Ville\n", " \n", " | Kirmo\n", " \n", " |
---|---|---|---|---|---|---|---|
\n", " Sami\n", " \n", " | \n", " nan\n", " \n", " | \n", " 5\n", " \n", " | \n", " 5\n", " \n", " | \n", " 6\n", " \n", " | \n", " 6\n", " \n", " | \n", " 2\n", " \n", " | \n", " 2\n", " \n", " |
\n", " Joe\n", " \n", " | \n", " 3\n", " \n", " | \n", " nan\n", " \n", " | \n", " 1\n", " \n", " | \n", " 1\n", " \n", " | \n", " 4\n", " \n", " | \n", " 1\n", " \n", " | \n", " 4\n", " \n", " |
\n", " Mikko\n", " \n", " | \n", " 5\n", " \n", " | \n", " 3\n", " \n", " | \n", " nan\n", " \n", " | \n", " 3\n", " \n", " | \n", " 3\n", " \n", " | \n", " 5\n", " \n", " | \n", " 3\n", " \n", " |
\n", " Taneli\n", " \n", " | \n", " 4\n", " \n", " | \n", " 2\n", " \n", " | \n", " 2\n", " \n", " | \n", " nan\n", " \n", " | \n", " 1\n", " \n", " | \n", " 3\n", " \n", " | \n", " 1\n", " \n", " |
\n", " Arttu\n", " \n", " | \n", " 6\n", " \n", " | \n", " 6\n", " \n", " | \n", " 6\n", " \n", " | \n", " 5\n", " \n", " | \n", " nan\n", " \n", " | \n", " 6\n", " \n", " | \n", " 6\n", " \n", " |
\n", " Ville\n", " \n", " | \n", " 2\n", " \n", " | \n", " 1\n", " \n", " | \n", " 4\n", " \n", " | \n", " 4\n", " \n", " | \n", " 5\n", " \n", " | \n", " nan\n", " \n", " | \n", " 5\n", " \n", " |
\n", " Kirmo\n", " \n", " | \n", " 1\n", " \n", " | \n", " 4\n", " \n", " | \n", " 3\n", " \n", " | \n", " 2\n", " \n", " | \n", " 2\n", " \n", " | \n", " 4\n", " \n", " | \n", " nan\n", " \n", " |