{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Learning scikit-learn " ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## An Introduction to Machine Learning in Python" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### at PyData Chicago 2016" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "collapsed": false }, "outputs": [], "source": [ "%load_ext watermark\n", "%watermark -a \"Sebastian Raschka\" -u -d -p numpy,scipy,matplotlib,sklearn,pandas,mlxtend" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# Table of Contents\n", "\n", "* [1 Introduction to Machine Learning](#1-Introduction-to-Machine-Learning)\n", "* [2 Linear Regression](#2-Linear-Regression)\n", " * [Loading the dataset](#Loading-the-dataset)\n", " * [Preparing the dataset](#Preparing-the-dataset)\n", " * [Fitting the model](#Fitting-the-model)\n", " * [Evaluating the model](#Evaluating-the-model)\n", "* [3 Introduction to Classification](#3-Introduction-to-Classification)\n", " * [The Iris dataset](#The-Iris-dataset)\n", " * [Class label encoding](#Class-label-encoding)\n", " * [Scikit-learn's in-build datasets](#Scikit-learn's-in-build-datasets)\n", " * [Test/train splits](#Test/train-splits)\n", " * [Logistic Regression](#Logistic-Regression)\n", " * [K-Nearest Neighbors](#K-Nearest-Neighbors)\n", " * [3 - Exercises](#3---Exercises)\n", "* [4 - Feature Preprocessing & scikit-learn Pipelines](#4---Feature-Preprocessing-&-scikit-learn-Pipelines)\n", " * [Categorical features: nominal vs ordinal](#Categorical-features:-nominal-vs-ordinal)\n", " * [Normalization](#Normalization)\n", " * [Pipelines](#Pipelines)\n", " * [4 - Exercises](#4---Exercises)\n", "* [5 - Dimensionality Reduction: Feature Selection & Extraction](#5---Dimensionality-Reduction:-Feature-Selection-&-Extraction)\n", " * [Recursive Feature Elimination](#Recursive-Feature-Elimination)\n", " * [Sequential Feature Selection](#Sequential-Feature-Selection)\n", " * [Principal Component Analysis](#Principal-Component-Analysis)\n", "* [6 - Model Evaluation & Hyperparameter Tuning](#6---Model-Evaluation-&-Hyperparameter-Tuning)\n", " * [Wine Dataset](#Wine-Dataset)\n", " * [Stratified K-Fold](#Stratified-K-Fold)\n", " * [Grid Search](#Grid-Search)" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "collapsed": true }, "outputs": [], "source": [ "%matplotlib inline\n", "import matplotlib.pyplot as plt\n", "import numpy as np\n", "import pandas as pd" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "
