{ "cells": [ { "cell_type": "code", "execution_count": 1, "id": "acceptable-netherlands", "metadata": {}, "outputs": [], "source": [ "\"\"\"\n", "Today we will look at ensemble learning, which is an often undervalued part of machine learning. The way it works is\n", "by taking multiple different classifiers, training them, and then averaging their predictions into one \"ensemble\" prediction. \n", "You often want to have a diverse set of classifiers so you can aggregate predictions. The reason this works is because even if you have", "subpar classifiers with a 60% chance of predicting correctly on their own - as a group together it will increase more \n", "and more as long as these classifiers all have different learnt parameters. This can be suprisingly effective. \n", "The two algorithms we will look at are Random Forests and Ensemble Classifiers. \n", "\"\"\"\n", "\n", "#let's get started\n", "import sealion as sl \n", "from sealion.ensemble_learning import RandomForest, EnsembleClassifier" ] }, { "cell_type": "code", "execution_count": 2, "id": "aquatic-great", "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", " | PassengerId | \n", "Survived | \n", "Pclass | \n", "Name | \n", "Sex | \n", "Age | \n", "SibSp | \n", "Parch | \n", "Ticket | \n", "Fare | \n", "Cabin | \n", "Embarked | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "1 | \n", "0 | \n", "3 | \n", "Braund, Mr. Owen Harris | \n", "male | \n", "22.0 | \n", "1 | \n", "0 | \n", "A/5 21171 | \n", "7.2500 | \n", "NaN | \n", "S | \n", "
1 | \n", "2 | \n", "1 | \n", "1 | \n", "Cumings, Mrs. John Bradley (Florence Briggs Th... | \n", "female | \n", "38.0 | \n", "1 | \n", "0 | \n", "PC 17599 | \n", "71.2833 | \n", "C85 | \n", "C | \n", "
2 | \n", "3 | \n", "1 | \n", "3 | \n", "Heikkinen, Miss. Laina | \n", "female | \n", "26.0 | \n", "0 | \n", "0 | \n", "STON/O2. 3101282 | \n", "7.9250 | \n", "NaN | \n", "S | \n", "
3 | \n", "4 | \n", "1 | \n", "1 | \n", "Futrelle, Mrs. Jacques Heath (Lily May Peel) | \n", "female | \n", "35.0 | \n", "1 | \n", "0 | \n", "113803 | \n", "53.1000 | \n", "C123 | \n", "S | \n", "
4 | \n", "5 | \n", "0 | \n", "3 | \n", "Allen, Mr. William Henry | \n", "male | \n", "35.0 | \n", "0 | \n", "0 | \n", "373450 | \n", "8.0500 | \n", "NaN | \n", "S | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
886 | \n", "887 | \n", "0 | \n", "2 | \n", "Montvila, Rev. Juozas | \n", "male | \n", "27.0 | \n", "0 | \n", "0 | \n", "211536 | \n", "13.0000 | \n", "NaN | \n", "S | \n", "
887 | \n", "888 | \n", "1 | \n", "1 | \n", "Graham, Miss. Margaret Edith | \n", "female | \n", "19.0 | \n", "0 | \n", "0 | \n", "112053 | \n", "30.0000 | \n", "B42 | \n", "S | \n", "
888 | \n", "889 | \n", "0 | \n", "3 | \n", "Johnston, Miss. Catherine Helen \"Carrie\" | \n", "female | \n", "NaN | \n", "1 | \n", "2 | \n", "W./C. 6607 | \n", "23.4500 | \n", "NaN | \n", "S | \n", "
889 | \n", "890 | \n", "1 | \n", "1 | \n", "Behr, Mr. Karl Howell | \n", "male | \n", "26.0 | \n", "0 | \n", "0 | \n", "111369 | \n", "30.0000 | \n", "C148 | \n", "C | \n", "
890 | \n", "891 | \n", "0 | \n", "3 | \n", "Dooley, Mr. Patrick | \n", "male | \n", "32.0 | \n", "0 | \n", "0 | \n", "370376 | \n", "7.7500 | \n", "NaN | \n", "Q | \n", "
891 rows × 12 columns
\n", "\n", " | PassengerId | \n", "Survived | \n", "Pclass | \n", "Sex | \n", "Age | \n", "SibSp | \n", "Parch | \n", "Fare | \n", "Embarked | \n", "
---|---|---|---|---|---|---|---|---|---|
0 | \n", "1 | \n", "0 | \n", "3 | \n", "0 | \n", "22.0 | \n", "1 | \n", "0 | \n", "7.2500 | \n", "0 | \n", "
1 | \n", "2 | \n", "1 | \n", "1 | \n", "1 | \n", "38.0 | \n", "1 | \n", "0 | \n", "71.2833 | \n", "1 | \n", "
2 | \n", "3 | \n", "1 | \n", "3 | \n", "1 | \n", "26.0 | \n", "0 | \n", "0 | \n", "7.9250 | \n", "0 | \n", "
3 | \n", "4 | \n", "1 | \n", "1 | \n", "1 | \n", "35.0 | \n", "1 | \n", "0 | \n", "53.1000 | \n", "0 | \n", "
4 | \n", "5 | \n", "0 | \n", "3 | \n", "0 | \n", "35.0 | \n", "0 | \n", "0 | \n", "8.0500 | \n", "0 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
886 | \n", "887 | \n", "0 | \n", "2 | \n", "0 | \n", "27.0 | \n", "0 | \n", "0 | \n", "13.0000 | \n", "0 | \n", "
887 | \n", "888 | \n", "1 | \n", "1 | \n", "1 | \n", "19.0 | \n", "0 | \n", "0 | \n", "30.0000 | \n", "0 | \n", "
888 | \n", "889 | \n", "0 | \n", "3 | \n", "1 | \n", "0.0 | \n", "1 | \n", "2 | \n", "23.4500 | \n", "0 | \n", "
889 | \n", "890 | \n", "1 | \n", "1 | \n", "0 | \n", "26.0 | \n", "0 | \n", "0 | \n", "30.0000 | \n", "1 | \n", "
890 | \n", "891 | \n", "0 | \n", "3 | \n", "0 | \n", "32.0 | \n", "0 | \n", "0 | \n", "7.7500 | \n", "2 | \n", "
891 rows × 9 columns
\n", "