{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Jupyter Notebook Showcase\n", "\n", "**by Ian McLoughlin (ian.mcloughlin@gmit.ie)**" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "This notebook demonstrates the analysis of a data set using the [Python programming language](https://www.python.org/about/gettingstarted/).\n", "\n", "The notebook is hosted on [GitHub](https://github.com/ianmcloughlin/jupyter-teaching-notebooks/blob/master/showcase.ipynb).\n", "\n", "You can take this interactive notebook and play around with it: [**https://goo.gl/SbYMqr**](https://goo.gl/SbYMqr).\n", "\n", "*Note: unfortunately, Google haven't updated their packages yet so you'll have to uncomment and run the following code if you're running it on Collaboratory.*" ] }, { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [], "source": [ "#!pip install --upgrade seaborn " ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "***\n", "\n", "## A bit of context\n", "\n", "#### My objectives in this talk are to:\n", "\n", "1. Demonstrate that programming is accessible to non-computing students.\n", "2. Discuss the pros and cons of Graphical User Interfaces in data analytics.\n", "3. Widen understanding of the formal sciences.\n", "\n", "#### A few talking points\n", "\n", "- Notebooks are useful for students of all disciplines (formal sciences, natural sciences, social sciences, engineering, etc.)\n", "- Notebooks are documents that blend text, mathematical notation and runnable code, and can be run from a browser.\n", "- We'll soon have our first wave of incoming students having taken Computer Science at second level.\n", "- That might foster a discussion about what programming is - maybe it's just a skill." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "***\n", "\n", "## About the data set\n", "\n", "\n", "![Ronald Fisher](https://upload.wikimedia.org/wikipedia/commons/a/aa/Youngronaldfisher2.JPG)\n", "\n", "We'll look at the well-known [Iris data set](https://archive.ics.uci.edu/ml/datasets/iris).\n", "\n", "It was collected by [Ronald Fisher (above)](http://www-history.mcs.st-andrews.ac.uk/Biographies/Fisher.html).\n", "\n", "Fisher is famous for [The Design of Experiments](https://en.wikipedia.org/wiki/The_Design_of_Experiments) including the [Lady Tasting Tea problem](https://en.wikipedia.org/wiki/Lady_tasting_tea) and [ANOVA](https://en.wikipedia.org/wiki/Analysis_of_variance#History) amongst other things." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "***\n", "\n", "## Load a data set\n", "\n", "![Iris](http://www.oranchak.com/three-irises.jpg)\n", "\n", "Using Python, we can easily load a comma separated values (CSV) file to analyse it." ] }, { "cell_type": "code", "execution_count": 2, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", " | sepal_length | \n", "sepal_width | \n", "petal_length | \n", "petal_width | \n", "class | \n", "
---|---|---|---|---|---|
0 | \n", "5.1 | \n", "3.5 | \n", "1.4 | \n", "0.2 | \n", "setosa | \n", "
1 | \n", "4.9 | \n", "3.0 | \n", "1.4 | \n", "0.2 | \n", "setosa | \n", "
2 | \n", "4.7 | \n", "3.2 | \n", "1.3 | \n", "0.2 | \n", "setosa | \n", "
3 | \n", "4.6 | \n", "3.1 | \n", "1.5 | \n", "0.2 | \n", "setosa | \n", "
4 | \n", "5.0 | \n", "3.6 | \n", "1.4 | \n", "0.2 | \n", "setosa | \n", "