{ "cells": [ { "cell_type": "code", "execution_count": null, "id": "09fe0040-1786-42c3-a866-447e283fa1ab", "metadata": { "tags": [ "hide-cell" ] }, "outputs": [], "source": [ "# Install the necessary dependencies\n", "\n", "import sys\n", "import numpy as np\n", "import pandas as pd\n", "import matplotlib.pyplot as plt\n", "!{sys.executable} -m pip install --quiet pandas numpy matplotlib jupyterlab_myst pygments" ] }, { "cell_type": "markdown", "id": "103f46e6", "metadata": { "tags": [ "remove-cell" ] }, "source": [ "---\n", "license:\n", " code: MIT\n", " content: CC-BY-4.0\n", "github: https://github.com/ocademy-ai/machine-learning\n", "venue: By Ocademy\n", "open_access: true\n", "bibliography:\n", " - https://raw.githubusercontent.com/ocademy-ai/machine-learning/main/open-machine-learning-jupyter-book/references.bib\n", "---" ] }, { "cell_type": "markdown", "id": "e8feb576", "metadata": { "tags": [] }, "source": [ "# Introduction to statistics and probability\n", "\n", "Statistics and Probability Theory are two highly related areas of Mathematics that are highly relevant to Data Science. It is possible to operate with data without deep knowledge of mathematics, but it is still better to know at least some basic concepts. Here we will present a short introduction that will help you get started." ] }, { "cell_type": "code", "execution_count": null, "id": "574d8e32-a00e-4a44-bc3f-d5691b3c9364", "metadata": { "attributes": { "classes": [ "seealso" ], "id": "" }, "tags": [ "hide-input" ] }, "outputs": [], "source": [ "from IPython.display import HTML\n", "\n", "display(\n", " HTML(\n", " \"\"\"\n", "