{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Walkthrough for S3-helper function\n", "
\n",
" Forget where you put some sensitive information on your cloud database?\n",
"
Tired of downloading files from s3 only to preview them in excel, or textpad?\n",
"
Want to touch and reshape data but feel it's caged off from you?\n",
"
Look no further, your s3 problems are solved!\n",
"
\n",
" s3.ls will list all the files and directories in a bucket/key akin to os.listdir()\n",
"
see the code\n",
"
\n",
" s3.read_csv and read_json are identical to their Pandas' \n",
" ancestor and backbone.\n",
"
\n",
" Using this handy function, you have data displayed in a nice tabular format:
\n",
" see the code\n",
"
\n", " | fixed acidity | \n", "volatile acidity | \n", "citric acid | \n", "residual sugar | \n", "chlorides | \n", "free sulfur dioxide | \n", "total sulfur dioxide | \n", "density | \n", "pH | \n", "sulphates | \n", "alcohol | \n", "quality | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "7.4 | \n", "0.70 | \n", "0.00 | \n", "1.9 | \n", "0.076 | \n", "11.0 | \n", "34.0 | \n", "0.9978 | \n", "3.51 | \n", "0.56 | \n", "9.4 | \n", "5 | \n", "
1 | \n", "7.8 | \n", "0.88 | \n", "0.00 | \n", "2.6 | \n", "0.098 | \n", "25.0 | \n", "67.0 | \n", "0.9968 | \n", "3.20 | \n", "0.68 | \n", "9.8 | \n", "5 | \n", "
2 | \n", "7.8 | \n", "0.76 | \n", "0.04 | \n", "2.3 | \n", "0.092 | \n", "15.0 | \n", "54.0 | \n", "0.9970 | \n", "3.26 | \n", "0.65 | \n", "9.8 | \n", "5 | \n", "
\n", " | fixed acidity | \n", "volatile acidity | \n", "citric acid | \n", "residual sugar | \n", "chlorides | \n", "free sulfur dioxide | \n", "total sulfur dioxide | \n", "density | \n", "pH | \n", "sulphates | \n", "alcohol | \n", "quality | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|
1596 | \n", "6.3 | \n", "0.510 | \n", "0.13 | \n", "2.3 | \n", "0.076 | \n", "29.0 | \n", "40.0 | \n", "0.99574 | \n", "3.42 | \n", "0.75 | \n", "11.0 | \n", "6 | \n", "
1597 | \n", "5.9 | \n", "0.645 | \n", "0.12 | \n", "2.0 | \n", "0.075 | \n", "32.0 | \n", "44.0 | \n", "0.99547 | \n", "3.57 | \n", "0.71 | \n", "10.2 | \n", "5 | \n", "
1598 | \n", "6.0 | \n", "0.310 | \n", "0.47 | \n", "3.6 | \n", "0.067 | \n", "18.0 | \n", "42.0 | \n", "0.99549 | \n", "3.39 | \n", "0.66 | \n", "11.0 | \n", "6 | \n", "
\n", " | alcohol | \n", "chlorides | \n", "citric acid | \n", "density | \n", "fixed acidity | \n", "free sulfur dioxide | \n", "pH | \n", "quality | \n", "residual sugar | \n", "sulphates | \n", "total sulfur dioxide | \n", "volatile acidity | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|
1289 | \n", "10.2 | \n", "0.068 | \n", "0.30 | \n", "0.99914 | \n", "7.0 | \n", "20.0 | \n", "3.30 | \n", "5 | \n", "4.5 | \n", "1.17 | \n", "110.0 | \n", "0.60 | \n", "
607 | \n", "10.5 | \n", "0.092 | \n", "0.41 | \n", "0.99820 | \n", "8.8 | \n", "26.0 | \n", "3.31 | \n", "6 | \n", "3.3 | \n", "0.53 | \n", "52.0 | \n", "0.48 | \n", "
675 | \n", "10.2 | \n", "0.064 | \n", "0.39 | \n", "0.99840 | \n", "9.3 | \n", "12.0 | \n", "3.26 | \n", "5 | \n", "2.2 | \n", "0.65 | \n", "31.0 | \n", "0.41 | \n", "
\n", " | count | \n", "mean | \n", "std | \n", "min | \n", "25% | \n", "50% | \n", "75% | \n", "max | \n", "
---|---|---|---|---|---|---|---|---|
alcohol | \n", "1599.0 | \n", "10.422983 | \n", "1.065668 | \n", "8.40000 | \n", "9.5000 | \n", "10.20000 | \n", "11.100000 | \n", "14.90000 | \n", "
chlorides | \n", "1599.0 | \n", "0.087467 | \n", "0.047065 | \n", "0.01200 | \n", "0.0700 | \n", "0.07900 | \n", "0.090000 | \n", "0.61100 | \n", "
citric acid | \n", "1599.0 | \n", "0.270976 | \n", "0.194801 | \n", "0.00000 | \n", "0.0900 | \n", "0.26000 | \n", "0.420000 | \n", "1.00000 | \n", "
density | \n", "1599.0 | \n", "0.996747 | \n", "0.001887 | \n", "0.99007 | \n", "0.9956 | \n", "0.99675 | \n", "0.997835 | \n", "1.00369 | \n", "
fixed acidity | \n", "1599.0 | \n", "8.319637 | \n", "1.741096 | \n", "4.60000 | \n", "7.1000 | \n", "7.90000 | \n", "9.200000 | \n", "15.90000 | \n", "
free sulfur dioxide | \n", "1599.0 | \n", "15.874922 | \n", "10.460157 | \n", "1.00000 | \n", "7.0000 | \n", "14.00000 | \n", "21.000000 | \n", "72.00000 | \n", "
pH | \n", "1599.0 | \n", "3.311113 | \n", "0.154386 | \n", "2.74000 | \n", "3.2100 | \n", "3.31000 | \n", "3.400000 | \n", "4.01000 | \n", "
quality | \n", "1599.0 | \n", "5.636023 | \n", "0.807569 | \n", "3.00000 | \n", "5.0000 | \n", "6.00000 | \n", "6.000000 | \n", "8.00000 | \n", "
residual sugar | \n", "1599.0 | \n", "2.538806 | \n", "1.409928 | \n", "0.90000 | \n", "1.9000 | \n", "2.20000 | \n", "2.600000 | \n", "15.50000 | \n", "
sulphates | \n", "1599.0 | \n", "0.658149 | \n", "0.169507 | \n", "0.33000 | \n", "0.5500 | \n", "0.62000 | \n", "0.730000 | \n", "2.00000 | \n", "
total sulfur dioxide | \n", "1599.0 | \n", "46.467792 | \n", "32.895324 | \n", "6.00000 | \n", "22.0000 | \n", "38.00000 | \n", "62.000000 | \n", "289.00000 | \n", "
volatile acidity | \n", "1599.0 | \n", "0.527821 | \n", "0.179060 | \n", "0.12000 | \n", "0.3900 | \n", "0.52000 | \n", "0.640000 | \n", "1.58000 | \n", "
\n", " | alcohol | \n", "chlorides | \n", "citric acid | \n", "density | \n", "fixed acidity | \n", "free sulfur dioxide | \n", "pH | \n", "quality | \n", "residual sugar | \n", "sulphates | \n", "total sulfur dioxide | \n", "volatile acidity | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|
45 | \n", "13.1 | \n", "0.054 | \n", "0.15 | \n", "0.9934 | \n", "4.6 | \n", "8.0 | \n", "3.90 | \n", "4 | \n", "2.1 | \n", "0.56 | \n", "65.0 | \n", "0.52 | \n", "
95 | \n", "12.9 | \n", "0.058 | \n", "0.17 | \n", "0.9932 | \n", "4.7 | \n", "17.0 | \n", "3.85 | \n", "6 | \n", "2.3 | \n", "0.60 | \n", "106.0 | \n", "0.60 | \n", "
131 | \n", "13.0 | \n", "0.049 | \n", "0.09 | \n", "0.9937 | \n", "5.6 | \n", "17.0 | \n", "3.63 | \n", "5 | \n", "2.3 | \n", "0.63 | \n", "99.0 | \n", "0.50 | \n", "
132 | \n", "13.0 | \n", "0.049 | \n", "0.09 | \n", "0.9937 | \n", "5.6 | \n", "17.0 | \n", "3.63 | \n", "5 | \n", "2.3 | \n", "0.63 | \n", "99.0 | \n", "0.50 | \n", "
142 | \n", "14.0 | \n", "0.050 | \n", "0.00 | \n", "0.9916 | \n", "5.2 | \n", "27.0 | \n", "3.68 | \n", "6 | \n", "1.8 | \n", "0.79 | \n", "63.0 | \n", "0.34 | \n", "
\n",
" s3.read_csv and read_json are almost identical to their Pandas ancestor and backbone.\n",
"
The difference is that s3.to_csv takes the dataframe as an argument, rather than being a function of a dataframe.\n",
"
see the code\n",
"