{ "cells": [ { "cell_type": "markdown", "id": "4c16c877-0135-4ee4-86e2-db21dca5eb5f", "metadata": {}, "source": [ "# Exploring tabular data\n", "When working with data in tables, the ability of quickly getting an overview about the data is key." ] }, { "cell_type": "code", "execution_count": 1, "id": "05824a7f", "metadata": {}, "outputs": [], "source": [ "import pandas as pd " ] }, { "cell_type": "markdown", "id": "5673e04d", "metadata": {}, "source": [ "## Loading CSV files from disk\n", "To ensure compatility beween different software for processing tabular data the [CSV file format](https://en.wikipedia.org/wiki/Comma-separated_values) is commonly used. We can open those files using [pandas.read_csv](https://pandas.pydata.org/docs/reference/api/pandas.read_csv.html)." ] }, { "cell_type": "code", "execution_count": 2, "id": "f46f4002", "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
AreaMeanStdDevMinMaxXYXMYMMajorMinorAngle%AreaType
118.0730.389103.354592.0948.0435.0004.722434.9624.6975.9873.828168.425100A
2126.0718.33390.367556.01046.0388.0878.683388.1838.68716.5599.688175.471100A
3NaNNaNNaN608.0964.0NaNNaNNaN7.6657.359NaN101.121100A
468.0686.98561.169571.0880.0126.1478.809126.1928.81115.1365.720168.133100A
5NaNNaN69.438566.0792.0348.5007.500NaN7.508NaN3.088NaN100A
.............................................
387152.0801.599111.328582.01263.0348.487497.632348.451497.67517.77310.88911.829100A
38817.0742.70669.624620.0884.0420.500496.382420.513NaNNaN3.66349.457100A
38960.0758.03377.309601.0947.0259.000499.300258.990499.2899.4768.06290.000100A
39012.0714.83367.294551.0785.0240.167498.167240.179498.1484.6063.317168.690100A
39123.0695.04367.356611.0846.049.891503.02249.882502.9796.4544.53773.243100A
\n", "

391 rows × 14 columns

\n", "
" ], "text/plain": [ " Area Mean StdDev Min Max X Y XM \\\n", " \n", "1 18.0 730.389 103.354 592.0 948.0 435.000 4.722 434.962 \n", "2 126.0 718.333 90.367 556.0 1046.0 388.087 8.683 388.183 \n", "3 NaN NaN NaN 608.0 964.0 NaN NaN NaN \n", "4 68.0 686.985 61.169 571.0 880.0 126.147 8.809 126.192 \n", "5 NaN NaN 69.438 566.0 792.0 348.500 7.500 NaN \n", ".. ... ... ... ... ... ... ... ... \n", "387 152.0 801.599 111.328 582.0 1263.0 348.487 497.632 348.451 \n", "388 17.0 742.706 69.624 620.0 884.0 420.500 496.382 420.513 \n", "389 60.0 758.033 77.309 601.0 947.0 259.000 499.300 258.990 \n", "390 12.0 714.833 67.294 551.0 785.0 240.167 498.167 240.179 \n", "391 23.0 695.043 67.356 611.0 846.0 49.891 503.022 49.882 \n", "\n", " YM Major Minor Angle %Area Type \n", " \n", "1 4.697 5.987 3.828 168.425 100 A \n", "2 8.687 16.559 9.688 175.471 100 A \n", "3 7.665 7.359 NaN 101.121 100 A \n", "4 8.811 15.136 5.720 168.133 100 A \n", "5 7.508 NaN 3.088 NaN 100 A \n", ".. ... ... ... ... ... ... \n", "387 497.675 17.773 10.889 11.829 100 A \n", "388 NaN NaN 3.663 49.457 100 A \n", "389 499.289 9.476 8.062 90.000 100 A \n", "390 498.148 4.606 3.317 168.690 100 A \n", "391 502.979 6.454 4.537 73.243 100 A \n", "\n", "[391 rows x 14 columns]" ] }, "execution_count": 2, "metadata": {}, "output_type": "execute_result" } ], "source": [ "data = pd.read_csv('../../data/Results.csv', index_col=0, delimiter=';')\n", "data" ] }, { "cell_type": "markdown", "id": "7beb37a0", "metadata": {}, "source": [ "## Viewing the data\n", "Viewing data can be tricky, especially when working with large tables." ] }, { "cell_type": "code", "execution_count": 3, "id": "0a79b9c3", "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
AreaMeanStdDevMinMaxXYXMYMMajorMinorAngle%AreaType
118.0730.389103.354592.0948.0435.0004.722434.9624.6975.9873.828168.425100A
2126.0718.33390.367556.01046.0388.0878.683388.1838.68716.5599.688175.471100A
3NaNNaNNaN608.0964.0NaNNaNNaN7.6657.359NaN101.121100A
468.0686.98561.169571.0880.0126.1478.809126.1928.81115.1365.720168.133100A
5NaNNaN69.438566.0792.0348.5007.500NaN7.508NaN3.088NaN100A
6669.0697.16472.863539.0957.0471.69626.253471.69426.19736.65623.237124.340100A
75.0658.60049.161607.0710.028.3008.10028.2848.1033.1442.025161.565100A
87.0677.57149.899596.0768.0415.3578.786415.3608.8044.1102.168112.500100A
914.0691.07163.873586.0808.0493.2869.000493.2959.0165.1203.48138.802100C
1039.0763.61588.786623.01016.0157.52612.731157.59212.7578.8155.63346.437100C
\n", "
" ], "text/plain": [ " Area Mean StdDev Min Max X Y XM YM \\\n", " \n", "1 18.0 730.389 103.354 592.0 948.0 435.000 4.722 434.962 4.697 \n", "2 126.0 718.333 90.367 556.0 1046.0 388.087 8.683 388.183 8.687 \n", "3 NaN NaN NaN 608.0 964.0 NaN NaN NaN 7.665 \n", "4 68.0 686.985 61.169 571.0 880.0 126.147 8.809 126.192 8.811 \n", "5 NaN NaN 69.438 566.0 792.0 348.500 7.500 NaN 7.508 \n", "6 669.0 697.164 72.863 539.0 957.0 471.696 26.253 471.694 26.197 \n", "7 5.0 658.600 49.161 607.0 710.0 28.300 8.100 28.284 8.103 \n", "8 7.0 677.571 49.899 596.0 768.0 415.357 8.786 415.360 8.804 \n", "9 14.0 691.071 63.873 586.0 808.0 493.286 9.000 493.295 9.016 \n", "10 39.0 763.615 88.786 623.0 1016.0 157.526 12.731 157.592 12.757 \n", "\n", " Major Minor Angle %Area Type \n", " \n", "1 5.987 3.828 168.425 100 A \n", "2 16.559 9.688 175.471 100 A \n", "3 7.359 NaN 101.121 100 A \n", "4 15.136 5.720 168.133 100 A \n", "5 NaN 3.088 NaN 100 A \n", "6 36.656 23.237 124.340 100 A \n", "7 3.144 2.025 161.565 100 A \n", "8 4.110 2.168 112.500 100 A \n", "9 5.120 3.481 38.802 100 C \n", "10 8.815 5.633 46.437 100 C " ] }, "execution_count": 3, "metadata": {}, "output_type": "execute_result" } ], "source": [ "data.head(10) # top 10 rows" ] }, { "cell_type": "code", "execution_count": 4, "id": "ef55a071", "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
AreaMeanStdDevMinMaxXYXMYMMajorMinorAngle%AreaType
38245.0734.35668.637575.0867.0171.500494.789171.492494.73914.6303.91695.698100B
38394.0746.61785.198550.01021.0194.032498.223194.014498.23917.2956.92052.720100B
38435.0776.25774.746611.0961.0268.957493.586268.977NaNNaN5.990111.193100A
38535.0739.286NaN593.0928.0291.871493.843291.871493.806NaN5.35279.368100A
38614.0736.14381.533646.0902.0315.000493.000314.989493.003NaN3.67645.000100A
387152.0801.599111.328582.01263.0348.487497.632348.451497.67517.77310.88911.829100A
38817.0742.70669.624620.0884.0420.500496.382420.513NaNNaN3.66349.457100A
38960.0758.03377.309601.0947.0259.000499.300258.990499.2899.4768.06290.000100A
39012.0714.83367.294551.0785.0240.167498.167240.179498.1484.6063.317168.690100A
39123.0695.04367.356611.0846.049.891503.02249.882502.9796.4544.53773.243100A
\n", "
" ], "text/plain": [ " Area Mean StdDev Min Max X Y XM \\\n", " \n", "382 45.0 734.356 68.637 575.0 867.0 171.500 494.789 171.492 \n", "383 94.0 746.617 85.198 550.0 1021.0 194.032 498.223 194.014 \n", "384 35.0 776.257 74.746 611.0 961.0 268.957 493.586 268.977 \n", "385 35.0 739.286 NaN 593.0 928.0 291.871 493.843 291.871 \n", "386 14.0 736.143 81.533 646.0 902.0 315.000 493.000 314.989 \n", "387 152.0 801.599 111.328 582.0 1263.0 348.487 497.632 348.451 \n", "388 17.0 742.706 69.624 620.0 884.0 420.500 496.382 420.513 \n", "389 60.0 758.033 77.309 601.0 947.0 259.000 499.300 258.990 \n", "390 12.0 714.833 67.294 551.0 785.0 240.167 498.167 240.179 \n", "391 23.0 695.043 67.356 611.0 846.0 49.891 503.022 49.882 \n", "\n", " YM Major Minor Angle %Area Type \n", " \n", "382 494.739 14.630 3.916 95.698 100 B \n", "383 498.239 17.295 6.920 52.720 100 B \n", "384 NaN NaN 5.990 111.193 100 A \n", "385 493.806 NaN 5.352 79.368 100 A \n", "386 493.003 NaN 3.676 45.000 100 A \n", "387 497.675 17.773 10.889 11.829 100 A \n", "388 NaN NaN 3.663 49.457 100 A \n", "389 499.289 9.476 8.062 90.000 100 A \n", "390 498.148 4.606 3.317 168.690 100 A \n", "391 502.979 6.454 4.537 73.243 100 A " ] }, "execution_count": 4, "metadata": {}, "output_type": "execute_result" } ], "source": [ "data.tail(10) # bottom 10 rows" ] }, { "cell_type": "markdown", "id": "a66c128b-0f4d-45e3-9918-0306614c6e31", "metadata": {}, "source": [ "## Overview descriptive statistics\n", "To get a glimpse of the range of values which exist in the given table, we can ask the DateFrame to _describe_ itself using [`DataFrame.describe()`](https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.describe.html). It will display count, mean, standard deviation and other descriptive statistics for each column in our table." ] }, { "cell_type": "code", "execution_count": 5, "id": "c8c7b3af", "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
AreaMeanStdDevMinMaxXYXMYMMajorMinorAngle%Area
count389.000000386.000000388.000000388.000000388.000000389.000000388.000000388.000000386.000000383.000000388.000000390.000000391.0
mean107.164524743.45556576.575309610.414948962.922680256.419859254.384088256.183338253.35300512.4810169.50066286.598441100.0
std241.03708242.25214031.84486457.156709244.897224152.261694155.080074152.380388154.42625011.97917649.71428060.5936860.0
min1.000000587.0000000.000000516.000000587.0000003.9780004.7220004.0120004.6970001.1280001.1280000.000000100.0
25%15.000000717.06075063.861000570.750000847.750000127.142000102.875250126.923250103.8137505.0980003.63725034.517250100.0
50%44.000000741.07750074.727000599.000000917.500000243.300000271.490000242.288000271.2720009.3740005.88600089.703500100.0
75%116.000000767.26075086.826500633.2500001014.500000400.167000395.058250400.363500393.80075016.2830009.017250134.617250100.0
max2755.000000912.938000377.767000877.0000003880.000000508.214000503.022000508.169000502.979000144.475000981.000000568.000000100.0
\n", "
" ], "text/plain": [ " Area Mean StdDev Min Max \\\n", "count 389.000000 386.000000 388.000000 388.000000 388.000000 \n", "mean 107.164524 743.455565 76.575309 610.414948 962.922680 \n", "std 241.037082 42.252140 31.844864 57.156709 244.897224 \n", "min 1.000000 587.000000 0.000000 516.000000 587.000000 \n", "25% 15.000000 717.060750 63.861000 570.750000 847.750000 \n", "50% 44.000000 741.077500 74.727000 599.000000 917.500000 \n", "75% 116.000000 767.260750 86.826500 633.250000 1014.500000 \n", "max 2755.000000 912.938000 377.767000 877.000000 3880.000000 \n", "\n", " X Y XM YM Major Minor \\\n", "count 389.000000 388.000000 388.000000 386.000000 383.000000 388.000000 \n", "mean 256.419859 254.384088 256.183338 253.353005 12.481016 9.500662 \n", "std 152.261694 155.080074 152.380388 154.426250 11.979176 49.714280 \n", "min 3.978000 4.722000 4.012000 4.697000 1.128000 1.128000 \n", "25% 127.142000 102.875250 126.923250 103.813750 5.098000 3.637250 \n", "50% 243.300000 271.490000 242.288000 271.272000 9.374000 5.886000 \n", "75% 400.167000 395.058250 400.363500 393.800750 16.283000 9.017250 \n", "max 508.214000 503.022000 508.169000 502.979000 144.475000 981.000000 \n", "\n", " Angle %Area \n", "count 390.000000 391.0 \n", "mean 86.598441 100.0 \n", "std 60.593686 0.0 \n", "min 0.000000 100.0 \n", "25% 34.517250 100.0 \n", "50% 89.703500 100.0 \n", "75% 134.617250 100.0 \n", "max 568.000000 100.0 " ] }, "execution_count": 5, "metadata": {}, "output_type": "execute_result" } ], "source": [ "data.describe()" ] }, { "cell_type": "markdown", "id": "a4e9e343-bee1-49be-bfaa-c1fdcb3d8190", "metadata": {}, "source": [ "## Sorting in tables\n", "In many cases, we are interested in table rows that contain the maximum value, e.g. in the `area` column we can find the largest object:" ] }, { "cell_type": "code", "execution_count": 6, "id": "9a20c5ec", "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
AreaMeanStdDevMinMaxXYXMYMMajorMinorAngle%AreaType
1902755.0859.928235.458539.03880.0108.710302.158110.999300.247144.47524.28039.318100C
812295.0765.23996.545558.01431.0375.003134.888374.982135.35965.76944.429127.247100B
2091821.0847.761122.074600.01510.0287.795321.115288.074321.82455.87941.492112.124100A
2521528.0763.77783.183572.01172.0191.969385.944192.487385.69763.15030.80834.424100B
2651252.0793.371117.139579.01668.0262.071394.497262.268394.32660.15426.50050.147100A
.............................................
1131.0587.0000.000587.0587.0399.500117.500399.500117.5001.1281.1280.000100A
3101.0866.0000.000866.0866.0343.500408.500343.500408.5001.1281.1280.000100A
2191.0763.0000.000763.0763.0411.500296.500411.500296.5001.1281.1280.000100A
3NaNNaNNaN608.0964.0NaNNaNNaN7.6657.359NaN101.121100A
5NaNNaN69.438566.0792.0348.5007.500NaN7.508NaN3.088NaN100A
\n", "

391 rows × 14 columns

\n", "
" ], "text/plain": [ " Area Mean StdDev Min Max X Y XM \\\n", " \n", "190 2755.0 859.928 235.458 539.0 3880.0 108.710 302.158 110.999 \n", "81 2295.0 765.239 96.545 558.0 1431.0 375.003 134.888 374.982 \n", "209 1821.0 847.761 122.074 600.0 1510.0 287.795 321.115 288.074 \n", "252 1528.0 763.777 83.183 572.0 1172.0 191.969 385.944 192.487 \n", "265 1252.0 793.371 117.139 579.0 1668.0 262.071 394.497 262.268 \n", ".. ... ... ... ... ... ... ... ... \n", "113 1.0 587.000 0.000 587.0 587.0 399.500 117.500 399.500 \n", "310 1.0 866.000 0.000 866.0 866.0 343.500 408.500 343.500 \n", "219 1.0 763.000 0.000 763.0 763.0 411.500 296.500 411.500 \n", "3 NaN NaN NaN 608.0 964.0 NaN NaN NaN \n", "5 NaN NaN 69.438 566.0 792.0 348.500 7.500 NaN \n", "\n", " YM Major Minor Angle %Area Type \n", " \n", "190 300.247 144.475 24.280 39.318 100 C \n", "81 135.359 65.769 44.429 127.247 100 B \n", "209 321.824 55.879 41.492 112.124 100 A \n", "252 385.697 63.150 30.808 34.424 100 B \n", "265 394.326 60.154 26.500 50.147 100 A \n", ".. ... ... ... ... ... ... \n", "113 117.500 1.128 1.128 0.000 100 A \n", "310 408.500 1.128 1.128 0.000 100 A \n", "219 296.500 1.128 1.128 0.000 100 A \n", "3 7.665 7.359 NaN 101.121 100 A \n", "5 7.508 NaN 3.088 NaN 100 A \n", "\n", "[391 rows x 14 columns]" ] }, "execution_count": 6, "metadata": {}, "output_type": "execute_result" } ], "source": [ "data.sort_values(by = \"Area\", ascending=False)" ] } ], "metadata": { "kernelspec": { "display_name": "Python 3 (ipykernel)", "language": "python", "name": "python3" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.9.9" } }, "nbformat": 4, "nbformat_minor": 5 }