{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# spark-df-profiling Meteorites example" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Source of data: https://data.nasa.gov/Space-Science/Meteorite-Landings/gh4g-9sfh\n", "\n", "I have previously transformed the downloaded csv to a [Parquet](https://parquet.apache.org/) table, but that doesn't matter. As long as you have your Spark Dataframe loaded, you are good to go." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Import library" ] }, { "cell_type": "code", "execution_count": 1, "metadata": { "collapsed": false }, "outputs": [], "source": [ "import spark_df_profiling" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Create the DataFrame" ] }, { "cell_type": "code", "execution_count": 2, "metadata": { "collapsed": false }, "outputs": [ { "data": { "text/plain": [ "DataFrame[name: string, id: bigint, nametype: string, recclass: string, mass_g: double, fall: string, reclat: double, reclong: double, GeoLocation: string, source: string, reclat_city: double, year: date]" ] }, "execution_count": 2, "metadata": {}, "output_type": "execute_result" } ], "source": [ "df = sqlContext.read.parquet(\"/Users/Julio/Downloads/Meteorite_Landings.parquet\").cache()\n", "\n", "df" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Spark Dataframes have the built-in method `.describe()`. Let's see what it shows:" ] }, { "cell_type": "code", "execution_count": 3, "metadata": { "collapsed": false }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "+-------+------------------+------+---------+----------+-------------------+\n", "|summary| id|mass_g| reclat| reclong| reclat_city|\n", "+-------+------------------+------+---------+----------+-------------------+\n", "| count| 45726| 45726| 45726| 45726| 45726|\n", "| mean|26883.906202160695| NaN| NaN| NaN| NaN|\n", "| stddev| 16863.44556599258| NaN| NaN| NaN| NaN|\n", "| min| 1| 0.0|-87.36667|-165.43333|-103.79172917787167|\n", "| max| 57458| NaN| NaN| NaN| NaN|\n", "+-------+------------------+------+---------+----------+-------------------+\n", "\n" ] } ], "source": [ "df.describe().show()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Generate the report" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Now let's use `spark_df_profiling`:" ] }, { "cell_type": "code", "execution_count": 4, "metadata": { "collapsed": false }, "outputs": [], "source": [ "report = spark_df_profiling.ProfileReport(df)" ] }, { "cell_type": "code", "execution_count": 5, "metadata": { "collapsed": false }, "outputs": [ { "data": { "text/html": [ "\n", " \n", "\n", " \n", "\n", "
\n", "
\n", "

Overview

\n", "
\n", "\n", " \n", "
\n", "
\n", "

Dataset info

\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
Number of variables12
Number of observations45726
Total Missing (%)4.1%
Total size in memory0.0 B
Average record size in memory0.0 B
\n", "
\n", "
\n", "

Variables types

\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
Numeric4
Categorical4
Date1
Text (Unique)1
Rejected2
\n", "
\n", "
\n", "

Warnings

\n", "
  • GeoLocation has 7315 / 19.0% missing values Missing
  • GeoLocation has a high cardinality: 17100 distinct values Warning
  • mass_g is highly skewed (γ1 = 76.916)
  • recclass has a high cardinality: 466 distinct values Warning
  • reclat has 7315 / 19.0% missing values Missing
  • reclat has 6438 / 14.1% zeros
  • reclat_city is highly correlated with reclat (ρ = 0.99423) Rejected
  • reclong has 7315 / 19.0% missing values Missing
  • reclong has 6214 / 13.6% zeros
  • source has constant value NASA Rejected
\n", "
\n", "
\n", "\n", "\n", "
\n", "

Variables

\n", "
\n", "\n", "
\n", "
\n", "

GeoLocation
Categorical

\n", "
\n", "\n", "
\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
Distinct count17100
Unique (%)44.5%
Missing (%)19.0%
Missing (n)7315
Infinite (%)0.0%
Infinite (n)0
\n", "\n", "\n", "\n", "
\n", " \n", "
\n", " \n", " \n", " \n", " \n", " \n", "\n", " \n", " \n", " \n", "\n", " \n", " \n", " \n", "\n", " \n", " \n", " \n", "\n", " \n", " \n", " \n", "\n", "
(0.000000, 0.000000)\n", "
\n", "6214\n", "
\n", "
(-71.500000, 35.666670)\n", "
\n", " \n", "
4761\n", "
(-84.000000, 168.000000)\n", "
\n", " \n", "
3040\n", "
Other values (17097)\n", "
\n", "24396\n", "
\n", "
(Missing)\n", "
\n", "7315\n", "
\n", "
\n", "
\n", "\n", "\n", " \n", " \n", "\n", "
\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "\n", " \n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "
ValueCountFrequency (%) 
(0.000000, 0.000000)621413.6%\n", "
 
\n", "
(-71.500000, 35.666670)476110.4%\n", "
 
\n", "
(-84.000000, 168.000000)30406.6%\n", "
 
\n", "
(-72.000000, 26.000000)15053.3%\n", "
 
\n", "
(-79.683330, 159.750000)6571.4%\n", "
 
\n", "
(-76.716670, 159.666670)6371.4%\n", "
 
\n", "
(-76.183330, 157.166670)5391.2%\n", "
 
\n", "
(-79.683330, 155.750000)4731.0%\n", "
 
\n", "
(-84.216670, 160.500000)2630.6%\n", "
 
\n", "
(-86.366670, -70.000000)2260.5%\n", "
 
\n", "
(0.000000, 35.666670)2230.5%\n", "
 
\n", "
(-86.716670, -141.500000)2170.5%\n", "
 
\n", "
(-85.666670, 175.000000)1850.4%\n", "
 
\n", "
(-24.850000, -70.533330)1780.4%\n", "
 
\n", "
(-85.633330, -68.700000)1050.2%\n", "
 
\n", "
(-72.954880, 160.473280)740.2%\n", "
 
\n", "
(58.583330, 13.433330)640.1%\n", "
 
\n", "
(-76.716670, 159.333330)420.1%\n", "
 
\n", "
(-72.778890, 75.313610)390.1%\n", "
 
\n", "
(-72.983890, 75.246390)380.1%\n", "
 
\n", "
Other values (17080)1893141.4%\n", "
 
\n", "
(Missing)731516.0%\n", "
 
\n", "
\n", "\n", "
\n", "\n", "\n", "
\n", "
\n", "

fall
Categorical

\n", "
\n", "\n", "
\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
Distinct count2
Unique (%)0.0%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
\n", "\n", "\n", "\n", "
\n", " \n", "
\n", " \n", " \n", " \n", " \n", " \n", "\n", " \n", " \n", " \n", "\n", "
Found\n", "
\n", "44609\n", "
\n", "
Fell\n", "
\n", " \n", "
1117\n", "
\n", "
\n", "\n", "\n", " \n", " \n", "\n", "
\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "\n", " \n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "
ValueCountFrequency (%) 
Found4460997.6%\n", "
 
\n", "
Fell11172.4%\n", "
 
\n", "
\n", "\n", "
\n", "\n", "\n", "
\n", "
\n", "

id
Numeric

\n", "
\n", "\n", "
\n", "
\n", "
\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
Distinct count45716
Unique (%)100.0%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
\n", "\n", "
\n", "
\n", " \n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
Mean26884
Minimum1
Maximum57458
Zeros (%)0.0%
\n", "
\n", "
\n", "
\n", "
\n", " \n", "\n", "
\n", " \n", "
\n", "
\n", "

Quantile statistics

\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
Minimum1
5-th percentile2388.8
Q112681
Median24256
Q340654
95-th percentile54891
Maximum57458
Range57457
Interquartile range27972
\n", "

Descriptive statistics

\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
Standard deviation16863
Coef of variation0.62727
Kurtosis-1.1601
Mean26884
MAD14490
Skewness0.26652
Sum1229300000
Variance284380000
Memory size0.0 B
\n", "
\n", "
\n", " \n", "
\n", "
\n", "
\n", "
\n", "

mass_g
Numeric

\n", "
\n", "\n", "
\n", "
\n", "
\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
Distinct count12577
Unique (%)27.6%
Missing (%)0.3%
Missing (n)131
Infinite (%)0.0%
Infinite (n)0
\n", "\n", "
\n", "
\n", " \n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
Mean13278
Minimum0
Maximum60000000
Zeros (%)0.0%
\n", "
\n", "
\n", "
\n", "
\n", " \n", "\n", "
\n", " \n", "
\n", "
\n", "

Quantile statistics

\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
Minimum0
5-th percentile1.0978
Q17.1907
Median32.598
Q3202.86
95-th percentile3999.9
Maximum60000000
Range60000000
Interquartile range195.67
\n", "

Descriptive statistics

\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
Standard deviation574930
Coef of variation43.298
Kurtosis6797.7
Mean13278
MAD25113
Skewness76.916
Sum605430000
Variance330540000000
Memory size0.0 B
\n", "
\n", "
\n", " \n", "
\n", "
\n", "
\n", "
\n", "

name
Categorical, Unique

\n", "
\n", "\n", "
\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
First 3 values
Abee
Asco
Aleppo
\n", "
\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
Last 3 values
Allende
Alessandria
Akaba
\n", " \n", "
\n", "

First 20 values

\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
1Abee
2Asco
3Aleppo
4Al Rais
5Arbol Solo
6Ash Creek
7Northwest Africa 5815
8Anlong
9Aomori
10Aldsworth
11Akyumak
12Aachen
13Ambapur Nagla
14Alta'ameem
15Aarhus
16Archie
17Almahata Sitta
18Andhara
19Adzhi-Bogdo (stone)
20Aïr
\n", "

Last 20 values

\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
45707Alais
45708Arroyo Aguiar
45709Aguada
45710Angra dos Reis (stone)
45711Alexandrovsky
45712Akwanga
45713Alfianello
45714Appley Bridge
45715Achiras
45716Adhi Kot
45717Akbarpur
45718Andover
45719Acapulco
45720Albareto
45721Apt
45722Agen
45723Andura
45724Allende
45725Alessandria
45726Akaba
\n", "
\n", "\n", "
\n", "
\n", "

nametype
Categorical

\n", "
\n", "\n", "
\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
Distinct count2
Unique (%)0.0%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
\n", "\n", "\n", "\n", "
\n", " \n", "
\n", " \n", " \n", " \n", " \n", " \n", "\n", " \n", " \n", " \n", "\n", "
Valid\n", "
\n", "45651\n", "
\n", "
Relict\n", "
\n", " \n", "
75\n", "
\n", "
\n", "\n", "\n", " \n", " \n", "\n", "
\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "\n", " \n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "
ValueCountFrequency (%) 
Valid4565199.8%\n", "
 
\n", "
Relict750.2%\n", "
 
\n", "
\n", "\n", "
\n", "\n", "\n", "
\n", "
\n", "

recclass
Categorical

\n", "
\n", "\n", "
\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
Distinct count466
Unique (%)1.0%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
\n", "\n", "\n", "\n", "
\n", " \n", "
\n", " \n", " \n", " \n", " \n", " \n", "\n", " \n", " \n", " \n", "\n", " \n", " \n", " \n", "\n", " \n", " \n", " \n", "\n", "
L6\n", "
\n", "8287\n", "
\n", "
H5\n", "
\n", "7143\n", "
\n", "
L5\n", "
\n", " \n", "
4797\n", "
Other values (463)\n", "
\n", "25499\n", "
\n", "
\n", "
\n", "\n", "\n", " \n", " \n", "\n", "
\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "\n", " \n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "\n", " \n", " \n", " \n", " \n", "\n", "\n", "
ValueCountFrequency (%) 
L6828718.1%\n", "
 
\n", "
H5714315.6%\n", "
 
\n", "
L5479710.5%\n", "
 
\n", "
H645299.9%\n", "
 
\n", "
H442119.2%\n", "
 
\n", "
LL527666.0%\n", "
 
\n", "
LL620434.5%\n", "
 
\n", "
L412532.7%\n", "
 
\n", "
H4/54280.9%\n", "
 
\n", "
CM24160.9%\n", "
 
\n", "
H33860.8%\n", "
 
\n", "
L33650.8%\n", "
 
\n", "
CO33350.7%\n", "
 
\n", "
Ureilite3000.7%\n", "
 
\n", "
Iron, IIIAB2850.6%\n", "
 
\n", "
LL42680.6%\n", "
 
\n", "
CV32560.6%\n", "
 
\n", "
Diogenite2410.5%\n", "
 
\n", "
Howardite2400.5%\n", "
 
\n", "
LL2250.5%\n", "
 
\n", "
Other values (446)695215.2%\n", "
 
\n", "
\n", "\n", "
\n", "\n", "\n", "
\n", "
\n", "

reclat
Numeric

\n", "
\n", "\n", "
\n", "
\n", "
\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
Distinct count12739
Unique (%)33.2%
Missing (%)19.0%
Missing (n)7315
Infinite (%)0.0%
Infinite (n)0
\n", "\n", "
\n", "
\n", " \n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
Mean-39.107
Minimum-87.367
Maximum81.167
Zeros (%)14.1%
\n", "
\n", "
\n", "
\n", "
\n", " \n", "\n", "
\n", " \n", "
\n", "
\n", "

Quantile statistics

\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
Minimum-87.367
5-th percentile-84.355
Q1-76.714
Median-71.529
Q3-0.16289
95-th percentile34.494
Maximum81.167
Range168.53
Interquartile range76.551
\n", "

Descriptive statistics

\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
Standard deviation46.386
Coef of variation-1.1861
Kurtosis-1.4768
Mean-39.107
MAD43.937
Skewness0.4913
Sum-1502100
Variance2151.7
Memory size0.0 B
\n", "
\n", "
\n", " \n", "
\n", "
\n", "
\n", "
\n", "

reclat_city
Highly correlated

\n", "
\n", "\n", "
\n", "

This variable is highly correlated with reclat and should be ignored for analysis

\n", "\n", "
\n", "
\n", " \n", " \n", " \n", "
Correlation0.99423
\n", "
\n", "
\n", "
\n", "

reclong
Numeric

\n", "
\n", "\n", "
\n", "
\n", "
\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
Distinct count14641
Unique (%)38.1%
Missing (%)19.0%
Missing (n)7315
Infinite (%)0.0%
Infinite (n)0
\n", "\n", "
\n", "
\n", " \n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
Mean61.053
Minimum-165.43
Maximum354.47
Zeros (%)13.6%
\n", "
\n", "
\n", "
\n", "
\n", " \n", "\n", "
\n", " \n", "
\n", "
\n", "

Quantile statistics

\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
Minimum-165.43
5-th percentile-90.466
Q1-0.0024196
Median35.666
Q3157.17
95-th percentile167.72
Maximum354.47
Range519.91
Interquartile range157.17
\n", "

Descriptive statistics

\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
Standard deviation80.655
Coef of variation1.3211
Kurtosis-0.73145
Mean61.053
MAD67.606
Skewness-0.17437
Sum2345100
Variance6505.3
Memory size0.0 B
\n", "
\n", "
\n", " \n", "
\n", "
\n", "
\n", "
\n", "

source
Constant

\n", "
\n", "\n", "
\n", "

This variable is constant and should be ignored for analysis

\n", "\n", "
\n", "
\n", " \n", " \n", " \n", "
Constant valueNASA
\n", "
\n", "
\n", "
\n", "

year
Date

\n", "
\n", "\n", "
\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
Distinct count244
Unique (%)0.5%
Missing (%)0.7%
Missing (n)312
Infinite (%)0.0%
Infinite (n)0
\n", "
\n", "
\n", " \n", " \n", " \n", " \n", " \n", "
Minimum1688-01-01
Maximum2101-01-01
\n", "
\n", "\n", "\n", "
\n", "\n", "
\n", "

Sample

\n", "
\n", "\n", " \n", "
\n", "
\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
nameidnametyperecclassmass_gfallreclatreclongGeoLocationsourcereclat_cityyear
0Aachen1ValidL521.0Fell50.775006.08333(50.775000, 6.083330)NASA44.9177281880-01-01
1Aarhus2ValidH6720.0Fell56.1833310.23333(56.183330, 10.233330)NASA58.4892771951-01-01
2Abee6ValidEH4107000.0Fell54.21667-113.00000(54.216670, -113.000000)NASA53.7539951952-01-01
3Acapulco10ValidAcapulcoite1914.0Fell16.88333-99.90000(16.883330, -99.900000)NASA17.3111361976-01-01
4Achiras370ValidL6780.0Fell-33.16667-64.95000(-33.166670, -64.950000)NASA-29.3508441902-01-01
\n", "
\n", "
\n", "\n", "
\n", "\n" ], "text/plain": [ "" ] }, "execution_count": 5, "metadata": {}, "output_type": "execute_result" } ], "source": [ "report" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Save report to file" ] }, { "cell_type": "code", "execution_count": 6, "metadata": { "collapsed": true }, "outputs": [], "source": [ "report.to_file(\"/tmp/example.html\")" ] } ], "metadata": { "kernelspec": { "display_name": "Python 3", "language": "python", "name": "python3" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.5.1" } }, "nbformat": 4, "nbformat_minor": 0 }