{"cells":[{"cell_type":"markdown","source":["## Schema Evolution in Merge Operations\nThis notebook provides an example of how to perform schema evolution in merge operations for the [2019 Novel Coronavirus COVID-19 (2019-nCoV) Data Repository by Johns Hopkins CSSE](https://github.com/CSSEGISandData/COVID-19) dataset. This primer example allows you to create a Delta Lake table (instead of native Parquet) to track the changes of the this dataset to support the [Johns Hopkins COVID-19 Data Analysis Sample Notebook](https://github.com/databricks/tech-talks/blob/master/samples/JHU%20COVID-19%20Analysis.html).\n\nThe data is updated in the `/databricks-datasets/COVID/CSSEGISandData/` location regularly so you can access the data directly."],"metadata":{}},{"cell_type":"code","source":["# PySpark\nfrom pyspark.sql.functions import input_file_name, lit, col\nfrom pyspark.sql.types import IntegerType, StringType"],"metadata":{},"outputs":[{"metadata":{},"output_type":"display_data","data":{"text/html":["\n
"]}}],"execution_count":2},{"cell_type":"markdown","source":["## Create Different DataFrames for Different Schemas\nAs noted in the [Johns Hopkins COVID-19 Analysis](https://github.com/databricks/tech-talks/blob/master/samples/JHU%20COVID-19%20Analysis.html) notebook, as of this writing, there are three different schemas for this dataset; for this example, we will focus on the last two schema changes.\n\n| id | Schema String List | Date Range | \n| -- | ------------------ | ---------- |\n| 1 | `Province/State Country/Region Last Update Confirmed Deaths Recovered Latitude Longitude` | 03-01-2020 to 03-21-2020 |\n| 2 | `FIPS Admin2 Province_State Country_Region Last_Update Lat Long_ Confirmed Deaths Recovered Active Combined_Key` | 03-22-2020 to current |\n\nThe following is an example of:\n* How to run merge operations with schema evolution for representative files of the different schemas\n* We will focus on only Washington State data for this example"],"metadata":{}},{"cell_type":"code","source":["# File paths\n# Two representative files of the two different schemas\nfile_1 = '/databricks-datasets/COVID/CSSEGISandData/csse_covid_19_data/csse_covid_19_daily_reports/03-21-2020.csv'\nfile_2 = '/databricks-datasets/COVID/CSSEGISandData/csse_covid_19_data/csse_covid_19_daily_reports/04-11-2020.csv'\n\n# Schema 1: Representing datasets between 2020-03-01 and 2020-03-21 (8 columns originally)\n# Renaming some of the columns for better standardization\nold_data = (spark.read\n .option(\"inferSchema\", True)\n .option(\"header\", True)\n .csv(file_1)\n .withColumnRenamed(\"Last Update\", \"Last_Update\")\n .withColumnRenamed(\"Province/State\", \"Province_State\")\n .withColumnRenamed(\"Country/Region\", \"Country_Region\")\n .withColumn(\"process_date\", lit('2020-03-21')) # Date determined by the filename, manually entered in this example\n .withColumn(\"level\", lit(2)) # Specify the level: 1 - Country, 2 - Province/State, 3 - County\n .where(\"Province_State == 'Washington'\")) # Filter by only Washington State (expected output: 1 row)\n\n# Schema 2: Latest schema representing data from 2020-03-22 onwards (12 columns originally)\n# Renaming some of the columns for better standardization\nnew_data = (spark.read\n .option(\"inferSchema\", True)\n .option(\"header\", True)\n .csv(file_2)\n .withColumnRenamed(\"Lat\", \"Latitude\")\n .withColumnRenamed(\"Long_\", \"Longitude\")\n .withColumn(\"process_date\", lit('2020-04-11')) # Date determined by the filename, manually entered in this example\n .withColumn(\"level\", lit(3)) # Specify the level: 1 - Country, 2 - Province/State, 3 - County\n .where(\"Province_State == 'Washington'\")) # Filter by only Washington State (expected output: 39 rows)\n\n# Notes: Expand each DataFrame below to review the schema"],"metadata":{},"outputs":[{"metadata":{},"output_type":"display_data","data":{"text/html":["\n"]}}],"execution_count":4},{"cell_type":"code","source":["# Old Data Schema\nold_data.printSchema()"],"metadata":{},"outputs":[{"metadata":{},"output_type":"display_data","data":{"text/html":["\n| Province_State | Country_Region | Last_Update | Confirmed | Deaths | Recovered | Latitude | Longitude | process_date | level |
|---|---|---|---|---|---|---|---|---|---|
| Washington | US | 2020-03-21T22:43:04.000+0000 | 1793 | 94 | 0 | 47.4009 | -121.4905 | 2020-03-21 | 2 |
| FIPS | Admin2 | Province_State | Country_Region | Last_Update | Latitude | Longitude | Confirmed | Deaths | Recovered | Active | Combined_Key | process_date | level |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 53 | Washington | US | 2020-04-27T19:00:00 | 47.4009 | -121.4905 | 1793 | 94 | 0 | 2020-03-21 | 2 | |||
| 53001 | Adams | Washington | US | 2020-04-11 22:45:33 | 46.98299757 | -118.56017340000001 | 30 | 0 | 0 | 0 | Adams, Washington, US | 2020-04-11 | 3 |
| 53003 | Asotin | Washington | US | 2020-04-11 22:45:33 | 46.18894415 | -117.2022851 | 4 | 0 | 0 | 0 | Asotin, Washington, US | 2020-04-11 | 3 |
| 53005 | Benton | Washington | US | 2020-04-11 22:45:33 | 46.23946995 | -119.51208340000001 | 244 | 25 | 0 | 0 | Benton, Washington, US | 2020-04-11 | 3 |
| 53007 | Chelan | Washington | US | 2020-04-11 22:45:33 | 47.87046092 | -120.6173956 | 53 | 5 | 0 | 0 | Chelan, Washington, US | 2020-04-11 | 3 |
| 53009 | Clallam | Washington | US | 2020-04-11 22:45:33 | 48.04754642 | -123.92263190000001 | 11 | 0 | 0 | 0 | Clallam, Washington, US | 2020-04-11 | 3 |
| 53011 | Clark | Washington | US | 2020-04-11 22:45:33 | 45.77568046 | -122.48292040000001 | 220 | 14 | 0 | 0 | Clark, Washington, US | 2020-04-11 | 3 |
| 53013 | Columbia | Washington | US | 2020-04-11 22:45:33 | 46.29442881 | -117.9051983 | 1 | 0 | 0 | 0 | Columbia, Washington, US | 2020-04-11 | 3 |
| 53015 | Cowlitz | Washington | US | 2020-04-11 22:45:33 | 46.19074721 | -122.6782231 | 23 | 0 | 0 | 0 | Cowlitz, Washington, US | 2020-04-11 | 3 |
| 53017 | Douglas | Washington | US | 2020-04-11 22:45:33 | 47.7361335 | -119.69293700000001 | 16 | 0 | 0 | 0 | Douglas, Washington, US | 2020-04-11 | 3 |
| 53019 | Ferry | Washington | US | 2020-04-11 22:45:33 | 48.471143100000006 | -118.5154041 | 1 | 0 | 0 | 0 | Ferry, Washington, US | 2020-04-11 | 3 |
| 53021 | Franklin | Washington | US | 2020-04-11 22:45:33 | 46.53351425 | -118.9018212 | 107 | 2 | 0 | 0 | Franklin, Washington, US | 2020-04-11 | 3 |
| 53025 | Grant | Washington | US | 2020-04-11 22:45:33 | 47.20753737 | -119.4500749 | 110 | 2 | 0 | 0 | Grant, Washington, US | 2020-04-11 | 3 |
| 53027 | Grays Harbor | Washington | US | 2020-04-11 22:45:33 | 47.14003705 | -123.7820567 | 8 | 0 | 0 | 0 | Grays Harbor, Washington, US | 2020-04-11 | 3 |
| 53029 | Island | Washington | US | 2020-04-11 22:45:33 | 48.14713305 | -122.5216407 | 157 | 7 | 0 | 0 | Island, Washington, US | 2020-04-11 | 3 |
| 53031 | Jefferson | Washington | US | 2020-04-11 22:45:33 | 47.75003077 | -123.56097040000002 | 28 | 0 | 0 | 0 | Jefferson, Washington, US | 2020-04-11 | 3 |
| 53033 | King | Washington | US | 2020-04-11 22:45:33 | 47.49137892 | -121.8346131 | 4262 | 284 | 0 | 0 | King, Washington, US | 2020-04-11 | 3 |
| 53035 | Kitsap | Washington | US | 2020-04-11 22:45:33 | 47.63479026 | -122.6436061 | 128 | 1 | 0 | 0 | Kitsap, Washington, US | 2020-04-11 | 3 |
| 53037 | Kittitas | Washington | US | 2020-04-11 22:45:33 | 47.12521214 | -120.67943899999999 | 15 | 1 | 0 | 0 | Kittitas, Washington, US | 2020-04-11 | 3 |
| 53039 | Klickitat | Washington | US | 2020-04-11 22:45:33 | 45.87325839 | -120.7913598 | 12 | 1 | 0 | 0 | Klickitat, Washington, US | 2020-04-11 | 3 |
| 53041 | Lewis | Washington | US | 2020-04-11 22:45:33 | 46.57756476 | -122.39257690000001 | 18 | 2 | 0 | 0 | Lewis, Washington, US | 2020-04-11 | 3 |
| 53043 | Lincoln | Washington | US | 2020-04-11 22:45:33 | 47.57705241 | -118.41943880000001 | 2 | 0 | 0 | 0 | Lincoln, Washington, US | 2020-04-11 | 3 |
| 53045 | Mason | Washington | US | 2020-04-11 22:45:33 | 47.35007586 | -123.1863685 | 18 | 0 | 0 | 0 | Mason, Washington, US | 2020-04-11 | 3 |
| 53047 | Okanogan | Washington | US | 2020-04-11 22:45:33 | 48.54855019 | -119.73872240000001 | 14 | 0 | 0 | 0 | Okanogan, Washington, US | 2020-04-11 | 3 |
| 53049 | Pacific | Washington | US | 2020-04-11 22:45:33 | 46.55418369 | -123.7285716 | 1 | 0 | 0 | 0 | Pacific, Washington, US | 2020-04-11 | 3 |
| 53051 | Pend Oreille | Washington | US | 2020-04-11 22:45:33 | 48.53234021 | -117.2741804 | 1 | 0 | 0 | 0 | Pend Oreille, Washington, US | 2020-04-11 | 3 |
| 53053 | Pierce | Washington | US | 2020-04-11 22:45:33 | 47.03892768 | -122.14059579999999 | 922 | 19 | 0 | 0 | Pierce, Washington, US | 2020-04-11 | 3 |
| 53055 | San Juan | Washington | US | 2020-04-11 22:45:33 | 48.60182783 | -122.96745 | 13 | 0 | 0 | 0 | San Juan, Washington, US | 2020-04-11 | 3 |
| 53057 | Skagit | Washington | US | 2020-04-11 22:45:33 | 48.48171488 | -121.766131 | 185 | 6 | 0 | 0 | Skagit, Washington, US | 2020-04-11 | 3 |
| 53059 | Skamania | Washington | US | 2020-04-11 22:45:33 | 46.02408726 | -121.91644029999999 | 3 | 0 | 0 | 0 | Skamania, Washington, US | 2020-04-11 | 3 |
| 53061 | Snohomish | Washington | US | 2020-04-11 22:45:33 | 48.04615983 | -121.7170703 | 1835 | 68 | 0 | 0 | Snohomish, Washington, US | 2020-04-11 | 3 |
| 53063 | Spokane | Washington | US | 2020-04-11 22:45:33 | 47.62113146 | -117.4046494 | 251 | 14 | 0 | 0 | Spokane, Washington, US | 2020-04-11 | 3 |
| 53065 | Stevens | Washington | US | 2020-04-11 22:45:33 | 48.40035475 | -117.8542701 | 6 | 0 | 0 | 0 | Stevens, Washington, US | 2020-04-11 | 3 |
| 53067 | Thurston | Washington | US | 2020-04-11 22:45:33 | 46.9291895 | -122.82906559999999 | 81 | 1 | 0 | 0 | Thurston, Washington, US | 2020-04-11 | 3 |
| 53069 | Wahkiakum | Washington | US | 2020-04-11 22:45:33 | 46.29180039999999 | -123.4250831 | 2 | 0 | 0 | 0 | Wahkiakum, Washington, US | 2020-04-11 | 3 |
| 53071 | Walla Walla | Washington | US | 2020-04-11 22:45:33 | 46.23040051 | -118.4775539 | 20 | 0 | 0 | 0 | Walla Walla, Washington, US | 2020-04-11 | 3 |
| 53073 | Whatcom | Washington | US | 2020-04-11 22:45:33 | 48.82227976 | -121.74900179999999 | 261 | 23 | 0 | 0 | Whatcom, Washington, US | 2020-04-11 | 3 |
| 53075 | Whitman | Washington | US | 2020-04-11 22:45:33 | 46.90022523 | -117.52417630000001 | 12 | 0 | 0 | 0 | Whitman, Washington, US | 2020-04-11 | 3 |
| 53077 | Yakima | Washington | US | 2020-04-11 22:45:33 | 46.45738486 | -120.73801259999999 | 511 | 20 | 0 | 0 | Yakima, Washington, US | 2020-04-11 | 3 |
| 90053 | Unassigned | Washington | US | 2020-04-11 22:45:33 | null | null | 848 | 0 | 0 | 0 | Unassigned, Washington, US | 2020-04-11 | 3 |
| Province_State | Country_Region | Last_Update | Confirmed | Deaths | Recovered | Latitude | Longitude | process_date | level | FIPS | Admin2 | Active | Combined_Key |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Washington | US | 2020-04-27T19:00:00.000+0000 | 1793 | 94 | 0 | 47.4009 | -121.4905 | 2020-03-21 | 2 | 53 | |||
| Washington | US | 2020-04-11T22:45:33.000+0000 | 30 | 0 | 0 | 46.98299757 | -118.56017340000001 | 2020-04-11 | 3 | 53001 | Adams | 0 | Adams, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 4 | 0 | 0 | 46.18894415 | -117.2022851 | 2020-04-11 | 3 | 53003 | Asotin | 0 | Asotin, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 244 | 25 | 0 | 46.23946995 | -119.51208340000001 | 2020-04-11 | 3 | 53005 | Benton | 0 | Benton, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 53 | 5 | 0 | 47.87046092 | -120.6173956 | 2020-04-11 | 3 | 53007 | Chelan | 0 | Chelan, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 11 | 0 | 0 | 48.04754642 | -123.92263190000001 | 2020-04-11 | 3 | 53009 | Clallam | 0 | Clallam, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 220 | 14 | 0 | 45.77568046 | -122.48292040000001 | 2020-04-11 | 3 | 53011 | Clark | 0 | Clark, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 1 | 0 | 0 | 46.29442881 | -117.9051983 | 2020-04-11 | 3 | 53013 | Columbia | 0 | Columbia, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 23 | 0 | 0 | 46.19074721 | -122.6782231 | 2020-04-11 | 3 | 53015 | Cowlitz | 0 | Cowlitz, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 16 | 0 | 0 | 47.7361335 | -119.69293700000001 | 2020-04-11 | 3 | 53017 | Douglas | 0 | Douglas, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 1 | 0 | 0 | 48.471143100000006 | -118.5154041 | 2020-04-11 | 3 | 53019 | Ferry | 0 | Ferry, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 107 | 2 | 0 | 46.53351425 | -118.9018212 | 2020-04-11 | 3 | 53021 | Franklin | 0 | Franklin, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 110 | 2 | 0 | 47.20753737 | -119.4500749 | 2020-04-11 | 3 | 53025 | Grant | 0 | Grant, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 8 | 0 | 0 | 47.14003705 | -123.7820567 | 2020-04-11 | 3 | 53027 | Grays Harbor | 0 | Grays Harbor, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 157 | 7 | 0 | 48.14713305 | -122.5216407 | 2020-04-11 | 3 | 53029 | Island | 0 | Island, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 28 | 0 | 0 | 47.75003077 | -123.56097040000002 | 2020-04-11 | 3 | 53031 | Jefferson | 0 | Jefferson, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 4262 | 284 | 0 | 47.49137892 | -121.8346131 | 2020-04-11 | 3 | 53033 | King | 0 | King, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 128 | 1 | 0 | 47.63479026 | -122.6436061 | 2020-04-11 | 3 | 53035 | Kitsap | 0 | Kitsap, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 15 | 1 | 0 | 47.12521214 | -120.67943899999999 | 2020-04-11 | 3 | 53037 | Kittitas | 0 | Kittitas, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 12 | 1 | 0 | 45.87325839 | -120.7913598 | 2020-04-11 | 3 | 53039 | Klickitat | 0 | Klickitat, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 18 | 2 | 0 | 46.57756476 | -122.39257690000001 | 2020-04-11 | 3 | 53041 | Lewis | 0 | Lewis, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 2 | 0 | 0 | 47.57705241 | -118.41943880000001 | 2020-04-11 | 3 | 53043 | Lincoln | 0 | Lincoln, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 18 | 0 | 0 | 47.35007586 | -123.1863685 | 2020-04-11 | 3 | 53045 | Mason | 0 | Mason, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 14 | 0 | 0 | 48.54855019 | -119.73872240000001 | 2020-04-11 | 3 | 53047 | Okanogan | 0 | Okanogan, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 1 | 0 | 0 | 46.55418369 | -123.7285716 | 2020-04-11 | 3 | 53049 | Pacific | 0 | Pacific, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 1 | 0 | 0 | 48.53234021 | -117.2741804 | 2020-04-11 | 3 | 53051 | Pend Oreille | 0 | Pend Oreille, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 922 | 19 | 0 | 47.03892768 | -122.14059579999999 | 2020-04-11 | 3 | 53053 | Pierce | 0 | Pierce, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 13 | 0 | 0 | 48.60182783 | -122.96745 | 2020-04-11 | 3 | 53055 | San Juan | 0 | San Juan, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 185 | 6 | 0 | 48.48171488 | -121.766131 | 2020-04-11 | 3 | 53057 | Skagit | 0 | Skagit, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 3 | 0 | 0 | 46.02408726 | -121.91644029999999 | 2020-04-11 | 3 | 53059 | Skamania | 0 | Skamania, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 1835 | 68 | 0 | 48.04615983 | -121.7170703 | 2020-04-11 | 3 | 53061 | Snohomish | 0 | Snohomish, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 251 | 14 | 0 | 47.62113146 | -117.4046494 | 2020-04-11 | 3 | 53063 | Spokane | 0 | Spokane, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 6 | 0 | 0 | 48.40035475 | -117.8542701 | 2020-04-11 | 3 | 53065 | Stevens | 0 | Stevens, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 81 | 1 | 0 | 46.9291895 | -122.82906559999999 | 2020-04-11 | 3 | 53067 | Thurston | 0 | Thurston, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 2 | 0 | 0 | 46.29180039999999 | -123.4250831 | 2020-04-11 | 3 | 53069 | Wahkiakum | 0 | Wahkiakum, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 20 | 0 | 0 | 46.23040051 | -118.4775539 | 2020-04-11 | 3 | 53071 | Walla Walla | 0 | Walla Walla, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 261 | 23 | 0 | 48.82227976 | -121.74900179999999 | 2020-04-11 | 3 | 53073 | Whatcom | 0 | Whatcom, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 12 | 0 | 0 | 46.90022523 | -117.52417630000001 | 2020-04-11 | 3 | 53075 | Whitman | 0 | Whitman, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 511 | 20 | 0 | 46.45738486 | -120.73801259999999 | 2020-04-11 | 3 | 53077 | Yakima | 0 | Yakima, Washington, US |
| Washington | US | 2020-04-11T22:45:33.000+0000 | 848 | 0 | 0 | null | null | 2020-04-11 | 3 | 90053 | Unassigned | 0 | Unassigned, Washington, US |
| version | timestamp | userId | userName | operation | operationParameters | job | notebook | clusterId | readVersion | isolationLevel | isBlindAppend | operationMetrics |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2020-04-30T04:53:22.000+0000 | 100802 | denny.lee@databricks.com | MERGE | Map(predicate -> (((s.`process_date` = t.`process_date`) AND (s.`province_state` = t.`province_state`)) AND ((s.`country_region` = t.`country_region`) AND (s.`level` = CAST(t.`level` AS BIGINT))))) | null | List(25384224) | 0430-041257-sided264 | 0 | WriteSerializable | false | Map(numTargetRowsCopied -> 0, numTargetRowsDeleted -> 0, numTargetFilesAdded -> 3, numTargetRowsInserted -> 39, numTargetRowsUpdated -> 1, numOutputRows -> 40, numSourceRows -> 40, numTargetFilesRemoved -> 1) |
| 0 | 2020-04-30T04:45:00.000+0000 | 100802 | denny.lee@databricks.com | WRITE | Map(mode -> ErrorIfExists, partitionBy -> []) | null | List(25384224) | 0430-041257-sided264 | null | WriteSerializable | true | Map(numFiles -> 1, numOutputBytes -> 2723, numOutputRows -> 1) |
| commitInfo |
|---|
| List(0430-041257-sided264, false, WriteSerializable, List(25384224), MERGE, List(40, 40, 3, 1, 0, 0, 39, 1), List((((s.`process_date` = t.`process_date`) AND (s.`province_state` = t.`province_state`)) AND ((s.`country_region` = t.`country_region`) AND (s.`level` = CAST(t.`level` AS BIGINT))))), 0, 1588222401301, 100802, denny.lee@databricks.com) |
| add |
|---|
| List(true, 1588222398000, part-00000-74632cb5-6b7e-4f2f-81b1-7238403c31d9-c000.snappy.parquet, 1463, {\"numRecords\":0,\"minValues\":{},\"maxValues\":{},\"nullCount\":{}}) |
| List(true, 1588222401000, part-00147-b6f679c9-8f22-4673-8bd8-b0442c34fdf1-c000.snappy.parquet, 5746, {\"numRecords\":39,\"minValues\":{\"Province_State\":\"Washington\",\"Country_Region\":\"US\",\"Last_Update\":\"2020-04-11T22:45:33.000Z\",\"Confirmed\":1,\"Deaths\":0,\"Recovered\":0,\"Latitude\":45.77568046,\"Longitude\":-123.92263190000001,\"process_date\":\"2020-04-11\",\"level\":3,\"FIPS\":53001,\"Admin2\":\"Adams\",\"Active\":\"0\",\"Combined_Key\":\"Adams, Washington, US\"},\"maxValues\":{\"Province_State\":\"Washington\",\"Country_Region\":\"US\",\"Last_Update\":\"2020-04-11T22:45:33.000Z\",\"Confirmed\":4262,\"Deaths\":284,\"Recovered\":0,\"Latitude\":48.82227976,\"Longitude\":-117.2022851,\"process_date\":\"2020-04-11\",\"level\":3,\"FIPS\":90053,\"Admin2\":\"Yakima\",\"Active\":\"0\",\"Combined_Key\":\"Yakima, Washington, US\"},\"nullCount\":{\"Province_State\":0,\"Country_Region\":0,\"Last_Update\":0,\"Confirmed\":0,\"Deaths\":0,\"Recovered\":0,\"Latitude\":1,\"Longitude\":1,\"process_date\":0,\"level\":0,\"FIPS\":0,\"Admin2\":0,\"Active\":0,\"Combined_Key\":0}}) |
| List(true, 1588222401000, part-00183-46079500-157e-4369-9256-1a72d4b32d04-c000.snappy.parquet, 3534, {\"numRecords\":1,\"minValues\":{\"Province_State\":\"Washington\",\"Country_Region\":\"US\",\"Last_Update\":\"2020-04-27T19:00:00.000Z\",\"Confirmed\":1793,\"Deaths\":94,\"Recovered\":0,\"Latitude\":47.4009,\"Longitude\":-121.4905,\"process_date\":\"2020-03-21\",\"level\":2,\"FIPS\":53,\"Admin2\":\"\",\"Active\":\"\",\"Combined_Key\":\"\"},\"maxValues\":{\"Province_State\":\"Washington\",\"Country_Region\":\"US\",\"Last_Update\":\"2020-04-27T19:00:00.000Z\",\"Confirmed\":1793,\"Deaths\":94,\"Recovered\":0,\"Latitude\":47.4009,\"Longitude\":-121.4905,\"process_date\":\"2020-03-21\",\"level\":2,\"FIPS\":53,\"Admin2\":\"\",\"Active\":\"\",\"Combined_Key\":\"\"},\"nullCount\":{\"Province_State\":0,\"Country_Region\":0,\"Last_Update\":0,\"Confirmed\":0,\"Deaths\":0,\"Recovered\":0,\"Latitude\":0,\"Longitude\":0,\"process_date\":0,\"level\":0,\"FIPS\":0,\"Admin2\":0,\"Active\":0,\"Combined_Key\":0}}) |