{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "[En français](../tutorial_gdal_fr/) \n", "\n", "![ECCC logo](https://eccc-msc.github.io/open-data/img_eccc-logo.png) \n", "\n", "[TOC](https://eccc-msc.github.io/open-data/readme_en/) > [Usage overview](https://eccc-msc.github.io/open-data/usage/readme_en/) > GDAL command line tutorial\n", "\n", "# GDAL command line tutorial with weather data\n", "\n", "## Introduction\n", "\n", "[MSC GeoMet](https://eccc-msc.github.io/open-data/msc-geomet/readme_en/) and [MSC Datamart](https://eccc-msc.github.io/open-data/msc-datamart/readme_en/) data can be manipulated via the command line using [GDAL](https://gdal.org/), a widely-known software library used to read and write raster and vector geospatial data. In the following examples, you'll use a GeoTIFF file retrieved using via a Web Coverage Service (WCS) request to MSC GeoMet. This tutorial will show you how to:\n", "* Display the GDAL version installed on your system\n", "* Save a WCS request output to your computer\n", "* List information/metadata related to the raster file\n", "* Reproject a raster file\n", "* Convert a GeoTIFF file to the NetCDF file format\n", "* Get the value for a specific point based on a location in longitude/latitude\n", "\n", "There are various ways to install GDAL, please refer to https://gdal.org/ for more information.\n", "\n", "To run the following GDAL command line examples, you need to have a basic knowledge of using the terminal command line. These examples work within a bash terminal. \n", "\n", "The [interactive version of this Jupyter Notebook is available](https://mybinder.org/v2/gh/ECCC-MSC/open-data/master?filepath=docs%2Fusage%2Ftutorial_gdal%2ftutorial_gdal_en.ipynb).\n", "\n", "[![badge](https://img.shields.io/badge/Interactive%20version-binder-F5A252.svg?logo=)](https://mybinder.org/v2/gh/ECCC-MSC/open-data/master?filepath=docs%2Fusage%2Ftutorial_gdal%2ftutorial_gdal_en.ipynb)\n", "\n", "## Show GDAL version\n", "\n", "GDAL is a suite of several command line tools. When you install GDAL you get all the different command line tools. The most basic tool is `gdalinfo` which can be used to retrieve information pertaining to your GDAL installation and display information about a raster file." ] }, { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "GDAL 3.1.1, released 2020/06/22\n" ] } ], "source": [ "%%bash\n", "\n", "gdalinfo --version" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Save a WCS request output to disk\n", "\n", "The OGC Web Coverage Service requests enable a client to retrieve coverage information from a raster file for a given area of interest. WCS requests are made over the internet (HTTPS) and give the user more flexibility when requesting information about the coverage of a layer compared with the more traditional way of downloading flat files. The Web Coverage Service allows for several different types of requests, each of which are described in further detail below. For more information on the WCS request parameters, please refer to the [MSC GeoMet WCS GetCoverage page](https://eccc-msc.github.io/open-data/msc-geomet/web-services_en/#wcs-getcoverage).\n", "\n", "We are going to use a `curl` command to save the WCS request result on disk, the file will be named `CMC_glb_TMP.tif`. The result is a GeoTIFF file, showing temperature (°C) from a subset of MSC's Global Deterministic Prediction System (GDPS)." ] }, { "cell_type": "code", "execution_count": 4, "metadata": {}, "outputs": [ { "name": "stderr", "output_type": "stream", "text": [ " % Total % Received % Xferd Average Speed Time Time Time Current\n", " Dload Upload Total Spent Left Speed\n", "\r", " 0 0 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0\r", "100 44266 100 44266 0 0 78208 0 --:--:-- --:--:-- --:--:-- 78208\n" ] } ], "source": [ "%%bash\n", "\n", "curl \"https://geo.weather.gc.ca/geomet?SERVICE=WCS&VERSION=2.0.1&REQUEST=GetCoverage&COVERAGEID=GDPS.ETA_TT&SUBSETTINGCRS=EPSG:4326&SUBSET=x(-120,-85)&SUBSET=y(48,66)&RESOLUTION=x(0.24)&RESOLUTION=y(0.24)&FORMAT=image/tiff\" > CMC_glb_TMP.tif " ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Lists information about a raster file\n", "\n", "The `gdalinfo` tool can be used to retrieve the downloaded raster file's metadata. The command's output will list some information on the file, such as:\n", "* File format\n", "* File size\n", "* Coordinate system\n", "* Metadata\n", "* Band information" ] }, { "cell_type": "code", "execution_count": 11, "metadata": { "scrolled": true }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Driver: GTiff/GeoTIFF\n", "Files: CMC_glb_TMP.tif\n", "Size is 146, 75\n", "Coordinate System is:\n", "GEOGCRS[\"WGS 84\",\n", " DATUM[\"World Geodetic System 1984\",\n", " ELLIPSOID[\"WGS 84\",6378137,298.257223563,\n", " LENGTHUNIT[\"metre\",1]]],\n", " PRIMEM[\"Greenwich\",0,\n", " ANGLEUNIT[\"degree\",0.0174532925199433]],\n", " CS[ellipsoidal,2],\n", " AXIS[\"geodetic latitude (Lat)\",north,\n", " ORDER[1],\n", " ANGLEUNIT[\"degree\",0.0174532925199433]],\n", " AXIS[\"geodetic longitude (Lon)\",east,\n", " ORDER[2],\n", " ANGLEUNIT[\"degree\",0.0174532925199433]],\n", " ID[\"EPSG\",4326]]\n", "Data axis to CRS axis mapping: 2,1\n", "Origin = (-119.999862068965513,66.000000000000000)\n", "Pixel Size = (0.239724137931034,-0.240000000000000)\n", "Metadata:\n", " AREA_OR_POINT=Area\n", " TIFFTAG_RESOLUTIONUNIT=2 (pixels/inch)\n", " TIFFTAG_XRESOLUTION=72\n", " TIFFTAG_YRESOLUTION=72\n", "Image Structure Metadata:\n", " INTERLEAVE=BAND\n", "Corner Coordinates:\n", "Upper Left (-119.9998621, 66.0000000) (119d59'59.50\"W, 66d 0' 0.00\"N)\n", "Lower Left (-119.9998621, 48.0000000) (119d59'59.50\"W, 48d 0' 0.00\"N)\n", "Upper Right ( -85.0001379, 66.0000000) ( 85d 0' 0.50\"W, 66d 0' 0.00\"N)\n", "Lower Right ( -85.0001379, 48.0000000) ( 85d 0' 0.50\"W, 48d 0' 0.00\"N)\n", "Center (-102.5000000, 57.0000000) (102d30' 0.00\"W, 57d 0' 0.00\"N)\n", "Band 1 Block=146x14 Type=Float32, ColorInterp=Gray\n" ] } ], "source": [ "%%bash\n", "\n", "gdalinfo CMC_glb_TMP.tif" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "It is also possible to use `gdalinfo` to retrieve some basic statistics on the raster file, such as minimum and maximum value by adding the `-mm` option. Note that the resulting values are in °C." ] }, { "cell_type": "code", "execution_count": 21, "metadata": { "scrolled": true }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " Computed Min/Max=0.728,27.828\n" ] } ], "source": [ "%%bash\n", "\n", "gdalinfo -mm CMC_glb_TMP.tif | grep Min/Max" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Adding the `-proj4` option to `gdalinfo` will output the projection definition as a proj4 string:" ] }, { "cell_type": "code", "execution_count": 24, "metadata": { "scrolled": true }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "PROJ.4 string is:\n", "'+proj=longlat +datum=WGS84 +no_defs'\n" ] } ], "source": [ "%%bash\n", "\n", "gdalinfo -proj4 CMC_glb_TMP.tif | grep PROJ.4 -A 1" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Reproject a raster file\n", "\n", "Using the `gdalwarp` command, we can reproject a raster file. Using MSC GeoMet and MSC Datamart data, you only need to provide an output projection definition corresponding to a EPSG code, or you can use a proj4 string.\n", "\n", "The following example reprojects the GeoTIFF file into an EPSG:3857 projection. The output file is named `CMC_glb_TMP_epsg3857.tif`" ] }, { "cell_type": "code", "execution_count": 25, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Creating output file that is 118P x 114L.\n", "Processing CMC_glb_TMP.tif [1/1] : 0...10...20...30...40...50...60...70...80...90...100 - done.\n" ] } ], "source": [ "%%bash\n", "\n", "gdalwarp -t_srs EPSG:3857 CMC_glb_TMP.tif CMC_glb_TMP_epsg3857.tif" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Then we can use `gdalinfo` to look at the coordinates and the proj4 string to ensure that the projection of `CMC_glb_TMP_epsg3857.tif` really is different from the original file." ] }, { "cell_type": "code", "execution_count": 26, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "PROJ.4 string is:\n", "'+proj=merc +a=6378137 +b=6378137 +lat_ts=0 +lon_0=0 +x_0=0 +y_0=0 +k=1 +units=m +nadgrids=@null +wktext +no_defs'\n", "Origin = (-20037500.189506221562624,17439592.350557137280703)\n", "Pixel Size = (19796.259162699658191,-19796.259162699658191)\n", "Metadata:\n", " AREA_OR_POINT=Area\n", "--\n", "Corner Coordinates:\n", "Upper Left (-20037500.190,17439592.351) (179d59'59.74\"W, 82d34' 7.50\"N)\n", "Lower Left (-20037500.190,-17441416.294) (179d59'59.74\"W, 82d34'15.13\"S)\n", "Upper Right (20030128.356,17439592.351) (179d56' 1.34\"E, 82d34' 7.50\"N)\n", "Lower Right (20030128.356,-17441416.294) (179d56' 1.34\"E, 82d34'15.13\"S)\n", "Center ( -3685.917, -911.972) ( 0d 1'59.20\"W, 0d 0'29.49\"S)\n" ] } ], "source": [ "%%bash\n", "\n", "gdalinfo -proj4 epsg3857.tif | grep -E '(PROJ.4|Corner Coordinates:)' -A 5" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Convert a GeoTIFF file to the NetCDF file format\n", "\n", "Using `gdal_translate` command, we can convert a raster file from any supported format (`gdalinfo --formats`) into another raster file format.\n", "\n", "In this example, we convert our GeoTIFF file to a NetCDF file. The `-of NetCDF` parameter tells gdal_translate in which format to do the projection. The output file will be named `CMC_glb_TMP.nc`" ] }, { "cell_type": "code", "execution_count": 27, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Input file size is 146, 75\n", "0...10...20...30...40...50...60...70...80...90...100 - done.\n" ] } ], "source": [ "%%bash\n", "\n", "gdal_translate -of NetCDF CMC_glb_TMP.tif CMC_glb_TMP.nc" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Then using `gdalinfo` we can make sure the output NetCDF file is a valid raster file." ] }, { "cell_type": "code", "execution_count": 29, "metadata": { "scrolled": true }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Driver: netCDF/Network Common Data Format\n", "Files: CMC_glb_TMP.nc\n", "Size is 146, 75\n", "Coordinate System is:\n", "GEOGCRS[\"WGS 84\",\n", " DATUM[\"World Geodetic System 1984\",\n", " ELLIPSOID[\"WGS 84\",6378137,298.257223563,\n", " LENGTHUNIT[\"metre\",1]]],\n", " PRIMEM[\"Greenwich\",0,\n", " ANGLEUNIT[\"degree\",0.0174532925199433]],\n", " CS[ellipsoidal,2],\n", " AXIS[\"geodetic latitude (Lat)\",north,\n", " ORDER[1],\n", " ANGLEUNIT[\"degree\",0.0174532925199433]],\n", " AXIS[\"geodetic longitude (Lon)\",east,\n", " ORDER[2],\n", " ANGLEUNIT[\"degree\",0.0174532925199433]],\n", " ID[\"EPSG\",4326]]\n", "Data axis to CRS axis mapping: 2,1\n", "Origin = (-119.999862068965513,66.000000000000000)\n", "Pixel Size = (0.239724137931034,-0.240000000000000)\n", "Metadata:\n", " Band1#grid_mapping=crs\n", " Band1#long_name=GDAL Band Number 1\n", " Band1#_FillValue=9.96921e+36\n", " crs#GeoTransform=-119.9998620689655 0.2397241379310344 0 66 0 -0.24 \n", " crs#grid_mapping_name=latitude_longitude\n", " crs#inverse_flattening=298.257223563\n", " crs#longitude_of_prime_meridian=0\n", " crs#long_name=CRS definition\n", " crs#semi_major_axis=6378137\n", " crs#spatial_ref=GEOGCS[\"WGS 84\",DATUM[\"WGS_1984\",SPHEROID[\"WGS 84\",6378137,298.257223563,AUTHORITY[\"EPSG\",\"7030\"]],AUTHORITY[\"EPSG\",\"6326\"]],PRIMEM[\"Greenwich\",0],UNIT[\"degree\",0.0174532925199433,AUTHORITY[\"EPSG\",\"9122\"]],AXIS[\"Latitude\",NORTH],AXIS[\"Longitude\",EAST],AUTHORITY[\"EPSG\",\"4326\"]]\n", " lat#long_name=latitude\n", " lat#standard_name=latitude\n", " lat#units=degrees_north\n", " lon#long_name=longitude\n", " lon#standard_name=longitude\n", " lon#units=degrees_east\n", " NC_GLOBAL#Conventions=CF-1.5\n", " NC_GLOBAL#GDAL=GDAL 3.1.1, released 2020/06/22\n", " NC_GLOBAL#GDAL_AREA_OR_POINT=Area\n", " NC_GLOBAL#GDAL_TIFFTAG_RESOLUTIONUNIT=2 (pixels/inch)\n", " NC_GLOBAL#GDAL_TIFFTAG_XRESOLUTION=72\n", " NC_GLOBAL#GDAL_TIFFTAG_YRESOLUTION=72\n", " NC_GLOBAL#history=Tue Jul 21 18:51:21 2020: GDAL CreateCopy( CMC_glb_TMP.nc, ... )\n", "Corner Coordinates:\n", "Upper Left (-119.9998621, 66.0000000) (119d59'59.50\"W, 66d 0' 0.00\"N)\n", "Lower Left (-119.9998621, 48.0000000) (119d59'59.50\"W, 48d 0' 0.00\"N)\n", "Upper Right ( -85.0001379, 66.0000000) ( 85d 0' 0.50\"W, 66d 0' 0.00\"N)\n", "Lower Right ( -85.0001379, 48.0000000) ( 85d 0' 0.50\"W, 48d 0' 0.00\"N)\n", "Center (-102.5000000, 57.0000000) (102d30' 0.00\"W, 57d 0' 0.00\"N)\n", "Band 1 Block=146x1 Type=Float32, ColorInterp=Undefined\n", " NoData Value=9.96920996838686905e+36\n", " Metadata:\n", " grid_mapping=crs\n", " long_name=GDAL Band Number 1\n", " NETCDF_VARNAME=Band1\n", " _FillValue=9.96921e+36\n" ] } ], "source": [ "%%bash\n", "\n", "gdalinfo CMC_glb_TMP.nc" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Get the value for a specific point based on a location in longitude/latitude\n", "\n", "Using `gdallocationinfo` command we can get the raw value of a pixel by specifying a location in either longitude/latitude or by specifying a pixel position. \n", "\n", "In the following example, we use longitude/latitude. The resulting value is in °C." ] }, { "cell_type": "code", "execution_count": 32, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Report:\n", " Location: (83P,66L)\n", " Band 1:\n", " Value: 16.4780216217041\n" ] } ], "source": [ "%%bash\n", "\n", "gdallocationinfo -wgs84 CMC_glb_TMP.tif -100 50" ] } ], "metadata": { "kernelspec": { "display_name": "Python 3", "language": "python", "name": "python3" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.6.10" } }, "nbformat": 4, "nbformat_minor": 4 }