{ "nbformat": 4, "nbformat_minor": 0, "metadata": { "colab": { "provenance": [], "mount_file_id": "10P5xS5zuQTNNxygr7JEus-dbrUFcLNdR", "authorship_tag": "ABX9TyPPyv0AYFk13Gtm+WZ56/lb", "include_colab_link": true }, "kernelspec": { "name": "python3", "display_name": "Python 3" }, "language_info": { "name": "python" } }, "cells": [ { "cell_type": "markdown", "metadata": { "id": "view-in-github", "colab_type": "text" }, "source": [ "\"Open" ] }, { "cell_type": "markdown", "source": [ "# Cracking Transit Data — Calgary 2025\n", "## How to Decode and Leverage GTFS for Real-Time Transit Insights\n", "\n", "![weston-m-phyfofYbDp8-unsplash (1).jpg]()\n", "\n", "\n", "\n", " " ], "metadata": { "id": "9HGnYbLacYhV" } }, { "cell_type": "markdown", "source": [ "# Introduction\n", "In the digital age, public transportation systems increasingly leverage technology to provide passengers with real-time data, enabling a more seamless travel experience. Transit systems share this data primarily through the General Transit Feed Specification (GTFS), a standard that provides data on schedules, routes, and real-time updates. However, while this data can be incredibly valuable, accessing and interpreting it can be challenging, especially when combining static and real-time feeds.\n", "\n", "This article shows you how to use Calgary Transit’s data about bus locations and routes to get useful information. If you work with data, understanding these datasets can help improve transit planning and make it better for users. We’ll cover the essential steps for fetching and processing Calgary Transit’s static and real-time data, including troubleshooting common issues you might encounter. By the end of this guide, you’ll be well-equipped to tap into Calgary’s transit data to solve real-world problems.\n", "\n", "---" ], "metadata": { "id": "AgUWBNxpeVOB" } }, { "cell_type": "markdown", "source": [ "# GTFS\n", "The General Transit Feed Specification (GTFS) is an open standard that formats public transport schedules and geographic data. GTFS allows public transit agencies to publish their data in a format that various software applications can consume, such as trip planners and API developers. This allows users easy access to travel information on smartphones and other devices.\n", "\n", "![gtfs.PNG]()" ], "metadata": { "id": "7ztDPf_wePoF" } }, { "cell_type": "markdown", "source": [ "GTFS includes information such as\n", "\n", "* Trip Name\n", "* Stops\n", "* Bus Routes\n", "* Fares\n", "* Time and more.\n", "\n", "When working with GTFS data, it’s important to understand the source and format of the data you are using. To fully explore and use the transit’s GTFS feeds, including the static and real-time data, visit their official website.\n", "\n", "[What is GTFS - General Transit Feed Specification](https://gtfs.org/getting-started/what-is-GTFS/?source=post_page-----49baea30833e--------------------------------)\n", "\n", "---" ], "metadata": { "id": "4uEO0li5e_D5" } }, { "cell_type": "markdown", "source": [ "# Data Portal\n", "We can use Calgary’s Open Data Portal to get real-time transit vehicle locations. Calgary Transit provides real-time data through the General Transit Feed Specification Realtime (GTFS-RT) format. These feeds offer live information on vehicle positions, trip updates, and service alerts.\n", "\n", "## Available GTFS-RT Feeds\n", "* Vehicle Positions: Provides real-time locations of transit vehicles.\n", "* Trip Updates: Offers real-time updates on scheduled trips, including delays and cancellations.\n", "* Service Alerts: Contains information on disruptions or changes in service.\n", "\n", "## Accessing the Feeds\n", "These feeds are accessible via Calgary’s Open Data Portal:\n", "\n", "* Vehicle Positions Feed — [Calgary Open Data](https://data.calgary.ca/Transportation-Transit/Calgary-Transit-Realtime-Vehicle-Positions-GTFS-RT/am7c-qe3u?utm_source=chatgpt.com)\n", "* Trip Updates Feed — [Calgary Open Data](https://data.calgary.ca/Transportation-Transit/Calgary-Transit-Realtime-Trip-Updates-GTFS-RT/gs4m-mdc2?utm_source=chatgpt.com)\n", "* Service Alerts — [Calgary Open Data](https://data.calgary.ca/Transportation-Transit/Calgary-Transit-Realtime-Service-Alerts-GTFS-RT/jhgn-ynqj)\n", "\n", "---" ], "metadata": { "id": "DogTaGHIfZny" } }, { "cell_type": "markdown", "source": [ "# Vehicle Positions Feed\n", "## Handling GTFS-RT Feeds with Python\n", "Install Required Libraries" ], "metadata": { "id": "KQlP0WNOgFlk" } }, { "cell_type": "code", "source": [ "pip install requests protobuf" ], "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "E2QVMut1gLbH", "outputId": "2b84bd07-adc0-49e9-e907-64769e02144d" }, "execution_count": 1, "outputs": [ { "output_type": "stream", "name": "stdout", "text": [ "Requirement already satisfied: requests in /usr/local/lib/python3.11/dist-packages (2.32.3)\n", "Requirement already satisfied: protobuf in /usr/local/lib/python3.11/dist-packages (4.25.5)\n", "Requirement already satisfied: charset-normalizer<4,>=2 in /usr/local/lib/python3.11/dist-packages (from requests) (3.4.1)\n", "Requirement already satisfied: idna<4,>=2.5 in /usr/local/lib/python3.11/dist-packages (from requests) (3.10)\n", "Requirement already satisfied: urllib3<3,>=1.21.1 in /usr/local/lib/python3.11/dist-packages (from requests) (2.3.0)\n", "Requirement already satisfied: certifi>=2017.4.17 in /usr/local/lib/python3.11/dist-packages (from requests) (2024.12.14)\n" ] } ] }, { "cell_type": "markdown", "source": [ "The requests library is used to send HTTP requests in Python. Whereas the protobuf is a library for working with Protocol Buffers (Protobuf), a method developed by Google for serializing structured data. Long story short, if you are handling GTFS-RT feed, then requests is used to fetch and protobuf is used to parse the data from an API.\n", "\n", "## Generating Python code from a Protocol Buffer\n", "We need to use the Protocol buffer compiler protoc1, a tool provided by Google for working with .proto files. We shall use the gtfs-realtime.proto Protobuf definition file. It describes the structure of GTFS-RT messages, including FeedMessage, FeedHeader, FeedEntity, etc. The technical documentation follows below.\n", "\n", "[Protobuf - General Transit Feed Specification](https://gtfs.org/documentation/realtime/proto/?source=post_page-----49baea30833e--------------------------------)" ], "metadata": { "id": "VNGi1KNQgQoI" } }, { "cell_type": "code", "source": [ "!protoc --python_out=. gtfs-realtime.proto" ], "metadata": { "id": "VBlxVP1RgdUA" }, "execution_count": null, "outputs": [] }, { "cell_type": "markdown", "source": [ "Running this command creates a Python module, which isgtfs_realtime_pb2.py. If you don’t want the hassle, you can skip this step by just manually uploading the gtfs_realtime_pb2.py to your drive. Click [here](https://github.com/MobilityData/gtfs-realtime-bindings/blob/master/python/google/transit/gtfs_realtime_pb2.py) to access the file. To upload a file on Google Colab, you can write as:" ], "metadata": { "id": "K8aOcJWegfcG" } }, { "cell_type": "code", "source": [ "from google.colab import files\n", "files.upload()" ], "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 1000 }, "id": "FqZbVs1Tgncg", "outputId": "aec00674-33c9-4ad5-86aa-96b768be5993" }, "execution_count": 3, "outputs": [ { "output_type": "display_data", "data": { "text/plain": [ "" ], "text/html": [ "\n", " \n", " \n", " Upload widget is only available when the cell has been executed in the\n", " current browser session. Please rerun this cell to enable.\n", " \n", " " ] }, "metadata": {} }, { "output_type": "stream", "name": "stdout", "text": [ "Saving gtfs_realtime_pb2.py to gtfs_realtime_pb2.py\n" ] }, { "output_type": "execute_result", "data": { "text/plain": [ "{'gtfs_realtime_pb2.py': b'#! /usr/bin/python\\n#\\n# Copyright 2016-2019 Google Inc., MobilityData\\n#\\n# Licensed under the Apache License, Version 2.0 (the \"License\");\\n# you may not use this file except in compliance with the License.\\n# You may obtain a copy of the License at\\n#\\n# http://www.apache.org/licenses/LICENSE-2.0\\n#\\n# Unless required by applicable law or agreed to in writing, software\\n# distributed under the License is distributed on an \"AS IS\" BASIS,\\n# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.\\n# See the License for the specific language governing permissions and\\n# limitations under the License.\\n\\n# -*- coding: utf-8 -*-\\n# Generated by the protocol buffer compiler. DO NOT EDIT!\\n# source: gtfs-realtime.proto\\n\"\"\"Generated protocol buffer code.\"\"\"\\nfrom google.protobuf.internal import builder as _builder\\nfrom google.protobuf import descriptor as _descriptor\\nfrom google.protobuf import descriptor_pool as _descriptor_pool\\nfrom google.protobuf import symbol_database as _symbol_database\\n# @@protoc_insertion_point(imports)\\n\\n_sym_db = _symbol_database.Default()\\n\\n\\n\\n\\nDESCRIPTOR = _descriptor_pool.Default().AddSerializedFile(b\\'\\\\n\\\\x13gtfs-realtime.proto\\\\x12\\\\x10transit_realtime\\\\\"y\\\\n\\\\x0b\\\\x46\\\\x65\\\\x65\\\\x64Message\\\\x12,\\\\n\\\\x06header\\\\x18\\\\x01 \\\\x02(\\\\x0b\\\\x32\\\\x1c.transit_realtime.FeedHeader\\\\x12,\\\\n\\\\x06\\\\x65ntity\\\\x18\\\\x02 \\\\x03(\\\\x0b\\\\x32\\\\x1c.transit_realtime.FeedEntity*\\\\x06\\\\x08\\\\xe8\\\\x07\\\\x10\\\\xd0\\\\x0f*\\\\x06\\\\x08\\\\xa8\\\\x46\\\\x10\\\\x90N\\\\\"\\\\xd7\\\\x01\\\\n\\\\nFeedHeader\\\\x12\\\\x1d\\\\n\\\\x15gtfs_realtime_version\\\\x18\\\\x01 \\\\x02(\\\\t\\\\x12Q\\\\n\\\\x0eincrementality\\\\x18\\\\x02 \\\\x01(\\\\x0e\\\\x32+.transit_realtime.FeedHeader.Incrementality:\\\\x0c\\\\x46ULL_DATASET\\\\x12\\\\x11\\\\n\\\\ttimestamp\\\\x18\\\\x03 \\\\x01(\\\\x04\\\\\"4\\\\n\\\\x0eIncrementality\\\\x12\\\\x10\\\\n\\\\x0c\\\\x46ULL_DATASET\\\\x10\\\\x00\\\\x12\\\\x10\\\\n\\\\x0c\\\\x44IFFERENTIAL\\\\x10\\\\x01*\\\\x06\\\\x08\\\\xe8\\\\x07\\\\x10\\\\xd0\\\\x0f*\\\\x06\\\\x08\\\\xa8\\\\x46\\\\x10\\\\x90N\\\\\"\\\\xd2\\\\x01\\\\n\\\\nFeedEntity\\\\x12\\\\n\\\\n\\\\x02id\\\\x18\\\\x01 \\\\x02(\\\\t\\\\x12\\\\x19\\\\n\\\\nis_deleted\\\\x18\\\\x02 \\\\x01(\\\\x08:\\\\x05\\\\x66\\\\x61lse\\\\x12\\\\x31\\\\n\\\\x0btrip_update\\\\x18\\\\x03 \\\\x01(\\\\x0b\\\\x32\\\\x1c.transit_realtime.TripUpdate\\\\x12\\\\x32\\\\n\\\\x07vehicle\\\\x18\\\\x04 \\\\x01(\\\\x0b\\\\x32!.transit_realtime.VehiclePosition\\\\x12&\\\\n\\\\x05\\\\x61lert\\\\x18\\\\x05 \\\\x01(\\\\x0b\\\\x32\\\\x17.transit_realtime.Alert*\\\\x06\\\\x08\\\\xe8\\\\x07\\\\x10\\\\xd0\\\\x0f*\\\\x06\\\\x08\\\\xa8\\\\x46\\\\x10\\\\x90N\\\\\"\\\\x82\\\\x08\\\\n\\\\nTripUpdate\\\\x12.\\\\n\\\\x04trip\\\\x18\\\\x01 \\\\x02(\\\\x0b\\\\x32 .transit_realtime.TripDescriptor\\\\x12\\\\x34\\\\n\\\\x07vehicle\\\\x18\\\\x03 \\\\x01(\\\\x0b\\\\x32#.transit_realtime.VehicleDescriptor\\\\x12\\\\x45\\\\n\\\\x10stop_time_update\\\\x18\\\\x02 \\\\x03(\\\\x0b\\\\x32+.transit_realtime.TripUpdate.StopTimeUpdate\\\\x12\\\\x11\\\\n\\\\ttimestamp\\\\x18\\\\x04 \\\\x01(\\\\x04\\\\x12\\\\r\\\\n\\\\x05\\\\x64\\\\x65lay\\\\x18\\\\x05 \\\\x01(\\\\x05\\\\x12\\\\x44\\\\n\\\\x0ftrip_properties\\\\x18\\\\x06 \\\\x01(\\\\x0b\\\\x32+.transit_realtime.TripUpdate.TripProperties\\\\x1aQ\\\\n\\\\rStopTimeEvent\\\\x12\\\\r\\\\n\\\\x05\\\\x64\\\\x65lay\\\\x18\\\\x01 \\\\x01(\\\\x05\\\\x12\\\\x0c\\\\n\\\\x04time\\\\x18\\\\x02 \\\\x01(\\\\x03\\\\x12\\\\x13\\\\n\\\\x0buncertainty\\\\x18\\\\x03 \\\\x01(\\\\x05*\\\\x06\\\\x08\\\\xe8\\\\x07\\\\x10\\\\xd0\\\\x0f*\\\\x06\\\\x08\\\\xa8\\\\x46\\\\x10\\\\x90N\\\\x1a\\\\xa0\\\\x04\\\\n\\\\x0eStopTimeUpdate\\\\x12\\\\x15\\\\n\\\\rstop_sequence\\\\x18\\\\x01 \\\\x01(\\\\r\\\\x12\\\\x0f\\\\n\\\\x07stop_id\\\\x18\\\\x04 \\\\x01(\\\\t\\\\x12;\\\\n\\\\x07\\\\x61rrival\\\\x18\\\\x02 \\\\x01(\\\\x0b\\\\x32*.transit_realtime.TripUpdate.StopTimeEvent\\\\x12=\\\\n\\\\tdeparture\\\\x18\\\\x03 \\\\x01(\\\\x0b\\\\x32*.transit_realtime.TripUpdate.StopTimeEvent\\\\x12j\\\\n\\\\x15schedule_relationship\\\\x18\\\\x05 \\\\x01(\\\\x0e\\\\x32@.transit_realtime.TripUpdate.StopTimeUpdate.ScheduleRelationship:\\\\tSCHEDULED\\\\x12\\\\\\\\\\\\n\\\\x14stop_time_properties\\\\x18\\\\x06 \\\\x01(\\\\x0b\\\\x32>.transit_realtime.TripUpdate.StopTimeUpdate.StopTimeProperties\\\\x1a>\\\\n\\\\x12StopTimeProperties\\\\x12\\\\x18\\\\n\\\\x10\\\\x61ssigned_stop_id\\\\x18\\\\x01 \\\\x01(\\\\t*\\\\x06\\\\x08\\\\xe8\\\\x07\\\\x10\\\\xd0\\\\x0f*\\\\x06\\\\x08\\\\xa8\\\\x46\\\\x10\\\\x90N\\\\\"P\\\\n\\\\x14ScheduleRelationship\\\\x12\\\\r\\\\n\\\\tSCHEDULED\\\\x10\\\\x00\\\\x12\\\\x0b\\\\n\\\\x07SKIPPED\\\\x10\\\\x01\\\\x12\\\\x0b\\\\n\\\\x07NO_DATA\\\\x10\\\\x02\\\\x12\\\\x0f\\\\n\\\\x0bUNSCHEDULED\\\\x10\\\\x03*\\\\x06\\\\x08\\\\xe8\\\\x07\\\\x10\\\\xd0\\\\x0f*\\\\x06\\\\x08\\\\xa8\\\\x46\\\\x10\\\\x90N\\\\x1aY\\\\n\\\\x0eTripProperties\\\\x12\\\\x0f\\\\n\\\\x07trip_id\\\\x18\\\\x01 \\\\x01(\\\\t\\\\x12\\\\x12\\\\n\\\\nstart_date\\\\x18\\\\x02 \\\\x01(\\\\t\\\\x12\\\\x12\\\\n\\\\nstart_time\\\\x18\\\\x03 \\\\x01(\\\\t*\\\\x06\\\\x08\\\\xe8\\\\x07\\\\x10\\\\xd0\\\\x0f*\\\\x06\\\\x08\\\\xa8\\\\x46\\\\x10\\\\x90N*\\\\x06\\\\x08\\\\xe8\\\\x07\\\\x10\\\\xd0\\\\x0f*\\\\x06\\\\x08\\\\xa8\\\\x46\\\\x10\\\\x90N\\\\\"\\\\xdf\\\\t\\\\n\\\\x0fVehiclePosition\\\\x12.\\\\n\\\\x04trip\\\\x18\\\\x01 \\\\x01(\\\\x0b\\\\x32 .transit_realtime.TripDescriptor\\\\x12\\\\x34\\\\n\\\\x07vehicle\\\\x18\\\\x08 \\\\x01(\\\\x0b\\\\x32#.transit_realtime.VehicleDescriptor\\\\x12,\\\\n\\\\x08position\\\\x18\\\\x02 \\\\x01(\\\\x0b\\\\x32\\\\x1a.transit_realtime.Position\\\\x12\\\\x1d\\\\n\\\\x15\\\\x63urrent_stop_sequence\\\\x18\\\\x03 \\\\x01(\\\\r\\\\x12\\\\x0f\\\\n\\\\x07stop_id\\\\x18\\\\x07 \\\\x01(\\\\t\\\\x12Z\\\\n\\\\x0e\\\\x63urrent_status\\\\x18\\\\x04 \\\\x01(\\\\x0e\\\\x32\\\\x33.transit_realtime.VehiclePosition.VehicleStopStatus:\\\\rIN_TRANSIT_TO\\\\x12\\\\x11\\\\n\\\\ttimestamp\\\\x18\\\\x05 \\\\x01(\\\\x04\\\\x12K\\\\n\\\\x10\\\\x63ongestion_level\\\\x18\\\\x06 \\\\x01(\\\\x0e\\\\x32\\\\x31.transit_realtime.VehiclePosition.CongestionLevel\\\\x12K\\\\n\\\\x10occupancy_status\\\\x18\\\\t \\\\x01(\\\\x0e\\\\x32\\\\x31.transit_realtime.VehiclePosition.OccupancyStatus\\\\x12\\\\x1c\\\\n\\\\x14occupancy_percentage\\\\x18\\\\n \\\\x01(\\\\r\\\\x12Q\\\\n\\\\x16multi_carriage_details\\\\x18\\\\x0b \\\\x03(\\\\x0b\\\\x32\\\\x31.transit_realtime.VehiclePosition.CarriageDetails\\\\x1a\\\\xd9\\\\x01\\\\n\\\\x0f\\\\x43\\\\x61rriageDetails\\\\x12\\\\n\\\\n\\\\x02id\\\\x18\\\\x01 \\\\x01(\\\\t\\\\x12\\\\r\\\\n\\\\x05label\\\\x18\\\\x02 \\\\x01(\\\\t\\\\x12^\\\\n\\\\x10occupancy_status\\\\x18\\\\x03 \\\\x01(\\\\x0e\\\\x32\\\\x31.transit_realtime.VehiclePosition.OccupancyStatus:\\\\x11NO_DATA_AVAILABLE\\\\x12 \\\\n\\\\x14occupancy_percentage\\\\x18\\\\x04 \\\\x01(\\\\x05:\\\\x02-1\\\\x12\\\\x19\\\\n\\\\x11\\\\x63\\\\x61rriage_sequence\\\\x18\\\\x05 \\\\x01(\\\\r*\\\\x06\\\\x08\\\\xe8\\\\x07\\\\x10\\\\xd0\\\\x0f*\\\\x06\\\\x08\\\\xa8\\\\x46\\\\x10\\\\x90N\\\\\"G\\\\n\\\\x11VehicleStopStatus\\\\x12\\\\x0f\\\\n\\\\x0bINCOMING_AT\\\\x10\\\\x00\\\\x12\\\\x0e\\\\n\\\\nSTOPPED_AT\\\\x10\\\\x01\\\\x12\\\\x11\\\\n\\\\rIN_TRANSIT_TO\\\\x10\\\\x02\\\\\"}\\\\n\\\\x0f\\\\x43ongestionLevel\\\\x12\\\\x1c\\\\n\\\\x18UNKNOWN_CONGESTION_LEVEL\\\\x10\\\\x00\\\\x12\\\\x14\\\\n\\\\x10RUNNING_SMOOTHLY\\\\x10\\\\x01\\\\x12\\\\x0f\\\\n\\\\x0bSTOP_AND_GO\\\\x10\\\\x02\\\\x12\\\\x0e\\\\n\\\\nCONGESTION\\\\x10\\\\x03\\\\x12\\\\x15\\\\n\\\\x11SEVERE_CONGESTION\\\\x10\\\\x04\\\\\"\\\\xd9\\\\x01\\\\n\\\\x0fOccupancyStatus\\\\x12\\\\t\\\\n\\\\x05\\\\x45MPTY\\\\x10\\\\x00\\\\x12\\\\x18\\\\n\\\\x14MANY_SEATS_AVAILABLE\\\\x10\\\\x01\\\\x12\\\\x17\\\\n\\\\x13\\\\x46\\\\x45W_SEATS_AVAILABLE\\\\x10\\\\x02\\\\x12\\\\x16\\\\n\\\\x12STANDING_ROOM_ONLY\\\\x10\\\\x03\\\\x12\\\\x1e\\\\n\\\\x1a\\\\x43RUSHED_STANDING_ROOM_ONLY\\\\x10\\\\x04\\\\x12\\\\x08\\\\n\\\\x04\\\\x46ULL\\\\x10\\\\x05\\\\x12\\\\x1c\\\\n\\\\x18NOT_ACCEPTING_PASSENGERS\\\\x10\\\\x06\\\\x12\\\\x15\\\\n\\\\x11NO_DATA_AVAILABLE\\\\x10\\\\x07\\\\x12\\\\x11\\\\n\\\\rNOT_BOARDABLE\\\\x10\\\\x08*\\\\x06\\\\x08\\\\xe8\\\\x07\\\\x10\\\\xd0\\\\x0f*\\\\x06\\\\x08\\\\xa8\\\\x46\\\\x10\\\\x90N\\\\\"\\\\x80\\\\t\\\\n\\\\x05\\\\x41lert\\\\x12\\\\x32\\\\n\\\\ractive_period\\\\x18\\\\x01 \\\\x03(\\\\x0b\\\\x32\\\\x1b.transit_realtime.TimeRange\\\\x12\\\\x39\\\\n\\\\x0finformed_entity\\\\x18\\\\x05 \\\\x03(\\\\x0b\\\\x32 .transit_realtime.EntitySelector\\\\x12;\\\\n\\\\x05\\\\x63\\\\x61use\\\\x18\\\\x06 \\\\x01(\\\\x0e\\\\x32\\\\x1d.transit_realtime.Alert.Cause:\\\\rUNKNOWN_CAUSE\\\\x12>\\\\n\\\\x06\\\\x65\\\\x66\\\\x66\\\\x65\\\\x63t\\\\x18\\\\x07 \\\\x01(\\\\x0e\\\\x32\\\\x1e.transit_realtime.Alert.Effect:\\\\x0eUNKNOWN_EFFECT\\\\x12/\\\\n\\\\x03url\\\\x18\\\\x08 \\\\x01(\\\\x0b\\\\x32\\\\\".transit_realtime.TranslatedString\\\\x12\\\\x37\\\\n\\\\x0bheader_text\\\\x18\\\\n \\\\x01(\\\\x0b\\\\x32\\\\\".transit_realtime.TranslatedString\\\\x12<\\\\n\\\\x10\\\\x64\\\\x65scription_text\\\\x18\\\\x0b \\\\x01(\\\\x0b\\\\x32\\\\\".transit_realtime.TranslatedString\\\\x12;\\\\n\\\\x0ftts_header_text\\\\x18\\\\x0c \\\\x01(\\\\x0b\\\\x32\\\\\".transit_realtime.TranslatedString\\\\x12@\\\\n\\\\x14tts_description_text\\\\x18\\\\r \\\\x01(\\\\x0b\\\\x32\\\\\".transit_realtime.TranslatedString\\\\x12O\\\\n\\\\x0eseverity_level\\\\x18\\\\x0e \\\\x01(\\\\x0e\\\\x32%.transit_realtime.Alert.SeverityLevel:\\\\x10UNKNOWN_SEVERITY\\\\\"\\\\xd8\\\\x01\\\\n\\\\x05\\\\x43\\\\x61use\\\\x12\\\\x11\\\\n\\\\rUNKNOWN_CAUSE\\\\x10\\\\x01\\\\x12\\\\x0f\\\\n\\\\x0bOTHER_CAUSE\\\\x10\\\\x02\\\\x12\\\\x15\\\\n\\\\x11TECHNICAL_PROBLEM\\\\x10\\\\x03\\\\x12\\\\n\\\\n\\\\x06STRIKE\\\\x10\\\\x04\\\\x12\\\\x11\\\\n\\\\rDEMONSTRATION\\\\x10\\\\x05\\\\x12\\\\x0c\\\\n\\\\x08\\\\x41\\\\x43\\\\x43IDENT\\\\x10\\\\x06\\\\x12\\\\x0b\\\\n\\\\x07HOLIDAY\\\\x10\\\\x07\\\\x12\\\\x0b\\\\n\\\\x07WEATHER\\\\x10\\\\x08\\\\x12\\\\x0f\\\\n\\\\x0bMAINTENANCE\\\\x10\\\\t\\\\x12\\\\x10\\\\n\\\\x0c\\\\x43ONSTRUCTION\\\\x10\\\\n\\\\x12\\\\x13\\\\n\\\\x0fPOLICE_ACTIVITY\\\\x10\\\\x0b\\\\x12\\\\x15\\\\n\\\\x11MEDICAL_EMERGENCY\\\\x10\\\\x0c\\\\\"\\\\xdd\\\\x01\\\\n\\\\x06\\\\x45\\\\x66\\\\x66\\\\x65\\\\x63t\\\\x12\\\\x0e\\\\n\\\\nNO_SERVICE\\\\x10\\\\x01\\\\x12\\\\x13\\\\n\\\\x0fREDUCED_SERVICE\\\\x10\\\\x02\\\\x12\\\\x16\\\\n\\\\x12SIGNIFICANT_DELAYS\\\\x10\\\\x03\\\\x12\\\\n\\\\n\\\\x06\\\\x44\\\\x45TOUR\\\\x10\\\\x04\\\\x12\\\\x16\\\\n\\\\x12\\\\x41\\\\x44\\\\x44ITIONAL_SERVICE\\\\x10\\\\x05\\\\x12\\\\x14\\\\n\\\\x10MODIFIED_SERVICE\\\\x10\\\\x06\\\\x12\\\\x10\\\\n\\\\x0cOTHER_EFFECT\\\\x10\\\\x07\\\\x12\\\\x12\\\\n\\\\x0eUNKNOWN_EFFECT\\\\x10\\\\x08\\\\x12\\\\x0e\\\\n\\\\nSTOP_MOVED\\\\x10\\\\t\\\\x12\\\\r\\\\n\\\\tNO_EFFECT\\\\x10\\\\n\\\\x12\\\\x17\\\\n\\\\x13\\\\x41\\\\x43\\\\x43\\\\x45SSIBILITY_ISSUE\\\\x10\\\\x0b\\\\\"H\\\\n\\\\rSeverityLevel\\\\x12\\\\x14\\\\n\\\\x10UNKNOWN_SEVERITY\\\\x10\\\\x01\\\\x12\\\\x08\\\\n\\\\x04INFO\\\\x10\\\\x02\\\\x12\\\\x0b\\\\n\\\\x07WARNING\\\\x10\\\\x03\\\\x12\\\\n\\\\n\\\\x06SEVERE\\\\x10\\\\x04*\\\\x06\\\\x08\\\\xe8\\\\x07\\\\x10\\\\xd0\\\\x0f*\\\\x06\\\\x08\\\\xa8\\\\x46\\\\x10\\\\x90N\\\\\"7\\\\n\\\\tTimeRange\\\\x12\\\\r\\\\n\\\\x05start\\\\x18\\\\x01 \\\\x01(\\\\x04\\\\x12\\\\x0b\\\\n\\\\x03\\\\x65nd\\\\x18\\\\x02 \\\\x01(\\\\x04*\\\\x06\\\\x08\\\\xe8\\\\x07\\\\x10\\\\xd0\\\\x0f*\\\\x06\\\\x08\\\\xa8\\\\x46\\\\x10\\\\x90N\\\\\"q\\\\n\\\\x08Position\\\\x12\\\\x10\\\\n\\\\x08latitude\\\\x18\\\\x01 \\\\x02(\\\\x02\\\\x12\\\\x11\\\\n\\\\tlongitude\\\\x18\\\\x02 \\\\x02(\\\\x02\\\\x12\\\\x0f\\\\n\\\\x07\\\\x62\\\\x65\\\\x61ring\\\\x18\\\\x03 \\\\x01(\\\\x02\\\\x12\\\\x10\\\\n\\\\x08odometer\\\\x18\\\\x04 \\\\x01(\\\\x01\\\\x12\\\\r\\\\n\\\\x05speed\\\\x18\\\\x05 \\\\x01(\\\\x02*\\\\x06\\\\x08\\\\xe8\\\\x07\\\\x10\\\\xd0\\\\x0f*\\\\x06\\\\x08\\\\xa8\\\\x46\\\\x10\\\\x90N\\\\\"\\\\xcd\\\\x02\\\\n\\\\x0eTripDescriptor\\\\x12\\\\x0f\\\\n\\\\x07trip_id\\\\x18\\\\x01 \\\\x01(\\\\t\\\\x12\\\\x10\\\\n\\\\x08route_id\\\\x18\\\\x05 \\\\x01(\\\\t\\\\x12\\\\x14\\\\n\\\\x0c\\\\x64irection_id\\\\x18\\\\x06 \\\\x01(\\\\r\\\\x12\\\\x12\\\\n\\\\nstart_time\\\\x18\\\\x02 \\\\x01(\\\\t\\\\x12\\\\x12\\\\n\\\\nstart_date\\\\x18\\\\x03 \\\\x01(\\\\t\\\\x12T\\\\n\\\\x15schedule_relationship\\\\x18\\\\x04 \\\\x01(\\\\x0e\\\\x32\\\\x35.transit_realtime.TripDescriptor.ScheduleRelationship\\\\\"t\\\\n\\\\x14ScheduleRelationship\\\\x12\\\\r\\\\n\\\\tSCHEDULED\\\\x10\\\\x00\\\\x12\\\\t\\\\n\\\\x05\\\\x41\\\\x44\\\\x44\\\\x45\\\\x44\\\\x10\\\\x01\\\\x12\\\\x0f\\\\n\\\\x0bUNSCHEDULED\\\\x10\\\\x02\\\\x12\\\\x0c\\\\n\\\\x08\\\\x43\\\\x41NCELED\\\\x10\\\\x03\\\\x12\\\\x13\\\\n\\\\x0bREPLACEMENT\\\\x10\\\\x05\\\\x1a\\\\x02\\\\x08\\\\x01\\\\x12\\\\x0e\\\\n\\\\nDUPLICATED\\\\x10\\\\x06*\\\\x06\\\\x08\\\\xe8\\\\x07\\\\x10\\\\xd0\\\\x0f*\\\\x06\\\\x08\\\\xa8\\\\x46\\\\x10\\\\x90N\\\\\"U\\\\n\\\\x11VehicleDescriptor\\\\x12\\\\n\\\\n\\\\x02id\\\\x18\\\\x01 \\\\x01(\\\\t\\\\x12\\\\r\\\\n\\\\x05label\\\\x18\\\\x02 \\\\x01(\\\\t\\\\x12\\\\x15\\\\n\\\\rlicense_plate\\\\x18\\\\x03 \\\\x01(\\\\t*\\\\x06\\\\x08\\\\xe8\\\\x07\\\\x10\\\\xd0\\\\x0f*\\\\x06\\\\x08\\\\xa8\\\\x46\\\\x10\\\\x90N\\\\\"\\\\xb0\\\\x01\\\\n\\\\x0e\\\\x45ntitySelector\\\\x12\\\\x11\\\\n\\\\tagency_id\\\\x18\\\\x01 \\\\x01(\\\\t\\\\x12\\\\x10\\\\n\\\\x08route_id\\\\x18\\\\x02 \\\\x01(\\\\t\\\\x12\\\\x12\\\\n\\\\nroute_type\\\\x18\\\\x03 \\\\x01(\\\\x05\\\\x12.\\\\n\\\\x04trip\\\\x18\\\\x04 \\\\x01(\\\\x0b\\\\x32 .transit_realtime.TripDescriptor\\\\x12\\\\x0f\\\\n\\\\x07stop_id\\\\x18\\\\x05 \\\\x01(\\\\t\\\\x12\\\\x14\\\\n\\\\x0c\\\\x64irection_id\\\\x18\\\\x06 \\\\x01(\\\\r*\\\\x06\\\\x08\\\\xe8\\\\x07\\\\x10\\\\xd0\\\\x0f*\\\\x06\\\\x08\\\\xa8\\\\x46\\\\x10\\\\x90N\\\\\"\\\\xa6\\\\x01\\\\n\\\\x10TranslatedString\\\\x12\\\\x43\\\\n\\\\x0btranslation\\\\x18\\\\x01 \\\\x03(\\\\x0b\\\\x32..transit_realtime.TranslatedString.Translation\\\\x1a=\\\\n\\\\x0bTranslation\\\\x12\\\\x0c\\\\n\\\\x04text\\\\x18\\\\x01 \\\\x02(\\\\t\\\\x12\\\\x10\\\\n\\\\x08language\\\\x18\\\\x02 \\\\x01(\\\\t*\\\\x06\\\\x08\\\\xe8\\\\x07\\\\x10\\\\xd0\\\\x0f*\\\\x06\\\\x08\\\\xa8\\\\x46\\\\x10\\\\x90N*\\\\x06\\\\x08\\\\xe8\\\\x07\\\\x10\\\\xd0\\\\x0f*\\\\x06\\\\x08\\\\xa8\\\\x46\\\\x10\\\\x90NB\\\\x1d\\\\n\\\\x1b\\\\x63om.google.transit.realtime\\')\\n\\n_builder.BuildMessageAndEnumDescriptors(DESCRIPTOR, globals())\\n_builder.BuildTopDescriptorsAndMessages(DESCRIPTOR, \\'gtfs_realtime_pb2\\', globals())\\nif _descriptor._USE_C_DESCRIPTORS == False:\\n\\n DESCRIPTOR._options = None\\n DESCRIPTOR._serialized_options = b\\'\\\\n\\\\033com.google.transit.realtime\\'\\n _TRIPDESCRIPTOR_SCHEDULERELATIONSHIP.values_by_name[\"REPLACEMENT\"]._options = None\\n _TRIPDESCRIPTOR_SCHEDULERELATIONSHIP.values_by_name[\"REPLACEMENT\"]._serialized_options = b\\'\\\\010\\\\001\\'\\n _FEEDMESSAGE._serialized_start=41\\n _FEEDMESSAGE._serialized_end=162\\n _FEEDHEADER._serialized_start=165\\n _FEEDHEADER._serialized_end=380\\n _FEEDHEADER_INCREMENTALITY._serialized_start=312\\n _FEEDHEADER_INCREMENTALITY._serialized_end=364\\n _FEEDENTITY._serialized_start=383\\n _FEEDENTITY._serialized_end=593\\n _TRIPUPDATE._serialized_start=596\\n _TRIPUPDATE._serialized_end=1622\\n _TRIPUPDATE_STOPTIMEEVENT._serialized_start=887\\n _TRIPUPDATE_STOPTIMEEVENT._serialized_end=968\\n _TRIPUPDATE_STOPTIMEUPDATE._serialized_start=971\\n _TRIPUPDATE_STOPTIMEUPDATE._serialized_end=1515\\n _TRIPUPDATE_STOPTIMEUPDATE_STOPTIMEPROPERTIES._serialized_start=1355\\n _TRIPUPDATE_STOPTIMEUPDATE_STOPTIMEPROPERTIES._serialized_end=1417\\n _TRIPUPDATE_STOPTIMEUPDATE_SCHEDULERELATIONSHIP._serialized_start=1419\\n _TRIPUPDATE_STOPTIMEUPDATE_SCHEDULERELATIONSHIP._serialized_end=1499\\n _TRIPUPDATE_TRIPPROPERTIES._serialized_start=1517\\n _TRIPUPDATE_TRIPPROPERTIES._serialized_end=1606\\n _VEHICLEPOSITION._serialized_start=1625\\n _VEHICLEPOSITION._serialized_end=2872\\n _VEHICLEPOSITION_CARRIAGEDETAILS._serialized_start=2219\\n _VEHICLEPOSITION_CARRIAGEDETAILS._serialized_end=2436\\n _VEHICLEPOSITION_VEHICLESTOPSTATUS._serialized_start=2438\\n _VEHICLEPOSITION_VEHICLESTOPSTATUS._serialized_end=2509\\n _VEHICLEPOSITION_CONGESTIONLEVEL._serialized_start=2511\\n _VEHICLEPOSITION_CONGESTIONLEVEL._serialized_end=2636\\n _VEHICLEPOSITION_OCCUPANCYSTATUS._serialized_start=2639\\n _VEHICLEPOSITION_OCCUPANCYSTATUS._serialized_end=2856\\n _ALERT._serialized_start=2875\\n _ALERT._serialized_end=4027\\n _ALERT_CAUSE._serialized_start=3497\\n _ALERT_CAUSE._serialized_end=3713\\n _ALERT_EFFECT._serialized_start=3716\\n _ALERT_EFFECT._serialized_end=3937\\n _ALERT_SEVERITYLEVEL._serialized_start=3939\\n _ALERT_SEVERITYLEVEL._serialized_end=4011\\n _TIMERANGE._serialized_start=4029\\n _TIMERANGE._serialized_end=4084\\n _POSITION._serialized_start=4086\\n _POSITION._serialized_end=4199\\n _TRIPDESCRIPTOR._serialized_start=4202\\n _TRIPDESCRIPTOR._serialized_end=4535\\n _TRIPDESCRIPTOR_SCHEDULERELATIONSHIP._serialized_start=4403\\n _TRIPDESCRIPTOR_SCHEDULERELATIONSHIP._serialized_end=4519\\n _VEHICLEDESCRIPTOR._serialized_start=4537\\n _VEHICLEDESCRIPTOR._serialized_end=4622\\n _ENTITYSELECTOR._serialized_start=4625\\n _ENTITYSELECTOR._serialized_end=4801\\n _TRANSLATEDSTRING._serialized_start=4804\\n _TRANSLATEDSTRING._serialized_end=4970\\n _TRANSLATEDSTRING_TRANSLATION._serialized_start=4893\\n _TRANSLATEDSTRING_TRANSLATION._serialized_end=4954\\n# @@protoc_insertion_point(module_scope)\\n'}" ] }, "metadata": {}, "execution_count": 3 } ] }, { "cell_type": "markdown", "source": [ "Your directory should look something like this. Use the !ls command.\n", "\n" ], "metadata": { "id": "vnD48OhWg4UO" } }, { "cell_type": "code", "source": [ "!ls" ], "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "g3fJU2HggtWT", "outputId": "f25a1326-655f-4f91-96f0-788725735656" }, "execution_count": 4, "outputs": [ { "output_type": "stream", "name": "stdout", "text": [ "drive gtfs_realtime_pb2.py sample_data\n" ] } ] }, { "cell_type": "markdown", "source": [ "The whole point of using gtfs_realtime_pb2.py is to provide Python classes and methods that make it easy to work with GTFS-RT Protobuf (Protocol Buffers) encoded data. Without it, manual parsing and interpretation of the binary data would be necessary, which is both error-prone and impractical.\n", "\n", "## Fetch Calgary Transit’s GTFS-RT Feed\n", "Now let’s fetch and parse the data from the feed. Try uploading the gtfs_realtime_pb2.pymanually because if there is a mismatch between versions of the .proto file, then that will lead to a TypeError." ], "metadata": { "id": "DHCw3gIOg7s9" } }, { "cell_type": "code", "source": [ "# Import the requests library to handle HTTP requests\n", "import requests\n", "import pandas as pd\n", "import gtfs_realtime_pb2\n", "\n", "def fetch_gtfs_rt_feed(url):\n", " \"\"\"\n", " Fetches GTFS real-time data from the given URL.\n", " Args:\n", " url (str): The URL to fetch the GTFS real-time data from.\n", " Returns:\n", " feed: A GTFS-RT FeedMessage object containing the parsed data, or None if an error occurs.\n", " \"\"\"\n", " try:\n", " # Send a GET request to the provided URL\n", " response = requests.get(url)\n", "\n", " # Check if the response status code is 200 (OK)\n", " if response.status_code == 200:\n", " # Initialize a FeedMessage object from gtfs_realtime_pb2\n", " feed = gtfs_realtime_pb2.FeedMessage()\n", "\n", " # Parse the response content into the FeedMessage object\n", " feed.ParseFromString(response.content)\n", " return feed\n", " else:\n", " # Print an error message if the status code is not 200\n", " print(f\"Error fetching data: {response.status_code} - {response.reason}\")\n", " return None\n", " except Exception as e:\n", " # Print an error message if an exception occurs\n", " print(f\"An error occurred: {e}\")\n", " return None\n", "# URL for GTFS real-time vehicle positions data\n", "vehicle_positions_url = \"https://data.calgary.ca/download/am7c-qe3u/application%2Foctet-stream\"\n", "# Fetch the GTFS real-time data from the specified URL\n", "feed = fetch_gtfs_rt_feed(vehicle_positions_url)\n", "# Check if the feed was fetched successfully\n", "if feed:\n", " vehicle_data = [] # Initialize an empty list to store vehicle information\n", "\n", " # Loop through the first 5 entities in the feed\n", " for entity in feed.entity[:5]: # [:5] ensures we only process the first 5 entities\n", " if entity.HasField('vehicle'): # Check if the entity contains vehicle data\n", " vehicle = entity.vehicle # Extract the vehicle field\n", "\n", " # Create a dictionary with relevant vehicle information\n", " vehicle_info = {\n", " \"Vehicle ID\": vehicle.vehicle.id, # Vehicle identifier\n", " \"Latitude\": vehicle.position.latitude, # Latitude position of the vehicle\n", " \"Longitude\": vehicle.position.longitude # Longitude position of the vehicle\n", " }\n", "\n", " # Append the vehicle information dictionary to the list\n", " vehicle_data.append(vehicle_info)\n", "\n", " # Create a DataFrame from the list of dictionaries\n", " df = pd.DataFrame(vehicle_data)\n", "\n", " # Print the DataFrame to display the data in a tabular format\n", " print(df)" ], "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "npB3t6C8g_qN", "outputId": "6c14e8b5-ba0c-4f94-94d8-b4bc76bc1619" }, "execution_count": 6, "outputs": [ { "output_type": "stream", "name": "stdout", "text": [ " Vehicle ID Latitude Longitude\n", "0 8241 51.071911 -113.981918\n", "1 1200 51.057251 -114.020523\n", "2 8312 51.121761 -114.071648\n", "3 8274 51.123436 -114.071243\n", "4 8254 51.092236 -114.033279\n" ] } ] }, { "cell_type": "markdown", "source": [ "## Explanation\n", "First, you are using the requests library to fetch the data from the URL. Second, you execute the function fetch_gtfs_rt_feed that retrieves and parses the GTFS real-time feed. Third, you check the response to ensure successful data retrieval. Fourth, the system implements error handling to catch and print any errors. Finally, we then process the feed, which extracts and prints vehicle information like ID and position if available. To neaten the results, I used the Pandas library to display the data.\n", "\n", "## Output" ], "metadata": { "id": "BEbxNsuchSLQ" } }, { "cell_type": "code", "source": [ "df" ], "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 206 }, "id": "frJUgObkhQCy", "outputId": "2ce6e1f7-f008-445c-e098-e7ff74d3b1bf" }, "execution_count": 7, "outputs": [ { "output_type": "execute_result", "data": { "text/plain": [ " Vehicle ID Latitude Longitude\n", "0 8241 51.071911 -113.981918\n", "1 1200 51.057251 -114.020523\n", "2 8312 51.121761 -114.071648\n", "3 8274 51.123436 -114.071243\n", "4 8254 51.092236 -114.033279" ], "text/html": [ "\n", "
\n", "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
Vehicle IDLatitudeLongitude
0824151.071911-113.981918
1120051.057251-114.020523
2831251.121761-114.071648
3827451.123436-114.071243
4825451.092236-114.033279
\n", "
\n", "
\n", "\n", "
\n", " \n", "\n", " \n", "\n", " \n", "
\n", "\n", "\n", "
\n", " \n", "\n", "\n", "\n", " \n", "
\n", "\n", "
\n", " \n", " \n", " \n", "
\n", "\n", "
\n", "
\n" ], "application/vnd.google.colaboratory.intrinsic+json": { "type": "dataframe", "variable_name": "df", "summary": "{\n \"name\": \"df\",\n \"rows\": 5,\n \"fields\": [\n {\n \"column\": \"Vehicle ID\",\n \"properties\": {\n \"dtype\": \"string\",\n \"num_unique_values\": 5,\n \"samples\": [\n \"1200\",\n \"8254\",\n \"8312\"\n ],\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"Latitude\",\n \"properties\": {\n \"dtype\": \"number\",\n \"std\": 0.02948042780870442,\n \"min\": 51.0572509765625,\n \"max\": 51.123435974121094,\n \"num_unique_values\": 5,\n \"samples\": [\n 51.0572509765625,\n 51.09223556518555,\n 51.121761322021484\n ],\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"Longitude\",\n \"properties\": {\n \"dtype\": \"number\",\n \"std\": 0.03769688871650909,\n \"min\": -114.07164764404297,\n \"max\": -113.98191833496094,\n \"num_unique_values\": 5,\n \"samples\": [\n -114.02052307128906,\n -114.03327941894531,\n -114.07164764404297\n ],\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n }\n ]\n}" } }, "metadata": {}, "execution_count": 7 } ] }, { "cell_type": "markdown", "source": [ "The data that you are seeing is raw real-time information that comprises vehicle ID and its position (Latitude and Longitude). In some cases, the vehicle ID might directly relate to the Bus number (For example Vehicle ID 1280 will be Bus 128). It all depends on the City’s style of encoding the data. Contact the City Commission to get more accurate results.\n", "\n", "## Mapping the Location\n", "Let’s use folium library to map the data. Folium easily visualizes data manipulated in Python. Visit the documentation below.\n", "\n", "[Folium Documentation](https://python-visualization.github.io/folium/latest/?source=post_page-----49baea30833e--------------------------------)\n", "\n", "You need to install the folium library\n", "\n" ], "metadata": { "id": "nuUHduIbhbyV" } }, { "cell_type": "code", "source": [ "!pip install folium" ], "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "oXyJfkKGhmkt", "outputId": "8433f6b7-7521-417e-eb79-b87994af888d" }, "execution_count": 8, "outputs": [ { "output_type": "stream", "name": "stdout", "text": [ "Requirement already satisfied: folium in /usr/local/lib/python3.11/dist-packages (0.19.4)\n", "Requirement already satisfied: branca>=0.6.0 in /usr/local/lib/python3.11/dist-packages (from folium) (0.8.1)\n", "Requirement already satisfied: jinja2>=2.9 in /usr/local/lib/python3.11/dist-packages (from folium) (3.1.5)\n", "Requirement already satisfied: numpy in /usr/local/lib/python3.11/dist-packages (from folium) (1.26.4)\n", "Requirement already satisfied: requests in /usr/local/lib/python3.11/dist-packages (from folium) (2.32.3)\n", "Requirement already satisfied: xyzservices in /usr/local/lib/python3.11/dist-packages (from folium) (2024.9.0)\n", "Requirement already satisfied: MarkupSafe>=2.0 in /usr/local/lib/python3.11/dist-packages (from jinja2>=2.9->folium) (3.0.2)\n", "Requirement already satisfied: charset-normalizer<4,>=2 in /usr/local/lib/python3.11/dist-packages (from requests->folium) (3.4.1)\n", "Requirement already satisfied: idna<4,>=2.5 in /usr/local/lib/python3.11/dist-packages (from requests->folium) (3.10)\n", "Requirement already satisfied: urllib3<3,>=1.21.1 in /usr/local/lib/python3.11/dist-packages (from requests->folium) (2.3.0)\n", "Requirement already satisfied: certifi>=2017.4.17 in /usr/local/lib/python3.11/dist-packages (from requests->folium) (2024.12.14)\n" ] } ] }, { "cell_type": "markdown", "source": [ "Folium is used to create interactive maps. The reason I am using it is that it’s simple and easy to understand. Let me know if you come across any other similar libraries that get the job done.\n", "\n" ], "metadata": { "id": "5ppAUlRXhrCh" } }, { "cell_type": "code", "source": [ "import folium # Importing the folium library to work with interactive maps\n", "\n", "def plot_vehicle_on_map(latitude, longitude, vehicle_id):\n", " \"\"\"\n", " Plots the vehicle's location on a map using its latitude, longitude, and ID.\n", "\n", " Parameters:\n", " latitude (float): The latitude of the vehicle's location.\n", " longitude (float): The longitude of the vehicle's location.\n", " vehicle_id (str): The unique identifier for the vehicle.\n", "\n", " Returns:\n", " folium.Map: A Folium map centered on the vehicle's location with a marker.\n", " \"\"\"\n", " # Create a map centered at the vehicle's location with a zoom level of 14\n", " vehicle_map = folium.Map(location=[latitude, longitude], zoom_start=14)\n", "\n", " # Add a marker to the map at the vehicle's location\n", " folium.Marker(\n", " location=[latitude, longitude], # The latitude and longitude of the marker\n", " popup=f\"Vehicle ID: {vehicle_id}\", # Popup text to display when the marker is clicked\n", " icon=folium.Icon(color=\"blue\", icon=\"bus\", prefix=\"fa\"), # Custom icon for the marker\n", " ).add_to(vehicle_map) # Add the marker to the map\n", "\n", " return vehicle_map # Return the map object\n", "# Example data for a vehicle's location and ID\n", "latitude = 51.071911 # Example latitude value\n", "longitude = -113.981918 # Example longitude value\n", "vehicle_id = \"8241\" # Example vehicle ID\n", "# Generate the map with the example vehicle data\n", "map_output = plot_vehicle_on_map(latitude, longitude, vehicle_id)" ], "metadata": { "id": "E9X90mbJhrs-" }, "execution_count": 10, "outputs": [] }, { "cell_type": "markdown", "source": [ "## Output\n", "Upon execution of the above code, you will get the exact location of the Bus with the ID of 8080. Now keep in mind this data is dynamic. According to the Calgary Data Portal, this data changes every 30 minutes or even sooner." ], "metadata": { "id": "U2AC7q5_h_kK" } }, { "cell_type": "code", "source": [ "map_output # Display the map" ], "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 711 }, "id": "o6s8dShxiGXR", "outputId": "9821c1c2-ca0f-4d90-f589-0f991637733b" }, "execution_count": 11, "outputs": [ { "output_type": "execute_result", "data": { "text/plain": [ "" ], "text/html": [ "
Make this Notebook Trusted to load map: File -> Trust Notebook
" ] }, "metadata": {}, "execution_count": 11 } ] }, { "cell_type": "markdown", "source": [ "# Trip Updates Feed\n", "The GTFS Realtime Trip Updates feed contains information about real-time updates to scheduled trips, such as delays, changes in stop times, and other dynamic data. But in this case, the Calgary real-time feed only provides Trip ID, Start Time, Start Date, Stop ID, Arrival Time, and Departure Time. The feed design reflects trip delays; for example, a trip scheduled for 8:00 AM with a 10-minute delay will show this updated information. The update will also include this ID if bus 8080 is running the trip." ], "metadata": { "id": "ae-UQFeGiKP0" } }, { "cell_type": "code", "source": [ "import requests\n", "import gtfs_realtime_pb2 # Import the compiled GTFS Realtime protocol buffer\n", "import pandas as pd\n", "from datetime import datetime\n", "\n", "def fetch_gtfs_rt_trip_updates(url):\n", " \"\"\"Fetches and parses the GTFS Realtime Trip Updates feed from the given URL.\"\"\"\n", " try:\n", " # Make a GET request to fetch the data from the specified URL\n", " response = requests.get(url)\n", " if response.status_code == 200:\n", " # Parse the response content into a FeedMessage object\n", " feed = gtfs_realtime_pb2.FeedMessage()\n", " feed.ParseFromString(response.content)\n", " return feed\n", " else:\n", " # Print an error message if the response status is not OK\n", " print(f\"Error fetching data: {response.status_code} - {response.reason}\")\n", " return None\n", " except Exception as e:\n", " # Catch and print any exceptions that occur during the request\n", " print(f\"An error occurred: {e}\")\n", " return None\n", "def extract_trip_updates(feed):\n", " \"\"\"Extracts trip update information from the GTFS Realtime feed.\"\"\"\n", " trip_updates = []\n", "\n", " # Loop through each entity in the feed\n", " for entity in feed.entity:\n", " if entity.HasField('trip_update'):\n", " # Extract the trip update data\n", " trip_update = entity.trip_update\n", " trip_id = trip_update.trip.trip_id\n", " start_time = trip_update.trip.start_time\n", " start_date = trip_update.trip.start_date\n", "\n", " # Loop through each stop time update in the trip update\n", " for stop_time_update in trip_update.stop_time_update:\n", " stop_id = stop_time_update.stop_id\n", " # Extract arrival and departure times, if available\n", " arrival_time = stop_time_update.arrival.time if stop_time_update.HasField('arrival') else None\n", " departure_time = stop_time_update.departure.time if stop_time_update.HasField('departure') else None\n", "\n", " # Convert timestamps to human-readable format\n", " arrival_time = datetime.utcfromtimestamp(arrival_time).strftime('%Y-%m-%d %H:%M:%S') if arrival_time else None\n", " departure_time = datetime.utcfromtimestamp(departure_time).strftime('%Y-%m-%d %H:%M:%S') if departure_time else None\n", "\n", " # Add the extracted information to the trip updates list\n", " trip_updates.append({\n", " \"Trip ID\": trip_id,\n", " \"Start Time\": start_time,\n", " \"Start Date\": start_date,\n", " \"Stop ID\": stop_id,\n", " \"Arrival Time\": arrival_time,\n", " \"Departure Time\": departure_time\n", " })\n", " return trip_updates\n", "# URL for GTFS Realtime Trip Updates\n", "trip_updates_url = \"https://data.calgary.ca/download/gs4m-mdc2/application%2Foctet-stream\" # Replace with the actual Trip Updates URL\n", "# Fetch the trip updates feed\n", "feed = fetch_gtfs_rt_trip_updates(trip_updates_url)\n", "if feed:\n", " # Extract the trip updates from the feed\n", " trip_updates = extract_trip_updates(feed)\n", "\n", " # Convert the trip updates into a DataFrame for easy manipulation and display\n", " df_trip_updates = pd.DataFrame(trip_updates)\n", "\n", " # Display the first 10 rows of the DataFrame\n", " print(df_trip_updates.head(10))" ], "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "em0SgaSQiOLY", "outputId": "b8ade906-5939-4982-9997-3d931b5aabc5" }, "execution_count": 12, "outputs": [ { "output_type": "stream", "name": "stdout", "text": [ " Trip ID Start Time Start Date Stop ID Arrival Time \\\n", "0 69082338 6212 2025-01-17 04:33:00 \n", "1 69082338 4343 2025-01-17 04:34:00 \n", "2 69082338 6674 2025-01-17 04:35:00 \n", "3 69082338 6213 2025-01-17 04:36:00 \n", "4 69082338 4342 2025-01-17 04:37:00 \n", "5 69082338 6214 2025-01-17 04:38:00 \n", "6 69082338 6962 2025-01-17 04:39:00 \n", "7 69082338 6705 2025-01-17 04:41:00 \n", "8 69082338 6963 2025-01-17 04:41:00 \n", "9 69082338 6964 2025-01-17 04:42:00 \n", "\n", " Departure Time \n", "0 2025-01-17 04:33:00 \n", "1 2025-01-17 04:34:00 \n", "2 2025-01-17 04:35:00 \n", "3 2025-01-17 04:36:00 \n", "4 2025-01-17 04:37:00 \n", "5 2025-01-17 04:38:00 \n", "6 2025-01-17 04:39:00 \n", "7 2025-01-17 04:41:00 \n", "8 2025-01-17 04:41:00 \n", "9 2025-01-17 04:42:00 \n" ] } ] }, { "cell_type": "markdown", "source": [ "## Output" ], "metadata": { "id": "DC-W9Dguh6a9" } }, { "cell_type": "code", "source": [ "df_trip_updates.head(10)" ], "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 363 }, "id": "h-C5IRotiTAv", "outputId": "63cd07f1-19a3-40b0-945e-a7130dd927e5" }, "execution_count": 15, "outputs": [ { "output_type": "execute_result", "data": { "text/plain": [ " Trip ID Start Time Start Date Stop ID Arrival Time \\\n", "0 69082338 6212 2025-01-17 04:33:00 \n", "1 69082338 4343 2025-01-17 04:34:00 \n", "2 69082338 6674 2025-01-17 04:35:00 \n", "3 69082338 6213 2025-01-17 04:36:00 \n", "4 69082338 4342 2025-01-17 04:37:00 \n", "5 69082338 6214 2025-01-17 04:38:00 \n", "6 69082338 6962 2025-01-17 04:39:00 \n", "7 69082338 6705 2025-01-17 04:41:00 \n", "8 69082338 6963 2025-01-17 04:41:00 \n", "9 69082338 6964 2025-01-17 04:42:00 \n", "\n", " Departure Time \n", "0 2025-01-17 04:33:00 \n", "1 2025-01-17 04:34:00 \n", "2 2025-01-17 04:35:00 \n", "3 2025-01-17 04:36:00 \n", "4 2025-01-17 04:37:00 \n", "5 2025-01-17 04:38:00 \n", "6 2025-01-17 04:39:00 \n", "7 2025-01-17 04:41:00 \n", "8 2025-01-17 04:41:00 \n", "9 2025-01-17 04:42:00 " ], "text/html": [ "\n", "
\n", "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
Trip IDStart TimeStart DateStop IDArrival TimeDeparture Time
06908233862122025-01-17 04:33:002025-01-17 04:33:00
16908233843432025-01-17 04:34:002025-01-17 04:34:00
26908233866742025-01-17 04:35:002025-01-17 04:35:00
36908233862132025-01-17 04:36:002025-01-17 04:36:00
46908233843422025-01-17 04:37:002025-01-17 04:37:00
56908233862142025-01-17 04:38:002025-01-17 04:38:00
66908233869622025-01-17 04:39:002025-01-17 04:39:00
76908233867052025-01-17 04:41:002025-01-17 04:41:00
86908233869632025-01-17 04:41:002025-01-17 04:41:00
96908233869642025-01-17 04:42:002025-01-17 04:42:00
\n", "
\n", "
\n", "\n", "
\n", " \n", "\n", " \n", "\n", " \n", "
\n", "\n", "\n", "
\n", " \n", "\n", "\n", "\n", " \n", "
\n", "\n", "
\n", "
\n" ], "application/vnd.google.colaboratory.intrinsic+json": { "type": "dataframe", "variable_name": "df_trip_updates", "summary": "{\n \"name\": \"df_trip_updates\",\n \"rows\": 10999,\n \"fields\": [\n {\n \"column\": \"Trip ID\",\n \"properties\": {\n \"dtype\": \"category\",\n \"num_unique_values\": 554,\n \"samples\": [\n \"69083977\",\n \"69083929\",\n \"69088118\"\n ],\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"Start Time\",\n \"properties\": {\n \"dtype\": \"object\",\n \"num_unique_values\": 1,\n \"samples\": [\n \"\"\n ],\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"Start Date\",\n \"properties\": {\n \"dtype\": \"object\",\n \"num_unique_values\": 3,\n \"samples\": [\n \"\"\n ],\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"Stop ID\",\n \"properties\": {\n \"dtype\": \"category\",\n \"num_unique_values\": 5198,\n \"samples\": [\n \"4652\"\n ],\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"Arrival Time\",\n \"properties\": {\n \"dtype\": \"object\",\n \"num_unique_values\": 105,\n \"samples\": [\n \"2025-01-17 05:14:00\"\n ],\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"Departure Time\",\n \"properties\": {\n \"dtype\": \"object\",\n \"num_unique_values\": 102,\n \"samples\": [\n \"2025-01-17 05:14:00\"\n ],\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n }\n ]\n}" } }, "metadata": {}, "execution_count": 15 } ] }, { "cell_type": "markdown", "source": [ "If you’re not receiving start_time and start_date from the GTFS Realtime Trip Updates, it might be because those fields are optional and not always provided in the feed. This might be for security reasons as well. Upon contacting the City’s Transit service, you can get this vital information. I hope you understand the point.\n", "\n", "[Calgary Transit Realtime Trip Updates GTFS-RT](https://data.calgary.ca/Transportation-Transit/Calgary-Transit-Realtime-Trip-Updates-GTFS-RT/gs4m-mdc2/about_data?utm_source=chatgpt.com&source=post_page-----49baea30833e--------------------------------)\n", "\n", "---\n", "\n", "## Side Note\n", "\n", "You see how there are different IDs involved, such as Trip and the Stop. Now, if all the data is available, you can easily find the corresponding bus that has been running in those routes. Let me know if you can connect those dots. I would be happy to Colab with you and make this a working project.\n", "\n", "---\n", "\n", "# Service Alerts\n", "\n", "Calgary Transit refreshes its real-time data every half minute. To learn more about the [GTFS-RT specification](https://developers.google.com/transit/gtfs-realtime/) and its [components](https://developers.google.com/transit/gtfs-realtime/guides/feed-entities) ([Trip Updates](https://data.calgary.ca/dataset/GTFS-RT-Trip-Updates/gs4m-mdc2), [Service Alerts](https://data.calgary.ca/Transportation-Transit/Calgary-Transit-Realtime-Service-Alerts-GTFS-RT/jhgn-ynqj/about_data), and [Vehicle Positions](https://data.calgary.ca/dataset/GTFS-RT-Vehicle-Positions/am7c-qe3u)), check out the Google Transit API page. Also, see [Service Updates](http://www.calgarytransit.com/service-updates?nid=170214). Let’s see what the service alerts look like." ], "metadata": { "id": "efffCccxidRn" } }, { "cell_type": "code", "source": [ "import requests # Library to make HTTP requests\n", "import gtfs_realtime_pb2 # Ensure this proto file is compiled as Python\n", "import pandas as pd # For handling and displaying data in DataFrame format\n", "\n", "# Function to fetch the GTFS Realtime Alerts feed\n", "def fetch_gtfs_rt_alerts(url):\n", " \"\"\"Fetches and parses the GTFS Realtime Alerts feed from the given URL.\"\"\"\n", " try:\n", " # Send a request to the URL to get the feed\n", " response = requests.get(url)\n", "\n", " # Check if the response is successful (status code 200)\n", " if response.status_code == 200:\n", " # Parse the feed using GTFS Realtime protocol\n", " feed = gtfs_realtime_pb2.FeedMessage()\n", " feed.ParseFromString(response.content)\n", " return feed # Return the parsed feed\n", " else:\n", " # Print error if the response status is not 200\n", " print(f\"Error fetching data: {response.status_code} - {response.reason}\")\n", " return None\n", " except Exception as e:\n", " # Catch and print any exception that occurs during the request\n", " print(f\"An error occurred: {e}\")\n", " return None\n", "\n", "# Function to extract alerts from the GTFS Realtime feed\n", "def extract_alerts(feed):\n", " \"\"\"Extracts alert information from the GTFS Realtime feed.\"\"\"\n", " alerts = [] # Initialize an empty list to store alert information\n", "\n", " # Loop through each entity in the feed\n", " for entity in feed.entity:\n", " # Check if the entity contains an alert\n", " if entity.HasField('alert'):\n", " alert = entity.alert\n", " # Extract relevant fields from the alert\n", " alert_id = entity.id # Unique ID for the alert\n", " # Extract header text from the alert (if available)\n", " header_text = alert.header_text.translation[0].text if alert.header_text.translation else \"No header\"\n", " # Extract description text from the alert (if available)\n", " description_text = alert.description_text.translation[0].text if alert.description_text.translation else \"No description\"\n", " severity_level = alert.severity_level # Severity level of the alert (e.g., low, medium, high)\n", "\n", " # Append the extracted alert information to the alerts list\n", " alerts.append({\n", " \"Alert ID\": alert_id,\n", " \"Header\": header_text,\n", " \"Description\": description_text,\n", " \"Severity Level\": severity_level\n", " })\n", "\n", " # Return the list of alerts\n", " return alerts\n", "\n", "# URL for GTFS Realtime Alerts (replace with the actual URL)\n", "alerts_url = \"https://data.calgary.ca/download/jhgn-ynqj/application%2Foctet-stream\" # Example placeholder URL\n", "\n", "# Fetch the alerts feed\n", "feed = fetch_gtfs_rt_alerts(alerts_url)\n", "\n", "# If feed is fetched successfully, extract the alerts\n", "if feed:\n", " alerts = extract_alerts(feed)\n", "\n", " # Convert the list of alerts into a DataFrame for easier viewing\n", " df_alerts = pd.DataFrame(alerts)\n", "\n", " # Display the first 5 rows of the alerts DataFrame\n", " print(df_alerts.head(5))" ], "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "BmeyPyCEi-lt", "outputId": "a80e6f35-8225-4e1c-9147-5eed6162b294" }, "execution_count": 17, "outputs": [ { "output_type": "stream", "name": "stdout", "text": [ " Alert ID Header Description \\\n", "0 166173
\\n

Starting ... \n", "1 166174

Starting Th... \n", "2 166175

\\n

Start... \n", "3 166177

\\n

Starting ... \n", "4 166178

\\n

Starting ... \n", "\n", " Severity Level \n", "0 1 \n", "1 1 \n", "2 1 \n", "3 1 \n", "4 1 \n" ] } ] }, { "cell_type": "markdown", "source": [ "You can read the documentation of the Service Alerts below.\n", "\n", "[Calgary Transit Realtime Service Alerts GTFS - RT](https://data.calgary.ca/Transportation-Transit/Calgary-Transit-Realtime-Service-Alerts-GTFS-RT/jhgn-ynqj/about_data?source=post_page-----49baea30833e--------------------------------)\n", "\n", "Upon executing the above code snippet, I noticed that the data was not in the readable format. The descriptions were not fully displayed and contained HTML tags. So that’s why I have used BeautifulSoup library to clean this up, you can trim leading and trailing spaces or newlines from the alert text.\n", "\n" ], "metadata": { "id": "u8RJ_PxxjOop" } }, { "cell_type": "code", "source": [ "from bs4 import BeautifulSoup # Import the BeautifulSoup library to parse and clean HTML\n", "\n", "def clean_html(raw_html):\n", " \"\"\"Removes HTML tags and returns plain text.\"\"\"\n", " # Create a BeautifulSoup object to parse the HTML content\n", " soup = BeautifulSoup(raw_html, 'html.parser')\n", " # Use the get_text() method to extract and return the plain text from the HTML\n", " return soup.get_text()\n", "# Loop through each alert and clean the \"Header\" and \"Description\" fields by removing HTML tags\n", "for alert in alerts:\n", " # Apply the clean_html function to the \"Header\" field\n", " alert[\"Header\"] = clean_html(alert[\"Header\"])\n", " # Apply the clean_html function to the \"Description\" field\n", " alert[\"Description\"] = clean_html(alert[\"Description\"])\n", "# Convert the cleaned alerts into a pandas DataFrame for easy viewing and manipulation\n", "df_alerts = pd.DataFrame(alerts)\n", "# Display the first 5 rows of the DataFrame to verify the cleaned alerts\n", "print(df_alerts.head(5))\n" ], "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "VIEXmsrUjf6c", "outputId": "bad67a5e-265d-4c24-b644-1e2d2c455c23" }, "execution_count": 18, "outputs": [ { "output_type": "stream", "name": "stdout", "text": [ " Alert ID Header Description \\\n", "0 166173 \\nStarting Thursday, January 16 at 9 a.m. thro... \n", "1 166174 Starting Thursday, January 16 at 9 a.m. throug... \n", "2 166175 \\nStarting Thursday, January 16 at 9 a.m. thro... \n", "3 166177 \\nStarting Thursday, January 16 at 9 a.m. thro... \n", "4 166178 \\nStarting Thursday, January 16 at 9 a.m. thro... \n", "\n", " Severity Level \n", "0 1 \n", "1 1 \n", "2 1 \n", "3 1 \n", "4 1 \n" ] } ] }, { "cell_type": "markdown", "source": [ "# Output\n", "You might be asking yourself, “Can I associate this with the trip_id and vehicle_id”. The answer is Yes, by incorporating that information into your data processing, you can link the alerts to particular trip_id and vehicle_id. Every alert should be connected to the appropriate trip and vehicle. Once again, it’s a moving piece of the puzzle. Once the data is completely available for the public without encapsulation, this can be possible." ], "metadata": { "id": "i6yh1MbCjkL_" } }, { "cell_type": "code", "source": [ "df_alerts.head(5)" ], "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 507 }, "id": "W_5wtfRCjlxa", "outputId": "4ff0413f-534f-4151-8920-4526d5fdac16" }, "execution_count": 19, "outputs": [ { "output_type": "execute_result", "data": { "text/plain": [ " Alert ID Header Description \\\n", "0 166173 \\nStarting Thursday, January 16 at 9 a.m. thro... \n", "1 166174 Starting Thursday, January 16 at 9 a.m. throug... \n", "2 166175 \\nStarting Thursday, January 16 at 9 a.m. thro... \n", "3 166177 \\nStarting Thursday, January 16 at 9 a.m. thro... \n", "4 166178 \\nStarting Thursday, January 16 at 9 a.m. thro... \n", "\n", " Severity Level \n", "0 1 \n", "1 1 \n", "2 1 \n", "3 1 \n", "4 1 " ], "text/html": [ "\n", "

\n", "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
Alert IDHeaderDescriptionSeverity Level
0166173\\nStarting Thursday, January 16 at 9 a.m. thro...1
1166174Starting Thursday, January 16 at 9 a.m. throug...1
2166175\\nStarting Thursday, January 16 at 9 a.m. thro...1
3166177\\nStarting Thursday, January 16 at 9 a.m. thro...1
4166178\\nStarting Thursday, January 16 at 9 a.m. thro...1
\n", "
\n", "
\n", "\n", "
\n", " \n", "\n", " \n", "\n", " \n", "
\n", "\n", "\n", "
\n", " \n", "\n", "\n", "\n", " \n", "
\n", "\n", "
\n", "
\n" ], "application/vnd.google.colaboratory.intrinsic+json": { "type": "dataframe", "variable_name": "df_alerts", "summary": "{\n \"name\": \"df_alerts\",\n \"rows\": 22,\n \"fields\": [\n {\n \"column\": \"Alert ID\",\n \"properties\": {\n \"dtype\": \"string\",\n \"num_unique_values\": 22,\n \"samples\": [\n \"166173\",\n \"165820\",\n \"166075\"\n ],\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"Header\",\n \"properties\": {\n \"dtype\": \"object\",\n \"num_unique_values\": 1,\n \"samples\": [\n \"\"\n ],\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"Description\",\n \"properties\": {\n \"dtype\": \"string\",\n \"num_unique_values\": 22,\n \"samples\": [\n \"\\nStarting Thursday, January 16 at 9 a.m. through Friday, January 17 at 3 p.m., 1 Street will be closed between 7 - 9 Avenue S.W. for construction.\\u00a0 Route 6 will be detoured.\\nRoute 6 will travel:\\n\\nNorth on 1 Street S.W.\\u00a0\\nEast on 9 Avenue S.W.\\u00a0\\nNorth on Centre Street S.\\nWest on 6 Avenue S.W. to regular route\\n\\n\\tThe following stop will be temporarily closed:\\n\\tSB 1 ST @ 7 AV SW (#5303)\\n\\n\\tPlease use the following stop:\\n \\nNB Centre ST @ 7 AV S (farside) - detour stop\\n\\t \\n\"\n ],\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"Severity Level\",\n \"properties\": {\n \"dtype\": \"number\",\n \"std\": 0,\n \"min\": 1,\n \"max\": 1,\n \"num_unique_values\": 1,\n \"samples\": [\n 1\n ],\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n }\n ]\n}" } }, "metadata": {}, "execution_count": 19 } ] }, { "cell_type": "markdown", "source": [ "# Conclusion\n", "You’ve made it to the end! This topic is vast and offers significant opportunities for further exploration and development. With the right approach, you could create a new application to help Calgary’s residents avoid inconveniences during bus travel, especially in extreme weather conditions. If you encounter any issues while executing the code, feel free to reach out. Suggestions are always welcome! I hope you enjoyed reading this article, and I look forward to seeing you next time. Happy coding!" ], "metadata": { "id": "Wt0wpn1Pjwop" } } ] }