{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "---\n", "title: \"Hive PySpark Example\"\n", "date: 2021-02-24\n", "type: technical_note\n", "draft: false\n", "---" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# PySpark With Hive\n", "\n", "In this notebook we'll cover how you can read/write to Hive using SparkSQL, this notebook assumes that you have enabled the service \"Hive\" in your project" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Create a SparkSession with Hive Enabled\n", "\n", "sparkmagic automatically creates a spark session in the cluster for us with Hive enabled" ] }, { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Starting Spark application\n" ] }, { "data": { "text/html": [ "
ID | YARN Application ID | Kind | State | Spark UI | Driver log | Current session? |
---|---|---|---|---|---|---|
0 | application_1540813611542_0002 | pyspark | idle | Link | Link | ✔ |