{
 "cells": [
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# LL parsing (optional)\n",
    "\n",
    "Deterministic PDAs are appealing because they are much easier to implement. However, the CFG to PDA conversion in the book outputs a deterministic PDA only for the most uninteresting CFGs.\n",
    "\n",
    "For example, the following grammar generates the language $\\{\\texttt{a}^i \\texttt{b}^j \\texttt{c}^i \\mid i, j \\geq 0\\}$, which you could easily write a deterministic PDA for:\n",
    "\n",
    "\\begin{align*}\n",
    "S &\\rightarrow \\texttt{a} S \\texttt{c} \\\\\n",
    "S &\\rightarrow T \\\\\n",
    "T &\\rightarrow \\texttt{b} T \\\\\n",
    "T &\\rightarrow \\varepsilon\n",
    "\\end{align*}"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 1,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "image/svg+xml": [
       "<svg height=\"182pt\" viewBox=\"0.00 0.00 568.00 182.39\" width=\"568pt\" xmlns=\"http://www.w3.org/2000/svg\" xmlns:xlink=\"http://www.w3.org/1999/xlink\">\n",
       "<g class=\"graph\" id=\"graph0\" transform=\"scale(1 1) rotate(0) translate(4 178.3904)\">\n",
       "<title>%3</title>\n",
       "<polygon fill=\"#ffffff\" points=\"-4,4 -4,-178.3904 564,-178.3904 564,4 -4,4\" stroke=\"transparent\"/>\n",
       "<!-- _START -->\n",
       "<g class=\"node\" id=\"node1\">\n",
       "<title>_START</title>\n",
       "</g>\n",
       "<!-- 1 -->\n",
       "<g class=\"node\" id=\"node3\">\n",
       "<title>1</title>\n",
       "<path d=\"M71.3333,-84.3904C71.3333,-84.3904 43.6667,-84.3904 43.6667,-84.3904 40.8333,-84.3904 38,-81.5571 38,-78.7238 38,-78.7238 38,-73.0571 38,-73.0571 38,-70.2238 40.8333,-67.3904 43.6667,-67.3904 43.6667,-67.3904 71.3333,-67.3904 71.3333,-67.3904 74.1667,-67.3904 77,-70.2238 77,-73.0571 77,-73.0571 77,-78.7238 77,-78.7238 77,-81.5571 74.1667,-84.3904 71.3333,-84.3904\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"10.00\" text-anchor=\"start\" x=\"42\" y=\"-73.3904\">start</text>\n",
       "</g>\n",
       "<!-- _START&#45;&gt;1 -->\n",
       "<g class=\"edge\" id=\"edge1\">\n",
       "<title>_START-&gt;1</title>\n",
       "<path d=\"M1.1401,-75.8904C4.3362,-75.8904 18.9507,-75.8904 32.4957,-75.8904\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"37.804,-75.8904 32.8041,-78.1405 35.304,-75.8905 32.804,-75.8905 32.804,-75.8905 32.804,-75.8905 35.304,-75.8905 32.804,-73.6405 37.804,-75.8904 37.804,-75.8904\" stroke=\"#000000\"/>\n",
       "</g>\n",
       "<!-- 0 -->\n",
       "<g class=\"node\" id=\"node2\">\n",
       "<title>0</title>\n",
       "<path d=\"M173.3333,-84.3904C173.3333,-84.3904 157.6667,-84.3904 157.6667,-84.3904 154.8333,-84.3904 152,-81.5571 152,-78.7238 152,-78.7238 152,-73.0571 152,-73.0571 152,-70.2238 154.8333,-67.3904 157.6667,-67.3904 157.6667,-67.3904 173.3333,-67.3904 173.3333,-67.3904 176.1667,-67.3904 179,-70.2238 179,-73.0571 179,-73.0571 179,-78.7238 179,-78.7238 179,-81.5571 176.1667,-84.3904 173.3333,-84.3904\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"10.00\" text-anchor=\"start\" x=\"156\" y=\"-73.3904\">0.1</text>\n",
       "</g>\n",
       "<!-- 2 -->\n",
       "<g class=\"node\" id=\"node4\">\n",
       "<title>2</title>\n",
       "<path d=\"M300.8333,-84.3904C300.8333,-84.3904 279.1667,-84.3904 279.1667,-84.3904 276.3333,-84.3904 273.5,-81.5571 273.5,-78.7238 273.5,-78.7238 273.5,-73.0571 273.5,-73.0571 273.5,-70.2238 276.3333,-67.3904 279.1667,-67.3904 279.1667,-67.3904 300.8333,-67.3904 300.8333,-67.3904 303.6667,-67.3904 306.5,-70.2238 306.5,-73.0571 306.5,-73.0571 306.5,-78.7238 306.5,-78.7238 306.5,-81.5571 303.6667,-84.3904 300.8333,-84.3904\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"10.00\" text-anchor=\"start\" x=\"277.5\" y=\"-73.3904\">loop</text>\n",
       "</g>\n",
       "<!-- 0&#45;&gt;2 -->\n",
       "<g class=\"edge\" id=\"edge3\">\n",
       "<title>0-&gt;2</title>\n",
       "<path d=\"M179.2994,-75.8904C200.8141,-75.8904 242.5364,-75.8904 268.1636,-75.8904\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"273.3748,-75.8904 268.3749,-78.1405 270.8748,-75.8905 268.3748,-75.8905 268.3748,-75.8905 268.3748,-75.8905 270.8748,-75.8905 268.3748,-73.6405 273.3748,-75.8904 273.3748,-75.8904\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"200.5\" y=\"-81.6904\">ε,ε → S</text>\n",
       "</g>\n",
       "<!-- 1&#45;&gt;0 -->\n",
       "<g class=\"edge\" id=\"edge2\">\n",
       "<title>1-&gt;0</title>\n",
       "<path d=\"M77.058,-75.8904C96.8383,-75.8904 127.2003,-75.8904 146.7035,-75.8904\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"151.8987,-75.8904 146.8987,-78.1405 149.3987,-75.8905 146.8987,-75.8905 146.8987,-75.8905 146.8987,-75.8905 149.3987,-75.8905 146.8986,-73.6405 151.8987,-75.8904 151.8987,-75.8904\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"98.5\" y=\"-81.6904\">ε,ε → $</text>\n",
       "</g>\n",
       "<!-- 2&#45;&gt;2 -->\n",
       "<g class=\"edge\" id=\"edge5\">\n",
       "<title>2-&gt;2</title>\n",
       "<path d=\"M279.2719,-84.5761C273.3458,-93.0643 276.9219,-102.3904 290,-102.3904 300.626,-102.3904 304.9791,-96.2337 303.0594,-89.3721\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"300.7281,-84.5761 304.9376,-88.0894 301.8211,-86.8246 302.914,-89.073 302.914,-89.073 302.914,-89.073 301.8211,-86.8246 300.8904,-90.0567 300.7281,-84.5761 300.7281,-84.5761\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"272\" y=\"-164.1904\">ε,S → T</text>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"272.5\" y=\"-150.1904\">ε,T → ε</text>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"272\" y=\"-136.1904\">a,a → ε</text>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"272\" y=\"-122.1904\">b,b → ε</text>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"272\" y=\"-108.1904\">c,c → ε</text>\n",
       "</g>\n",
       "<!-- 3 -->\n",
       "<g class=\"node\" id=\"node5\">\n",
       "<title>3</title>\n",
       "<path d=\"M439.3333,-152.3904C439.3333,-152.3904 423.6667,-152.3904 423.6667,-152.3904 420.8333,-152.3904 418,-149.5571 418,-146.7238 418,-146.7238 418,-141.0571 418,-141.0571 418,-138.2238 420.8333,-135.3904 423.6667,-135.3904 423.6667,-135.3904 439.3333,-135.3904 439.3333,-135.3904 442.1667,-135.3904 445,-138.2238 445,-141.0571 445,-141.0571 445,-146.7238 445,-146.7238 445,-149.5571 442.1667,-152.3904 439.3333,-152.3904\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"10.00\" text-anchor=\"start\" x=\"422\" y=\"-141.3904\">1.2</text>\n",
       "</g>\n",
       "<!-- 2&#45;&gt;3 -->\n",
       "<g class=\"edge\" id=\"edge4\">\n",
       "<title>2-&gt;3</title>\n",
       "<path d=\"M299.256,-84.5537C309.4952,-93.7264 326.8269,-108.0729 344,-116.8904 366.3009,-128.3409 394.3274,-135.9293 412.5646,-140.0691\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"417.893,-141.2435 412.5259,-142.3645 415.4516,-140.7054 413.0102,-140.1672 413.0102,-140.1672 413.0102,-140.1672 415.4516,-140.7054 413.4946,-137.9699 417.893,-141.2435 417.893,-141.2435\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"347.5\" y=\"-138.6904\">ε,S → c</text>\n",
       "</g>\n",
       "<!-- 5 -->\n",
       "<g class=\"node\" id=\"node7\">\n",
       "<title>5</title>\n",
       "<path d=\"M439.3333,-107.3904C439.3333,-107.3904 423.6667,-107.3904 423.6667,-107.3904 420.8333,-107.3904 418,-104.5571 418,-101.7238 418,-101.7238 418,-96.0571 418,-96.0571 418,-93.2238 420.8333,-90.3904 423.6667,-90.3904 423.6667,-90.3904 439.3333,-90.3904 439.3333,-90.3904 442.1667,-90.3904 445,-93.2238 445,-96.0571 445,-96.0571 445,-101.7238 445,-101.7238 445,-104.5571 442.1667,-107.3904 439.3333,-107.3904\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"10.00\" text-anchor=\"start\" x=\"422\" y=\"-96.3904\">3.1</text>\n",
       "</g>\n",
       "<!-- 2&#45;&gt;5 -->\n",
       "<g class=\"edge\" id=\"edge6\">\n",
       "<title>2-&gt;5</title>\n",
       "<path d=\"M306.7619,-82.2969C317.2932,-86.0374 331.2518,-90.4819 344,-92.8904 367.322,-97.2966 394.6831,-98.5615 412.5181,-98.877\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"417.7323,-98.9468 412.7025,-101.1295 415.2325,-98.9133 412.7327,-98.8797 412.7327,-98.8797 412.7327,-98.8797 415.2325,-98.9133 412.7629,-96.63 417.7323,-98.9468 417.7323,-98.9468\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"347.5\" y=\"-102.6904\">ε,T → T</text>\n",
       "</g>\n",
       "<!-- 6 -->\n",
       "<g class=\"node\" id=\"node8\">\n",
       "<title>6</title>\n",
       "<path d=\"M448.3333,-52.3904C448.3333,-52.3904 414.6667,-52.3904 414.6667,-52.3904 411.8333,-52.3904 409,-49.5571 409,-46.7238 409,-46.7238 409,-41.0571 409,-41.0571 409,-38.2238 411.8333,-35.3904 414.6667,-35.3904 414.6667,-35.3904 448.3333,-35.3904 448.3333,-35.3904 451.1667,-35.3904 454,-38.2238 454,-41.0571 454,-41.0571 454,-46.7238 454,-46.7238 454,-49.5571 451.1667,-52.3904 448.3333,-52.3904\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<path d=\"M449.6667,-56.3904C449.6667,-56.3904 413.3333,-56.3904 413.3333,-56.3904 409.1667,-56.3904 405,-52.2238 405,-48.0571 405,-48.0571 405,-39.7238 405,-39.7238 405,-35.5571 409.1667,-31.3904 413.3333,-31.3904 413.3333,-31.3904 449.6667,-31.3904 449.6667,-31.3904 453.8333,-31.3904 458,-35.5571 458,-39.7238 458,-39.7238 458,-48.0571 458,-48.0571 458,-52.2238 453.8333,-56.3904 449.6667,-56.3904\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"10.00\" text-anchor=\"start\" x=\"413\" y=\"-41.3904\">accept</text>\n",
       "</g>\n",
       "<!-- 2&#45;&gt;6 -->\n",
       "<g class=\"edge\" id=\"edge7\">\n",
       "<title>2-&gt;6</title>\n",
       "<path d=\"M303.589,-67.1397C314.1524,-60.8333 329.4152,-52.776 344,-48.8904 361.9881,-44.0982 382.7674,-42.7445 399.5617,-42.6372\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"404.6383,-42.6415 399.6363,-44.8872 402.1383,-42.6393 399.6383,-42.6372 399.6383,-42.6372 399.6383,-42.6372 402.1383,-42.6393 399.6402,-40.3872 404.6383,-42.6415 404.6383,-42.6415\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"347.5\" y=\"-54.6904\">ε,$ → ε</text>\n",
       "</g>\n",
       "<!-- 4 -->\n",
       "<g class=\"node\" id=\"node6\">\n",
       "<title>4</title>\n",
       "<path d=\"M554.3333,-21.3904C554.3333,-21.3904 538.6667,-21.3904 538.6667,-21.3904 535.8333,-21.3904 533,-18.5571 533,-15.7238 533,-15.7238 533,-10.0571 533,-10.0571 533,-7.2238 535.8333,-4.3904 538.6667,-4.3904 538.6667,-4.3904 554.3333,-4.3904 554.3333,-4.3904 557.1667,-4.3904 560,-7.2238 560,-10.0571 560,-10.0571 560,-15.7238 560,-15.7238 560,-18.5571 557.1667,-21.3904 554.3333,-21.3904\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"10.00\" text-anchor=\"start\" x=\"537\" y=\"-10.3904\">1.1</text>\n",
       "</g>\n",
       "<!-- 3&#45;&gt;4 -->\n",
       "<g class=\"edge\" id=\"edge8\">\n",
       "<title>3-&gt;4</title>\n",
       "<path d=\"M440.4958,-135.0444C445.7028,-129.8457 452.3278,-123.0957 458,-116.8904 487.4191,-84.7069 520.2456,-45.0879 536.3991,-25.3284\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"539.5858,-21.4232 538.1679,-26.7197 538.0052,-23.3602 536.4246,-25.2971 536.4246,-25.2971 536.4246,-25.2971 538.0052,-23.3602 534.6813,-23.8746 539.5858,-21.4232 539.5858,-21.4232\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"479.5\" y=\"-99.6904\">ε,ε → S</text>\n",
       "</g>\n",
       "<!-- 4&#45;&gt;2 -->\n",
       "<g class=\"edge\" id=\"edge9\">\n",
       "<title>4-&gt;2</title>\n",
       "<path d=\"M532.9605,-9.5696C507.4469,-3.8891 450.7063,5.9872 405,-5.8904 364.2393,-16.4828 323.8865,-46.8421 303.56,-63.8952\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"299.5336,-67.3203 301.8842,-62.3668 301.4378,-65.7005 303.3421,-64.0806 303.3421,-64.0806 303.3421,-64.0806 301.4378,-65.7005 304.8,-65.7944 299.5336,-67.3203 299.5336,-67.3203\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"415.5\" y=\"-11.6904\">ε,ε → a</text>\n",
       "</g>\n",
       "<!-- 5&#45;&gt;2 -->\n",
       "<g class=\"edge\" id=\"edge10\">\n",
       "<title>5-&gt;2</title>\n",
       "<path d=\"M420.5417,-90.2486C411.9625,-84.0631 399.437,-76.2574 387,-72.8904 361.9196,-66.1006 331.855,-68.5801 311.9132,-71.625\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"306.8204,-72.4486 311.397,-69.4292 309.2883,-72.0495 311.7562,-71.6503 311.7562,-71.6503 311.7562,-71.6503 309.2883,-72.0495 312.1155,-73.8714 306.8204,-72.4486 306.8204,-72.4486\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"349.5\" y=\"-78.6904\">ε,ε → b</text>\n",
       "</g>\n",
       "</g>\n",
       "</svg>"
      ],
      "text/plain": [
       "<IPython.core.display.SVG object>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "from tock import *\n",
    "\n",
    "g = Grammar.from_lines([\"S -> a S c\",\n",
    "                        \"S -> T\",\n",
    "                        \"T -> b T\",\n",
    "                        \"T -> &\"])\n",
    "p1 = from_grammar(g)\n",
    "to_graph(p1)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 2,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "False"
      ]
     },
     "execution_count": 2,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "p1.is_deterministic()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "There is nondeterminism in state `loop`, when the top stack symbol is either $S$ or $T$. Each of these nonterminals has two rules it can be rewritten with, and the PDA doesn't know which one to use. You can figure it out intuitively:\n",
    "\n",
    "- If the top stack symbol is $S$:\n",
    "    - If the next input symbol is $\\texttt{a}$, use the first rule.\n",
    "    - Else, use the second rule.\n",
    "- If the top stack symbol is $T$:\n",
    "    - If the next symbol is $\\texttt{b}$, use the third rule.\n",
    "    - Else, use the fourth rule.\n",
    "    \n",
    "Below, we'll show how to modify the CFG to PDA conversion to output a deterministic PDA that is able to _look ahead_ one symbol to capture the above intuition."
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## The endmarker\n",
    "\n",
    "When the PDA is at the end of the string, we want it to be able to look ahead and see that there are no more input symbols. So let's append an _endmarker_ $\\dashv$ to end of the input string. We continue to write $\\Sigma$ for the original alphabet that does not contain $\\dashv$.\n",
    "\n",
    "Accordingly, we modify our grammar by creating a new start nonterminal, $S'$, and adding a rule\n",
    "\n",
    "$$ S' \\rightarrow S\\dashv $$"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## Basic idea\n",
    "\n",
    "Given a top stack symbol $A$ and a look-ahead input symbol $c$, we want to automatically figure out which rule $A \\rightarrow \\beta$ to use. The logic will go as follows:\n",
    "\n",
    "- If $\\beta$ can be rewritten to a string that starts with $c$, then $A \\rightarrow \\beta$ is possible.\n",
    "- If $\\beta$ can be rewritten to $\\varepsilon$ and it's possible for $c$ to come after $A$, then $A \\rightarrow \\beta$ is possible.\n",
    "\n",
    "And we only want one rule to be possible at a time.\n",
    "\n",
    "We implement this logic by precomputing three tables, defined below."
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## The $\\text{Nullable}$ table\n",
    "\n",
    "Define a _rhs suffix_ of $G$ to be a suffix of the right-hand side of a rule of $G$. The rhs suffixes of our example grammar are: $\\varepsilon, \\texttt{c}, S\\texttt{c}, \\texttt{a}S\\texttt{c}, T, \\texttt{b}T$.\n",
    "\n",
    "Define a table $\\text{Nullable}(\\alpha)$, where $\\alpha$ is a terminal or nonterminal symbol or a rhs suffix, that says whether it's possible to rewrite $\\alpha$ to the empty string (that is, $\\alpha \\Rightarrow^\\ast \\varepsilon$).\n",
    "\n",
    "1. For all $\\alpha$, $\\text{Nullable}(\\alpha) \\leftarrow \\text{False}$.\n",
    "2. $\\text{Nullable}(\\epsilon) \\leftarrow \\text{True}$.\n",
    "3. Repeat until $\\text{Nullable}$ does not change:\n",
    "    1. For each rule $A \\rightarrow \\beta$:\n",
    "        1. $n \\leftarrow |\\beta|$\n",
    "        2. For $i \\leftarrow n, \\ldots, 1$:\n",
    "            1. If $\\text{Nullable}(\\beta_{i+1} \\cdots \\beta_n)$ and $\\text{Nullable}(\\beta_i)$, then $\\text{Nullable}(\\beta_i \\cdots \\beta_n) \\leftarrow \\text{True}$\n",
    "        3. If $\\text{Nullable}(\\beta)$, then $\\text{Nullable}(A) \\leftarrow \\text{True}$.\n",
    "\n",
    "In our example grammar, $S$ and $T$ are both nullable, but $S'$ is not. The nullable rhs suffixes are: $\\varepsilon, T$."
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## The $\\text{First}$ table\n",
    "\n",
    "Define a table $\\text{First}(\\alpha)$, where $\\alpha$ is a terminal or nonterminal symbol\n",
    "or a rhs suffix, that says what terminals $\\alpha$ can start with (after rewriting). That is, \n",
    "$$\\text{First}(\\alpha) = \\{b \\mid \\text{$\\alpha \\Rightarrow^\\ast b\\gamma$ for some $\\gamma$}\\}$$\n",
    "\n",
    "1. For all $\\alpha$, $\\text{First}(\\alpha) = \\emptyset$.\n",
    "1. For all terminals $a$, $\\text{First}(a) = \\{ a \\}$.\n",
    "2. Repeat until $\\text{First}$ does not change:\n",
    "    1. For each rule $A \\rightarrow \\beta$:\n",
    "        1. $n \\leftarrow |\\beta|$\n",
    "        2. For $i \\leftarrow n, \\ldots, 1$:\n",
    "            1. $\\text{First}(\\beta_i \\cdots \\beta_n) \\leftarrow \\text{First}(\\beta_i \\cdots \\beta_n) \\cup \\text{First}(\\beta_{i})$.\n",
    "            1. If $\\text{Nullable}(\\beta_i)$, then $\\text{First}(\\beta_i \\cdots \\beta_n) \\leftarrow \\text{First}(\\beta_i \\cdots \\beta_n) \\cup \\text{First}(\\beta_{i+1} \\cdots \\beta_n)$.\n",
    "        3. $\\text{First}(A) \\leftarrow \\text{First}(A) \\cup \\text{First}(\\beta)$.\n",
    "\n",
    "In our example grammar, we have\n",
    "\n",
    "| $\\alpha$ | $\\text{First}(\\alpha)$ |\n",
    "|----------|------------------------|\n",
    "| $S$      | $\\{\\texttt{a}, \\texttt{b}\\}$ |\n",
    "| $T$      | $\\{\\texttt{b}\\}$ |\n",
    "| $\\texttt{a}$ | $\\{\\texttt{a}\\}$ |\n",
    "| $\\texttt{b}$ | $\\{\\texttt{b}\\}$ |\n",
    "| $\\texttt{c}$ | $\\{\\texttt{c}\\}$ |\n",
    "| $\\varepsilon$ | $\\emptyset$ |\n",
    "| $S\\texttt{c}$ | $\\{\\texttt{a}, \\texttt{b}, \\texttt{c}\\}$ |\n",
    "| $\\texttt{a}S\\texttt{c}$ | $\\{\\texttt{a}\\}$ |\n",
    "| $\\texttt{b}T$ | $\\{\\texttt{b}\\}$ |\n"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## The $\\text{Follow}$ function\n",
    "\n",
    "Define a table $\\text{Follow}(A)$ that says, for each nonterminal $A$, what terminals can come after it. That is, $$\\text{Follow}(A) = \\{ b \\mid \\text{$S \\Rightarrow^\\ast \\gamma A b \\delta$ for some $\\gamma, \\delta$} \\}$$\n",
    "\n",
    "1. For all $A$, $\\text{Follow}(A) = \\emptyset$.\n",
    "2. Repeat until $\\text{Follow}$ does not change:\n",
    "    1. For each rule $A \\rightarrow \\beta$:\n",
    "        1. $n \\leftarrow |\\beta|$\n",
    "        1. For $i \\leftarrow 1, \\ldots, n$ such that $\\beta_i$ is a nonterminal $B$:\n",
    "            1. $\\text{Follow}(B) \\leftarrow \\text{Follow}(B) \\cup \\text{First}(\\beta_{i+1}\\cdots \\beta_n)$\n",
    "            2. If $\\text{Nullable}(\\beta_{i+1} \\cdots \\beta_n)$, then $\\text{Follow}(B) \\leftarrow \\text{Follow}(B) \\cup \\text{Follow}(A)$.\n",
    "            \n",
    "In our example grammar, we have $\\text{Follow}(S) = \\text{Follow}(T) = \\{\\texttt{c}, \\dashv\\}$."
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## Recursive-descent parsing\n",
    "\n",
    "We can now use these tables to implement a _recursive-descent_ parser, which has a function for each nonterminal symbol. The function for nonterminal $A$ is called on an input string $w$ and a position $i$ and has to return $j$ such that $A \\Rightarrow^\\ast w_i \\cdots w_{j-1}$. To do this, the function must decide which rule to use, using the Nullable, First, and Follow tables. For our example grammar, the parser would look like:\n",
    "\n",
    "```\n",
    "function parse(w)\n",
    "    i <- parseS(w, 0)\n",
    "    if i = |w| then\n",
    "        return True\n",
    "    else\n",
    "        error\n",
    "        \n",
    "function parseS(w, i)\n",
    "    # S -> aSc\n",
    "    if i < |w| and w[i] in {\"a\"} # First(aSc)\n",
    "        i = i + 1\n",
    "        i = parseS(w, i)\n",
    "        if w[i] != \"c\" return False\n",
    "        i = i + 1\n",
    "        return i\n",
    "    # S -> T\n",
    "    else if (w[i] in {\"b\"} # First(T)\n",
    "             or\n",
    "             i = |w| or w[i] in {\"c\"}) # Follow(S)\n",
    "        return parseT(w, i)\n",
    "    else\n",
    "        error\n",
    "        \n",
    "function parseT(w, i)\n",
    "    # T -> bT\n",
    "    if i < |w| and w[i] in {\"b\"} # First(bT)\n",
    "        i = i + 1\n",
    "        return parseT(w, i)\n",
    "    # T -> ε\n",
    "    else if i = |w| or w[i] in {\"c\"} # Follow(T)\n",
    "        return i\n",
    "    else\n",
    "        error        \n",
    "```"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## Conversion to DPDA\n",
    "\n",
    "We can also use the Nullable, First, and Follow tables to build a DPDA. To do this, we first modify the conversion from a CFG to a PDA to use a one-symbol lookahead. The states of the PDA are the start state $s$, $q$, a state $q_a$ for each $a \\in \\Sigma$, and an accept state $f$.\n",
    "\n",
    "1. A transition from $s$ to $q$ that pushes $S\\$$.\n",
    "2. A transition from $q$ to $q_a$ that reads $a$ for all $a \\in \\Sigma \\cup \\{\\dashv\\}$.\n",
    "3. A transition from $q_a$ to $q$ that pops $a$, for all $a \\in \\Sigma$.\n",
    "4. For each rule $A \\rightarrow \\beta$ and each $c \\in \\Sigma$, a transition from $q_c$ to itself that pops $A$ and pushes $\\beta$.\n",
    "5. A transition from $q$ to $f$ that pops $\\$$.\n",
    "\n",
    "The PDA looks like this (using the shorthand on page 119):\n",
    "\n",
    "![Schema for PDA with lookahead](llpda.pdf)\n",
    "\n",
    "where state $q_a$ is replicated for every possible terminal symbol, and the self-loop on state $q_a$ is replicated for all rules. \n",
    "\n",
    "For our example grammar, this PDA looks like this:"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 3,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "image/svg+xml": [
       "<svg height=\"477pt\" viewBox=\"0.00 0.00 321.50 477.00\" width=\"322pt\" xmlns=\"http://www.w3.org/2000/svg\" xmlns:xlink=\"http://www.w3.org/1999/xlink\">\n",
       "<g class=\"graph\" id=\"graph0\" transform=\"scale(1 1) rotate(0) translate(4 473)\">\n",
       "<title>%3</title>\n",
       "<polygon fill=\"#ffffff\" points=\"-4,4 -4,-473 317.5,-473 317.5,4 -4,4\" stroke=\"transparent\"/>\n",
       "<!-- _START -->\n",
       "<g class=\"node\" id=\"node1\">\n",
       "<title>_START</title>\n",
       "</g>\n",
       "<!-- 6 -->\n",
       "<g class=\"node\" id=\"node8\">\n",
       "<title>6</title>\n",
       "<path d=\"M48,-204C48,-204 43,-204 43,-204 40.5,-204 38,-201.5 38,-199 38,-199 38,-192 38,-192 38,-189.5 40.5,-187 43,-187 43,-187 48,-187 48,-187 50.5,-187 53,-189.5 53,-192 53,-192 53,-199 53,-199 53,-201.5 50.5,-204 48,-204\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"10.00\" text-anchor=\"start\" x=\"42\" y=\"-193\">s</text>\n",
       "</g>\n",
       "<!-- _START&#45;&gt;6 -->\n",
       "<g class=\"edge\" id=\"edge1\">\n",
       "<title>_START-&gt;6</title>\n",
       "<path d=\"M1.0054,-195.5C4.0868,-195.5 20.6128,-195.5 32.6265,-195.5\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"37.7642,-195.5 32.7643,-197.7501 35.2642,-195.5 32.7642,-195.5001 32.7642,-195.5001 32.7642,-195.5001 35.2642,-195.5 32.7642,-193.2501 37.7642,-195.5 37.7642,-195.5\" stroke=\"#000000\"/>\n",
       "</g>\n",
       "<!-- 0 -->\n",
       "<g class=\"node\" id=\"node2\">\n",
       "<title>0</title>\n",
       "<path d=\"M160,-204C160,-204 155,-204 155,-204 152.5,-204 150,-201.5 150,-199 150,-199 150,-192 150,-192 150,-189.5 152.5,-187 155,-187 155,-187 160,-187 160,-187 162.5,-187 165,-189.5 165,-192 165,-192 165,-199 165,-199 165,-201.5 162.5,-204 160,-204\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"10.00\" text-anchor=\"start\" x=\"154\" y=\"-193\">q</text>\n",
       "</g>\n",
       "<!-- 1 -->\n",
       "<g class=\"node\" id=\"node3\">\n",
       "<title>1</title>\n",
       "<path d=\"M281.8333,-393C281.8333,-393 272.1667,-393 272.1667,-393 269.3333,-393 266.5,-390.1667 266.5,-387.3333 266.5,-387.3333 266.5,-381.6667 266.5,-381.6667 266.5,-378.8333 269.3333,-376 272.1667,-376 272.1667,-376 281.8333,-376 281.8333,-376 284.6667,-376 287.5,-378.8333 287.5,-381.6667 287.5,-381.6667 287.5,-387.3333 287.5,-387.3333 287.5,-390.1667 284.6667,-393 281.8333,-393\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"10.00\" text-anchor=\"start\" x=\"270.5\" y=\"-382\">qa</text>\n",
       "</g>\n",
       "<!-- 0&#45;&gt;1 -->\n",
       "<g class=\"edge\" id=\"edge4\">\n",
       "<title>0-&gt;1</title>\n",
       "<path d=\"M157.8432,-204.2914C159.1467,-233.4031 164.6828,-326.0835 183,-348.5 202.5718,-372.4519 240.0155,-380.4923 261.2614,-383.1739\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"266.2932,-383.7466 261.0708,-385.4166 263.8092,-383.4638 261.3253,-383.181 261.3253,-383.181 261.3253,-383.181 263.8092,-383.4638 261.5798,-380.9455 266.2932,-383.7466 266.2932,-383.7466\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"190.5\" y=\"-381.3\">a,ε → ε</text>\n",
       "</g>\n",
       "<!-- 2 -->\n",
       "<g class=\"node\" id=\"node4\">\n",
       "<title>2</title>\n",
       "<path d=\"M281.8333,-282C281.8333,-282 272.1667,-282 272.1667,-282 269.3333,-282 266.5,-279.1667 266.5,-276.3333 266.5,-276.3333 266.5,-270.6667 266.5,-270.6667 266.5,-267.8333 269.3333,-265 272.1667,-265 272.1667,-265 281.8333,-265 281.8333,-265 284.6667,-265 287.5,-267.8333 287.5,-270.6667 287.5,-270.6667 287.5,-276.3333 287.5,-276.3333 287.5,-279.1667 284.6667,-282 281.8333,-282\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"10.00\" text-anchor=\"start\" x=\"270.5\" y=\"-271\">qb</text>\n",
       "</g>\n",
       "<!-- 0&#45;&gt;2 -->\n",
       "<g class=\"edge\" id=\"edge5\">\n",
       "<title>0-&gt;2</title>\n",
       "<path d=\"M158.9761,-204.1528C161.5638,-216.5761 168.1295,-239.3091 183,-251.5 205.6068,-270.0331 241.0419,-273.5247 261.3318,-273.8679\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"266.3885,-273.8929 261.3773,-276.118 263.8885,-273.8805 261.3885,-273.8681 261.3885,-273.8681 261.3885,-273.8681 263.8885,-273.8805 261.3997,-271.6181 266.3885,-273.8929 266.3885,-273.8929\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"190.5\" y=\"-276.3\">b,ε → ε</text>\n",
       "</g>\n",
       "<!-- 3 -->\n",
       "<g class=\"node\" id=\"node5\">\n",
       "<title>3</title>\n",
       "<path d=\"M281.8333,-171C281.8333,-171 272.1667,-171 272.1667,-171 269.3333,-171 266.5,-168.1667 266.5,-165.3333 266.5,-165.3333 266.5,-159.6667 266.5,-159.6667 266.5,-156.8333 269.3333,-154 272.1667,-154 272.1667,-154 281.8333,-154 281.8333,-154 284.6667,-154 287.5,-156.8333 287.5,-159.6667 287.5,-159.6667 287.5,-165.3333 287.5,-165.3333 287.5,-168.1667 284.6667,-171 281.8333,-171\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"10.00\" text-anchor=\"start\" x=\"270.5\" y=\"-160\">qc</text>\n",
       "</g>\n",
       "<!-- 0&#45;&gt;3 -->\n",
       "<g class=\"edge\" id=\"edge6\">\n",
       "<title>0-&gt;3</title>\n",
       "<path d=\"M165.3407,-193.3348C184.7314,-187.98 235.2367,-174.033 261.153,-166.8762\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"266.29,-165.4576 262.0693,-168.9574 263.8802,-166.1231 261.4704,-166.7886 261.4704,-166.7886 261.4704,-166.7886 263.8802,-166.1231 260.8714,-164.6198 266.29,-165.4576 266.29,-165.4576\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"190.5\" y=\"-193.3\">c,ε → ε</text>\n",
       "</g>\n",
       "<!-- 4 -->\n",
       "<g class=\"node\" id=\"node6\">\n",
       "<title>4</title>\n",
       "<path d=\"M287.8333,-60C287.8333,-60 266.1667,-60 266.1667,-60 263.3333,-60 260.5,-57.1667 260.5,-54.3333 260.5,-54.3333 260.5,-48.6667 260.5,-48.6667 260.5,-45.8333 263.3333,-43 266.1667,-43 266.1667,-43 287.8333,-43 287.8333,-43 290.6667,-43 293.5,-45.8333 293.5,-48.6667 293.5,-48.6667 293.5,-54.3333 293.5,-54.3333 293.5,-57.1667 290.6667,-60 287.8333,-60\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"10.00\" text-anchor=\"start\" x=\"264.5\" y=\"-49\">qend</text>\n",
       "</g>\n",
       "<!-- 0&#45;&gt;4 -->\n",
       "<g class=\"edge\" id=\"edge7\">\n",
       "<title>0-&gt;4</title>\n",
       "<path d=\"M159.1963,-186.8541C162.1932,-173.213 169.4293,-146.3161 183,-127.5 203.2856,-99.3733 236.5489,-75.8916 257.5837,-62.7775\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"261.8776,-60.1385 258.7959,-64.6735 259.7477,-61.4475 257.6178,-62.7566 257.6178,-62.7566 257.6178,-62.7566 259.7477,-61.4475 256.4396,-60.8396 261.8776,-60.1385 261.8776,-60.1385\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"186.5\" y=\"-133.3\">ε,-| → ε</text>\n",
       "</g>\n",
       "<!-- 5 -->\n",
       "<g class=\"node\" id=\"node7\">\n",
       "<title>5</title>\n",
       "<path d=\"M279.5,-21C279.5,-21 274.5,-21 274.5,-21 272,-21 269.5,-18.5 269.5,-16 269.5,-16 269.5,-9 269.5,-9 269.5,-6.5 272,-4 274.5,-4 274.5,-4 279.5,-4 279.5,-4 282,-4 284.5,-6.5 284.5,-9 284.5,-9 284.5,-16 284.5,-16 284.5,-18.5 282,-21 279.5,-21\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<path d=\"M280.8333,-25C280.8333,-25 273.1667,-25 273.1667,-25 269.3333,-25 265.5,-21.1667 265.5,-17.3333 265.5,-17.3333 265.5,-7.6667 265.5,-7.6667 265.5,-3.8333 269.3333,0 273.1667,0 273.1667,0 280.8333,0 280.8333,0 284.6667,0 288.5,-3.8333 288.5,-7.6667 288.5,-7.6667 288.5,-17.3333 288.5,-17.3333 288.5,-21.1667 284.6667,-25 280.8333,-25\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"10.00\" text-anchor=\"start\" x=\"273.5\" y=\"-10\">f</text>\n",
       "</g>\n",
       "<!-- 0&#45;&gt;5 -->\n",
       "<g class=\"edge\" id=\"edge3\">\n",
       "<title>0-&gt;5</title>\n",
       "<path d=\"M158.2159,-186.5929C160.9795,-153.2311 171.3801,-36.9874 183,-25.5 203.3053,-5.4262 239.2003,-6.5741 260.27,-9.4099\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"265.2887,-10.1553 260.0124,-11.6462 262.8158,-9.788 260.343,-9.4206 260.343,-9.4206 260.343,-9.4206 262.8158,-9.788 260.6736,-7.195 265.2887,-10.1553 265.2887,-10.1553\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"188.5\" y=\"-31.3\">ε,$ → ε</text>\n",
       "</g>\n",
       "<!-- 1&#45;&gt;0 -->\n",
       "<g class=\"edge\" id=\"edge8\">\n",
       "<title>1-&gt;0</title>\n",
       "<path d=\"M266.2632,-380.1505C260.2676,-377.2841 253.0176,-373.0341 248,-367.5 235.1771,-353.3573 241.7295,-343.562 230,-328.5 213.4954,-307.3062 197.6439,-313.0197 183,-290.5 166.3216,-264.8515 160.4984,-228.5749 158.5052,-209.1075\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"158.0387,-204.0852 160.7416,-208.8557 158.27,-206.5745 158.5012,-209.0638 158.5012,-209.0638 158.5012,-209.0638 158.27,-206.5745 156.2609,-209.2719 158.0387,-204.0852 158.0387,-204.0852\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"188.5\" y=\"-334.3\">ε,a → ε</text>\n",
       "</g>\n",
       "<!-- 1&#45;&gt;1 -->\n",
       "<g class=\"edge\" id=\"edge9\">\n",
       "<title>1-&gt;1</title>\n",
       "<path d=\"M266.5699,-393.1857C260.8084,-401.6739 264.2852,-411 277,-411 287.3308,-411 291.563,-404.8433 289.6966,-397.9817\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"287.4301,-393.1857 291.6008,-396.7449 288.4983,-395.446 289.5665,-397.7063 289.5665,-397.7063 289.5665,-397.7063 288.4983,-395.446 287.5322,-398.6677 287.4301,-393.1857 287.4301,-393.1857\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"260\" y=\"-458.8\">S,ε → T</text>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"244\" y=\"-444.8\">S,ε → [a] S c</text>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"261\" y=\"-430.8\">T,ε → ε</text>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"249.5\" y=\"-416.8\">T,ε → [b] T</text>\n",
       "</g>\n",
       "<!-- 2&#45;&gt;0 -->\n",
       "<g class=\"edge\" id=\"edge10\">\n",
       "<title>2-&gt;0</title>\n",
       "<path d=\"M266.3669,-268.6303C260.5479,-265.6457 253.4377,-261.4501 248,-256.5 237.8754,-247.2832 240.438,-240.3603 230,-231.5 227.4938,-229.3726 189.9087,-211.1176 169.8706,-201.451\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"165.1862,-199.1934 170.6673,-199.3373 167.4383,-200.2788 169.6904,-201.3642 169.6904,-201.3642 169.6904,-201.3642 167.4383,-200.2788 168.7136,-203.3911 165.1862,-199.1934 165.1862,-199.1934\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"188.5\" y=\"-237.3\">ε,b → ε</text>\n",
       "</g>\n",
       "<!-- 2&#45;&gt;2 -->\n",
       "<g class=\"edge\" id=\"edge11\">\n",
       "<title>2-&gt;2</title>\n",
       "<path d=\"M266.5699,-282.1857C260.8084,-290.6739 264.2852,-300 277,-300 287.3308,-300 291.563,-293.8433 289.6966,-286.9817\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"287.4301,-282.1857 291.6008,-285.7449 288.4983,-284.446 289.5665,-286.7063 289.5665,-286.7063 289.5665,-286.7063 288.4983,-284.446 287.5322,-287.6677 287.4301,-282.1857 287.4301,-282.1857\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"260\" y=\"-347.8\">S,ε → T</text>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"244\" y=\"-333.8\">S,ε → [a] S c</text>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"261\" y=\"-319.8\">T,ε → ε</text>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"249.5\" y=\"-305.8\">T,ε → [b] T</text>\n",
       "</g>\n",
       "<!-- 3&#45;&gt;0 -->\n",
       "<g class=\"edge\" id=\"edge12\">\n",
       "<title>3-&gt;0</title>\n",
       "<path d=\"M266.3363,-158.1493C248.0254,-151.416 209.9752,-140.7063 183,-155.5 172.7643,-161.1135 166.0457,-172.7743 162.0874,-182.0644\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"160.1676,-186.9283 159.9105,-181.4513 161.0855,-184.6028 162.0034,-182.2774 162.0034,-182.2774 162.0034,-182.2774 161.0855,-184.6028 164.0963,-183.1036 160.1676,-186.9283 160.1676,-186.9283\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"188.5\" y=\"-161.3\">ε,c → ε</text>\n",
       "</g>\n",
       "<!-- 3&#45;&gt;3 -->\n",
       "<g class=\"edge\" id=\"edge13\">\n",
       "<title>3-&gt;3</title>\n",
       "<path d=\"M266.5699,-171.1857C260.8084,-179.6739 264.2852,-189 277,-189 287.3308,-189 291.563,-182.8433 289.6966,-175.9817\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"287.4301,-171.1857 291.6008,-174.7449 288.4983,-173.446 289.5665,-175.7063 289.5665,-175.7063 289.5665,-175.7063 288.4983,-173.446 287.5322,-176.6677 287.4301,-171.1857 287.4301,-171.1857\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"244\" y=\"-236.8\">S,ε → [a] S c</text>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"260\" y=\"-222.8\">S,ε → T</text>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"249.5\" y=\"-208.8\">T,ε → [b] T</text>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"261\" y=\"-194.8\">T,ε → ε</text>\n",
       "</g>\n",
       "<!-- 4&#45;&gt;0 -->\n",
       "<g class=\"edge\" id=\"edge14\">\n",
       "<title>4-&gt;0</title>\n",
       "<path d=\"M260.3495,-47.9059C239.4691,-44.4601 203.8828,-42.2643 183,-61.5 165.5367,-77.5859 159.7857,-150.6849 158.1133,-181.6134\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"157.8465,-186.8745 155.8527,-181.7669 157.9732,-184.3777 158.0998,-181.8809 158.0998,-181.8809 158.0998,-181.8809 157.9732,-184.3777 160.3469,-181.9949 157.8465,-186.8745 157.8465,-186.8745\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"187.5\" y=\"-67.3\">-|,ε → ε</text>\n",
       "</g>\n",
       "<!-- 4&#45;&gt;4 -->\n",
       "<g class=\"edge\" id=\"edge15\">\n",
       "<title>4-&gt;4</title>\n",
       "<path d=\"M266.5699,-60.1857C260.8084,-68.6739 264.2852,-78 277,-78 287.3308,-78 291.563,-71.8433 289.6966,-64.9817\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"287.4301,-60.1857 291.6008,-63.7449 288.4983,-62.446 289.5665,-64.7063 289.5665,-64.7063 289.5665,-64.7063 288.4983,-62.446 287.5322,-65.6677 287.4301,-60.1857 287.4301,-60.1857\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"260\" y=\"-125.8\">S,ε → T</text>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"244\" y=\"-111.8\">S,ε → [a] S c</text>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"249.5\" y=\"-97.8\">T,ε → [b] T</text>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"261\" y=\"-83.8\">T,ε → ε</text>\n",
       "</g>\n",
       "<!-- 6&#45;&gt;0 -->\n",
       "<g class=\"edge\" id=\"edge2\">\n",
       "<title>6-&gt;0</title>\n",
       "<path d=\"M53.1982,-195.5C72.1403,-195.5 121.171,-195.5 144.6281,-195.5\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"149.7479,-195.5 144.748,-197.7501 147.2479,-195.5 144.7479,-195.5001 144.7479,-195.5001 144.7479,-195.5001 147.2479,-195.5 144.7479,-193.2501 149.7479,-195.5 149.7479,-195.5\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"74.5\" y=\"-201.3\">ε,ε → [S] $</text>\n",
       "</g>\n",
       "</g>\n",
       "</svg>"
      ],
      "text/plain": [
       "<IPython.core.display.SVG object>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "p2 = read_csv(\"llpda.csv\")\n",
    "to_graph(p2)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 4,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "False"
      ]
     },
     "execution_count": 4,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "p2.is_deterministic()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "This PDA is similar to the one in the proof of Lemma 2.21, but it has states $q_a$ that can apply rules with the knowledge that the next input symbol is $a$.\n",
    "\n",
    "So, we can restrict the application of rules only to those that are allowed by Nullable, First, and Follow:\n",
    "\n",
    "4. For each rule $A \\rightarrow \\beta$ and for each $c \\in \\Sigma$ such that $c \\in \\text{First}(\\beta)$ or ($\\text{Nullable}(\\beta)$ and $c \\in \\text{Follow}(A)$), a transition from $q_c$ to itself that pops $A$ and pushes $\\beta$."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 5,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "image/svg+xml": [
       "<svg height=\"368pt\" viewBox=\"0.00 0.00 309.00 368.00\" width=\"309pt\" xmlns=\"http://www.w3.org/2000/svg\" xmlns:xlink=\"http://www.w3.org/1999/xlink\">\n",
       "<g class=\"graph\" id=\"graph0\" transform=\"scale(1 1) rotate(0) translate(4 364)\">\n",
       "<title>%3</title>\n",
       "<polygon fill=\"#ffffff\" points=\"-4,4 -4,-364 305,-364 305,4 -4,4\" stroke=\"transparent\"/>\n",
       "<!-- _START -->\n",
       "<g class=\"node\" id=\"node1\">\n",
       "<title>_START</title>\n",
       "</g>\n",
       "<!-- 5 -->\n",
       "<g class=\"node\" id=\"node7\">\n",
       "<title>5</title>\n",
       "<path d=\"M48,-177C48,-177 43,-177 43,-177 40.5,-177 38,-174.5 38,-172 38,-172 38,-165 38,-165 38,-162.5 40.5,-160 43,-160 43,-160 48,-160 48,-160 50.5,-160 53,-162.5 53,-165 53,-165 53,-172 53,-172 53,-174.5 50.5,-177 48,-177\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"10.00\" text-anchor=\"start\" x=\"42\" y=\"-166\">s</text>\n",
       "</g>\n",
       "<!-- _START&#45;&gt;5 -->\n",
       "<g class=\"edge\" id=\"edge1\">\n",
       "<title>_START-&gt;5</title>\n",
       "<path d=\"M1.0054,-168.5C4.0868,-168.5 20.6128,-168.5 32.6265,-168.5\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"37.7642,-168.5 32.7643,-170.7501 35.2642,-168.5 32.7642,-168.5001 32.7642,-168.5001 32.7642,-168.5001 35.2642,-168.5 32.7642,-166.2501 37.7642,-168.5 37.7642,-168.5\" stroke=\"#000000\"/>\n",
       "</g>\n",
       "<!-- 0 -->\n",
       "<g class=\"node\" id=\"node2\">\n",
       "<title>0</title>\n",
       "<path d=\"M160,-177C160,-177 155,-177 155,-177 152.5,-177 150,-174.5 150,-172 150,-172 150,-165 150,-165 150,-162.5 152.5,-160 155,-160 155,-160 160,-160 160,-160 162.5,-160 165,-162.5 165,-165 165,-165 165,-172 165,-172 165,-174.5 162.5,-177 160,-177\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"10.00\" text-anchor=\"start\" x=\"154\" y=\"-166\">q</text>\n",
       "</g>\n",
       "<!-- 1 -->\n",
       "<g class=\"node\" id=\"node3\">\n",
       "<title>1</title>\n",
       "<path d=\"M269.3333,-326C269.3333,-326 259.6667,-326 259.6667,-326 256.8333,-326 254,-323.1667 254,-320.3333 254,-320.3333 254,-314.6667 254,-314.6667 254,-311.8333 256.8333,-309 259.6667,-309 259.6667,-309 269.3333,-309 269.3333,-309 272.1667,-309 275,-311.8333 275,-314.6667 275,-314.6667 275,-320.3333 275,-320.3333 275,-323.1667 272.1667,-326 269.3333,-326\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"10.00\" text-anchor=\"start\" x=\"258\" y=\"-315\">qa</text>\n",
       "</g>\n",
       "<!-- 0&#45;&gt;1 -->\n",
       "<g class=\"edge\" id=\"edge4\">\n",
       "<title>0-&gt;1</title>\n",
       "<path d=\"M157.9026,-177.0669C159.3379,-203.7621 165.1125,-284.7202 183,-302.5 200.1997,-319.5962 230.0838,-320.6818 248.4913,-319.429\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"253.8087,-318.9802 249.0157,-321.6428 251.3176,-319.1905 248.8264,-319.4008 248.8264,-319.4008 248.8264,-319.4008 251.3176,-319.1905 248.6372,-317.1587 253.8087,-318.9802 253.8087,-318.9802\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"190.5\" y=\"-325.3\">a,ε → ε</text>\n",
       "</g>\n",
       "<!-- 2 -->\n",
       "<g class=\"node\" id=\"node4\">\n",
       "<title>2</title>\n",
       "<path d=\"M269.3333,-251C269.3333,-251 259.6667,-251 259.6667,-251 256.8333,-251 254,-248.1667 254,-245.3333 254,-245.3333 254,-239.6667 254,-239.6667 254,-236.8333 256.8333,-234 259.6667,-234 259.6667,-234 269.3333,-234 269.3333,-234 272.1667,-234 275,-236.8333 275,-239.6667 275,-239.6667 275,-245.3333 275,-245.3333 275,-248.1667 272.1667,-251 269.3333,-251\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"10.00\" text-anchor=\"start\" x=\"258\" y=\"-240\">qb</text>\n",
       "</g>\n",
       "<!-- 0&#45;&gt;2 -->\n",
       "<g class=\"edge\" id=\"edge5\">\n",
       "<title>0-&gt;2</title>\n",
       "<path d=\"M158.9559,-177.1775C161.5186,-189.6316 168.0534,-212.4025 183,-224.5 201.8836,-239.784 230.9888,-242.7691 248.828,-243.0091\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"253.9788,-243.0061 248.9802,-245.2592 251.4788,-243.0076 248.9788,-243.0092 248.9788,-243.0092 248.9788,-243.0092 251.4788,-243.0076 248.9774,-240.7592 253.9788,-243.0061 253.9788,-243.0061\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"190.5\" y=\"-247.3\">b,ε → ε</text>\n",
       "</g>\n",
       "<!-- 3 -->\n",
       "<g class=\"node\" id=\"node5\">\n",
       "<title>3</title>\n",
       "<path d=\"M269.3333,-164C269.3333,-164 259.6667,-164 259.6667,-164 256.8333,-164 254,-161.1667 254,-158.3333 254,-158.3333 254,-152.6667 254,-152.6667 254,-149.8333 256.8333,-147 259.6667,-147 259.6667,-147 269.3333,-147 269.3333,-147 272.1667,-147 275,-149.8333 275,-152.6667 275,-152.6667 275,-158.3333 275,-158.3333 275,-161.1667 272.1667,-164 269.3333,-164\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"10.00\" text-anchor=\"start\" x=\"258\" y=\"-153\">qc</text>\n",
       "</g>\n",
       "<!-- 0&#45;&gt;3 -->\n",
       "<g class=\"edge\" id=\"edge6\">\n",
       "<title>0-&gt;3</title>\n",
       "<path d=\"M165.1952,-167.5651C182.585,-165.4523 225.2136,-160.2731 248.5658,-157.4359\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"253.7876,-156.8015 249.0955,-159.6382 251.3058,-157.1031 248.8241,-157.4046 248.8241,-157.4046 248.8241,-157.4046 251.3058,-157.1031 248.5527,-155.1711 253.7876,-156.8015 253.7876,-156.8015\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"190.5\" y=\"-170.3\">c,ε → ε</text>\n",
       "</g>\n",
       "<!-- 4 -->\n",
       "<g class=\"node\" id=\"node6\">\n",
       "<title>4</title>\n",
       "<path d=\"M275.3333,-74C275.3333,-74 253.6667,-74 253.6667,-74 250.8333,-74 248,-71.1667 248,-68.3333 248,-68.3333 248,-62.6667 248,-62.6667 248,-59.8333 250.8333,-57 253.6667,-57 253.6667,-57 275.3333,-57 275.3333,-57 278.1667,-57 281,-59.8333 281,-62.6667 281,-62.6667 281,-68.3333 281,-68.3333 281,-71.1667 278.1667,-74 275.3333,-74\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"10.00\" text-anchor=\"start\" x=\"252\" y=\"-63\">qend</text>\n",
       "</g>\n",
       "<!-- 0&#45;&gt;4 -->\n",
       "<g class=\"edge\" id=\"edge7\">\n",
       "<title>0-&gt;4</title>\n",
       "<path d=\"M159.7697,-159.7647C163.1721,-148.0742 170.5597,-127.225 183,-113.5 199.6324,-95.1499 224.9548,-81.7612 242.955,-73.8571\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"247.8245,-71.7736 244.1127,-75.8091 245.526,-72.757 243.2276,-73.7405 243.2276,-73.7405 243.2276,-73.7405 245.526,-72.757 242.3425,-71.6719 247.8245,-71.7736 247.8245,-71.7736\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"186.5\" y=\"-119.3\">ε,-| → ε</text>\n",
       "</g>\n",
       "<!-- 6 -->\n",
       "<g class=\"node\" id=\"node8\">\n",
       "<title>6</title>\n",
       "<path d=\"M267,-21C267,-21 262,-21 262,-21 259.5,-21 257,-18.5 257,-16 257,-16 257,-9 257,-9 257,-6.5 259.5,-4 262,-4 262,-4 267,-4 267,-4 269.5,-4 272,-6.5 272,-9 272,-9 272,-16 272,-16 272,-18.5 269.5,-21 267,-21\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<path d=\"M268.3333,-25C268.3333,-25 260.6667,-25 260.6667,-25 256.8333,-25 253,-21.1667 253,-17.3333 253,-17.3333 253,-7.6667 253,-7.6667 253,-3.8333 256.8333,0 260.6667,0 260.6667,0 268.3333,0 268.3333,0 272.1667,0 276,-3.8333 276,-7.6667 276,-7.6667 276,-17.3333 276,-17.3333 276,-21.1667 272.1667,-25 268.3333,-25\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"10.00\" text-anchor=\"start\" x=\"261\" y=\"-10\">f</text>\n",
       "</g>\n",
       "<!-- 0&#45;&gt;6 -->\n",
       "<g class=\"edge\" id=\"edge3\">\n",
       "<title>0-&gt;6</title>\n",
       "<path d=\"M157.9097,-159.7791C159.3658,-132.5985 165.1955,-50.1246 183,-31.5 199.65,-14.0832 228.9313,-11.2586 247.4904,-11.4293\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"252.88,-11.5656 247.8247,-13.6884 250.3808,-11.5023 247.8816,-11.4391 247.8816,-11.4391 247.8816,-11.4391 250.3808,-11.5023 247.9385,-9.1898 252.88,-11.5656 252.88,-11.5656\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"188.5\" y=\"-37.3\">ε,$ → ε</text>\n",
       "</g>\n",
       "<!-- 1&#45;&gt;0 -->\n",
       "<g class=\"edge\" id=\"edge8\">\n",
       "<title>1-&gt;0</title>\n",
       "<path d=\"M257.5757,-308.8251C251.0885,-301.133 240.7968,-289.9883 230,-282.5 211.2,-269.461 197.8979,-278.864 183,-261.5 172.7562,-249.5605 163.8101,-205.0309 159.777,-182.1954\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"158.9036,-177.1496 161.9735,-181.6926 159.33,-179.613 159.7564,-182.0764 159.7564,-182.0764 159.7564,-182.0764 159.33,-179.613 157.5394,-182.4602 158.9036,-177.1496 158.9036,-177.1496\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"188.5\" y=\"-288.3\">ε,a → ε</text>\n",
       "</g>\n",
       "<!-- 1&#45;&gt;1 -->\n",
       "<g class=\"edge\" id=\"edge9\">\n",
       "<title>1-&gt;1</title>\n",
       "<path d=\"M255.8579,-326.1857C251.0841,-334.6739 253.9648,-344 264.5,-344 273.0598,-344 276.5665,-337.8433 275.02,-330.9817\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"273.1421,-326.1857 277.0603,-330.0211 274.0537,-328.5136 274.9652,-330.8415 274.9652,-330.8415 274.9652,-330.8415 274.0537,-328.5136 272.8701,-331.6619 273.1421,-326.1857 273.1421,-326.1857\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"231.5\" y=\"-349.8\">S,ε → [a] S c</text>\n",
       "</g>\n",
       "<!-- 2&#45;&gt;0 -->\n",
       "<g class=\"edge\" id=\"edge10\">\n",
       "<title>2-&gt;0</title>\n",
       "<path d=\"M257.2353,-233.6536C249.052,-223.8467 236.1006,-208.8367 230,-204.5 211.4971,-191.347 203.122,-195.0101 183,-184.5 178.5093,-182.1545 173.7632,-179.2911 169.5991,-176.6384\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"165.3485,-173.877 170.7672,-174.7141 167.4449,-175.239 169.5414,-176.601 169.5414,-176.601 169.5414,-176.601 167.4449,-175.239 168.3157,-178.4878 165.3485,-173.877 165.3485,-173.877\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"188.5\" y=\"-210.3\">ε,b → ε</text>\n",
       "</g>\n",
       "<!-- 2&#45;&gt;2 -->\n",
       "<g class=\"edge\" id=\"edge11\">\n",
       "<title>2-&gt;2</title>\n",
       "<path d=\"M255.8579,-251.1857C251.0841,-259.6739 253.9648,-269 264.5,-269 273.0598,-269 276.5665,-262.8433 275.02,-255.9817\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"273.1421,-251.1857 277.0603,-255.0211 274.0537,-253.5136 274.9652,-255.8415 274.9652,-255.8415 274.9652,-255.8415 274.0537,-253.5136 272.8701,-256.6619 273.1421,-251.1857 273.1421,-251.1857\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"236.5\" y=\"-274.8\">T,ε → [b] T</text>\n",
       "</g>\n",
       "<!-- 3&#45;&gt;0 -->\n",
       "<g class=\"edge\" id=\"edge12\">\n",
       "<title>3-&gt;0</title>\n",
       "<path d=\"M253.7455,-149.0312C247.1472,-145.4461 238.3815,-141.349 230,-139.5 209.6016,-135.0001 202.0326,-130.8915 183,-139.5 175.6984,-142.8025 169.656,-149.3434 165.2887,-155.3785\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"162.2824,-159.8331 163.2145,-154.4299 163.681,-157.7608 165.0795,-155.6886 165.0795,-155.6886 165.0795,-155.6886 163.681,-157.7608 166.9445,-156.9473 162.2824,-159.8331 162.2824,-159.8331\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"188.5\" y=\"-145.3\">ε,c → ε</text>\n",
       "</g>\n",
       "<!-- 3&#45;&gt;3 -->\n",
       "<g class=\"edge\" id=\"edge13\">\n",
       "<title>3-&gt;3</title>\n",
       "<path d=\"M255.8579,-164.1857C251.0841,-172.6739 253.9648,-182 264.5,-182 273.0598,-182 276.5665,-175.8433 275.02,-168.9817\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"273.1421,-164.1857 277.0603,-168.0211 274.0537,-166.5136 274.9652,-168.8415 274.9652,-168.8415 274.9652,-168.8415 274.0537,-166.5136 272.8701,-169.6619 273.1421,-164.1857 273.1421,-164.1857\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"247.5\" y=\"-201.8\">S,ε → T</text>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"248.5\" y=\"-187.8\">T,ε → ε</text>\n",
       "</g>\n",
       "<!-- 4&#45;&gt;0 -->\n",
       "<g class=\"edge\" id=\"edge14\">\n",
       "<title>4-&gt;0</title>\n",
       "<path d=\"M247.8796,-58.5514C229.8965,-52.2577 201.3415,-45.9848 183,-60.5 168.3319,-72.1081 161.2442,-128.2478 158.6766,-154.6815\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"158.1962,-159.8446 156.4192,-154.6576 158.4279,-157.3553 158.6595,-154.8661 158.6595,-154.8661 158.6595,-154.8661 158.4279,-157.3553 160.8998,-155.0746 158.1962,-159.8446 158.1962,-159.8446\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"187.5\" y=\"-66.3\">-|,ε → ε</text>\n",
       "</g>\n",
       "<!-- 4&#45;&gt;4 -->\n",
       "<g class=\"edge\" id=\"edge15\">\n",
       "<title>4-&gt;4</title>\n",
       "<path d=\"M255.8579,-74.1857C251.0841,-82.6739 253.9648,-92 264.5,-92 273.0598,-92 276.5665,-85.8433 275.02,-78.9817\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"273.1421,-74.1857 277.0603,-78.0211 274.0537,-76.5136 274.9652,-78.8415 274.9652,-78.8415 274.9652,-78.8415 274.0537,-76.5136 272.8701,-79.6619 273.1421,-74.1857 273.1421,-74.1857\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"247.5\" y=\"-111.8\">S,ε → T</text>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"248.5\" y=\"-97.8\">T,ε → ε</text>\n",
       "</g>\n",
       "<!-- 5&#45;&gt;0 -->\n",
       "<g class=\"edge\" id=\"edge2\">\n",
       "<title>5-&gt;0</title>\n",
       "<path d=\"M53.1982,-168.5C72.1403,-168.5 121.171,-168.5 144.6281,-168.5\" fill=\"none\" stroke=\"#000000\"/>\n",
       "<polygon fill=\"#000000\" points=\"149.7479,-168.5 144.748,-170.7501 147.2479,-168.5 144.7479,-168.5001 144.7479,-168.5001 144.7479,-168.5001 147.2479,-168.5 144.7479,-166.2501 149.7479,-168.5 149.7479,-168.5\" stroke=\"#000000\"/>\n",
       "<text fill=\"#000000\" font-family=\"Courier,monospace\" font-size=\"9.00\" text-anchor=\"start\" x=\"74.5\" y=\"-174.3\">ε,ε → [S] $</text>\n",
       "</g>\n",
       "</g>\n",
       "</svg>"
      ],
      "text/plain": [
       "<IPython.core.display.SVG object>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "p3 = read_csv(\"lldpda.csv\")\n",
    "to_graph(p3)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 6,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "False"
      ]
     },
     "execution_count": 6,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "p3.is_deterministic()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "Now the PDA is deterministic, which is what we wanted. Note that it was _not_ guaranteed to be deterministic; we just got lucky. If it is deterministic, we say that the original grammar is $LL(1)$."
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## Making grammars $LL(1)$\n",
    "\n",
    "So, if grammars are not guaranteed to be $LL(1)$, how do we design our grammars so that they are? We show two strategies that can sometimes (but not always) help.\n",
    "\n",
    "**Merge common prefixes.** Consider this very simple grammar:\n",
    "\n",
    "\\begin{align*}\n",
    "S &\\rightarrow \\texttt{a b} \\\\\n",
    "S &\\rightarrow \\texttt{a c}\n",
    "\\end{align*}\n",
    "\n",
    "It's not $LL(1)$ because $\\texttt{a}$ belongs to both $\\text{First}(\\texttt{a b})$ and $\\text{First}(\\texttt{a c})$. The solution is to create a new nonterminal for the non-shared part, like this:\n",
    "\n",
    "\\begin{align*}\n",
    "S &\\rightarrow \\texttt{a} S' \\\\\n",
    "S' &\\rightarrow \\texttt{b} \\\\\n",
    "S' &\\rightarrow \\texttt{c}\n",
    "\\end{align*}\n",
    "\n",
    "In general, if we have two rules \n",
    "\\begin{align*}\n",
    "A &\\rightarrow \\beta\\gamma \\\\\n",
    "A &\\rightarrow \\beta\\delta\n",
    "\\end{align*}\n",
    "then $\\text{First}(\\beta\\gamma)$ and $\\text{First}(\\beta\\delta)$ wlil overlap, and the solution is to change this to\n",
    "\\begin{align*}\n",
    "A &\\rightarrow \\beta A' \\\\\n",
    "A' &\\rightarrow \\gamma \\\\\n",
    "A' &\\rightarrow \\delta\n",
    "\\end{align*}\n",
    "\n",
    "**Eliminate left recursion.** Consider this grammar:\n",
    "\n",
    "\\begin{align*}\n",
    "S &\\rightarrow S~\\texttt{-}~T \\\\\n",
    "S &\\rightarrow T \\\\\n",
    "T &\\rightarrow \\texttt{1}\n",
    "\\end{align*}\n",
    "\n",
    "The first rule is called _left-recursive_ because the first symbol on the right-hand side is the same as the left-hand side. In such cases there will always be an overlap between the left-recursive rule's and the \"base case\" rule's right-hand side. The usual fix is:\n",
    "\n",
    "\\begin{align*}\n",
    "S &\\rightarrow T S' \\\\\n",
    "S' &\\rightarrow \\texttt{-}~T S' \\\\\n",
    "S' &\\rightarrow \\varepsilon \\\\\n",
    "T &\\rightarrow \\texttt{1}\n",
    "\\end{align*}\n",
    "\n",
    "But be careful, because it seems we just changed - (minus) from left-associative to right-associative. In a recursive-descent parser, we can get the associativity correct like this:\n",
    "\n",
    "```\n",
    "function parse(w)\n",
    "    val, i <- parseS(w, 0)\n",
    "    if i = |w| then\n",
    "        return val\n",
    "    else\n",
    "        error\n",
    "        \n",
    "function parseS(w, i)\n",
    "    # S -> T S'\n",
    "    val, i = parseT(w, i)\n",
    "    val, i = parseS'(w, i, val)\n",
    "    return val, i\n",
    "    \n",
    "function parseS'(w, i, val)\n",
    "    # S' -> + T S'\n",
    "    if i < |w| and w[i] in {\"+\"} # First(+ T S')\n",
    "        i <- i + 1\n",
    "        val2, i = parseT(w, i)\n",
    "        val, i = parseS'(w, i, val-val2)\n",
    "        return val, i\n",
    "    else if i = |w| # Follow(S')\n",
    "        return val, i\n",
    "    else\n",
    "        error\n",
    "        \n",
    "function parseT(w, i)\n",
    "    # T -> a\n",
    "    if i < |w| and w[i] in {\"1\"} # First(1)\n",
    "        i <- i + 1\n",
    "        return 1, i\n",
    "    else\n",
    "        error\n",
    "```"
   ]
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "Python 3",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.7.5"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 2
}