{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Constituent Parsing Exercises\n", "\n", "\n", "\n", "In the lecture we took a look at a simple tokenizer and sentence segmenter. In this exercise we will expand our understanding of the problem by asking a few important questions, and looking at the problem from a different perspectives." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## <font color='green'>Setup 1</font>: Load Libraries" ] }, { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [], "source": [ "%%capture\n", "%load_ext autoreload\n", "%autoreload 2\n", "%matplotlib inline\n", "# %cd .. \n", "import sys\n", "sys.path.append(\"..\")\n", "import math \n", "import statnlpbook.util as util\n", "import statnlpbook.parsing as parsing" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## <font color='blue'>Task 1</font>: Understanding parsing\n", "\n", "Be sure you understand [grammatical categories and structures](http://webdelprofesor.ula.ve/humanidades/azapata/materias/english_4/grammatical_categories_structures_and_syntactical_functions.pdf) and brush up on your [grammar skils](http://www.ucl.ac.uk/internet-grammar/intro/intro.htm).\n", "\n", "Then re-visit the [Enju online parser](http://www.nactem.ac.uk/enju/demo.html), and parse the following sentences...\n", "\n", "What is wrong with the parses of the following sentences? Are they correct?\n", "- Fat people eat accumulates.\n", "- The fat that people eat accumulates in their bodies.\n", "- The fat that people eat is accumulating in their bodies.\n", "\n", "What about these, is the problem in the parser or in the sentence?\n", " - The old man the boat.\n", " - The old people man the boat. \n", "\n", "These were examples of garden path sentences, find out what that means.\n", "\n", "What about these sentences? Are their parses correct?\n", " - Time flies like an arrow; fruit flies like a banana.\n", " - We saw her duck." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## <font color='blue'>Task 2</font>: Parent Annotation\n", "\n" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Reminisce the lecture notes in parsing, and the mentioned parent annotation. (grand)*parents, matter - knowing who the parent is in a tree gives a bit of context information which can later help us with smoothing probabilities, and approaching context-dependent parsing.\n", "\n", "in that case, each non-terminal node should know it's parent. We'll do this exercise on a single tree, just to play around a bit with trees and their labeling.\n", "\n", "\n", "Given the following tree:" ] }, { "cell_type": "code", "execution_count": 2, "metadata": {}, "outputs": [ { "data": { "image/svg+xml": [ "<?xml version=\"1.0\" encoding=\"UTF-8\" standalone=\"no\"?>\n", "<!DOCTYPE svg PUBLIC \"-//W3C//DTD SVG 1.1//EN\"\n", " \"http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd\">\n", "<!-- Generated by graphviz version 2.38.0 (20140413.2041)\n", " -->\n", "<!-- Title: %3 Pages: 1 -->\n", "<svg width=\"506pt\" height=\"260pt\"\n", " viewBox=\"0.00 0.00 506.30 260.00\" xmlns=\"http://www.w3.org/2000/svg\" xmlns:xlink=\"http://www.w3.org/1999/xlink\">\n", "<g id=\"graph0\" class=\"graph\" transform=\"scale(1 1) rotate(0) translate(4 256)\">\n", "<title>%3</title>\n", "<polygon fill=\"white\" stroke=\"none\" points=\"-4,4 -4,-256 502.296,-256 502.296,4 -4,4\"/>\n", "<!-- 0 -->\n", "<g id=\"node1\" class=\"node\"><title>0</title>\n", "<ellipse fill=\"none\" stroke=\"black\" cx=\"27\" cy=\"-90\" rx=\"27\" ry=\"18\"/>\n", "<text text-anchor=\"middle\" x=\"27\" y=\"-86.3\" font-family=\"Times,serif\" font-size=\"14.00\">He</text>\n", "</g>\n", "<!-- 1 -->\n", "<g id=\"node2\" class=\"node\"><title>1</title>\n", "<ellipse fill=\"none\" stroke=\"black\" cx=\"65\" cy=\"-162\" rx=\"27\" ry=\"18\"/>\n", "<text text-anchor=\"middle\" x=\"65\" y=\"-158.3\" font-family=\"Times,serif\" font-size=\"14.00\">Subj</text>\n", "</g>\n", "<!-- 1->0 -->\n", "<g id=\"edge1\" class=\"edge\"><title>1->0</title>\n", "<path fill=\"none\" stroke=\"black\" d=\"M56.1865,-144.765C51.5823,-136.283 45.845,-125.714 40.6786,-116.197\"/>\n", "<polygon fill=\"black\" stroke=\"black\" points=\"43.6126,-114.266 35.7657,-107.147 37.4607,-117.606 43.6126,-114.266\"/>\n", "</g>\n", "<!-- 2 -->\n", "<g id=\"node3\" class=\"node\"><title>2</title>\n", "<ellipse fill=\"none\" stroke=\"black\" cx=\"59\" cy=\"-18\" rx=\"27\" ry=\"18\"/>\n", "<text text-anchor=\"middle\" x=\"59\" y=\"-14.3\" font-family=\"Times,serif\" font-size=\"14.00\">shot</text>\n", "</g>\n", "<!-- 3 -->\n", "<g id=\"node4\" class=\"node\"><title>3</title>\n", "<ellipse fill=\"none\" stroke=\"black\" cx=\"101\" cy=\"-90\" rx=\"28.6953\" ry=\"18\"/>\n", "<text text-anchor=\"middle\" x=\"101\" y=\"-86.3\" font-family=\"Times,serif\" font-size=\"14.00\">Verb</text>\n", "</g>\n", "<!-- 3->2 -->\n", "<g id=\"edge2\" class=\"edge\"><title>3->2</title>\n", "<path fill=\"none\" stroke=\"black\" d=\"M91.2588,-72.7646C86.1169,-64.1948 79.6964,-53.494 73.9402,-43.9004\"/>\n", "<polygon fill=\"black\" stroke=\"black\" points=\"76.8346,-41.9215 68.6884,-35.1473 70.8322,-45.523 76.8346,-41.9215\"/>\n", "</g>\n", "<!-- 4 -->\n", "<g id=\"node5\" class=\"node\"><title>4</title>\n", "<ellipse fill=\"none\" stroke=\"black\" cx=\"131\" cy=\"-18\" rx=\"27\" ry=\"18\"/>\n", "<text text-anchor=\"middle\" x=\"131\" y=\"-14.3\" font-family=\"Times,serif\" font-size=\"14.00\">the</text>\n", "</g>\n", "<!-- 5 -->\n", "<g id=\"node6\" class=\"node\"><title>5</title>\n", "<ellipse fill=\"none\" stroke=\"black\" cx=\"216\" cy=\"-18\" rx=\"40.0939\" ry=\"18\"/>\n", "<text text-anchor=\"middle\" x=\"216\" y=\"-14.3\" font-family=\"Times,serif\" font-size=\"14.00\">elephant</text>\n", "</g>\n", "<!-- 6 -->\n", "<g id=\"node7\" class=\"node\"><title>6</title>\n", "<ellipse fill=\"none\" stroke=\"black\" cx=\"175\" cy=\"-90\" rx=\"27\" ry=\"18\"/>\n", "<text text-anchor=\"middle\" x=\"175\" y=\"-86.3\" font-family=\"Times,serif\" font-size=\"14.00\">Obj</text>\n", "</g>\n", "<!-- 6->4 -->\n", "<g id=\"edge3\" class=\"edge\"><title>6->4</title>\n", "<path fill=\"none\" stroke=\"black\" d=\"M165.016,-73.1159C159.503,-64.345 152.537,-53.2637 146.347,-43.4162\"/>\n", "<polygon fill=\"black\" stroke=\"black\" points=\"149.23,-41.425 140.945,-34.8212 143.303,-45.1502 149.23,-41.425\"/>\n", "</g>\n", "<!-- 6->5 -->\n", "<g id=\"edge4\" class=\"edge\"><title>6->5</title>\n", "<path fill=\"none\" stroke=\"black\" d=\"M184.509,-72.7646C189.457,-64.3172 195.618,-53.7993 201.174,-44.3122\"/>\n", "<polygon fill=\"black\" stroke=\"black\" points=\"204.224,-46.0312 206.258,-35.6334 198.183,-42.4933 204.224,-46.0312\"/>\n", "</g>\n", "<!-- 7 -->\n", "<g id=\"node8\" class=\"node\"><title>7</title>\n", "<ellipse fill=\"none\" stroke=\"black\" cx=\"301\" cy=\"-18\" rx=\"27\" ry=\"18\"/>\n", "<text text-anchor=\"middle\" x=\"301\" y=\"-14.3\" font-family=\"Times,serif\" font-size=\"14.00\">in</text>\n", "</g>\n", "<!-- 8 -->\n", "<g id=\"node9\" class=\"node\"><title>8</title>\n", "<ellipse fill=\"none\" stroke=\"black\" cx=\"373\" cy=\"-18\" rx=\"27\" ry=\"18\"/>\n", "<text text-anchor=\"middle\" x=\"373\" y=\"-14.3\" font-family=\"Times,serif\" font-size=\"14.00\">his</text>\n", "</g>\n", "<!-- 9 -->\n", "<g id=\"node10\" class=\"node\"><title>9</title>\n", "<ellipse fill=\"none\" stroke=\"black\" cx=\"458\" cy=\"-18\" rx=\"40.0939\" ry=\"18\"/>\n", "<text text-anchor=\"middle\" x=\"458\" y=\"-14.3\" font-family=\"Times,serif\" font-size=\"14.00\">pyjamas</text>\n", "</g>\n", "<!-- 10 -->\n", "<g id=\"node11\" class=\"node\"><title>10</title>\n", "<ellipse fill=\"none\" stroke=\"black\" cx=\"337\" cy=\"-90\" rx=\"27\" ry=\"18\"/>\n", "<text text-anchor=\"middle\" x=\"337\" y=\"-86.3\" font-family=\"Times,serif\" font-size=\"14.00\">PP</text>\n", "</g>\n", "<!-- 10->7 -->\n", "<g id=\"edge5\" class=\"edge\"><title>10->7</title>\n", "<path fill=\"none\" stroke=\"black\" d=\"M328.65,-72.7646C324.288,-64.2831 318.853,-53.7144 313.959,-44.1974\"/>\n", "<polygon fill=\"black\" stroke=\"black\" points=\"316.99,-42.4395 309.304,-35.1473 310.765,-45.6409 316.99,-42.4395\"/>\n", "</g>\n", "<!-- 10->8 -->\n", "<g id=\"edge6\" class=\"edge\"><title>10->8</title>\n", "<path fill=\"none\" stroke=\"black\" d=\"M345.35,-72.7646C349.712,-64.2831 355.147,-53.7144 360.041,-44.1974\"/>\n", "<polygon fill=\"black\" stroke=\"black\" points=\"363.235,-45.6409 364.696,-35.1473 357.01,-42.4395 363.235,-45.6409\"/>\n", "</g>\n", "<!-- 10->9 -->\n", "<g id=\"edge7\" class=\"edge\"><title>10->9</title>\n", "<path fill=\"none\" stroke=\"black\" d=\"M356.71,-77.5975C375.309,-66.8378 403.535,-50.5089 425.47,-37.8188\"/>\n", "<polygon fill=\"black\" stroke=\"black\" points=\"427.234,-40.8421 434.137,-32.8049 423.729,-34.783 427.234,-40.8421\"/>\n", "</g>\n", "<!-- 11 -->\n", "<g id=\"node12\" class=\"node\"><title>11</title>\n", "<ellipse fill=\"none\" stroke=\"black\" cx=\"138\" cy=\"-162\" rx=\"27\" ry=\"18\"/>\n", "<text text-anchor=\"middle\" x=\"138\" y=\"-158.3\" font-family=\"Times,serif\" font-size=\"14.00\">VP</text>\n", "</g>\n", "<!-- 11->3 -->\n", "<g id=\"edge8\" class=\"edge\"><title>11->3</title>\n", "<path fill=\"none\" stroke=\"black\" d=\"M129.418,-144.765C124.999,-136.404 119.508,-126.016 114.535,-116.606\"/>\n", "<polygon fill=\"black\" stroke=\"black\" points=\"117.559,-114.839 109.792,-107.633 111.371,-118.11 117.559,-114.839\"/>\n", "</g>\n", "<!-- 11->6 -->\n", "<g id=\"edge9\" class=\"edge\"><title>11->6</title>\n", "<path fill=\"none\" stroke=\"black\" d=\"M146.582,-144.765C151.065,-136.283 156.651,-125.714 161.681,-116.197\"/>\n", "<polygon fill=\"black\" stroke=\"black\" points=\"164.886,-117.624 166.465,-107.147 158.698,-114.353 164.886,-117.624\"/>\n", "</g>\n", "<!-- 11->10 -->\n", "<g id=\"edge10\" class=\"edge\"><title>11->10</title>\n", "<path fill=\"none\" stroke=\"black\" d=\"M161.609,-152.695C197.011,-140.242 263.971,-116.689 304.132,-102.562\"/>\n", "<polygon fill=\"black\" stroke=\"black\" points=\"305.457,-105.806 313.729,-99.1857 303.134,-99.2023 305.457,-105.806\"/>\n", "</g>\n", "<!-- 12 -->\n", "<g id=\"node13\" class=\"node\"><title>12</title>\n", "<ellipse fill=\"none\" stroke=\"black\" cx=\"101\" cy=\"-234\" rx=\"27\" ry=\"18\"/>\n", "<text text-anchor=\"middle\" x=\"101\" y=\"-230.3\" font-family=\"Times,serif\" font-size=\"14.00\">S</text>\n", "</g>\n", "<!-- 12->1 -->\n", "<g id=\"edge11\" class=\"edge\"><title>12->1</title>\n", "<path fill=\"none\" stroke=\"black\" d=\"M92.6504,-216.765C88.2885,-208.283 82.8531,-197.714 77.9587,-188.197\"/>\n", "<polygon fill=\"black\" stroke=\"black\" points=\"80.9904,-186.439 73.3043,-179.147 74.7654,-189.641 80.9904,-186.439\"/>\n", "</g>\n", "<!-- 12->11 -->\n", "<g id=\"edge12\" class=\"edge\"><title>12->11</title>\n", "<path fill=\"none\" stroke=\"black\" d=\"M109.582,-216.765C114.065,-208.283 119.651,-197.714 124.681,-188.197\"/>\n", "<polygon fill=\"black\" stroke=\"black\" points=\"127.886,-189.624 129.465,-179.147 121.698,-186.353 127.886,-189.624\"/>\n", "</g>\n", "</g>\n", "</svg>\n" ], "text/plain": [ "<graphviz.dot.Digraph at 0x7efee80a1d30>" ] }, "execution_count": 2, "metadata": {}, "output_type": "execute_result" } ], "source": [ "x = ('S', [('Subj', ['He']), ('VP', [('Verb', ['shot']), ('Obj', ['the', 'elephant']), ('PP', ['in', 'his', 'pyjamas'])])])\n", "parsing.render_tree(x)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Construct an `annotate_parents` function which will take that tree, and annotate its parents. The final annotation result should look like this:" ] }, { "cell_type": "code", "execution_count": 3, "metadata": {}, "outputs": [ { "data": { "image/svg+xml": [ "<?xml version=\"1.0\" encoding=\"UTF-8\" standalone=\"no\"?>\n", "<!DOCTYPE svg PUBLIC \"-//W3C//DTD SVG 1.1//EN\"\n", " \"http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd\">\n", "<!-- Generated by graphviz version 2.38.0 (20140413.2041)\n", " -->\n", "<!-- Title: %3 Pages: 1 -->\n", "<svg width=\"535pt\" height=\"260pt\"\n", " viewBox=\"0.00 0.00 535.30 260.00\" xmlns=\"http://www.w3.org/2000/svg\" xmlns:xlink=\"http://www.w3.org/1999/xlink\">\n", "<g id=\"graph0\" class=\"graph\" transform=\"scale(1 1) rotate(0) translate(4 256)\">\n", "<title>%3</title>\n", "<polygon fill=\"white\" stroke=\"none\" points=\"-4,4 -4,-256 531.296,-256 531.296,4 -4,4\"/>\n", "<!-- 0 -->\n", "<g id=\"node1\" class=\"node\"><title>0</title>\n", "<ellipse fill=\"none\" stroke=\"black\" cx=\"27\" cy=\"-90\" rx=\"27\" ry=\"18\"/>\n", "<text text-anchor=\"middle\" x=\"27\" y=\"-86.3\" font-family=\"Times,serif\" font-size=\"14.00\">He</text>\n", "</g>\n", "<!-- 1 -->\n", "<g id=\"node2\" class=\"node\"><title>1</title>\n", "<ellipse fill=\"none\" stroke=\"black\" cx=\"80\" cy=\"-162\" rx=\"36.2938\" ry=\"18\"/>\n", "<text text-anchor=\"middle\" x=\"80\" y=\"-158.3\" font-family=\"Times,serif\" font-size=\"14.00\">Subj^S</text>\n", "</g>\n", "<!-- 1->0 -->\n", "<g id=\"edge1\" class=\"edge\"><title>1->0</title>\n", "<path fill=\"none\" stroke=\"black\" d=\"M67.7075,-144.765C60.9905,-135.893 52.5445,-124.738 45.0893,-114.892\"/>\n", "<polygon fill=\"black\" stroke=\"black\" points=\"47.6881,-112.526 38.8612,-106.666 42.1072,-116.751 47.6881,-112.526\"/>\n", "</g>\n", "<!-- 2 -->\n", "<g id=\"node3\" class=\"node\"><title>2</title>\n", "<ellipse fill=\"none\" stroke=\"black\" cx=\"88\" cy=\"-18\" rx=\"27\" ry=\"18\"/>\n", "<text text-anchor=\"middle\" x=\"88\" y=\"-14.3\" font-family=\"Times,serif\" font-size=\"14.00\">shot</text>\n", "</g>\n", "<!-- 3 -->\n", "<g id=\"node4\" class=\"node\"><title>3</title>\n", "<ellipse fill=\"none\" stroke=\"black\" cx=\"116\" cy=\"-90\" rx=\"44.393\" ry=\"18\"/>\n", "<text text-anchor=\"middle\" x=\"116\" y=\"-86.3\" font-family=\"Times,serif\" font-size=\"14.00\">Verb^VP</text>\n", "</g>\n", "<!-- 3->2 -->\n", "<g id=\"edge2\" class=\"edge\"><title>3->2</title>\n", "<path fill=\"none\" stroke=\"black\" d=\"M109.222,-72.055C105.989,-63.9726 102.049,-54.1214 98.4447,-45.1117\"/>\n", "<polygon fill=\"black\" stroke=\"black\" points=\"101.679,-43.7739 94.7156,-35.789 95.1799,-46.3737 101.679,-43.7739\"/>\n", "</g>\n", "<!-- 4 -->\n", "<g id=\"node5\" class=\"node\"><title>4</title>\n", "<ellipse fill=\"none\" stroke=\"black\" cx=\"160\" cy=\"-18\" rx=\"27\" ry=\"18\"/>\n", "<text text-anchor=\"middle\" x=\"160\" y=\"-14.3\" font-family=\"Times,serif\" font-size=\"14.00\">the</text>\n", "</g>\n", "<!-- 5 -->\n", "<g id=\"node6\" class=\"node\"><title>5</title>\n", "<ellipse fill=\"none\" stroke=\"black\" cx=\"245\" cy=\"-18\" rx=\"40.0939\" ry=\"18\"/>\n", "<text text-anchor=\"middle\" x=\"245\" y=\"-14.3\" font-family=\"Times,serif\" font-size=\"14.00\">elephant</text>\n", "</g>\n", "<!-- 6 -->\n", "<g id=\"node7\" class=\"node\"><title>6</title>\n", "<ellipse fill=\"none\" stroke=\"black\" cx=\"218\" cy=\"-90\" rx=\"39.7935\" ry=\"18\"/>\n", "<text text-anchor=\"middle\" x=\"218\" y=\"-86.3\" font-family=\"Times,serif\" font-size=\"14.00\">Obj^VP</text>\n", "</g>\n", "<!-- 6->4 -->\n", "<g id=\"edge3\" class=\"edge\"><title>6->4</title>\n", "<path fill=\"none\" stroke=\"black\" d=\"M204.548,-72.7646C197.018,-63.6773 187.504,-52.1939 179.202,-42.1753\"/>\n", "<polygon fill=\"black\" stroke=\"black\" points=\"181.66,-39.6561 172.585,-34.189 176.27,-44.1223 181.66,-39.6561\"/>\n", "</g>\n", "<!-- 6->5 -->\n", "<g id=\"edge4\" class=\"edge\"><title>6->5</title>\n", "<path fill=\"none\" stroke=\"black\" d=\"M224.536,-72.055C227.62,-64.0586 231.372,-54.3308 234.817,-45.3995\"/>\n", "<polygon fill=\"black\" stroke=\"black\" points=\"238.191,-46.3786 238.524,-35.789 231.66,-43.8594 238.191,-46.3786\"/>\n", "</g>\n", "<!-- 7 -->\n", "<g id=\"node8\" class=\"node\"><title>7</title>\n", "<ellipse fill=\"none\" stroke=\"black\" cx=\"330\" cy=\"-18\" rx=\"27\" ry=\"18\"/>\n", "<text text-anchor=\"middle\" x=\"330\" y=\"-14.3\" font-family=\"Times,serif\" font-size=\"14.00\">in</text>\n", "</g>\n", "<!-- 8 -->\n", "<g id=\"node9\" class=\"node\"><title>8</title>\n", "<ellipse fill=\"none\" stroke=\"black\" cx=\"402\" cy=\"-18\" rx=\"27\" ry=\"18\"/>\n", "<text text-anchor=\"middle\" x=\"402\" y=\"-14.3\" font-family=\"Times,serif\" font-size=\"14.00\">his</text>\n", "</g>\n", "<!-- 9 -->\n", "<g id=\"node10\" class=\"node\"><title>9</title>\n", "<ellipse fill=\"none\" stroke=\"black\" cx=\"487\" cy=\"-18\" rx=\"40.0939\" ry=\"18\"/>\n", "<text text-anchor=\"middle\" x=\"487\" y=\"-14.3\" font-family=\"Times,serif\" font-size=\"14.00\">pyjamas</text>\n", "</g>\n", "<!-- 10 -->\n", "<g id=\"node11\" class=\"node\"><title>10</title>\n", "<ellipse fill=\"none\" stroke=\"black\" cx=\"366\" cy=\"-90\" rx=\"36.2938\" ry=\"18\"/>\n", "<text text-anchor=\"middle\" x=\"366\" y=\"-86.3\" font-family=\"Times,serif\" font-size=\"14.00\">PP^VP</text>\n", "</g>\n", "<!-- 10->7 -->\n", "<g id=\"edge5\" class=\"edge\"><title>10->7</title>\n", "<path fill=\"none\" stroke=\"black\" d=\"M357.469,-72.411C353.164,-64.0421 347.851,-53.7098 343.047,-44.3698\"/>\n", "<polygon fill=\"black\" stroke=\"black\" points=\"346.156,-42.7621 338.47,-35.4699 339.931,-45.9636 346.156,-42.7621\"/>\n", "</g>\n", "<!-- 10->8 -->\n", "<g id=\"edge6\" class=\"edge\"><title>10->8</title>\n", "<path fill=\"none\" stroke=\"black\" d=\"M374.531,-72.411C378.836,-64.0421 384.149,-53.7098 388.953,-44.3698\"/>\n", "<polygon fill=\"black\" stroke=\"black\" points=\"392.069,-45.9636 393.53,-35.4699 385.844,-42.7621 392.069,-45.9636\"/>\n", "</g>\n", "<!-- 10->9 -->\n", "<g id=\"edge7\" class=\"edge\"><title>10->9</title>\n", "<path fill=\"none\" stroke=\"black\" d=\"M388.759,-75.8335C407.246,-65.1389 433.56,-49.9159 454.302,-37.9162\"/>\n", "<polygon fill=\"black\" stroke=\"black\" points=\"456.198,-40.863 463.101,-32.8258 452.693,-34.8039 456.198,-40.863\"/>\n", "</g>\n", "<!-- 11 -->\n", "<g id=\"node12\" class=\"node\"><title>11</title>\n", "<ellipse fill=\"none\" stroke=\"black\" cx=\"167\" cy=\"-162\" rx=\"31.3957\" ry=\"18\"/>\n", "<text text-anchor=\"middle\" x=\"167\" y=\"-158.3\" font-family=\"Times,serif\" font-size=\"14.00\">VP^S</text>\n", "</g>\n", "<!-- 11->3 -->\n", "<g id=\"edge8\" class=\"edge\"><title>11->3</title>\n", "<path fill=\"none\" stroke=\"black\" d=\"M155.427,-145.116C149.154,-136.505 141.258,-125.667 134.183,-115.957\"/>\n", "<polygon fill=\"black\" stroke=\"black\" points=\"136.956,-113.819 128.238,-107.797 131.298,-117.941 136.956,-113.819\"/>\n", "</g>\n", "<!-- 11->6 -->\n", "<g id=\"edge9\" class=\"edge\"><title>11->6</title>\n", "<path fill=\"none\" stroke=\"black\" d=\"M178.573,-145.116C184.937,-136.38 192.972,-125.353 200.124,-115.535\"/>\n", "<polygon fill=\"black\" stroke=\"black\" points=\"203.059,-117.45 206.119,-107.307 197.402,-113.328 203.059,-117.45\"/>\n", "</g>\n", "<!-- 11->10 -->\n", "<g id=\"edge10\" class=\"edge\"><title>11->10</title>\n", "<path fill=\"none\" stroke=\"black\" d=\"M193.016,-151.849C227.3,-139.789 287.884,-118.478 327.486,-104.548\"/>\n", "<polygon fill=\"black\" stroke=\"black\" points=\"328.802,-107.795 337.074,-101.175 326.479,-101.192 328.802,-107.795\"/>\n", "</g>\n", "<!-- 12 -->\n", "<g id=\"node13\" class=\"node\"><title>12</title>\n", "<ellipse fill=\"none\" stroke=\"black\" cx=\"123\" cy=\"-234\" rx=\"27\" ry=\"18\"/>\n", "<text text-anchor=\"middle\" x=\"123\" y=\"-230.3\" font-family=\"Times,serif\" font-size=\"14.00\">S^?</text>\n", "</g>\n", "<!-- 12->1 -->\n", "<g id=\"edge11\" class=\"edge\"><title>12->1</title>\n", "<path fill=\"none\" stroke=\"black\" d=\"M113.243,-217.116C107.931,-208.469 101.239,-197.575 95.2546,-187.833\"/>\n", "<polygon fill=\"black\" stroke=\"black\" points=\"98.2337,-185.996 90.0172,-179.307 92.2691,-189.66 98.2337,-185.996\"/>\n", "</g>\n", "<!-- 12->11 -->\n", "<g id=\"edge12\" class=\"edge\"><title>12->11</title>\n", "<path fill=\"none\" stroke=\"black\" d=\"M132.984,-217.116C138.42,-208.469 145.267,-197.575 151.391,-187.833\"/>\n", "<polygon fill=\"black\" stroke=\"black\" points=\"154.391,-189.636 156.75,-179.307 148.465,-185.911 154.391,-189.636\"/>\n", "</g>\n", "</g>\n", "</svg>\n" ], "text/plain": [ "<graphviz.dot.Digraph at 0x7efebdc806d8>" ] }, "execution_count": 3, "metadata": {}, "output_type": "execute_result" } ], "source": [ "y = ('S^?', [('Subj^S', ['He']), ('VP^S', [('Verb^VP', ['shot']), ('Obj^VP', ['the', 'elephant']), ('PP^VP', ['in', 'his', 'pyjamas'])])])\n", "parsing.render_tree(y)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Solutions\n", "\n", "You can find the solutions to this exercises [here](parsing_solutions.ipynb)" ] } ], "metadata": { "kernelspec": { "display_name": "Python 3", "language": "python", "name": "python3" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.6.2" } }, "nbformat": 4, "nbformat_minor": 1 }