{ "cells": [ { "cell_type": "markdown", "metadata": { "button": false, "new_sheet": false, "run_control": { "read_only": false }, "slideshow": { "slide_type": "slide" }, "tags": [] }, "source": [ "# Testing Graphical User Interfaces\n", "\n", "In this chapter, we explore how to generate tests for Graphical User Interfaces (GUIs), abstracting from our [previous examples on Web testing](WebFuzzer.ipynb). Building on general means to extract user interface elements and activate them, our techniques generalize to arbitrary graphical user interfaces, from rich Web applications to mobile apps, and systematically explore user interfaces through forms and navigation elements." ] }, { "cell_type": "code", "execution_count": 1, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:19.762277Z", "iopub.status.busy": "2023-01-07T14:53:19.760331Z", "iopub.status.idle": "2023-01-07T14:53:19.817987Z", "shell.execute_reply": "2023-01-07T14:53:19.818230Z" }, "slideshow": { "slide_type": "skip" } }, "outputs": [ { "data": { "text/html": [ "\n", " \n", " " ], "text/plain": [ "" ] }, "execution_count": 1, "metadata": {}, "output_type": "execute_result" } ], "source": [ "from bookutils import YouTubeVideo\n", "YouTubeVideo('79-HRgFot4k')" ] }, { "cell_type": "markdown", "metadata": { "button": false, "new_sheet": false, "run_control": { "read_only": false }, "slideshow": { "slide_type": "subslide" } }, "source": [ "**Prerequisites**\n", "\n", "* We build on the Web server introduced in the [chapter on Web testing](WebFuzzer.ipynb)." ] }, { "attachments": {}, "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "skip" } }, "source": [ "## Synopsis\n", "\n", "\n", "To [use the code provided in this chapter](Importing.ipynb), write\n", "\n", "```python\n", ">>> from fuzzingbook.GUIFuzzer import \n", "```\n", "\n", "and then make use of the following features.\n", "\n", "\n", "This chapter demonstrates how to programmatically interact with user interfaces, using Selenium on Web browsers. It provides an experimental `GUICoverageFuzzer` class that automatically explores a user interface by systematically interacting with all available user interface elements.\n", "\n", "The function `start_webdriver()` starts a headless Web browser in the background and returns a _GUI driver_ as handle for further communication.\n", "\n", "```python\n", ">>> gui_driver = start_webdriver()\n", "```\n", "We let the browser open the URL of the server we want to investigate (in this case, the vulnerable server from [the chapter on Web fuzzing](WebFuzzer.ipynb)) and obtain a screenshot.\n", "\n", "```python\n", ">>> gui_driver.get(httpd_url)\n", ">>> Image(gui_driver.get_screenshot_as_png())\n", "```\n", "![](PICS/GUIFuzzer-synopsis-1.png)\n", "\n", "The `GUICoverageFuzzer` class explores the user interface and builds a _grammar_ that encodes all states as well as the user interactions required to move from one state to the next. It is paired with a `GUIRunner` which interacts with the GUI driver.\n", "\n", "```python\n", ">>> gui_fuzzer = GUICoverageFuzzer(gui_driver)\n", ">>> gui_runner = GUIRunner(gui_driver)\n", "```\n", "The `explore_all()` method extracts all states and all transitions from a Web user interface.\n", "\n", "```python\n", ">>> gui_fuzzer.explore_all(gui_runner)\n", "```\n", "The grammar embeds a finite state automation and is best visualized as such.\n", "\n", "```python\n", ">>> fsm_diagram(gui_fuzzer.grammar)\n", "```\n", "![](PICS/GUIFuzzer-synopsis-2.svg)\n", "\n", "The GUI Fuzzer `fuzz()` method produces sequences of interactions that follow paths through the finite state machine. Since `GUICoverageFuzzer` is derived from `CoverageFuzzer` (see the [chapter on coverage-based grammar fuzzing](GrammarCoverageFuzzer.ipynb)), it automatically covers (a) as many transitions between states as well as (b) as many form elements as possible. In our case, the first set of actions explores the transition via the \"order form\" link; the second set then goes until the \"\" state.\n", "\n", "```python\n", ">>> gui_driver.get(httpd_url)\n", ">>> actions = gui_fuzzer.fuzz()\n", ">>> print(actions)\n", "fill('zip', '1')\n", "check('terms', False)\n", "fill('name', 'Q')\n", "fill('email', 'K@i')\n", "fill('city', 'lGd')\n", "submit('submit')\n", "click('order form')\n", "click('terms and conditions')\n", "click('order form')\n", "fill('zip', '6')\n", "check('terms', True)\n", "fill('name', 'w')\n", "fill('email', 'S@q')\n", "fill('city', 'h')\n", "submit('submit')\n", "\n", "\n", "```\n", "These actions can be fed into the GUI runner, which will execute them on the given GUI driver.\n", "\n", "```python\n", ">>> gui_driver.get(httpd_url)\n", ">>> result, outcome = gui_runner.run(actions)\n", ">>> Image(gui_driver.get_screenshot_as_png())\n", "```\n", "![](PICS/GUIFuzzer-synopsis-3.png)\n", "\n", "Further invocations of `fuzz()` will further cover the model – for instance, exploring the terms and conditions.\n", "\n", "Internally, `GUIFuzzer` and `GUICoverageFuzzer` use a subclass `GUIGrammarMiner` which implements the analysis of the GUI and all its states. Subclassing `GUIGrammarMiner` allows to extend the interpretation of GUIs; the `GUIFuzzer` constructor allows to pass a miner via the `miner` keyword parameter.\n", "\n", "A tool like `GUICoverageFuzzer` will provide \"deep\" exploration of user interfaces, even filling out forms to explore what is behind them. Keep in mind, though, that `GUICoverageFuzzer` is experimental: It only supports a subset of HTML form and link features, and does not take JavaScript into account.\n", "\n", "![](PICS/GUIFuzzer-synopsis-4.svg)\n", "\n" ] }, { "cell_type": "markdown", "metadata": { "button": false, "new_sheet": true, "run_control": { "read_only": false }, "slideshow": { "slide_type": "slide" } }, "source": [ "## Automated GUI Interaction\n", "\n", "In the [chapter on Web testing](WebFuzzer.ipynb), we have shown how to test Web-based interfaces by directly interacting with a Web server using the HTTP protocol, and processing the retrieved HTML pages to identify user interface elements. While these techniques work well for user interfaces that are based on HTML only, they fail as soon as there are interactive elements that use JavaScript to execute code within the browser, and generate and change the user interface without having to interact with the browser." ] }, { "cell_type": "markdown", "metadata": { "button": false, "new_sheet": true, "run_control": { "read_only": false }, "slideshow": { "slide_type": "subslide" } }, "source": [ "In this chapter, we therefore take a different approach to user interface testing. Rather than using HTTP and HTML as the mechanisms for interaction, we leverage a dedicated _UI testing framework_, which allows us to\n", "\n", "* query the program under test for available user interface elements, and\n", "* query the UI elements for how they can be interacted with.\n", "\n", "Although we will again illustrate our approach using a Web server, the approach easily generalizes to _arbitrary user interfaces_. In fact, the UI testing framework we use, *Selenium*, also comes in variants that run for Android apps." ] }, { "cell_type": "markdown", "metadata": { "button": false, "new_sheet": true, "run_control": { "read_only": false }, "slideshow": { "slide_type": "subslide" } }, "source": [ "### Our Web Server, Again\n", "\n", "As in the [chapter on Web testing](WebFuzzer.ipynb), we run a Web server that allows us to order products." ] }, { "cell_type": "code", "execution_count": 2, "metadata": { "button": false, "execution": { "iopub.execute_input": "2023-01-07T14:53:19.820593Z", "iopub.status.busy": "2023-01-07T14:53:19.820280Z", "iopub.status.idle": "2023-01-07T14:53:19.821387Z", "shell.execute_reply": "2023-01-07T14:53:19.821639Z" }, "new_sheet": false, "run_control": { "read_only": false }, "slideshow": { "slide_type": "skip" } }, "outputs": [], "source": [ "import bookutils" ] }, { "cell_type": "code", "execution_count": 3, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:19.823351Z", "iopub.status.busy": "2023-01-07T14:53:19.823035Z", "iopub.status.idle": "2023-01-07T14:53:19.824580Z", "shell.execute_reply": "2023-01-07T14:53:19.824332Z" }, "slideshow": { "slide_type": "skip" } }, "outputs": [], "source": [ "from typing import Set, FrozenSet, List, Optional, Tuple, Any" ] }, { "cell_type": "code", "execution_count": 4, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:19.826097Z", "iopub.status.busy": "2023-01-07T14:53:19.825810Z", "iopub.status.idle": "2023-01-07T14:53:19.827207Z", "shell.execute_reply": "2023-01-07T14:53:19.827403Z" }, "slideshow": { "slide_type": "skip" } }, "outputs": [], "source": [ "import os\n", "import sys" ] }, { "cell_type": "code", "execution_count": 5, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:19.829029Z", "iopub.status.busy": "2023-01-07T14:53:19.828731Z", "iopub.status.idle": "2023-01-07T14:53:19.830010Z", "shell.execute_reply": "2023-01-07T14:53:19.830219Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "# ignore\n", "if 'CI' in os.environ:\n", " # Can't run this in our continuous environment,\n", " # since it can't run a headless Web browser\n", " sys.exit(0)" ] }, { "cell_type": "code", "execution_count": 6, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:19.831923Z", "iopub.status.busy": "2023-01-07T14:53:19.831581Z", "iopub.status.idle": "2023-01-07T14:53:20.436054Z", "shell.execute_reply": "2023-01-07T14:53:20.436279Z" }, "slideshow": { "slide_type": "skip" } }, "outputs": [], "source": [ "from WebFuzzer import init_db, start_httpd, webbrowser, print_httpd_messages\n", "from WebFuzzer import print_url, ORDERS_DB" ] }, { "cell_type": "code", "execution_count": 7, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:20.438202Z", "iopub.status.busy": "2023-01-07T14:53:20.437900Z", "iopub.status.idle": "2023-01-07T14:53:20.439116Z", "shell.execute_reply": "2023-01-07T14:53:20.439321Z" }, "slideshow": { "slide_type": "skip" } }, "outputs": [], "source": [ "import html" ] }, { "cell_type": "code", "execution_count": 8, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:20.441107Z", "iopub.status.busy": "2023-01-07T14:53:20.440737Z", "iopub.status.idle": "2023-01-07T14:53:20.443300Z", "shell.execute_reply": "2023-01-07T14:53:20.443513Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "db = init_db()" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "This is the address of our web server:" ] }, { "cell_type": "code", "execution_count": 9, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:20.445358Z", "iopub.status.busy": "2023-01-07T14:53:20.445037Z", "iopub.status.idle": "2023-01-07T14:53:20.454578Z", "shell.execute_reply": "2023-01-07T14:53:20.454827Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/html": [ "
http://127.0.0.1:8800
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "httpd_process, httpd_url = start_httpd()\n", "print_url(httpd_url)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "Using `webbrowser()`, we can retrieve the HTML of the home page, and use `HTML()` to render it." ] }, { "cell_type": "code", "execution_count": 10, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:20.457111Z", "iopub.status.busy": "2023-01-07T14:53:20.456807Z", "iopub.status.idle": "2023-01-07T14:53:20.458065Z", "shell.execute_reply": "2023-01-07T14:53:20.458346Z" }, "slideshow": { "slide_type": "skip" } }, "outputs": [], "source": [ "from IPython.display import display, Image" ] }, { "cell_type": "code", "execution_count": 11, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:20.459962Z", "iopub.status.busy": "2023-01-07T14:53:20.459618Z", "iopub.status.idle": "2023-01-07T14:53:20.461388Z", "shell.execute_reply": "2023-01-07T14:53:20.461591Z" }, "slideshow": { "slide_type": "skip" } }, "outputs": [], "source": [ "from bookutils import HTML, rich_output" ] }, { "cell_type": "code", "execution_count": 12, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:20.463903Z", "iopub.status.busy": "2023-01-07T14:53:20.463596Z", "iopub.status.idle": "2023-01-07T14:53:20.473017Z", "shell.execute_reply": "2023-01-07T14:53:20.473338Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/html": [ "
127.0.0.1 - - [07/Jan/2023 15:53:20] \"GET / HTTP/1.1\" 200 -\n",
       "
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "\n", "\n", "
\n", " Fuzzingbook Swag Order Form\n", "

\n", " Yes! Please send me at your earliest convenience\n", " \n", "
\n", " \n", " \n", " \n", "
\n", " \n", " \n", "
\n", "
\n", " \n", " \n", " \n", "
\n", " .
\n", " \n", "

\n", "
\n", "\n" ], "text/plain": [ "" ] }, "execution_count": 12, "metadata": {}, "output_type": "execute_result" } ], "source": [ "HTML(webbrowser(httpd_url))" ] }, { "cell_type": "markdown", "metadata": { "button": false, "new_sheet": false, "run_control": { "read_only": false }, "slideshow": { "slide_type": "subslide" } }, "source": [ "### Remote Control with Selenium\n", "\n", "Let us take a look at the GUI above. In contrast to the [chapter on Web testing](WebFuzzer.ipynb), we do not assume we can access the HTML source of the current page. All we assume is that there is a set of *user interface elements* we can interact with." ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "[Selenium](https://www.seleniumhq.org) is a framework for testing Web applications by _automating interaction in the browser_. Selenium provides an API that allows one to launch a Web browser, query the state of the user interface, and interact with individual user interface elements. The Selenium API is available in a number of languages; we use the [Selenium API for Python](https://selenium-python.readthedocs.io/index.html)." ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "A Selenium *web driver* is the interface between a program and a browser controlled by the program." ] }, { "cell_type": "code", "execution_count": 13, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:20.475438Z", "iopub.status.busy": "2023-01-07T14:53:20.475077Z", "iopub.status.idle": "2023-01-07T14:53:20.491046Z", "shell.execute_reply": "2023-01-07T14:53:20.491373Z" }, "slideshow": { "slide_type": "skip" } }, "outputs": [], "source": [ "from selenium import webdriver" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "The following code starts a Firefox browser in the background, which we then control through the web driver." ] }, { "cell_type": "code", "execution_count": 14, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:20.493657Z", "iopub.status.busy": "2023-01-07T14:53:20.493330Z", "iopub.status.idle": "2023-01-07T14:53:20.494664Z", "shell.execute_reply": "2023-01-07T14:53:20.494859Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "BROWSER = 'firefox'" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "**Note:** If you don't have Firefox installed, you can also set `BROWSER` to `'chrome'` to use Google Chrome instead." ] }, { "cell_type": "code", "execution_count": 15, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:20.496551Z", "iopub.status.busy": "2023-01-07T14:53:20.496254Z", "iopub.status.idle": "2023-01-07T14:53:20.497459Z", "shell.execute_reply": "2023-01-07T14:53:20.497663Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "# BROWSER = 'chrome'" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "**Note:** For Firefox, you may have to make sure the [geckodriver program](https://github.com/mozilla/geckodriver/releases) is in your path." ] }, { "cell_type": "code", "execution_count": 16, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:20.499274Z", "iopub.status.busy": "2023-01-07T14:53:20.498965Z", "iopub.status.idle": "2023-01-07T14:53:20.500249Z", "shell.execute_reply": "2023-01-07T14:53:20.500454Z" }, "slideshow": { "slide_type": "skip" } }, "outputs": [], "source": [ "import shutil" ] }, { "cell_type": "code", "execution_count": 17, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:20.502548Z", "iopub.status.busy": "2023-01-07T14:53:20.502167Z", "iopub.status.idle": "2023-01-07T14:53:20.503392Z", "shell.execute_reply": "2023-01-07T14:53:20.503582Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "if BROWSER == 'firefox':\n", " assert shutil.which('geckodriver') is not None, \\\n", " \"Please install 'geckodriver' executable \" \\\n", " \"from https://github.com/mozilla/geckodriver/releases\"" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "The browser is _headless_, meaning that it does not show on the screen." ] }, { "cell_type": "code", "execution_count": 18, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:20.505306Z", "iopub.status.busy": "2023-01-07T14:53:20.505000Z", "iopub.status.idle": "2023-01-07T14:53:20.506609Z", "shell.execute_reply": "2023-01-07T14:53:20.506364Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "HEADLESS = True" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "**Note**: If the notebook server runs locally (i.e. on the same machine on which you are seeing this), you can also set `HEADLESS` to `False` and see what happens right on the screen as you execute the notebook cells. This is very much recommended for interactive sessions." ] }, { "cell_type": "code", "execution_count": 19, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:20.509545Z", "iopub.status.busy": "2023-01-07T14:53:20.509162Z", "iopub.status.idle": "2023-01-07T14:53:20.510504Z", "shell.execute_reply": "2023-01-07T14:53:20.510694Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [], "source": [ "def start_webdriver(browser=BROWSER, headless=HEADLESS, zoom=1.4):\n", " if browser == 'firefox':\n", " options = webdriver.FirefoxOptions()\n", " if browser == 'chrome':\n", " options = webdriver.ChromeOptions()\n", "\n", " if headless and browser == 'chrome':\n", " options.add_argument('headless')\n", " else:\n", " options.headless = headless\n", "\n", " # Start the browser, and obtain a _web driver_ object such that we can interact with it.\n", " if browser == 'firefox':\n", " # For firefox, set a higher resolution for our screenshots\n", " options.set_preference(\"layout.css.devPixelsPerPx\", repr(zoom))\n", " gui_driver = webdriver.Firefox(options=options)\n", "\n", " # We set the window size such that it fits our order form exactly;\n", " # this is useful for not wasting too much space when taking screen shots.\n", " gui_driver.set_window_size(700, 300)\n", "\n", " elif browser == 'chrome':\n", " gui_driver = webdriver.Chrome(options=options)\n", " gui_driver.set_window_size(700, 210 if headless else 340)\n", "\n", " return gui_driver" ] }, { "cell_type": "code", "execution_count": 20, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:20.512507Z", "iopub.status.busy": "2023-01-07T14:53:20.512189Z", "iopub.status.idle": "2023-01-07T14:53:23.012070Z", "shell.execute_reply": "2023-01-07T14:53:23.012351Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [], "source": [ "gui_driver = start_webdriver(browser=BROWSER, headless=HEADLESS)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "We can now interact with the browser programmatically. First, we have it navigate to the URL of our Web server:" ] }, { "cell_type": "code", "execution_count": 21, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:23.015809Z", "iopub.status.busy": "2023-01-07T14:53:23.015266Z", "iopub.status.idle": "2023-01-07T14:53:23.069537Z", "shell.execute_reply": "2023-01-07T14:53:23.069763Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "gui_driver.get(httpd_url)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "We see that the home page is actually accessed, together with a (failing) request to get a page icon:" ] }, { "cell_type": "code", "execution_count": 22, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:23.072518Z", "iopub.status.busy": "2023-01-07T14:53:23.072188Z", "iopub.status.idle": "2023-01-07T14:53:23.075417Z", "shell.execute_reply": "2023-01-07T14:53:23.075627Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/html": [ "
127.0.0.1 - - [07/Jan/2023 15:53:23] \"GET / HTTP/1.1\" 200 -\n",
       "
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
127.0.0.1 - - [07/Jan/2023 15:53:23] \"GET /favicon.ico HTTP/1.1\" 404 -\n",
       "
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "print_httpd_messages()" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "To see what the \"headless\" browser displays, we can obtain a screenshot. We see that it actually displays the home page." ] }, { "cell_type": "code", "execution_count": 23, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:23.078734Z", "iopub.status.busy": "2023-01-07T14:53:23.078404Z", "iopub.status.idle": "2023-01-07T14:53:23.114605Z", "shell.execute_reply": "2023-01-07T14:53:23.114870Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "image/png": "\n", "text/plain": [ "" ] }, "execution_count": 23, "metadata": {}, "output_type": "execute_result" } ], "source": [ "Image(gui_driver.get_screenshot_as_png())" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" }, "tags": [] }, "source": [ "### Filling out Forms\n", "\n", "To interact with the Web page through Selenium and the browser, we can _query_ Selenium for individual elements. For instance, we can access the UI element whose `name` attribute (as defined in HTML) is `\"name\"`." ] }, { "cell_type": "code", "execution_count": 24, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:23.117022Z", "iopub.status.busy": "2023-01-07T14:53:23.116741Z", "iopub.status.idle": "2023-01-07T14:53:23.117857Z", "shell.execute_reply": "2023-01-07T14:53:23.118229Z" }, "slideshow": { "slide_type": "skip" } }, "outputs": [], "source": [ "from selenium.webdriver.common.by import By" ] }, { "cell_type": "code", "execution_count": 25, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:23.120681Z", "iopub.status.busy": "2023-01-07T14:53:23.120232Z", "iopub.status.idle": "2023-01-07T14:53:23.128458Z", "shell.execute_reply": "2023-01-07T14:53:23.128671Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "name = gui_driver.find_element(By.NAME, \"name\")" ] }, { "attachments": {}, "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "Once we have an element, we can interact with it. Since `name` is a text field, we can send it a string using the `send_keys()` method; the string will be translated into appropriate keystrokes." ] }, { "cell_type": "code", "execution_count": 26, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:23.131030Z", "iopub.status.busy": "2023-01-07T14:53:23.130707Z", "iopub.status.idle": "2023-01-07T14:53:23.181328Z", "shell.execute_reply": "2023-01-07T14:53:23.181598Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "name.send_keys(\"Jane Doe\")" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "In the screenshot, we can see that the `name` field is now filled:" ] }, { "cell_type": "code", "execution_count": 27, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:23.183828Z", "iopub.status.busy": "2023-01-07T14:53:23.183516Z", "iopub.status.idle": "2023-01-07T14:53:23.202919Z", "shell.execute_reply": "2023-01-07T14:53:23.203127Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "image/png": "\n", "text/plain": [ "" ] }, "execution_count": 27, "metadata": {}, "output_type": "execute_result" } ], "source": [ "Image(gui_driver.get_screenshot_as_png())" ] }, { "attachments": {}, "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "Similarly, we can fill out the email, city, and ZIP fields:" ] }, { "cell_type": "code", "execution_count": 28, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:23.205768Z", "iopub.status.busy": "2023-01-07T14:53:23.205412Z", "iopub.status.idle": "2023-01-07T14:53:23.220415Z", "shell.execute_reply": "2023-01-07T14:53:23.220680Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [], "source": [ "email = gui_driver.find_element(By.NAME, \"email\")\n", "email.send_keys(\"j.doe@example.com\")" ] }, { "cell_type": "code", "execution_count": 29, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:23.222857Z", "iopub.status.busy": "2023-01-07T14:53:23.222491Z", "iopub.status.idle": "2023-01-07T14:53:23.233972Z", "shell.execute_reply": "2023-01-07T14:53:23.234171Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "city = gui_driver.find_element(By.NAME, 'city')\n", "city.send_keys(\"Seattle\")" ] }, { "cell_type": "code", "execution_count": 30, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:23.236273Z", "iopub.status.busy": "2023-01-07T14:53:23.235950Z", "iopub.status.idle": "2023-01-07T14:53:23.249666Z", "shell.execute_reply": "2023-01-07T14:53:23.249878Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "zip = gui_driver.find_element(By.NAME, 'zip')\n", "zip.send_keys(\"98104\")" ] }, { "cell_type": "code", "execution_count": 31, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:23.251609Z", "iopub.status.busy": "2023-01-07T14:53:23.251204Z", "iopub.status.idle": "2023-01-07T14:53:23.272286Z", "shell.execute_reply": "2023-01-07T14:53:23.272544Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "image/png": "\n", "text/plain": [ "" ] }, "execution_count": 31, "metadata": {}, "output_type": "execute_result" } ], "source": [ "Image(gui_driver.get_screenshot_as_png())" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "The check box for terms and conditions is not filled out, but clicked instead using the `click()` method." ] }, { "cell_type": "code", "execution_count": 32, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:23.275012Z", "iopub.status.busy": "2023-01-07T14:53:23.274631Z", "iopub.status.idle": "2023-01-07T14:53:23.488093Z", "shell.execute_reply": "2023-01-07T14:53:23.488370Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "terms = gui_driver.find_element(By.NAME, 'terms')\n", "terms.click()" ] }, { "cell_type": "code", "execution_count": 33, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:23.490961Z", "iopub.status.busy": "2023-01-07T14:53:23.490490Z", "iopub.status.idle": "2023-01-07T14:53:23.511199Z", "shell.execute_reply": "2023-01-07T14:53:23.511467Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "image/png": "\n", "text/plain": [ "" ] }, "execution_count": 33, "metadata": {}, "output_type": "execute_result" } ], "source": [ "Image(gui_driver.get_screenshot_as_png())" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "The form is now fully filled out. By clicking on the `submit` button, we can place the order:" ] }, { "cell_type": "code", "execution_count": 34, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:23.513743Z", "iopub.status.busy": "2023-01-07T14:53:23.513404Z", "iopub.status.idle": "2023-01-07T14:53:23.550742Z", "shell.execute_reply": "2023-01-07T14:53:23.550947Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "submit = gui_driver.find_element(By.NAME, 'submit')\n", "submit.click()" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "We see that the order is being processed, and that the Web browser has switched to the confirmation page." ] }, { "cell_type": "code", "execution_count": 35, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:23.553123Z", "iopub.status.busy": "2023-01-07T14:53:23.552808Z", "iopub.status.idle": "2023-01-07T14:53:23.555048Z", "shell.execute_reply": "2023-01-07T14:53:23.555281Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/html": [ "
127.0.0.1 - - [07/Jan/2023 15:53:23] INSERT INTO orders VALUES ('tshirt', 'Jane Doe', 'j.doe@example.com', 'Seattle', '98104')\n",
       "
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
127.0.0.1 - - [07/Jan/2023 15:53:23] \"GET /order?item=tshirt&name=Jane+Doe&email=j.doe%40example.com&city=Seattle&zip=98104&terms=on&submit=Place+order HTTP/1.1\" 200 -\n",
       "
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "print_httpd_messages()" ] }, { "cell_type": "code", "execution_count": 36, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:23.557255Z", "iopub.status.busy": "2023-01-07T14:53:23.556908Z", "iopub.status.idle": "2023-01-07T14:53:23.573496Z", "shell.execute_reply": "2023-01-07T14:53:23.573734Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "image/png": "\n", "text/plain": [ "" ] }, "execution_count": 36, "metadata": {}, "output_type": "execute_result" } ], "source": [ "Image(gui_driver.get_screenshot_as_png())" ] }, { "attachments": {}, "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "### Navigating\n", "\n", "Just as we fill out forms, we can also navigate through a website by clicking on links. Let us go back to the home page:" ] }, { "cell_type": "code", "execution_count": 37, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:23.576291Z", "iopub.status.busy": "2023-01-07T14:53:23.575931Z", "iopub.status.idle": "2023-01-07T14:53:23.592413Z", "shell.execute_reply": "2023-01-07T14:53:23.592740Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "gui_driver.back()" ] }, { "cell_type": "code", "execution_count": 38, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:23.595374Z", "iopub.status.busy": "2023-01-07T14:53:23.595042Z", "iopub.status.idle": "2023-01-07T14:53:23.616366Z", "shell.execute_reply": "2023-01-07T14:53:23.616631Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "image/png": "\n", "text/plain": [ "" ] }, "execution_count": 38, "metadata": {}, "output_type": "execute_result" } ], "source": [ "Image(gui_driver.get_screenshot_as_png())" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "We can query the web driver for all elements of a particular type. Querying for HTML anchor elements (``) for instance, gives us all links on a page." ] }, { "cell_type": "code", "execution_count": 39, "metadata": { "button": false, "execution": { "iopub.execute_input": "2023-01-07T14:53:23.618937Z", "iopub.status.busy": "2023-01-07T14:53:23.618611Z", "iopub.status.idle": "2023-01-07T14:53:23.624182Z", "shell.execute_reply": "2023-01-07T14:53:23.624466Z" }, "new_sheet": false, "run_control": { "read_only": false }, "slideshow": { "slide_type": "subslide" } }, "outputs": [], "source": [ "links = gui_driver.find_elements(By.TAG_NAME, \"a\")" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "We can query the attributes of UI elements – for instance, the URL the first anchor on the page links to:" ] }, { "cell_type": "code", "execution_count": 40, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:23.626902Z", "iopub.status.busy": "2023-01-07T14:53:23.626501Z", "iopub.status.idle": "2023-01-07T14:53:23.640818Z", "shell.execute_reply": "2023-01-07T14:53:23.641122Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/plain": [ "'http://127.0.0.1:8800/terms'" ] }, "execution_count": 40, "metadata": {}, "output_type": "execute_result" } ], "source": [ "links[0].get_attribute('href')" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "What happens if we click on it? Very simple: We switch to the Web page being referenced." ] }, { "cell_type": "code", "execution_count": 41, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:23.643732Z", "iopub.status.busy": "2023-01-07T14:53:23.643359Z", "iopub.status.idle": "2023-01-07T14:53:23.667994Z", "shell.execute_reply": "2023-01-07T14:53:23.668273Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "links[0].click()" ] }, { "cell_type": "code", "execution_count": 42, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:23.670439Z", "iopub.status.busy": "2023-01-07T14:53:23.670048Z", "iopub.status.idle": "2023-01-07T14:53:23.672036Z", "shell.execute_reply": "2023-01-07T14:53:23.672254Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/html": [ "
127.0.0.1 - - [07/Jan/2023 15:53:23] \"GET /terms HTTP/1.1\" 200 -\n",
       "
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "print_httpd_messages()" ] }, { "cell_type": "code", "execution_count": 43, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:23.674840Z", "iopub.status.busy": "2023-01-07T14:53:23.674465Z", "iopub.status.idle": "2023-01-07T14:53:23.686851Z", "shell.execute_reply": "2023-01-07T14:53:23.687074Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "image/png": "\n", "text/plain": [ "" ] }, "execution_count": 43, "metadata": {}, "output_type": "execute_result" } ], "source": [ "Image(gui_driver.get_screenshot_as_png())" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "Okay. Let's get back to our home page again." ] }, { "cell_type": "code", "execution_count": 44, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:23.689354Z", "iopub.status.busy": "2023-01-07T14:53:23.688986Z", "iopub.status.idle": "2023-01-07T14:53:23.704224Z", "shell.execute_reply": "2023-01-07T14:53:23.704508Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "gui_driver.back()" ] }, { "cell_type": "code", "execution_count": 45, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:23.706757Z", "iopub.status.busy": "2023-01-07T14:53:23.706421Z", "iopub.status.idle": "2023-01-07T14:53:23.708096Z", "shell.execute_reply": "2023-01-07T14:53:23.708347Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "print_httpd_messages()" ] }, { "cell_type": "code", "execution_count": 46, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:23.710480Z", "iopub.status.busy": "2023-01-07T14:53:23.710125Z", "iopub.status.idle": "2023-01-07T14:53:23.730281Z", "shell.execute_reply": "2023-01-07T14:53:23.730605Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "image/png": "\n", "text/plain": [ "" ] }, "execution_count": 46, "metadata": {}, "output_type": "execute_result" } ], "source": [ "Image(gui_driver.get_screenshot_as_png())" ] }, { "attachments": {}, "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "### Writing Test Cases\n", "\n", "The above calls, interacting with a user interface automatically, are typically used in *Selenium tests* – that is, code snippets that interact with a website, occasionally checking whether everything works as expected. The following code, for instance, places an order just as above. It then retrieves the `title` element and checks whether the title contains a \"Thank you\" message, indicating success." ] }, { "cell_type": "code", "execution_count": 47, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:23.734526Z", "iopub.status.busy": "2023-01-07T14:53:23.734175Z", "iopub.status.idle": "2023-01-07T14:53:23.735250Z", "shell.execute_reply": "2023-01-07T14:53:23.735568Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [], "source": [ "def test_successful_order(driver, url):\n", " name = \"Walter White\"\n", " email = \"white@jpwynne.edu\"\n", " city = \"Albuquerque\"\n", " zip_code = \"87101\"\n", "\n", " driver.get(url)\n", " driver.find_element(By.NAME, \"name\").send_keys(name)\n", " driver.find_element(By.NAME, \"email\").send_keys(email)\n", " driver.find_element(By.NAME, 'city').send_keys(city)\n", " driver.find_element(By.NAME, 'zip').send_keys(zip_code)\n", " driver.find_element(By.NAME, 'terms').click()\n", " driver.find_element(By.NAME, 'submit').click()\n", "\n", " title = driver.find_element(By.ID, 'title')\n", " assert title is not None\n", " assert title.text.find(\"Thank you\") >= 0\n", "\n", " confirmation = driver.find_element(By.ID, \"confirmation\")\n", " assert confirmation is not None\n", "\n", " assert confirmation.text.find(name) >= 0\n", " assert confirmation.text.find(email) >= 0\n", " assert confirmation.text.find(city) >= 0\n", " assert confirmation.text.find(zip_code) >= 0\n", "\n", " return True" ] }, { "cell_type": "code", "execution_count": 48, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:23.737608Z", "iopub.status.busy": "2023-01-07T14:53:23.737263Z", "iopub.status.idle": "2023-01-07T14:53:24.086813Z", "shell.execute_reply": "2023-01-07T14:53:24.087135Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [ { "data": { "text/plain": [ "True" ] }, "execution_count": 48, "metadata": {}, "output_type": "execute_result" } ], "source": [ "test_successful_order(gui_driver, httpd_url)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "In a similar vein, we can set up automated test cases for unsuccessful orders, canceling orders, changing orders, and many more. All these test cases would be automatically run after any change to the program code, ensuring the Web application still works." ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "Of course, writing such tests is quite some effort. Hence, in the remainder of this chapter, we will again explore how to automatically generate them." ] }, { "cell_type": "markdown", "metadata": { "button": false, "new_sheet": false, "run_control": { "read_only": false }, "slideshow": { "slide_type": "subslide" } }, "source": [ "## Retrieving User Interface Actions\n", "\n", "To automatically interact with a user interface, we first need to find out which elements there are, and which user interactions (or short *actions*) they support." ] }, { "cell_type": "markdown", "metadata": { "button": false, "new_sheet": false, "run_control": { "read_only": false }, "slideshow": { "slide_type": "subslide" } }, "source": [ "### User Interface Elements\n", "\n", "We start with finding available user elements. Let us get back to the order form." ] }, { "cell_type": "code", "execution_count": 49, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:24.089711Z", "iopub.status.busy": "2023-01-07T14:53:24.089340Z", "iopub.status.idle": "2023-01-07T14:53:24.106220Z", "shell.execute_reply": "2023-01-07T14:53:24.106441Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "gui_driver.get(httpd_url)" ] }, { "cell_type": "code", "execution_count": 50, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:24.108716Z", "iopub.status.busy": "2023-01-07T14:53:24.108243Z", "iopub.status.idle": "2023-01-07T14:53:24.130885Z", "shell.execute_reply": "2023-01-07T14:53:24.131138Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "image/png": "\n", "text/plain": [ "" ] }, "execution_count": 50, "metadata": {}, "output_type": "execute_result" } ], "source": [ "Image(gui_driver.get_screenshot_as_png())" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "Using `find_elements(By.TAG_NAME, )` (and other similar `find_elements_...()` functions), we can retrieve all elements of a particular type, such as HTML `input` elements." ] }, { "cell_type": "code", "execution_count": 51, "metadata": { "button": false, "execution": { "iopub.execute_input": "2023-01-07T14:53:24.133587Z", "iopub.status.busy": "2023-01-07T14:53:24.133238Z", "iopub.status.idle": "2023-01-07T14:53:24.137430Z", "shell.execute_reply": "2023-01-07T14:53:24.137626Z" }, "new_sheet": false, "run_control": { "read_only": false }, "slideshow": { "slide_type": "subslide" } }, "outputs": [], "source": [ "ui_elements = gui_driver.find_elements(By.TAG_NAME, \"input\")" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "For each element, we can retrieve its HTML attributes, using `get_attribute()`. We can thus retrieve the `name` and `type` of each input element (if defined)." ] }, { "cell_type": "code", "execution_count": 52, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:24.140063Z", "iopub.status.busy": "2023-01-07T14:53:24.139604Z", "iopub.status.idle": "2023-01-07T14:53:24.255213Z", "shell.execute_reply": "2023-01-07T14:53:24.255441Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Name: name | Type: text | Text: \n", "Name: email | Type: email | Text: \n", "Name: city | Type: text | Text: \n", "Name: zip | Type: number | Text: \n", "Name: terms | Type: checkbox | Text: \n", "Name: submit | Type: submit | Text: \n" ] } ], "source": [ "for element in ui_elements:\n", " print(\"Name: %-10s | Type: %-10s | Text: %s\" %\n", " (element.get_attribute('name'),\n", " element.get_attribute('type'),\n", " element.text))" ] }, { "cell_type": "code", "execution_count": 53, "metadata": { "button": false, "execution": { "iopub.execute_input": "2023-01-07T14:53:24.257799Z", "iopub.status.busy": "2023-01-07T14:53:24.257425Z", "iopub.status.idle": "2023-01-07T14:53:24.268730Z", "shell.execute_reply": "2023-01-07T14:53:24.268964Z" }, "new_sheet": false, "run_control": { "read_only": false }, "slideshow": { "slide_type": "subslide" } }, "outputs": [], "source": [ "ui_elements = gui_driver.find_elements(By.TAG_NAME, \"a\")" ] }, { "cell_type": "code", "execution_count": 54, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:24.271490Z", "iopub.status.busy": "2023-01-07T14:53:24.271115Z", "iopub.status.idle": "2023-01-07T14:53:24.291835Z", "shell.execute_reply": "2023-01-07T14:53:24.292178Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Name: | Type: | Text: terms and conditions\n" ] } ], "source": [ "for element in ui_elements:\n", " print(\"Name: %-10s | Type: %-10s | Text: %s\" %\n", " (element.get_attribute('name'),\n", " element.get_attribute('type'),\n", " element.text))" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "### User Interface Actions\n", "\n", "Similarly to what we did in the [chapter on Web fuzzing](WebFuzzer.ipynb), our idea is now to mine a _grammar_ for the user interface – first for an individual user interface *page* (i.e., a single Web page), later for all pages offered by the application. The idea is that a grammar defines _legal sequences of actions_ – clicks and keystrokes – that can be applied on the application." ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "We assume the following actions:\n", "\n", "1. `fill(, )` – fill the UI input element named `` with the text ``.\n", "1. `check(, )` – set the UI checkbox `` to the given value `` (True or False)\n", "1. `submit()` – submit the form by clicking on the UI element ``.\n", "1. `click()` – click on the UI element ``, typically for following a link." ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "This sequence of actions, for instance would fill out the order form:\n", "\n", "```python\n", "fill('name', \"Walter White\")\n", "fill('email', \"white@jpwynne.edu\")\n", "fill('city', \"Albuquerque\")\n", "fill('zip', \"87101\")\n", "check('terms', True)\n", "submit('submit')\n", "```" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "Our set of actions is deliberately defined to be small – for real user interfaces, one would also have to define interactions such as swipes, double clicks, long clicks, right button clicks, modifier keys, and more. Selenium supports all of this; but in the interest of simplicity, we focus on the most important set of interactions." ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" }, "toc-hr-collapsed": false }, "source": [ "### Retrieving Actions\n", "\n", "As a first step in mining an action grammar, we need to be able to retrieve possible interactions. We introduce a class `GUIGrammarMiner`, which is set to do precisely that." ] }, { "cell_type": "code", "execution_count": 55, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:24.294816Z", "iopub.status.busy": "2023-01-07T14:53:24.294482Z", "iopub.status.idle": "2023-01-07T14:53:24.295773Z", "shell.execute_reply": "2023-01-07T14:53:24.296036Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [], "source": [ "class GUIGrammarMiner:\n", " \"\"\"Retrieve a grammar of possible GUI interaction sequences\"\"\"\n", "\n", " def __init__(self, driver, stay_on_host: bool = True) -> None:\n", " \"\"\"Constructor.\n", " `driver` - a web driver as produced by Selenium.\n", " `stay_on_host` - if True (default), no not follow links to other hosts.\n", " \"\"\"\n", " self.driver = driver\n", " self.stay_on_host = stay_on_host\n", " self.grammar: Grammar = {}" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" }, "tags": [] }, "source": [ "#### Excursion: Implementing Retrieving Actions" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "Our first task is to obtain the set of possible interactions. Given a single UI page, the method `mine_input_actions()` of `GUIGrammarMiner` returns a set of *actions* as defined above. It first gets all `input` elements, followed by `button` elements, finally followed by links (`a` elements), and merges them into a set. (We use a `frozenset` here since we want to use the set as an index later.)" ] }, { "cell_type": "code", "execution_count": 56, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:24.298888Z", "iopub.status.busy": "2023-01-07T14:53:24.298564Z", "iopub.status.idle": "2023-01-07T14:53:24.299856Z", "shell.execute_reply": "2023-01-07T14:53:24.300210Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [], "source": [ "class GUIGrammarMiner(GUIGrammarMiner):\n", " def mine_state_actions(self) -> FrozenSet[str]:\n", " \"\"\"Return a set of all possible actions on the current Web site.\n", " Can be overloaded in subclasses.\"\"\"\n", " return frozenset(self.mine_input_element_actions()\n", " | self.mine_button_element_actions()\n", " | self.mine_a_element_actions())\n", "\n", " def mine_input_element_actions(self) -> Set[str]:\n", " return set() # to be defined later\n", "\n", " def mine_button_element_actions(self) -> Set[str]:\n", " return set() # to be defined later\n", "\n", " def mine_a_element_actions(self) -> Set[str]:\n", " return set() # to be defined later" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "##### Input Element Actions\n", "\n", "Mining input actions goes through the set of input elements, and returns an action depending on the input type. If the input field is a text, for instance, the associated action is `fill()`; for checkboxes, the action is `check()`." ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "The respective values are placeholders depending on the type; if the input field is a number, for instance, the value becomes ``. As these actions later become part of the grammar, they will be expanded into actual values during grammar expansion." ] }, { "cell_type": "code", "execution_count": 57, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:24.302440Z", "iopub.status.busy": "2023-01-07T14:53:24.302109Z", "iopub.status.idle": "2023-01-07T14:53:24.303266Z", "shell.execute_reply": "2023-01-07T14:53:24.303505Z" }, "slideshow": { "slide_type": "skip" } }, "outputs": [], "source": [ "from selenium.common.exceptions import StaleElementReferenceException" ] }, { "cell_type": "code", "execution_count": 58, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:24.307379Z", "iopub.status.busy": "2023-01-07T14:53:24.306943Z", "iopub.status.idle": "2023-01-07T14:53:24.308304Z", "shell.execute_reply": "2023-01-07T14:53:24.308574Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [], "source": [ "class GUIGrammarMiner(GUIGrammarMiner):\n", " def mine_input_element_actions(self) -> Set[str]:\n", " \"\"\"Determine all input actions on the current Web page\"\"\"\n", "\n", " actions = set()\n", "\n", " for elem in self.driver.find_elements(By.TAG_NAME, \"input\"):\n", " try:\n", " input_type = elem.get_attribute(\"type\")\n", " input_name = elem.get_attribute(\"name\")\n", " if input_name is None:\n", " input_name = elem.text\n", "\n", " if input_type in [\"checkbox\", \"radio\"]:\n", " actions.add(\"check('%s', )\" % html.escape(input_name))\n", " elif input_type in [\"text\", \"number\", \"email\", \"password\"]:\n", " actions.add(\"fill('%s', '<%s>')\" % (html.escape(input_name), html.escape(input_type)))\n", " elif input_type in [\"button\", \"submit\"]:\n", " actions.add(\"submit('%s')\" % html.escape(input_name))\n", " elif input_type in [\"hidden\"]:\n", " pass\n", " else:\n", " # TODO: Handle more types here\n", " actions.add(\"fill('%s', <%s>)\" % (html.escape(input_name), html.escape(input_type)))\n", " except StaleElementReferenceException:\n", " pass\n", "\n", " return actions" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "Applied on our order form, we see that the method gets us all input actions:" ] }, { "cell_type": "code", "execution_count": 59, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:24.310744Z", "iopub.status.busy": "2023-01-07T14:53:24.310381Z", "iopub.status.idle": "2023-01-07T14:53:24.419171Z", "shell.execute_reply": "2023-01-07T14:53:24.419574Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/plain": [ "{\"check('terms', )\",\n", " \"fill('city', '')\",\n", " \"fill('email', '')\",\n", " \"fill('name', '')\",\n", " \"fill('zip', '')\",\n", " \"submit('submit')\"}" ] }, "execution_count": 59, "metadata": {}, "output_type": "execute_result" } ], "source": [ "gui_grammar_miner = GUIGrammarMiner(gui_driver)\n", "gui_grammar_miner.mine_input_element_actions()" ] }, { "attachments": {}, "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "##### Button Element Actions\n", "\n", "Mining buttons works similarly:" ] }, { "cell_type": "code", "execution_count": 60, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:24.423419Z", "iopub.status.busy": "2023-01-07T14:53:24.423020Z", "iopub.status.idle": "2023-01-07T14:53:24.424148Z", "shell.execute_reply": "2023-01-07T14:53:24.424486Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [], "source": [ "class GUIGrammarMiner(GUIGrammarMiner):\n", " def mine_button_element_actions(self) -> Set[str]:\n", " \"\"\"Determine all button actions on the current Web page\"\"\"\n", "\n", " actions = set()\n", "\n", " for elem in self.driver.find_elements(By.TAG_NAME, \"button\"):\n", " try:\n", " button_type = elem.get_attribute(\"type\")\n", " button_name = elem.get_attribute(\"name\")\n", " if button_name is None:\n", " button_name = elem.text\n", " if button_type == \"submit\":\n", " actions.add(\"submit('%s')\" % html.escape(button_name))\n", " elif button_type != \"reset\":\n", " actions.add(\"click('%s')\" % html.escape(button_name))\n", " except StaleElementReferenceException:\n", " pass\n", "\n", " return actions" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "Our order form has no `button` elements. (The `submit` button is an `input` element, and was handled above)." ] }, { "cell_type": "code", "execution_count": 61, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:24.427154Z", "iopub.status.busy": "2023-01-07T14:53:24.426772Z", "iopub.status.idle": "2023-01-07T14:53:24.431321Z", "shell.execute_reply": "2023-01-07T14:53:24.431580Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/plain": [ "set()" ] }, "execution_count": 61, "metadata": {}, "output_type": "execute_result" } ], "source": [ "gui_grammar_miner = GUIGrammarMiner(gui_driver)\n", "gui_grammar_miner.mine_button_element_actions()" ] }, { "attachments": {}, "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "##### Link Element Actions\n", "\n", "When following links, we need to make sure that we stay on the current host – we want to explore a single website only, not all the Internet. To this end, we check the `href` attribute of the link to check whether it still points to the same host. If it does not, we give it a special action `ignore()`, which, as the name suggests, will later be ignored as it comes to executing these actions. We still return an action, though, as we use the set of actions to characterize a state in the application." ] }, { "cell_type": "code", "execution_count": 62, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:24.433811Z", "iopub.status.busy": "2023-01-07T14:53:24.433444Z", "iopub.status.idle": "2023-01-07T14:53:24.434627Z", "shell.execute_reply": "2023-01-07T14:53:24.434905Z" }, "slideshow": { "slide_type": "skip" } }, "outputs": [], "source": [ "from urllib.parse import urljoin, urlsplit" ] }, { "cell_type": "code", "execution_count": 63, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:24.437725Z", "iopub.status.busy": "2023-01-07T14:53:24.437414Z", "iopub.status.idle": "2023-01-07T14:53:24.438596Z", "shell.execute_reply": "2023-01-07T14:53:24.438849Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [], "source": [ "class GUIGrammarMiner(GUIGrammarMiner):\n", " def mine_a_element_actions(self) -> Set[str]:\n", " \"\"\"Determine all link actions on the current Web page\"\"\"\n", "\n", " actions = set()\n", "\n", " for elem in self.driver.find_elements(By.TAG_NAME, \"a\"):\n", " try:\n", " a_href = elem.get_attribute(\"href\")\n", " if a_href is not None:\n", " if self.follow_link(a_href):\n", " actions.add(\"click('%s')\" % html.escape(elem.text))\n", " else:\n", " actions.add(\"ignore('%s')\" % html.escape(elem.text))\n", " except StaleElementReferenceException:\n", " pass\n", "\n", " return actions" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "To check whether we can follow a link, the method `follow_link()` checks the URL:" ] }, { "cell_type": "code", "execution_count": 64, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:24.441900Z", "iopub.status.busy": "2023-01-07T14:53:24.441201Z", "iopub.status.idle": "2023-01-07T14:53:24.442562Z", "shell.execute_reply": "2023-01-07T14:53:24.442817Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "class GUIGrammarMiner(GUIGrammarMiner):\n", " def follow_link(self, link: str) -> bool:\n", " \"\"\"Return True iff we are allowed to follow the `link` URL\"\"\"\n", "\n", " if not self.stay_on_host:\n", " return True\n", "\n", " current_url = self.driver.current_url\n", " target_url = urljoin(current_url, link)\n", " return urlsplit(current_url).hostname == urlsplit(target_url).hostname" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "In our application, we would not be allowed to follow a link to `foo.bar`:" ] }, { "cell_type": "code", "execution_count": 65, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:24.444861Z", "iopub.status.busy": "2023-01-07T14:53:24.444563Z", "iopub.status.idle": "2023-01-07T14:53:24.445754Z", "shell.execute_reply": "2023-01-07T14:53:24.445940Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "gui_grammar_miner = GUIGrammarMiner(gui_driver)" ] }, { "cell_type": "code", "execution_count": 66, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:24.447505Z", "iopub.status.busy": "2023-01-07T14:53:24.447136Z", "iopub.status.idle": "2023-01-07T14:53:24.451074Z", "shell.execute_reply": "2023-01-07T14:53:24.451332Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [ { "data": { "text/plain": [ "False" ] }, "execution_count": 66, "metadata": {}, "output_type": "execute_result" } ], "source": [ "gui_grammar_miner.follow_link(\"ftp://foo.bar/\")" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "Following a link to `localhost`, though, works well:" ] }, { "cell_type": "code", "execution_count": 67, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:24.453187Z", "iopub.status.busy": "2023-01-07T14:53:24.452865Z", "iopub.status.idle": "2023-01-07T14:53:24.456203Z", "shell.execute_reply": "2023-01-07T14:53:24.456450Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/plain": [ "True" ] }, "execution_count": 67, "metadata": {}, "output_type": "execute_result" } ], "source": [ "gui_grammar_miner.follow_link(\"https://127.0.0.1/\")" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "When adapting this for other user interfaces, similar measures would be taken to ensure we stay in the same application." ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "Running this method on our page gets us the set of links:" ] }, { "cell_type": "code", "execution_count": 68, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:24.458889Z", "iopub.status.busy": "2023-01-07T14:53:24.458563Z", "iopub.status.idle": "2023-01-07T14:53:24.475363Z", "shell.execute_reply": "2023-01-07T14:53:24.475579Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/plain": [ "{\"click('terms and conditions')\"}" ] }, "execution_count": 68, "metadata": {}, "output_type": "execute_result" } ], "source": [ "gui_grammar_miner = GUIGrammarMiner(gui_driver)\n", "gui_grammar_miner.mine_a_element_actions()" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" }, "tags": [] }, "source": [ "#### End of Excursion" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "Let us show `GUIGrammarMiner` in action, using its `mine_state_actions()` method to retrieve all elements from our current page. We see that we obtain input element actions, button element actions, and link element actions." ] }, { "cell_type": "code", "execution_count": 69, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:24.477965Z", "iopub.status.busy": "2023-01-07T14:53:24.477611Z", "iopub.status.idle": "2023-01-07T14:53:24.580376Z", "shell.execute_reply": "2023-01-07T14:53:24.580588Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/plain": [ "frozenset({\"check('terms', )\",\n", " \"click('terms and conditions')\",\n", " \"fill('city', '')\",\n", " \"fill('email', '')\",\n", " \"fill('name', '')\",\n", " \"fill('zip', '')\",\n", " \"submit('submit')\"})" ] }, "execution_count": 69, "metadata": {}, "output_type": "execute_result" } ], "source": [ "gui_grammar_miner = GUIGrammarMiner(gui_driver)\n", "gui_grammar_miner.mine_state_actions()" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "We assume that we can identify a user interface *state* from the set of interactive elements it contains – that is, the current Web page is identified by the set above. This is in contrast to [Web fuzzing](WebFuzzer.ipynb), where we assumed the URL to uniquely characterize a page – but with JavaScript, the URL can stay unchanged although the page contents change, and UIs other than the Web may have no concept of unique URLs. Therefore, we say that the way a UI can be interacted with uniquely defines its state." ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Models for User Interfaces" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "### User Interfaces as Finite State Machines\n", "\n", "Now that we can retrieve UI elements from a page, let us go and systematically explore a user interface. The idea is to represent the user interface as a *finite state machine* – that is, a sequence of *states* that can be reached by interacting with the individual user interface elements." ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "Let us illustrate such a finite state machine by looking at our Web server. The following diagram shows the states our server can be in:" ] }, { "cell_type": "code", "execution_count": 70, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:24.582596Z", "iopub.status.busy": "2023-01-07T14:53:24.582302Z", "iopub.status.idle": "2023-01-07T14:53:24.583462Z", "shell.execute_reply": "2023-01-07T14:53:24.583705Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "# ignore\n", "from graphviz import Digraph" ] }, { "cell_type": "code", "execution_count": 71, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:24.585418Z", "iopub.status.busy": "2023-01-07T14:53:24.585145Z", "iopub.status.idle": "2023-01-07T14:53:24.586697Z", "shell.execute_reply": "2023-01-07T14:53:24.586502Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "# ignore\n", "from GrammarFuzzer import dot_escape" ] }, { "cell_type": "code", "execution_count": 72, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:24.592683Z", "iopub.status.busy": "2023-01-07T14:53:24.592335Z", "iopub.status.idle": "2023-01-07T14:53:24.840179Z", "shell.execute_reply": "2023-01-07T14:53:24.840441Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [ { "data": { "image/svg+xml": [ "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\\<start\\>\n", "\n", "<start>\n", "\n", "\n", "\n", "\\<Order Form\\>\n", "\n", "<Order Form>\n", "\n", "\n", "\n", "\\<start\\>->\\<Order Form\\>\n", "\n", "\n", "\n", "\n", "\n", "\\<Terms and Conditions\\>\n", "\n", "<Terms and Conditions>\n", "\n", "\n", "\n", "\\<Order Form\\>->\\<Terms and Conditions\\>\n", "\n", "\n", "click('Terms and conditions')\n", "\n", "\n", "\n", "\\<Thank You\\>\n", "\n", "<Thank You>\n", "\n", "\n", "\n", "\\<Order Form\\>->\\<Thank You\\>\n", "\n", "\n", "fill(...)\n", "submit('submit')\n", "\n", "\n", "\n", "\\<Terms and Conditions\\>->\\<Order Form\\>\n", "\n", "\n", "click('order form')\n", "\n", "\n", "\n", "\\<Thank You\\>->\\<Order Form\\>\n", "\n", "\n", "click('order form')\n", "\n", "\n", "\n" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "# ignore\n", "dot = Digraph(comment=\"Finite State Machine\")\n", "dot.node(dot_escape(''))\n", "dot.edge(dot_escape(''),\n", " dot_escape(''))\n", "dot.edge(dot_escape(''),\n", " dot_escape(''), \"click('Terms and conditions')\")\n", "dot.edge(dot_escape(''),\n", " dot_escape(''), r\"fill(...)\\lsubmit('submit')\")\n", "dot.edge(dot_escape(''),\n", " dot_escape(''), \"click('order form')\")\n", "dot.edge(dot_escape(''),\n", " dot_escape(''), \"click('order form')\")\n", "display(dot)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "Initially, we are in the `` state. From here, we can click on `Terms and Conditions`, and we'll be in the `Terms and Conditions` state, showing the page with the same title. We can also fill out the form and place the order, having us end in the `Thank You` state (again showing the page with the same title). From both `` and ``, we can return to the order form by clicking on the `order form` link." ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "### State Machines as Grammars\n", "\n", "To systematically explore a user interface, we must retrieve its finite state machine, and eventually cover all states and transitions. In the presence of forms, such an exploration is difficult, as we need a special mechanism to fill out forms and submit the values to get to the next state. There is a trick, though, which allows us to have a single representation for both states and (form) values. We can *embed the finite state machine into a grammar*, which is then used for both states and form values." ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "To embed a finite state machine into a grammar, we proceed as follows:\n", "\n", "1. Every _state_ $\\langle s \\rangle$ in the finite state machine becomes a _symbol_ $\\langle s \\rangle$ in the grammar.\n", "2. Every _transition_ in the finite state machine from $\\langle s \\rangle$ to $\\langle t \\rangle$ and actions $a_1, a_2, \\dots$ becomes an _alternative_ of $\\langle s \\rangle$ in the form $a_1, a_2, dots$ $\\langle t \\rangle$ in the grammar." ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "The above finite state machine thus gets encoded into the grammar\n", "\n", "```\n", " ::= \n", " ::= click('Terms and Conditions') | \n", " fill(...) submit('submit') \n", " ::= click('order form') \n", " ::= click('order form') \n", "```" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "Expanding this grammar gets us a stream of actions, navigating through the user interface:\n", "\n", "```\n", "fill(...) submit('submit') click('order form') click('Terms and Conditions') click('order form') ...\n", "```\n", "\n", "This stream is actually _infinite_ (as one can interact with the UI forever); to have it end, one can introduce an alternative `` that simply expands to the empty string, without having any expansion (state) follow." ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" }, "tags": [] }, "source": [ "### Retrieving State Grammars\n", "\n", "Let us extend `GUIGrammarMiner` such that it retrieves a grammar from the user interface in its _current state_." ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "#### Excursion: Implementing Extracting State Grammars" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" }, "tags": [] }, "source": [ "We first define a constant `GUI_GRAMMAR` that serves as template for all sorts of input types. We will use this to fill out forms." ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "\\todo{}: Have a common base class `GrammarMiner` with `__init__()` and `mine_grammar()`" ] }, { "cell_type": "code", "execution_count": 73, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:24.843835Z", "iopub.status.busy": "2023-01-07T14:53:24.843446Z", "iopub.status.idle": "2023-01-07T14:53:24.844441Z", "shell.execute_reply": "2023-01-07T14:53:24.844688Z" }, "slideshow": { "slide_type": "skip" } }, "outputs": [], "source": [ "from Grammars import new_symbol" ] }, { "cell_type": "code", "execution_count": 74, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:24.846722Z", "iopub.status.busy": "2023-01-07T14:53:24.846384Z", "iopub.status.idle": "2023-01-07T14:53:24.847632Z", "shell.execute_reply": "2023-01-07T14:53:24.847899Z" }, "slideshow": { "slide_type": "skip" } }, "outputs": [], "source": [ "from Grammars import nonterminals, START_SYMBOL\n", "from Grammars import extend_grammar, unreachable_nonterminals, crange, srange\n", "from Grammars import syntax_diagram, is_valid_grammar, Grammar" ] }, { "cell_type": "code", "execution_count": 75, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:24.851039Z", "iopub.status.busy": "2023-01-07T14:53:24.850706Z", "iopub.status.idle": "2023-01-07T14:53:24.851900Z", "shell.execute_reply": "2023-01-07T14:53:24.852139Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [], "source": [ "class GUIGrammarMiner(GUIGrammarMiner):\n", " START_STATE = \"\"\n", " UNEXPLORED_STATE = \"\"\n", " FINAL_STATE = \"\"\n", "\n", " GUI_GRAMMAR: Grammar = ({\n", " START_SYMBOL: [START_STATE],\n", " UNEXPLORED_STATE: [\"\"],\n", " FINAL_STATE: [\"\"],\n", "\n", " \"\": [\"\"],\n", " \"\": [\"\", \"\"],\n", " \"\": [\"\", \"\", \"\"],\n", " \"\": crange('a', 'z') + crange('A', 'Z'),\n", "\n", " \"\": [\"\"],\n", " \"\": [\"\", \"\"],\n", " \"\": crange('0', '9'),\n", "\n", " \"\": srange(\". !\"),\n", "\n", " \"\": [\"@\"],\n", " \"\": [\"\", \"\"],\n", "\n", " \"\": [\"True\", \"False\"],\n", "\n", " # Use a fixed password in case we need to repeat it\n", " \"\": [\"abcABC.123\"],\n", "\n", " \"\": [\"\"],\n", " })" ] }, { "cell_type": "code", "execution_count": 76, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:24.854531Z", "iopub.status.busy": "2023-01-07T14:53:24.854211Z", "iopub.status.idle": "2023-01-07T14:53:24.882601Z", "shell.execute_reply": "2023-01-07T14:53:24.882879Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "start\n" ] }, { "data": { "image/svg+xml": [ "\n", "\n", "\n", "\n", "\n", "\n", "state" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "name": "stdout", "output_type": "stream", "text": [ "unexplored\n" ] }, { "data": { "image/svg+xml": [ "\n", "\n", "\n", "\n", "\n", "\n", "" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "name": "stdout", "output_type": "stream", "text": [ "end\n" ] }, { "data": { "image/svg+xml": [ "\n", "\n", "\n", "\n", "\n", "\n", "" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "name": "stdout", "output_type": "stream", "text": [ "text\n" ] }, { "data": { "image/svg+xml": [ "\n", "\n", "\n", "\n", "\n", "\n", "string" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "name": "stdout", "output_type": "stream", "text": [ "string\n" ] }, { "data": { "image/svg+xml": [ "\n", "\n", "\n", "\n", "\n", "\n", "character\n", "\n", "string\n", "character" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "name": "stdout", "output_type": "stream", "text": [ "character\n" ] }, { "data": { "image/svg+xml": [ "\n", "\n", "\n", "\n", "\n", "\n", "letter\n", "\n", "digit\n", "\n", "special" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "name": "stdout", "output_type": "stream", "text": [ "letter\n" ] }, { "data": { "image/svg+xml": [ "\n", "\n", "\n", "\n", "\n", "\n", "\n", "e\n", "\n", "d\n", "\n", "c\n", "\n", "b\n", "\n", "a\n", "\n", "f\n", "\n", "g\n", "\n", "h\n", "\n", "i\n", "\n", "j\n", "\n", "\n", "o\n", "\n", "n\n", "\n", "m\n", "\n", "l\n", "\n", "k\n", "\n", "p\n", "\n", "q\n", "\n", "r\n", "\n", "s\n", "\n", "t\n", "\n", "\n", "y\n", "\n", "x\n", "\n", "w\n", "\n", "v\n", "\n", "u\n", "\n", "z\n", "\n", "A\n", "\n", "B\n", "\n", "C\n", "\n", "D\n", "\n", "\n", "I\n", "\n", "H\n", "\n", "G\n", "\n", "F\n", "\n", "E\n", "\n", "J\n", "\n", "K\n", "\n", "L\n", "\n", "M\n", "\n", "N\n", "\n", "\n", "S\n", "\n", "R\n", "\n", "Q\n", "\n", "P\n", "\n", "O\n", "\n", "T\n", "\n", "U\n", "\n", "V\n", "\n", "W\n", "\n", "X\n", "\n", "\n", "Y\n", "\n", "Z" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "name": "stdout", "output_type": "stream", "text": [ "number\n" ] }, { "data": { "image/svg+xml": [ "\n", "\n", "\n", "\n", "\n", "\n", "digits" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "name": "stdout", "output_type": "stream", "text": [ "digits\n" ] }, { "data": { "image/svg+xml": [ "\n", "\n", "\n", "\n", "\n", "\n", "digit\n", "\n", "digits\n", "digit" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "name": "stdout", "output_type": "stream", "text": [ "digit\n" ] }, { "data": { "image/svg+xml": [ "\n", "\n", "\n", "\n", "\n", "\n", "\n", "0\n", "\n", "1\n", "\n", "\n", "2\n", "\n", "3\n", "\n", "\n", "4\n", "\n", "5\n", "\n", "\n", "6\n", "\n", "7\n", "\n", "\n", "8\n", "\n", "9" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "name": "stdout", "output_type": "stream", "text": [ "special\n" ] }, { "data": { "image/svg+xml": [ "\n", "\n", "\n", "\n", "\n", "\n", ".\n", "\n", " \n", "\n", "!" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "name": "stdout", "output_type": "stream", "text": [ "email\n" ] }, { "data": { "image/svg+xml": [ "\n", "\n", "\n", "\n", "\n", "\n", "letters\n", "@\n", "letters" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "name": "stdout", "output_type": "stream", "text": [ "letters\n" ] }, { "data": { "image/svg+xml": [ "\n", "\n", "\n", "\n", "\n", "\n", "letter\n", "\n", "letters\n", "letter" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "name": "stdout", "output_type": "stream", "text": [ "boolean\n" ] }, { "data": { "image/svg+xml": [ "\n", "\n", "\n", "\n", "\n", "\n", "True\n", "\n", "False" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "name": "stdout", "output_type": "stream", "text": [ "password\n" ] }, { "data": { "image/svg+xml": [ "\n", "\n", "\n", "\n", "\n", "\n", "abcABC.123" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "name": "stdout", "output_type": "stream", "text": [ "hidden\n" ] }, { "data": { "image/svg+xml": [ "\n", "\n", "\n", "\n", "\n", "\n", "string" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "syntax_diagram(GUIGrammarMiner.GUI_GRAMMAR)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "The method `mine_state_grammar()` goes through the actions mined from the page (using `mine_state_actions()`) and creates a grammar for the current state. For each `click()` and `submit()` action, it assumes a new state follows, and introduces an appropriate state symbol into the grammar – a state symbol that now will be marked as ``, but will be expanded later as the appropriate state is seen." ] }, { "cell_type": "code", "execution_count": 77, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:24.887208Z", "iopub.status.busy": "2023-01-07T14:53:24.886888Z", "iopub.status.idle": "2023-01-07T14:53:24.888037Z", "shell.execute_reply": "2023-01-07T14:53:24.888228Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [], "source": [ "class GUIGrammarMiner(GUIGrammarMiner):\n", " def new_state_symbol(self, grammar: Grammar) -> str:\n", " \"\"\"Return a new symbol for some state in `grammar`\"\"\"\n", " return new_symbol(grammar, self.START_STATE)\n", "\n", " def mine_state_grammar(self, grammar: Grammar = {},\n", " state_symbol: Optional[str] = None) -> Grammar:\n", " \"\"\"Return a state grammar for the actions on the current Web site.\n", " Can be overloaded in subclasses.\"\"\"\n", "\n", " grammar = extend_grammar(self.GUI_GRAMMAR, grammar) # type: ignore\n", "\n", " if state_symbol is None:\n", " state_symbol = self.new_state_symbol(grammar)\n", " grammar[state_symbol] = []\n", "\n", " alternatives = []\n", " form = \"\"\n", " submit = None\n", "\n", " for action in self.mine_state_actions():\n", " if action.startswith(\"submit\"):\n", " submit = action\n", "\n", " elif action.startswith(\"click\"):\n", " link_target = self.new_state_symbol(grammar)\n", " grammar[link_target] = [self.UNEXPLORED_STATE]\n", " alternatives.append(action + '\\n' + link_target)\n", "\n", " elif action.startswith(\"ignore\"):\n", " pass\n", "\n", " else: # fill(), check() actions\n", " if len(form) > 0:\n", " form += '\\n'\n", " form += action\n", "\n", " if submit is not None:\n", " if len(form) > 0:\n", " form += '\\n'\n", " form += submit\n", "\n", " if len(form) > 0:\n", " form_target = self.new_state_symbol(grammar)\n", " grammar[form_target] = [self.UNEXPLORED_STATE]\n", " alternatives.append(form + '\\n' + form_target)\n", "\n", " alternatives += [self.FINAL_STATE]\n", "\n", " grammar[state_symbol] = alternatives # type: ignore\n", "\n", " # Remove unused parts\n", " for nonterminal in unreachable_nonterminals(grammar):\n", " del grammar[nonterminal]\n", "\n", " assert is_valid_grammar(grammar)\n", "\n", " return grammar" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "To better see the state structure, the function `fsm_diagram()` shows the resulting state grammar as a finite state machine. (This assumes that the grammar actually encodes a state machine.)" ] }, { "cell_type": "code", "execution_count": 78, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:24.890065Z", "iopub.status.busy": "2023-01-07T14:53:24.889717Z", "iopub.status.idle": "2023-01-07T14:53:24.891172Z", "shell.execute_reply": "2023-01-07T14:53:24.890821Z" }, "slideshow": { "slide_type": "skip" } }, "outputs": [], "source": [ "from collections import deque" ] }, { "cell_type": "code", "execution_count": 79, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:24.892877Z", "iopub.status.busy": "2023-01-07T14:53:24.892557Z", "iopub.status.idle": "2023-01-07T14:53:24.893894Z", "shell.execute_reply": "2023-01-07T14:53:24.894132Z" }, "slideshow": { "slide_type": "skip" } }, "outputs": [], "source": [ "from bookutils import unicode_escape" ] }, { "cell_type": "code", "execution_count": 80, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:24.897519Z", "iopub.status.busy": "2023-01-07T14:53:24.897170Z", "iopub.status.idle": "2023-01-07T14:53:24.898302Z", "shell.execute_reply": "2023-01-07T14:53:24.898557Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [], "source": [ "def fsm_diagram(grammar: Grammar, start_symbol: str = START_SYMBOL) -> Any:\n", " \"\"\"Produce a FSM diagram for the state grammar `grammar`.\n", " `start_symbol` - the start symbol (default: START_SYMBOL)\"\"\"\n", "\n", " from graphviz import Digraph\n", " from IPython.display import display\n", "\n", " def left_align(label: str) -> str:\n", " \"\"\"Render `label` as left-aligned in dot\"\"\"\n", " return dot_escape(label.replace('\\n', r'\\l')).replace(r'\\\\l', '\\\\l')\n", "\n", " dot = Digraph(comment=\"Grammar as Finite State Machine\")\n", "\n", " symbols = deque([start_symbol])\n", " symbols_seen = set()\n", "\n", " while len(symbols) > 0:\n", " symbol = symbols.popleft()\n", " symbols_seen.add(symbol)\n", " dot.node(symbol, dot_escape(unicode_escape(symbol)))\n", "\n", " for expansion in grammar[symbol]:\n", " assert type(expansion) == str # no opts() here\n", "\n", " nts = nonterminals(expansion)\n", " if len(nts) > 0:\n", " target_symbol = nts[-1]\n", " if target_symbol not in symbols_seen:\n", " symbols.append(target_symbol)\n", "\n", " label = expansion.replace(target_symbol, '')\n", " dot.edge(symbol, target_symbol, left_align(unicode_escape(label)))\n", "\n", " return display(dot)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "#### End of Excursion" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "Let us show `GUIGrammarMiner()` in action. Its method `mine_state_grammar()` extracts the grammar for the current Web page:" ] }, { "cell_type": "code", "execution_count": 81, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:24.900675Z", "iopub.status.busy": "2023-01-07T14:53:24.900371Z", "iopub.status.idle": "2023-01-07T14:53:25.002689Z", "shell.execute_reply": "2023-01-07T14:53:25.002895Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "gui_grammar_miner = GUIGrammarMiner(gui_driver)\n", "state_grammar = gui_grammar_miner.mine_state_grammar()" ] }, { "cell_type": "code", "execution_count": 82, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.005809Z", "iopub.status.busy": "2023-01-07T14:53:25.005482Z", "iopub.status.idle": "2023-01-07T14:53:25.006952Z", "shell.execute_reply": "2023-01-07T14:53:25.007172Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [ { "data": { "text/plain": [ "{'': [''],\n", " '': [''],\n", " '': [''],\n", " '': [''],\n", " '': ['', ''],\n", " '': ['', '', ''],\n", " '': ['a',\n", " 'b',\n", " 'c',\n", " 'd',\n", " 'e',\n", " 'f',\n", " 'g',\n", " 'h',\n", " 'i',\n", " 'j',\n", " 'k',\n", " 'l',\n", " 'm',\n", " 'n',\n", " 'o',\n", " 'p',\n", " 'q',\n", " 'r',\n", " 's',\n", " 't',\n", " 'u',\n", " 'v',\n", " 'w',\n", " 'x',\n", " 'y',\n", " 'z',\n", " 'A',\n", " 'B',\n", " 'C',\n", " 'D',\n", " 'E',\n", " 'F',\n", " 'G',\n", " 'H',\n", " 'I',\n", " 'J',\n", " 'K',\n", " 'L',\n", " 'M',\n", " 'N',\n", " 'O',\n", " 'P',\n", " 'Q',\n", " 'R',\n", " 'S',\n", " 'T',\n", " 'U',\n", " 'V',\n", " 'W',\n", " 'X',\n", " 'Y',\n", " 'Z'],\n", " '': [''],\n", " '': ['', ''],\n", " '': ['0', '1', '2', '3', '4', '5', '6', '7', '8', '9'],\n", " '': ['.', ' ', '!'],\n", " '': ['@'],\n", " '': ['', ''],\n", " '': ['True', 'False'],\n", " '': [\"click('terms and conditions')\\n\",\n", " \"fill('zip', '')\\ncheck('terms', )\\nfill('name', '')\\nfill('email', '')\\nfill('city', '')\\nsubmit('submit')\\n\",\n", " ''],\n", " '': [''],\n", " '': ['']}" ] }, "execution_count": 82, "metadata": {}, "output_type": "execute_result" } ], "source": [ "state_grammar" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "To better see the structure of the state grammar, we can visualize it as a state machine. We see that it nicely reflects what we can see from our Web server's home page:" ] }, { "cell_type": "code", "execution_count": 83, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.012583Z", "iopub.status.busy": "2023-01-07T14:53:25.011979Z", "iopub.status.idle": "2023-01-07T14:53:25.264759Z", "shell.execute_reply": "2023-01-07T14:53:25.265004Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "image/svg+xml": [ "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "start\n", "\n", "<start>\n", "\n", "\n", "\n", "state\n", "\n", "<state>\n", "\n", "\n", "\n", "start->state\n", "\n", "\n", "\n", "\n", "\n", "state-1\n", "\n", "<state-1>\n", "\n", "\n", "\n", "state->state-1\n", "\n", "\n", "click('terms and conditions')\n", "\n", "\n", "\n", "state-2\n", "\n", "<state-2>\n", "\n", "\n", "\n", "state->state-2\n", "\n", "\n", "fill('zip', '<number>')\n", "check('terms', <boolean>)\n", "fill('name', '<text>')\n", "fill('email', '<email>')\n", "fill('city', '<text>')\n", "submit('submit')\n", "\n", "\n", "\n", "end\n", "\n", "<end>\n", "\n", "\n", "\n", "state->end\n", "\n", "\n", "\n", "\n", "\n", "unexplored\n", "\n", "<unexplored>\n", "\n", "\n", "\n", "state-1->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-2->unexplored\n", "\n", "\n", "\n", "\n", "\n" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "fsm_diagram(state_grammar)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "From the start state (``), we can go and either click on \"terms and conditions\", ending in ``, or fill out the form, ending in ``." ] }, { "cell_type": "code", "execution_count": 84, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.267651Z", "iopub.status.busy": "2023-01-07T14:53:25.267282Z", "iopub.status.idle": "2023-01-07T14:53:25.268915Z", "shell.execute_reply": "2023-01-07T14:53:25.269174Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/plain": [ "[\"click('terms and conditions')\\n\",\n", " \"fill('zip', '')\\ncheck('terms', )\\nfill('name', '')\\nfill('email', '')\\nfill('city', '')\\nsubmit('submit')\\n\",\n", " '']" ] }, "execution_count": 84, "metadata": {}, "output_type": "execute_result" } ], "source": [ "state_grammar[GUIGrammarMiner.START_STATE]" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "Both these states are yet unexplored:" ] }, { "cell_type": "code", "execution_count": 85, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.271229Z", "iopub.status.busy": "2023-01-07T14:53:25.270866Z", "iopub.status.idle": "2023-01-07T14:53:25.272427Z", "shell.execute_reply": "2023-01-07T14:53:25.272615Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/plain": [ "['']" ] }, "execution_count": 85, "metadata": {}, "output_type": "execute_result" } ], "source": [ "state_grammar['']" ] }, { "cell_type": "code", "execution_count": 86, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.274528Z", "iopub.status.busy": "2023-01-07T14:53:25.274199Z", "iopub.status.idle": "2023-01-07T14:53:25.275791Z", "shell.execute_reply": "2023-01-07T14:53:25.275981Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [ { "data": { "text/plain": [ "['']" ] }, "execution_count": 86, "metadata": {}, "output_type": "execute_result" } ], "source": [ "state_grammar['']" ] }, { "cell_type": "code", "execution_count": 87, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.277902Z", "iopub.status.busy": "2023-01-07T14:53:25.277556Z", "iopub.status.idle": "2023-01-07T14:53:25.279054Z", "shell.execute_reply": "2023-01-07T14:53:25.279273Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/plain": [ "['']" ] }, "execution_count": 87, "metadata": {}, "output_type": "execute_result" } ], "source": [ "state_grammar['']" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "Given the grammar, we can use any of our grammar fuzzers to create valid input sequences:" ] }, { "cell_type": "code", "execution_count": 88, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.280908Z", "iopub.status.busy": "2023-01-07T14:53:25.280642Z", "iopub.status.idle": "2023-01-07T14:53:25.281799Z", "shell.execute_reply": "2023-01-07T14:53:25.282048Z" }, "slideshow": { "slide_type": "skip" } }, "outputs": [], "source": [ "from GrammarFuzzer import GrammarFuzzer" ] }, { "cell_type": "code", "execution_count": 89, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.285410Z", "iopub.status.busy": "2023-01-07T14:53:25.285042Z", "iopub.status.idle": "2023-01-07T14:53:25.286575Z", "shell.execute_reply": "2023-01-07T14:53:25.286820Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "fill('zip', '7')\n", "check('terms', True)\n", "fill('name', 'A')\n", "fill('email', 'M@Bo')\n", "fill('city', 'v')\n", "submit('submit')\n", "\n" ] } ], "source": [ "gui_fuzzer = GrammarFuzzer(state_grammar)\n", "while True:\n", " action = gui_fuzzer.fuzz()\n", " if action.find('submit(') > 0:\n", " break\n", "print(action)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "These actions, however, must also be _executed_ such that we can explore the user interface. This is what we do in the next section." ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "### Executing User Interface Actions" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "To execute actions, we introduce a `Runner` class, conveniently named `GUIRunner`. Its `run()` method executes the actions as given in an action string." ] }, { "cell_type": "code", "execution_count": 90, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.288655Z", "iopub.status.busy": "2023-01-07T14:53:25.288368Z", "iopub.status.idle": "2023-01-07T14:53:25.289512Z", "shell.execute_reply": "2023-01-07T14:53:25.289706Z" }, "slideshow": { "slide_type": "skip" } }, "outputs": [], "source": [ "from Fuzzer import Runner" ] }, { "cell_type": "code", "execution_count": 91, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.291651Z", "iopub.status.busy": "2023-01-07T14:53:25.291329Z", "iopub.status.idle": "2023-01-07T14:53:25.292681Z", "shell.execute_reply": "2023-01-07T14:53:25.292869Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "class GUIRunner(Runner):\n", " \"\"\"Execute the actions in a given action string\"\"\"\n", "\n", " def __init__(self, driver) -> None:\n", " \"\"\"Constructor. `driver` is a Selenium Web driver\"\"\"\n", " self.driver = driver" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "#### Excursion: Implementing Executing UI Actions" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "The way we implement `run()` is fairly simple: We introduce four methods named `fill()`, `check()`, `submit()` and `click()`, and run `exec()` on the action string to have the Python interpreter invoke these methods." ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "Running `exec()` on third-party input is dangerous, as the names of UI elements may contain valid Python code. We restrict access to the four functions defined above, and also set `__builtins__` to the empty dictionary such that built-in Python functions are not available during `exec()`. This will prevent accidents, but as we will see in the [chapter on information flow](InformationFlow.ipynb), it is still possible to inject Python code. To prevent such injection attacks, we use `html.escape()` to quote angle and quote characters in all third-party strings." ] }, { "cell_type": "code", "execution_count": 92, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.295804Z", "iopub.status.busy": "2023-01-07T14:53:25.295494Z", "iopub.status.idle": "2023-01-07T14:53:25.296904Z", "shell.execute_reply": "2023-01-07T14:53:25.297083Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [], "source": [ "class GUIRunner(GUIRunner):\n", " def run(self, inp: str) -> Tuple[str, str]:\n", " \"\"\"Execute the action string `inp` on the current Web site.\n", " Return a pair (`inp`, `outcome`).\"\"\"\n", "\n", " def fill(name, value):\n", " self.do_fill(html.unescape(name), html.unescape(value))\n", "\n", " def check(name, state):\n", " self.do_check(html.unescape(name), state)\n", "\n", " def submit(name):\n", " self.do_submit(html.unescape(name))\n", "\n", " def click(name):\n", " self.do_click(html.unescape(name))\n", "\n", " exec(inp, {'__builtins__': {}},\n", " {\n", " 'fill': fill,\n", " 'check': check,\n", " 'submit': submit,\n", " 'click': click,\n", " })\n", "\n", " return inp, self.PASS" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "To identify elements in an action, we first search them by their name, and then by the displayed link text." ] }, { "cell_type": "code", "execution_count": 93, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.298839Z", "iopub.status.busy": "2023-01-07T14:53:25.298542Z", "iopub.status.idle": "2023-01-07T14:53:25.299980Z", "shell.execute_reply": "2023-01-07T14:53:25.300174Z" }, "slideshow": { "slide_type": "skip" } }, "outputs": [], "source": [ "from selenium.common.exceptions import NoSuchElementException\n", "from selenium.common.exceptions import ElementClickInterceptedException, ElementNotInteractableException" ] }, { "cell_type": "code", "execution_count": 94, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.302380Z", "iopub.status.busy": "2023-01-07T14:53:25.302081Z", "iopub.status.idle": "2023-01-07T14:53:25.303251Z", "shell.execute_reply": "2023-01-07T14:53:25.303460Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "class GUIRunner(GUIRunner):\n", " def find_element(self, name: str) -> Any:\n", " \"\"\"Search for an element named `name` on the current Web site.\n", " Matches can occur by name or by link text.\"\"\"\n", "\n", " try:\n", " return self.driver.find_element(By.NAME, name)\n", " except NoSuchElementException:\n", " return self.driver.find_element(By.LINK_TEXT, name)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "The implementations of the actions simply defer to the appropriate Selenium methods, introducing explicit delays such that the page can reload and refresh." ] }, { "cell_type": "code", "execution_count": 95, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.305248Z", "iopub.status.busy": "2023-01-07T14:53:25.304947Z", "iopub.status.idle": "2023-01-07T14:53:25.307022Z", "shell.execute_reply": "2023-01-07T14:53:25.307221Z" }, "slideshow": { "slide_type": "skip" } }, "outputs": [], "source": [ "from selenium.webdriver.support.ui import WebDriverWait" ] }, { "cell_type": "code", "execution_count": 96, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.309084Z", "iopub.status.busy": "2023-01-07T14:53:25.308799Z", "iopub.status.idle": "2023-01-07T14:53:25.310058Z", "shell.execute_reply": "2023-01-07T14:53:25.310249Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [], "source": [ "class GUIRunner(GUIRunner):\n", " # Delays (in seconds)\n", " DELAY_AFTER_FILL = 0.1\n", " DELAY_AFTER_CHECK = 0.1\n", " DELAY_AFTER_SUBMIT = 1\n", " DELAY_AFTER_CLICK = 1" ] }, { "cell_type": "code", "execution_count": 97, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.312188Z", "iopub.status.busy": "2023-01-07T14:53:25.311903Z", "iopub.status.idle": "2023-01-07T14:53:25.313252Z", "shell.execute_reply": "2023-01-07T14:53:25.313449Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "class GUIRunner(GUIRunner):\n", " def do_fill(self, name: str, value: str) -> None:\n", " \"\"\"Fill the text element `name` with `value`\"\"\"\n", "\n", " element = self.find_element(name)\n", " element.send_keys(value)\n", " WebDriverWait(self.driver, self.DELAY_AFTER_FILL)" ] }, { "cell_type": "code", "execution_count": 98, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.315547Z", "iopub.status.busy": "2023-01-07T14:53:25.315232Z", "iopub.status.idle": "2023-01-07T14:53:25.316598Z", "shell.execute_reply": "2023-01-07T14:53:25.316911Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [], "source": [ "class GUIRunner(GUIRunner):\n", " def do_check(self, name: str, state: bool) -> None:\n", " \"\"\"Set the check element `name` to `state`\"\"\"\n", "\n", " element = self.find_element(name)\n", " if bool(state) != bool(element.is_selected()):\n", " element.click()\n", " WebDriverWait(self.driver, self.DELAY_AFTER_CHECK)" ] }, { "cell_type": "code", "execution_count": 99, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.318957Z", "iopub.status.busy": "2023-01-07T14:53:25.318595Z", "iopub.status.idle": "2023-01-07T14:53:25.320237Z", "shell.execute_reply": "2023-01-07T14:53:25.320513Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "class GUIRunner(GUIRunner):\n", " def do_submit(self, name: str) -> None:\n", " \"\"\"Click on the submit element `name`\"\"\"\n", "\n", " element = self.find_element(name)\n", " element.click()\n", " WebDriverWait(self.driver, self.DELAY_AFTER_SUBMIT)" ] }, { "cell_type": "code", "execution_count": 100, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.322859Z", "iopub.status.busy": "2023-01-07T14:53:25.322451Z", "iopub.status.idle": "2023-01-07T14:53:25.323909Z", "shell.execute_reply": "2023-01-07T14:53:25.324178Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [], "source": [ "class GUIRunner(GUIRunner):\n", " def do_click(self, name: str) -> None:\n", " \"\"\"Click on the element `name`\"\"\"\n", "\n", " element = self.find_element(name)\n", " element.click()\n", " WebDriverWait(self.driver, self.DELAY_AFTER_CLICK)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "#### End of Excursion" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "Let us try out `GUIRunner` and its `run()` method. We create a runner on our Web server, and let it execute a `fill()` action:" ] }, { "cell_type": "code", "execution_count": 101, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.326479Z", "iopub.status.busy": "2023-01-07T14:53:25.326137Z", "iopub.status.idle": "2023-01-07T14:53:25.341023Z", "shell.execute_reply": "2023-01-07T14:53:25.341237Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "gui_driver.get(httpd_url)" ] }, { "cell_type": "code", "execution_count": 102, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.343240Z", "iopub.status.busy": "2023-01-07T14:53:25.342684Z", "iopub.status.idle": "2023-01-07T14:53:25.344556Z", "shell.execute_reply": "2023-01-07T14:53:25.344764Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "gui_runner = GUIRunner(gui_driver)" ] }, { "cell_type": "code", "execution_count": 103, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.347056Z", "iopub.status.busy": "2023-01-07T14:53:25.346608Z", "iopub.status.idle": "2023-01-07T14:53:25.363074Z", "shell.execute_reply": "2023-01-07T14:53:25.363298Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/plain": [ "(\"fill('name', 'Walter White')\", 'PASS')" ] }, "execution_count": 103, "metadata": {}, "output_type": "execute_result" } ], "source": [ "gui_runner.run(\"fill('name', 'Walter White')\")" ] }, { "cell_type": "code", "execution_count": 104, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.365094Z", "iopub.status.busy": "2023-01-07T14:53:25.364430Z", "iopub.status.idle": "2023-01-07T14:53:25.383651Z", "shell.execute_reply": "2023-01-07T14:53:25.383990Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "image/png": "\n", "text/plain": [ "" ] }, "execution_count": 104, "metadata": {}, "output_type": "execute_result" } ], "source": [ "Image(gui_driver.get_screenshot_as_png())" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "A `submit()` action submits the order. (Note that our Web server does no effort whatsoever to validate the form.)" ] }, { "cell_type": "code", "execution_count": 105, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.386255Z", "iopub.status.busy": "2023-01-07T14:53:25.385971Z", "iopub.status.idle": "2023-01-07T14:53:25.413916Z", "shell.execute_reply": "2023-01-07T14:53:25.414138Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/plain": [ "(\"submit('submit')\", 'PASS')" ] }, "execution_count": 105, "metadata": {}, "output_type": "execute_result" } ], "source": [ "gui_runner.run(\"submit('submit')\")" ] }, { "cell_type": "code", "execution_count": 106, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.416302Z", "iopub.status.busy": "2023-01-07T14:53:25.415988Z", "iopub.status.idle": "2023-01-07T14:53:25.430052Z", "shell.execute_reply": "2023-01-07T14:53:25.430317Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "image/png": "\n", "text/plain": [ "" ] }, "execution_count": 106, "metadata": {}, "output_type": "execute_result" } ], "source": [ "Image(gui_driver.get_screenshot_as_png())" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "Of course, we can also execute action sequences generated from the grammar. This allows us to fill the form again and again, using values matching the type given in the form." ] }, { "cell_type": "code", "execution_count": 107, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.432520Z", "iopub.status.busy": "2023-01-07T14:53:25.432222Z", "iopub.status.idle": "2023-01-07T14:53:25.449902Z", "shell.execute_reply": "2023-01-07T14:53:25.450262Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "gui_driver.get(httpd_url)" ] }, { "cell_type": "code", "execution_count": 108, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.452840Z", "iopub.status.busy": "2023-01-07T14:53:25.452489Z", "iopub.status.idle": "2023-01-07T14:53:25.453627Z", "shell.execute_reply": "2023-01-07T14:53:25.453967Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "gui_fuzzer = GrammarFuzzer(state_grammar)" ] }, { "cell_type": "code", "execution_count": 109, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.457603Z", "iopub.status.busy": "2023-01-07T14:53:25.457107Z", "iopub.status.idle": "2023-01-07T14:53:25.459084Z", "shell.execute_reply": "2023-01-07T14:53:25.459289Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "while True:\n", " action = gui_fuzzer.fuzz()\n", " if action.find('submit(') > 0:\n", " break" ] }, { "cell_type": "code", "execution_count": 110, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.461596Z", "iopub.status.busy": "2023-01-07T14:53:25.461275Z", "iopub.status.idle": "2023-01-07T14:53:25.462754Z", "shell.execute_reply": "2023-01-07T14:53:25.462986Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "fill('zip', '7')\n", "check('terms', False)\n", "fill('name', 'yY')\n", "fill('email', 'xlp@G')\n", "fill('city', '!')\n", "submit('submit')\n", "\n" ] } ], "source": [ "print(action)" ] }, { "cell_type": "code", "execution_count": 111, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.464974Z", "iopub.status.busy": "2023-01-07T14:53:25.464640Z", "iopub.status.idle": "2023-01-07T14:53:25.518944Z", "shell.execute_reply": "2023-01-07T14:53:25.519168Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/plain": [ "(\"fill('zip', '7')\\ncheck('terms', False)\\nfill('name', 'yY')\\nfill('email', 'xlp@G')\\nfill('city', '!')\\nsubmit('submit')\\n\",\n", " 'PASS')" ] }, "execution_count": 111, "metadata": {}, "output_type": "execute_result" } ], "source": [ "gui_runner.run(action)" ] }, { "cell_type": "code", "execution_count": 112, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.521360Z", "iopub.status.busy": "2023-01-07T14:53:25.521004Z", "iopub.status.idle": "2023-01-07T14:53:25.534871Z", "shell.execute_reply": "2023-01-07T14:53:25.535180Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "image/png": "\n", "text/plain": [ "" ] }, "execution_count": 112, "metadata": {}, "output_type": "execute_result" } ], "source": [ "Image(gui_driver.get_screenshot_as_png())" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Exploring User Interfaces\n", "\n", "So far, our grammar retrieval and execution of actions is limited to the current user interface state (i.e., the current page shown). To systematically explore a user interface, we must explore all states, notably those ending in `` – and whenever we reach a new state, again retrieve its grammar such that we may be able to reach other states. Since some states can only be reached by generating inputs, test generation and user interface exploration _take place at the same time._ \n", "\n", "Consequently, we introduce a `GUIFuzzer` class, which generates inputs for all forms and follows all links, and which updates its grammar (i.e., its user interface model as a finite state machine) every time it encounters a new state. " ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "### Excursion: Implementing GUIFuzzer" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "Exploring states and updating the grammar at the same time is a fairly complex operation, so we need to introduce quite a number of methods before we can put this to use. The `GUIFuzzer` constructor sets three important attributes:\n", "\n", "1. `state_symbol`: This holds the symbol of the current state (e.g. ``).\n", "2. `state`: This holds the set of actions for the current state, as returned by the `GUIGrammarMiner` method `mine_state_actions()`.\n", "3. `states_seen`: This maps the states seen (as in `state`) to the respective symbols.\n", "\n", "Let us show these three attributes after initialization." ] }, { "cell_type": "code", "execution_count": 113, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.537164Z", "iopub.status.busy": "2023-01-07T14:53:25.536874Z", "iopub.status.idle": "2023-01-07T14:53:25.537999Z", "shell.execute_reply": "2023-01-07T14:53:25.538355Z" }, "slideshow": { "slide_type": "skip" } }, "outputs": [], "source": [ "from Grammars import is_nonterminal" ] }, { "cell_type": "code", "execution_count": 114, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.540110Z", "iopub.status.busy": "2023-01-07T14:53:25.539804Z", "iopub.status.idle": "2023-01-07T14:53:25.541503Z", "shell.execute_reply": "2023-01-07T14:53:25.541197Z" }, "slideshow": { "slide_type": "skip" } }, "outputs": [], "source": [ "from GrammarFuzzer import GrammarFuzzer" ] }, { "cell_type": "code", "execution_count": 115, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.544877Z", "iopub.status.busy": "2023-01-07T14:53:25.544570Z", "iopub.status.idle": "2023-01-07T14:53:25.545783Z", "shell.execute_reply": "2023-01-07T14:53:25.546295Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [], "source": [ "class GUIFuzzer(GrammarFuzzer):\n", " \"\"\"A fuzzer for GUIs, using Selenium.\"\"\"\n", "\n", " def __init__(self, driver, *,\n", " miner: Optional[GUIGrammarMiner] = None,\n", " stay_on_host: bool = True,\n", " log_gui_exploration: bool = False,\n", " disp_gui_exploration: bool = False,\n", " **kwargs) -> None:\n", " \"\"\"Constructor.\n", " `driver` - the Selenium driver to use.\n", " `miner` - the miner to use (default: `GUIGrammarMiner(driver)`)\n", " `stay_on_host` - if True (default), do not explore external links.\n", " `log_gui_exploration` - if set, print out exploration steps.\n", " `disp_gui_exploration` - if set, display screenshot of current Web page\n", " as well as FSM diagrams during exploration.\n", " Other keyword arguments are passed to the `GrammarFuzzer` superclass.\n", " \"\"\"\n", "\n", " self.driver = driver\n", "\n", " if miner is None:\n", " miner = GUIGrammarMiner(driver)\n", "\n", " self.miner = miner\n", " self.stay_on_host = True\n", " self.log_gui_exploration = log_gui_exploration\n", " self.disp_gui_exploration = disp_gui_exploration\n", " self.initial_url = driver.current_url\n", "\n", " self.states_seen = {} # Maps states to symbols\n", " self.state_symbol = self.miner.START_STATE\n", " self.state: FrozenSet[str] = self.miner.mine_state_actions()\n", " self.states_seen[self.state] = self.state_symbol\n", "\n", " grammar = self.miner.mine_state_grammar()\n", " super().__init__(grammar, **kwargs)" ] }, { "cell_type": "code", "execution_count": 116, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.549225Z", "iopub.status.busy": "2023-01-07T14:53:25.548650Z", "iopub.status.idle": "2023-01-07T14:53:25.564343Z", "shell.execute_reply": "2023-01-07T14:53:25.564567Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [], "source": [ "gui_driver.get(httpd_url)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "The initial state symbol is always ``:" ] }, { "cell_type": "code", "execution_count": 117, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.566919Z", "iopub.status.busy": "2023-01-07T14:53:25.566575Z", "iopub.status.idle": "2023-01-07T14:53:25.775772Z", "shell.execute_reply": "2023-01-07T14:53:25.776016Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/plain": [ "''" ] }, "execution_count": 117, "metadata": {}, "output_type": "execute_result" } ], "source": [ "gui_fuzzer = GUIFuzzer(gui_driver)\n", "gui_fuzzer.state_symbol" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "The current state is characterized by the available UI actions:" ] }, { "cell_type": "code", "execution_count": 118, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.778307Z", "iopub.status.busy": "2023-01-07T14:53:25.777959Z", "iopub.status.idle": "2023-01-07T14:53:25.779430Z", "shell.execute_reply": "2023-01-07T14:53:25.779682Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/plain": [ "frozenset({\"check('terms', )\",\n", " \"click('terms and conditions')\",\n", " \"fill('city', '')\",\n", " \"fill('email', '')\",\n", " \"fill('name', '')\",\n", " \"fill('zip', '')\",\n", " \"submit('submit')\"})" ] }, "execution_count": 118, "metadata": {}, "output_type": "execute_result" } ], "source": [ "gui_fuzzer.state" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "`states_seen` maps this state to its symbol:" ] }, { "cell_type": "code", "execution_count": 119, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.781603Z", "iopub.status.busy": "2023-01-07T14:53:25.781284Z", "iopub.status.idle": "2023-01-07T14:53:25.782819Z", "shell.execute_reply": "2023-01-07T14:53:25.783143Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [ { "data": { "text/plain": [ "''" ] }, "execution_count": 119, "metadata": {}, "output_type": "execute_result" } ], "source": [ "gui_fuzzer.states_seen[gui_fuzzer.state]" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "The `restart()` method gets us back to the initial URL and resets the state. This is what we use with every new exploration." ] }, { "cell_type": "code", "execution_count": 120, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.785295Z", "iopub.status.busy": "2023-01-07T14:53:25.784985Z", "iopub.status.idle": "2023-01-07T14:53:25.786114Z", "shell.execute_reply": "2023-01-07T14:53:25.786459Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "class GUIFuzzer(GUIFuzzer):\n", " def restart(self) -> None:\n", " \"\"\"Get back to original URL\"\"\"\n", "\n", " self.driver.get(self.initial_url)\n", " self.state = frozenset(self.miner.START_STATE)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "When producing a sequence of actions from the grammar, we want to know which final state we are to be in. We can retrieve this path from the _derivation tree_ produced – it is the last symbol being expanded." ] }, { "cell_type": "code", "execution_count": 121, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.789787Z", "iopub.status.busy": "2023-01-07T14:53:25.789484Z", "iopub.status.idle": "2023-01-07T14:53:25.790576Z", "shell.execute_reply": "2023-01-07T14:53:25.790908Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [], "source": [ "while True:\n", " action = gui_fuzzer.fuzz()\n", " if action.find('click(') >= 0:\n", " break" ] }, { "cell_type": "code", "execution_count": 122, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.792601Z", "iopub.status.busy": "2023-01-07T14:53:25.792312Z", "iopub.status.idle": "2023-01-07T14:53:25.793565Z", "shell.execute_reply": "2023-01-07T14:53:25.793757Z" }, "slideshow": { "slide_type": "skip" } }, "outputs": [], "source": [ "from GrammarFuzzer import display_tree, DerivationTree" ] }, { "cell_type": "code", "execution_count": 123, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:25.798752Z", "iopub.status.busy": "2023-01-07T14:53:25.798406Z", "iopub.status.idle": "2023-01-07T14:53:26.053848Z", "shell.execute_reply": "2023-01-07T14:53:26.054099Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "image/svg+xml": [ "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "0\n", "<start>\n", "\n", "\n", "\n", "1\n", "<state>\n", "\n", "\n", "\n", "0->1\n", "\n", "\n", "\n", "\n", "\n", "2\n", "click('terms and conditions')\\n\n", "\n", "\n", "\n", "1->2\n", "\n", "\n", "\n", "\n", "\n", "3\n", "<state-1>\n", "\n", "\n", "\n", "1->3\n", "\n", "\n", "\n", "\n", "\n", "4\n", "<unexplored>\n", "\n", "\n", "\n", "3->4\n", "\n", "\n", "\n", "\n", "\n", "5\n", "\n", "\n", "\n", "4->5\n", "\n", "\n", "\n", "\n", "\n" ], "text/plain": [ "" ] }, "execution_count": 123, "metadata": {}, "output_type": "execute_result" } ], "source": [ "tree = gui_fuzzer.derivation_tree\n", "display_tree(tree)" ] }, { "cell_type": "code", "execution_count": 124, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:26.057010Z", "iopub.status.busy": "2023-01-07T14:53:26.056704Z", "iopub.status.idle": "2023-01-07T14:53:26.058124Z", "shell.execute_reply": "2023-01-07T14:53:26.058470Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [], "source": [ "class GUIFuzzer(GUIFuzzer):\n", " def fsm_path(self, tree: DerivationTree) -> List[str]:\n", " \"\"\"Return sequence of state symbols.\"\"\"\n", "\n", " (node, children) = tree\n", " if node == self.miner.UNEXPLORED_STATE:\n", " return []\n", " elif children is None or len(children) == 0:\n", " return [node]\n", " else:\n", " return [node] + self.fsm_path(children[-1])" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "This is the path in the finite state machine towards the \"fuzzed\" state:" ] }, { "cell_type": "code", "execution_count": 125, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:26.060968Z", "iopub.status.busy": "2023-01-07T14:53:26.060622Z", "iopub.status.idle": "2023-01-07T14:53:26.264916Z", "shell.execute_reply": "2023-01-07T14:53:26.265265Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [ { "data": { "text/plain": [ "['', '', '']" ] }, "execution_count": 125, "metadata": {}, "output_type": "execute_result" } ], "source": [ "gui_fuzzer = GUIFuzzer(gui_driver)\n", "gui_fuzzer.fsm_path(tree)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "This is its last element:" ] }, { "cell_type": "code", "execution_count": 126, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:26.267704Z", "iopub.status.busy": "2023-01-07T14:53:26.267389Z", "iopub.status.idle": "2023-01-07T14:53:26.268512Z", "shell.execute_reply": "2023-01-07T14:53:26.268813Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "class GUIFuzzer(GUIFuzzer):\n", " def fsm_last_state_symbol(self, tree: DerivationTree) -> str:\n", " \"\"\"Return current (expected) state symbol\"\"\"\n", "\n", " for state in reversed(self.fsm_path(tree)):\n", " if is_nonterminal(state):\n", " return state\n", "\n", " assert False" ] }, { "cell_type": "code", "execution_count": 127, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:26.270649Z", "iopub.status.busy": "2023-01-07T14:53:26.270347Z", "iopub.status.idle": "2023-01-07T14:53:26.475727Z", "shell.execute_reply": "2023-01-07T14:53:26.475988Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [ { "data": { "text/plain": [ "''" ] }, "execution_count": 127, "metadata": {}, "output_type": "execute_result" } ], "source": [ "gui_fuzzer = GUIFuzzer(gui_driver)\n", "gui_fuzzer.fsm_last_state_symbol(tree)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "As we run (`run()`) the fuzzer, we create an action (via `fuzz()`) and retrieve and update the state symbol (`state_symbol`) we are supposed to be in after running this action. After actually running the action in the given `GUIRunner`, we retrieve and update the current state, using `update_state()`." ] }, { "cell_type": "code", "execution_count": 128, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:26.479370Z", "iopub.status.busy": "2023-01-07T14:53:26.478945Z", "iopub.status.idle": "2023-01-07T14:53:26.480249Z", "shell.execute_reply": "2023-01-07T14:53:26.480574Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [], "source": [ "class GUIFuzzer(GUIFuzzer):\n", " def run(self, runner: GUIRunner) -> Tuple[str, str]: # type: ignore\n", " \"\"\"Run the fuzzer on the given GUIRunner `runner`.\"\"\"\n", " assert isinstance(runner, GUIRunner)\n", "\n", " self.restart()\n", " action = self.fuzz()\n", " self.state_symbol = self.fsm_last_state_symbol(self.derivation_tree)\n", "\n", " if self.log_gui_exploration:\n", " print(\"Action\", action.strip(), \"->\", self.state_symbol)\n", "\n", " result, outcome = runner.run(action)\n", "\n", " if self.state_symbol != self.miner.FINAL_STATE:\n", " self.update_state()\n", "\n", " return self.state_symbol, outcome" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "When updating the current state, we check whether we are in a new or in a previously seen state, and invoke `update_new_state()` or `update_existing_state()`, respectively." ] }, { "cell_type": "code", "execution_count": 129, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:26.483138Z", "iopub.status.busy": "2023-01-07T14:53:26.482804Z", "iopub.status.idle": "2023-01-07T14:53:26.484032Z", "shell.execute_reply": "2023-01-07T14:53:26.484263Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [], "source": [ "class GUIFuzzer(GUIFuzzer):\n", " def update_state(self) -> None:\n", " \"\"\"Determine current state from current Web page\"\"\"\n", "\n", " if self.disp_gui_exploration:\n", " display(Image(self.driver.get_screenshot_as_png()))\n", "\n", " self.state = self.miner.mine_state_actions()\n", " if self.state not in self.states_seen:\n", " self.states_seen[self.state] = self.state_symbol\n", " self.update_new_state()\n", " else:\n", " self.update_existing_state()" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "Finding a new state means that we mine a new grammar for the newly found state, and update our existing grammar with it." ] }, { "cell_type": "code", "execution_count": 130, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:26.486525Z", "iopub.status.busy": "2023-01-07T14:53:26.486201Z", "iopub.status.idle": "2023-01-07T14:53:26.487488Z", "shell.execute_reply": "2023-01-07T14:53:26.487672Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [], "source": [ "class GUIFuzzer(GUIFuzzer):\n", " def set_grammar(self, new_grammar: Grammar) -> None:\n", " \"\"\"Set grammar to `new_grammar`.\"\"\"\n", "\n", " self.grammar = new_grammar\n", "\n", " if self.disp_gui_exploration and rich_output():\n", " display(fsm_diagram(self.grammar))" ] }, { "cell_type": "code", "execution_count": 131, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:26.490114Z", "iopub.status.busy": "2023-01-07T14:53:26.489815Z", "iopub.status.idle": "2023-01-07T14:53:26.491153Z", "shell.execute_reply": "2023-01-07T14:53:26.491333Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [], "source": [ "class GUIFuzzer(GUIFuzzer):\n", " def update_new_state(self) -> None:\n", " \"\"\"Found new state; extend grammar accordingly\"\"\"\n", "\n", " if self.log_gui_exploration:\n", " print(\"In new state\", unicode_escape(self.state_symbol),\n", " unicode_escape(repr(self.state)))\n", "\n", " state_grammar = self.miner.mine_state_grammar(grammar=self.grammar, \n", " state_symbol=self.state_symbol)\n", " del state_grammar[START_SYMBOL]\n", " del state_grammar[self.miner.START_STATE]\n", " self.set_grammar(extend_grammar(self.grammar, state_grammar))\n", "\n", " def update_existing_state(self) -> None:\n", " pass # See below" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "If we find an existing state, we need to _merge_ both states. If, for instance, we find that we are in existing `` rather than in the expected ``, we replace all instances of `` in the grammar by ``. The method `replace_symbol()` takes care of the renaming; `update_existing_state()` sets the grammar accordingly." ] }, { "cell_type": "code", "execution_count": 132, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:26.493046Z", "iopub.status.busy": "2023-01-07T14:53:26.492709Z", "iopub.status.idle": "2023-01-07T14:53:26.494427Z", "shell.execute_reply": "2023-01-07T14:53:26.494192Z" }, "slideshow": { "slide_type": "skip" }, "tags": [] }, "outputs": [], "source": [ "from Grammars import exp_string, exp_opts" ] }, { "cell_type": "code", "execution_count": 133, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:26.497108Z", "iopub.status.busy": "2023-01-07T14:53:26.496813Z", "iopub.status.idle": "2023-01-07T14:53:26.498145Z", "shell.execute_reply": "2023-01-07T14:53:26.498334Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [], "source": [ "def replace_symbol(grammar: Grammar, \n", " old_symbol: str, new_symbol: str) -> Grammar:\n", " \"\"\"Return a grammar in which all occurrences of `old_symbol` are replaced by `new_symbol`\"\"\"\n", "\n", " new_grammar: Grammar = {}\n", "\n", " for symbol in grammar:\n", " new_expansions = []\n", " for expansion in grammar[symbol]:\n", " new_expansion_string = exp_string(expansion).replace(old_symbol, new_symbol)\n", " if len(exp_opts(expansion)) > 0:\n", " new_expansion = (new_expansion_string, exp_opts(expansion))\n", " else:\n", " new_expansion = new_expansion_string # type: ignore\n", " new_expansions.append(new_expansion)\n", "\n", " new_grammar[symbol] = new_expansions # type: ignore\n", "\n", " # Remove unused parts\n", " for nonterminal in unreachable_nonterminals(new_grammar):\n", " del new_grammar[nonterminal]\n", "\n", " return new_grammar" ] }, { "cell_type": "code", "execution_count": 134, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:26.500875Z", "iopub.status.busy": "2023-01-07T14:53:26.500573Z", "iopub.status.idle": "2023-01-07T14:53:26.501789Z", "shell.execute_reply": "2023-01-07T14:53:26.501978Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [], "source": [ "class GUIFuzzer(GUIFuzzer):\n", " def update_existing_state(self) -> None:\n", " \"\"\"Update actions of existing state\"\"\"\n", "\n", " if self.log_gui_exploration:\n", " print(\"In existing state\", self.states_seen[self.state])\n", "\n", " if self.state_symbol != self.states_seen[self.state]:\n", " if self.log_gui_exploration:\n", " print(\"Replacing expected state %s by %s\" %\n", " (self.state_symbol, self.states_seen[self.state]))\n", "\n", " new_grammar = replace_symbol(self.grammar, self.state_symbol, \n", " self.states_seen[self.state])\n", " self.state_symbol = self.states_seen[self.state]\n", " self.set_grammar(new_grammar)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "This concludes our definitions for `GUIFuzzer`." ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "### End of Excursion" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "Let us put `GUIFuzzer` to use, enabling its logging mechanisms to see what it is doing." ] }, { "cell_type": "code", "execution_count": 135, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:26.503855Z", "iopub.status.busy": "2023-01-07T14:53:26.503553Z", "iopub.status.idle": "2023-01-07T14:53:26.519638Z", "shell.execute_reply": "2023-01-07T14:53:26.519849Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "gui_driver.get(httpd_url)" ] }, { "cell_type": "code", "execution_count": 136, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:26.521925Z", "iopub.status.busy": "2023-01-07T14:53:26.521625Z", "iopub.status.idle": "2023-01-07T14:53:26.737911Z", "shell.execute_reply": "2023-01-07T14:53:26.738128Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "gui_fuzzer = GUIFuzzer(gui_driver, log_gui_exploration=True, disp_gui_exploration=True)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "Running it the first time yields a new state:" ] }, { "cell_type": "code", "execution_count": 137, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:26.740286Z", "iopub.status.busy": "2023-01-07T14:53:26.739982Z", "iopub.status.idle": "2023-01-07T14:53:26.754803Z", "shell.execute_reply": "2023-01-07T14:53:26.755066Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Action -> \n" ] }, { "data": { "text/plain": [ "('', 'PASS')" ] }, "execution_count": 137, "metadata": {}, "output_type": "execute_result" } ], "source": [ "gui_fuzzer.run(gui_runner)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "The next actions fill out the order form." ] }, { "cell_type": "code", "execution_count": 138, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:26.758019Z", "iopub.status.busy": "2023-01-07T14:53:26.757541Z", "iopub.status.idle": "2023-01-07T14:53:27.124422Z", "shell.execute_reply": "2023-01-07T14:53:27.124760Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Action click('terms and conditions') -> \n" ] }, { "data": { "image/png": "\n", "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "name": "stdout", "output_type": "stream", "text": [ "In new state frozenset({\"ignore('Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.')\", \"click('order form')\"})\n" ] }, { "data": { "image/svg+xml": [ "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "start\n", "\n", "<start>\n", "\n", "\n", "\n", "state\n", "\n", "<state>\n", "\n", "\n", "\n", "start->state\n", "\n", "\n", "\n", "\n", "\n", "state-1\n", "\n", "<state-1>\n", "\n", "\n", "\n", "state->state-1\n", "\n", "\n", "click('terms and conditions')\n", "\n", "\n", "\n", "state-2\n", "\n", "<state-2>\n", "\n", "\n", "\n", "state->state-2\n", "\n", "\n", "fill('zip', '<number>')\n", "check('terms', <boolean>)\n", "fill('name', '<text>')\n", "fill('email', '<email>')\n", "fill('city', '<text>')\n", "submit('submit')\n", "\n", "\n", "\n", "end\n", "\n", "<end>\n", "\n", "\n", "\n", "state->end\n", "\n", "\n", "\n", "\n", "\n", "state-1->end\n", "\n", "\n", "\n", "\n", "\n", "state-3\n", "\n", "<state-3>\n", "\n", "\n", "\n", "state-1->state-3\n", "\n", "\n", "click('order form')\n", "\n", "\n", "\n", "unexplored\n", "\n", "<unexplored>\n", "\n", "\n", "\n", "state-2->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-3->unexplored\n", "\n", "\n", "\n", "\n", "\n" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/plain": [ "None" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/plain": [ "('', 'PASS')" ] }, "execution_count": 138, "metadata": {}, "output_type": "execute_result" } ], "source": [ "gui_fuzzer.run(gui_runner)" ] }, { "cell_type": "code", "execution_count": 139, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:27.127333Z", "iopub.status.busy": "2023-01-07T14:53:27.127011Z", "iopub.status.idle": "2023-01-07T14:53:27.731579Z", "shell.execute_reply": "2023-01-07T14:53:27.731993Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Action fill('zip', '7')\n", "check('terms', True)\n", "fill('name', '84')\n", "fill('email', 'M@Chu')\n", "fill('city', 'j')\n", "submit('submit') -> \n" ] }, { "data": { "image/png": "\n", "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "name": "stdout", "output_type": "stream", "text": [ "In new state frozenset({\"click('order form')\"})\n" ] }, { "data": { "image/svg+xml": [ "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "start\n", "\n", "<start>\n", "\n", "\n", "\n", "state\n", "\n", "<state>\n", "\n", "\n", "\n", "start->state\n", "\n", "\n", "\n", "\n", "\n", "state-1\n", "\n", "<state-1>\n", "\n", "\n", "\n", "state->state-1\n", "\n", "\n", "click('terms and conditions')\n", "\n", "\n", "\n", "state-2\n", "\n", "<state-2>\n", "\n", "\n", "\n", "state->state-2\n", "\n", "\n", "fill('zip', '<number>')\n", "check('terms', <boolean>)\n", "fill('name', '<text>')\n", "fill('email', '<email>')\n", "fill('city', '<text>')\n", "submit('submit')\n", "\n", "\n", "\n", "end\n", "\n", "<end>\n", "\n", "\n", "\n", "state->end\n", "\n", "\n", "\n", "\n", "\n", "state-1->end\n", "\n", "\n", "\n", "\n", "\n", "state-3\n", "\n", "<state-3>\n", "\n", "\n", "\n", "state-1->state-3\n", "\n", "\n", "click('order form')\n", "\n", "\n", "\n", "state-2->end\n", "\n", "\n", "\n", "\n", "\n", "state-4\n", "\n", "<state-4>\n", "\n", "\n", "\n", "state-2->state-4\n", "\n", "\n", "click('order form')\n", "\n", "\n", "\n", "unexplored\n", "\n", "<unexplored>\n", "\n", "\n", "\n", "state-3->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-4->unexplored\n", "\n", "\n", "\n", "\n", "\n" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/plain": [ "None" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/plain": [ "('', 'PASS')" ] }, "execution_count": 139, "metadata": {}, "output_type": "execute_result" } ], "source": [ "gui_fuzzer.run(gui_runner)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "At this point, our GUI model is fairly complete already. In order to systematically cover _all_ states, random exploration is not efficient enough, though." ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Covering States\n", "\n", "During exploration as well as during testing, we want to _cover_ all states and transitions between states. How can we achieve this?" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "It turns out that _we already have this._ Our `GrammarCoverageFuzzer` from the [chapter on coverage-based grammar testing](GrammarCoverageFuzzer.ipynb) strives to systematically _cover all expansion alternatives_ in a grammar. In the finite state model, these expansion alternatives translate into transitions between states. Hence, applying the coverage strategy from `GrammarCoverageFuzzer` to our state grammars would automatically cover one transition after another." ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "How do we get these features into `GUIFuzzer`? Using _multiple inheritance_, we can create a class `GUICoverageFuzzer` which combines the `run()` method from `GUIFuzzer` with the coverage choices from `GrammarCoverageFuzzer`." ] }, { "cell_type": "code", "execution_count": 140, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:27.735108Z", "iopub.status.busy": "2023-01-07T14:53:27.734697Z", "iopub.status.idle": "2023-01-07T14:53:27.947134Z", "shell.execute_reply": "2023-01-07T14:53:27.947348Z" }, "slideshow": { "slide_type": "skip" } }, "outputs": [], "source": [ "from GrammarCoverageFuzzer import GrammarCoverageFuzzer" ] }, { "cell_type": "code", "execution_count": 141, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:27.949317Z", "iopub.status.busy": "2023-01-07T14:53:27.949015Z", "iopub.status.idle": "2023-01-07T14:53:27.950239Z", "shell.execute_reply": "2023-01-07T14:53:27.950432Z" }, "slideshow": { "slide_type": "skip" } }, "outputs": [], "source": [ "from bookutils import inheritance_conflicts" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "Since the `__init__()` constructor is defined in both superclasses, we need to define our own constructor that serves both:" ] }, { "cell_type": "code", "execution_count": 142, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:27.955390Z", "iopub.status.busy": "2023-01-07T14:53:27.955058Z", "iopub.status.idle": "2023-01-07T14:53:27.956570Z", "shell.execute_reply": "2023-01-07T14:53:27.956794Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "text/plain": [ "['__init__']" ] }, "execution_count": 142, "metadata": {}, "output_type": "execute_result" } ], "source": [ "inheritance_conflicts(GUIFuzzer, GrammarCoverageFuzzer)" ] }, { "cell_type": "code", "execution_count": 143, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:27.959250Z", "iopub.status.busy": "2023-01-07T14:53:27.958777Z", "iopub.status.idle": "2023-01-07T14:53:27.960146Z", "shell.execute_reply": "2023-01-07T14:53:27.960338Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "class GUICoverageFuzzer(GUIFuzzer, GrammarCoverageFuzzer):\n", " \"\"\"Systematically explore all states of the current Web page\"\"\"\n", "\n", " def __init__(self, *args, **kwargs):\n", " \"\"\"Constructor. All args are passed to the `GUIFuzzer` superclass.\"\"\"\n", " GUIFuzzer.__init__(self, *args, **kwargs)\n", " self.reset_coverage()" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "With `GUICoverageFuzzer`, we can set up a method `explore_all()` that keeps on running the fuzzer until there are no unexplored states anymore:" ] }, { "cell_type": "code", "execution_count": 144, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:27.964193Z", "iopub.status.busy": "2023-01-07T14:53:27.963695Z", "iopub.status.idle": "2023-01-07T14:53:27.965335Z", "shell.execute_reply": "2023-01-07T14:53:27.965616Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [], "source": [ "class GUICoverageFuzzer(GUICoverageFuzzer):\n", " def explore_all(self, runner: GUIRunner, max_actions=100) -> None:\n", " \"\"\"Explore all states of the GUI, up to `max_actions` (default 100).\"\"\"\n", "\n", " actions = 0\n", " while (self.miner.UNEXPLORED_STATE in self.grammar and \n", " actions < max_actions):\n", " actions += 1\n", " if self.log_gui_exploration:\n", " print(\"Run #\" + repr(actions))\n", " try:\n", " self.run(runner)\n", " except ElementClickInterceptedException:\n", " pass\n", " except ElementNotInteractableException:\n", " pass\n", " except NoSuchElementException:\n", " pass" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "Let us use this to fully explore our Web server:" ] }, { "cell_type": "code", "execution_count": 145, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:27.968043Z", "iopub.status.busy": "2023-01-07T14:53:27.967646Z", "iopub.status.idle": "2023-01-07T14:53:27.986575Z", "shell.execute_reply": "2023-01-07T14:53:27.986783Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "gui_driver.get(httpd_url)" ] }, { "cell_type": "code", "execution_count": 146, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:27.988754Z", "iopub.status.busy": "2023-01-07T14:53:27.988455Z", "iopub.status.idle": "2023-01-07T14:53:28.198946Z", "shell.execute_reply": "2023-01-07T14:53:28.199160Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "gui_fuzzer = GUICoverageFuzzer(gui_driver)" ] }, { "cell_type": "code", "execution_count": 147, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:28.201041Z", "iopub.status.busy": "2023-01-07T14:53:28.200756Z", "iopub.status.idle": "2023-01-07T14:53:28.993117Z", "shell.execute_reply": "2023-01-07T14:53:28.993293Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "gui_fuzzer.explore_all(gui_runner)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "Success! We have covered all states:" ] }, { "cell_type": "code", "execution_count": 148, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:28.994948Z", "iopub.status.busy": "2023-01-07T14:53:28.994562Z", "iopub.status.idle": "2023-01-07T14:53:29.253816Z", "shell.execute_reply": "2023-01-07T14:53:29.254173Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "image/svg+xml": [ "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "start\n", "\n", "<start>\n", "\n", "\n", "\n", "state\n", "\n", "<state>\n", "\n", "\n", "\n", "start->state\n", "\n", "\n", "\n", "\n", "\n", "state-1\n", "\n", "<state-1>\n", "\n", "\n", "\n", "state->state-1\n", "\n", "\n", "click('terms and conditions')\n", "\n", "\n", "\n", "state-2\n", "\n", "<state-2>\n", "\n", "\n", "\n", "state->state-2\n", "\n", "\n", "fill('zip', '<number>')\n", "check('terms', <boolean>)\n", "fill('name', '<text>')\n", "fill('email', '<email>')\n", "fill('city', '<text>')\n", "submit('submit')\n", "\n", "\n", "\n", "end\n", "\n", "<end>\n", "\n", "\n", "\n", "state->end\n", "\n", "\n", "\n", "\n", "\n", "state-1->state\n", "\n", "\n", "click('order form')\n", "\n", "\n", "\n", "state-1->end\n", "\n", "\n", "\n", "\n", "\n", "state-2->state\n", "\n", "\n", "click('order form')\n", "\n", "\n", "\n", "state-2->end\n", "\n", "\n", "\n", "\n", "\n" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "fsm_diagram(gui_fuzzer.grammar)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "We can retrieve the expansions covered so far, which of course cover all states." ] }, { "cell_type": "code", "execution_count": 149, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:29.257018Z", "iopub.status.busy": "2023-01-07T14:53:29.256680Z", "iopub.status.idle": "2023-01-07T14:53:29.258283Z", "shell.execute_reply": "2023-01-07T14:53:29.258733Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [ { "data": { "text/plain": [ "{' -> False',\n", " ' -> True',\n", " ' -> ',\n", " ' -> ',\n", " ' -> ',\n", " ' -> 0',\n", " ' -> 4',\n", " ' -> 6',\n", " ' -> 9',\n", " ' -> ',\n", " ' -> ',\n", " ' -> @',\n", " ' -> ',\n", " ' -> D',\n", " ' -> F',\n", " ' -> K',\n", " ' -> O',\n", " ' -> P',\n", " ' -> U',\n", " ' -> W',\n", " ' -> Y',\n", " ' -> b',\n", " ' -> h',\n", " ' -> i',\n", " ' -> l',\n", " ' -> o',\n", " ' -> t',\n", " ' -> x',\n", " ' -> ',\n", " ' -> ',\n", " ' -> ',\n", " ' -> ',\n", " ' -> ',\n", " ' -> ',\n", " \" -> click('order form')\\n\",\n", " ' -> ',\n", " \" -> click('order form')\\n\",\n", " ' -> ',\n", " ' -> ',\n", " ' -> ',\n", " \" -> click('terms and conditions')\\n\",\n", " \" -> fill('zip', '')\\ncheck('terms', )\\nfill('name', '')\\nfill('email', '')\\nfill('city', '')\\nsubmit('submit')\\n\",\n", " ' -> ',\n", " ' -> ',\n", " ' -> ',\n", " ' -> '}" ] }, "execution_count": 149, "metadata": {}, "output_type": "execute_result" } ], "source": [ "gui_fuzzer.covered_expansions" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "Still, we haven't seen all expansions covered. A few digits and letters remain to be used." ] }, { "cell_type": "code", "execution_count": 150, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:29.261468Z", "iopub.status.busy": "2023-01-07T14:53:29.261109Z", "iopub.status.idle": "2023-01-07T14:53:29.262607Z", "shell.execute_reply": "2023-01-07T14:53:29.262851Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [ { "data": { "text/plain": [ "{' -> 1',\n", " ' -> 2',\n", " ' -> 3',\n", " ' -> 5',\n", " ' -> 7',\n", " ' -> 8',\n", " ' -> A',\n", " ' -> B',\n", " ' -> C',\n", " ' -> E',\n", " ' -> G',\n", " ' -> H',\n", " ' -> I',\n", " ' -> J',\n", " ' -> L',\n", " ' -> M',\n", " ' -> N',\n", " ' -> Q',\n", " ' -> R',\n", " ' -> S',\n", " ' -> T',\n", " ' -> V',\n", " ' -> X',\n", " ' -> Z',\n", " ' -> a',\n", " ' -> c',\n", " ' -> d',\n", " ' -> e',\n", " ' -> f',\n", " ' -> g',\n", " ' -> j',\n", " ' -> k',\n", " ' -> m',\n", " ' -> n',\n", " ' -> p',\n", " ' -> q',\n", " ' -> r',\n", " ' -> s',\n", " ' -> u',\n", " ' -> v',\n", " ' -> w',\n", " ' -> y',\n", " ' -> z',\n", " ' -> !',\n", " ' -> .',\n", " ' -> ',\n", " \" -> click('order form')\\n\",\n", " ' -> ',\n", " \" -> click('order form')\\n\"}" ] }, "execution_count": 150, "metadata": {}, "output_type": "execute_result" } ], "source": [ "gui_fuzzer.missing_expansion_coverage()" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "Running the fuzzer again and again will eventually cover these expansions too, leading to letter and digit coverage within the order form." ] }, { "cell_type": "markdown", "metadata": { "button": false, "new_sheet": false, "run_control": { "read_only": false }, "slideshow": { "slide_type": "slide" } }, "source": [ "## Exploring Large Sites\n", "\n", "Our GUI fuzzer is robust enough to handle exploration even on nontrivial sites such as [fuzzingbook.org](https://www.fuzzingbook.org). Let us demonstrate this:" ] }, { "cell_type": "code", "execution_count": 151, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:29.265361Z", "iopub.status.busy": "2023-01-07T14:53:29.264910Z", "iopub.status.idle": "2023-01-07T14:53:31.451255Z", "shell.execute_reply": "2023-01-07T14:53:31.451540Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "gui_driver.get(\"https://www.fuzzingbook.org/html/Fuzzer.html\")" ] }, { "cell_type": "code", "execution_count": 152, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:31.454456Z", "iopub.status.busy": "2023-01-07T14:53:31.454087Z", "iopub.status.idle": "2023-01-07T14:53:31.505432Z", "shell.execute_reply": "2023-01-07T14:53:31.507096Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "image/png": "\n", "text/plain": [ "" ] }, "execution_count": 152, "metadata": {}, "output_type": "execute_result" } ], "source": [ "Image(gui_driver.get_screenshot_as_png())" ] }, { "cell_type": "code", "execution_count": 153, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:31.511094Z", "iopub.status.busy": "2023-01-07T14:53:31.510455Z", "iopub.status.idle": "2023-01-07T14:53:31.512205Z", "shell.execute_reply": "2023-01-07T14:53:31.512914Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "book_runner = GUIRunner(gui_driver)" ] }, { "cell_type": "code", "execution_count": 154, "metadata": { "button": false, "execution": { "iopub.execute_input": "2023-01-07T14:53:31.516599Z", "iopub.status.busy": "2023-01-07T14:53:31.516232Z", "iopub.status.idle": "2023-01-07T14:53:37.342542Z", "shell.execute_reply": "2023-01-07T14:53:37.342752Z" }, "new_sheet": false, "run_control": { "read_only": false }, "slideshow": { "slide_type": "subslide" } }, "outputs": [], "source": [ "book_fuzzer = GUICoverageFuzzer(gui_driver, log_gui_exploration=True) # , disp_gui_exploration=True)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "We explore the first few states of the site, defined in `ACTIONS`:" ] }, { "cell_type": "code", "execution_count": 155, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:37.344819Z", "iopub.status.busy": "2023-01-07T14:53:37.344524Z", "iopub.status.idle": "2023-01-07T14:53:37.345890Z", "shell.execute_reply": "2023-01-07T14:53:37.346103Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "ACTIONS = 5" ] }, { "cell_type": "code", "execution_count": 156, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:37.348184Z", "iopub.status.busy": "2023-01-07T14:53:37.347810Z", "iopub.status.idle": "2023-01-07T14:53:52.607550Z", "shell.execute_reply": "2023-01-07T14:53:52.607871Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Run #1\n", "Action click('use grammars to specify the input format and thus get many more valid inputs') -> \n", "In new state frozenset({\"click('constraints')\", \"ignore('Hodov\\xc3\\xa1n et al, 2018')\", \"ignore('Grammarinator')\", \"ignore('Burkhardt et al, 1967')\", \"ignore('Backus-Naur form')\", \"ignore('CSmith')\", \"click('')\", \"click('coverage')\", \"ignore('typing')\", \"click('coverage-based')\", \"click('fuzzing functions and APIs')\", \"click('probabilistic grammar fuzzing')\", \"click('fuzz configurations')\", \"submit('')\", \"click('fuzzing graphical user interfaces')\", \"ignore('Hanford et al, 1970')\", \"ignore('Use the notebook')\", \"ignore('Domato')\", \"click('probabilistic-based')\", \"ignore('Last change: 2022-11-12 08:04:04+01:00')\", \"click('Chapter introducing fuzzing')\", \"click('create an efficient grammar fuzzer')\", \"click('The Fuzzing Book')\", \"ignore('Purdom et al, 1972')\", \"check('a58a0898-625b-11ed-9297-6298cf1a5790', )\", \"ignore('copy')\", \"ignore('Imprint')\", \"ignore('inspect')\", \"click('basic fuzzing')\", \"click('Fuzzer')\", \"click('the GrammarFuzzer class')\", \"ignore('JSON specification')\", \"click('grammar toolbox')\", \"click('generator-based')\", \"click('Cite')\", \"click('use the code provided in this chapter')\", \"click('MutationFuzzer')\", \"ignore('Yang et al, 2011')\", \"ignore('bookutils')\", \"click('fuzzingbook.Grammars')\", \"ignore('random')\", \"ignore('Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License')\", \"ignore('re')\", \"click('later in this book')\", \"ignore('LangFuzz')\", \"ignore('Holler et al, 2012')\", \"check('a598b348-625b-11ed-9297-6298cf1a5790', )\", \"ignore('Le et al, 2014')\", \"ignore('MIT License')\", \"click('"Mutation-Based Fuzzing"')\", \"ignore('Wikipedia page on file formats')\", \"click('next chapter')\", \"click('chapter on coverage')\", \"click('mutation-based fuzzing')\", \"ignore('Chomsky et al, 1956')\", \"ignore('Dak\\xe1\\xb9\\xa3iputra P\\xc4\\x81\\xe1\\xb9\\x87ini, 350 BCE')\", \"ignore('')\", \"ignore('ast')\", \"check('a588f304-625b-11ed-9297-6298cf1a5790', )\", \"ignore('EMI Project')\", \"click('our chapter on coverage-based fuzzing')\", \"ignore('string')\", \"click('probabilities')\"})\n", "Run #2\n", "Action click('use mutations on existing inputs to get more valid inputs') -> \n", "Run #3\n", "Action click('The Fuzzing Book') -> \n", "In new state frozenset({\"click('discussed above')\", \"click('Sitemap')\", \"click('fuzzingbook.Fuzzer')\", \"ignore('os')\", \"check('e1cd81d4-6ff0-11ed-9dea-6298cf1a578e', )\", \"click('')\", \"ignore('typing')\", \"ignore('XKCD comic')\", \"click('chapter on information flow')\", \"click('A Fuzzing Architecture')\", \"click('About this book')\", \"click('reduce failing inputs for efficient debugging')\", \"submit('')\", \"ignore('tempfile')\", \"click('use grammars to specify the input format and thus get many more valid inputs')\", \"click('IV\\nSemantical Fuzzing')\", \"ignore('LLVM Address Sanitizer')\", \"ignore('Use the notebook')\", \"click('Intro_Testing')\", \"ignore('assignment')\", \"click('runtime verification')\", \"click('chapter on testing')\", \"click('The Fuzzing Book')\", \"click('V\\nDomain-Specific Fuzzing')\", \"ignore('Imprint')\", \"click('"Introduction to Software Testing"')\", \"click('Cite')\", \"click('II\\nLexical Fuzzing')\", \"click('use the code provided in this chapter')\", \"click('chapter on mining function specifications')\", \"ignore('bookutils')\", \"ignore('random')\", \"click('VI\\nManaging Fuzzing')\", \"ignore('Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License')\", \"ignore('Python tutorial')\", \"ignore('HeartBleed announcement page')\", \"ignore('red-black tree')\", \"ignore('MIT License')\", \"ignore('Last change: 2022-07-25 12:07:43+02:00')\", \"check('e1d4c994-6ff0-11ed-9dea-6298cf1a578e', )\", \"ignore('HeartBleed bug')\", \"click('Index (beta)')\", \"click('Introduction to Testing')\", \"ignore('')\", \"click('I\\nWhetting Your Appetite')\", \"ignore('MyPy')\", \"ignore('subprocess')\", \"ignore('Takanen et al, 2008')\", \"click('ExpectError')\", \"ignore('Miller et al, 1990')\", \"click('Appendices')\", \"click('III\\nSyntactical Fuzzing')\", \"click('use mutations on existing inputs to get more valid inputs')\"})\n", "Run #4\n", "Action click('Cite') -> \n", "Run #5\n", "Action click('runtime verification') -> \n" ] } ], "source": [ "book_fuzzer.explore_all(book_runner, max_actions=ACTIONS)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "After the first `ACTIONS` actions already, we can see that the finite state model is quite complex, with dozens of transitions still left to explore. Most of the yet unexplored states will eventually merge with existing states, yielding one state per chapter. Still, following _all_ links on _all_ pages will take quite some time." ] }, { "cell_type": "code", "execution_count": 157, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:52.618548Z", "iopub.status.busy": "2023-01-07T14:53:52.618142Z", "iopub.status.idle": "2023-01-07T14:53:52.915272Z", "shell.execute_reply": "2023-01-07T14:53:52.915526Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "image/svg+xml": [ "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "start\n", "\n", "<start>\n", "\n", "\n", "\n", "state\n", "\n", "<state>\n", "\n", "\n", "\n", "start->state\n", "\n", "\n", "\n", "\n", "\n", "state-1\n", "\n", "<state-1>\n", "\n", "\n", "\n", "state->state-1\n", "\n", "\n", "click('discussed above')\n", "\n", "\n", "\n", "state-2\n", "\n", "<state-2>\n", "\n", "\n", "\n", "state->state-2\n", "\n", "\n", "click('fuzzingbook.Fuzzer')\n", "\n", "\n", "\n", "state-3\n", "\n", "<state-3>\n", "\n", "\n", "\n", "state->state-3\n", "\n", "\n", "click('')\n", "\n", "\n", "\n", "state-4\n", "\n", "<state-4>\n", "\n", "\n", "\n", "state->state-4\n", "\n", "\n", "click('chapter on information flow')\n", "\n", "\n", "\n", "state-5\n", "\n", "<state-5>\n", "\n", "\n", "\n", "state->state-5\n", "\n", "\n", "click('A Fuzzing Architecture')\n", "\n", "\n", "\n", "state-6\n", "\n", "<state-6>\n", "\n", "\n", "\n", "state->state-6\n", "\n", "\n", "click('reduce failing inputs for efficient debugging')\n", "\n", "\n", "\n", "state-7\n", "\n", "<state-7>\n", "\n", "\n", "\n", "state->state-7\n", "\n", "\n", "click('use grammars to specify the input format and thus get many more valid inputs')\n", "\n", "\n", "\n", "state-8\n", "\n", "<state-8>\n", "\n", "\n", "\n", "state->state-8\n", "\n", "\n", "click('Intro_Testing')\n", "\n", "\n", "\n", "state-9\n", "\n", "<state-9>\n", "\n", "\n", "\n", "state->state-9\n", "\n", "\n", "click('runtime verification')\n", "\n", "\n", "\n", "state-10\n", "\n", "<state-10>\n", "\n", "\n", "\n", "state->state-10\n", "\n", "\n", "click('chapter on testing')\n", "\n", "\n", "\n", "state-11\n", "\n", "<state-11>\n", "\n", "\n", "\n", "state->state-11\n", "\n", "\n", "click('The Fuzzing Book')\n", "\n", "\n", "\n", "state-12\n", "\n", "<state-12>\n", "\n", "\n", "\n", "state->state-12\n", "\n", "\n", "click('"Introduction to Software Testing"')\n", "\n", "\n", "\n", "state-13\n", "\n", "<state-13>\n", "\n", "\n", "\n", "state->state-13\n", "\n", "\n", "click('Cite')\n", "\n", "\n", "\n", "state-14\n", "\n", "<state-14>\n", "\n", "\n", "\n", "state->state-14\n", "\n", "\n", "click('use the code provided in this chapter')\n", "\n", "\n", "\n", "state-15\n", "\n", "<state-15>\n", "\n", "\n", "\n", "state->state-15\n", "\n", "\n", "click('chapter on mining function specifications')\n", "\n", "\n", "\n", "state-16\n", "\n", "<state-16>\n", "\n", "\n", "\n", "state->state-16\n", "\n", "\n", "click('Introduction to Testing')\n", "\n", "\n", "\n", "state-17\n", "\n", "<state-17>\n", "\n", "\n", "\n", "state->state-17\n", "\n", "\n", "click('ExpectError')\n", "\n", "\n", "\n", "state-18\n", "\n", "<state-18>\n", "\n", "\n", "\n", "state->state-18\n", "\n", "\n", "click('use mutations on existing inputs to get more valid inputs')\n", "\n", "\n", "\n", "state-19\n", "\n", "<state-19>\n", "\n", "\n", "\n", "state->state-19\n", "\n", "\n", "check('e1cd81d4-6ff0-11ed-9dea-6298cf1a578e', <boolean>)\n", "check('e1d4c994-6ff0-11ed-9dea-6298cf1a578e', <boolean>)\n", "submit('')\n", "\n", "\n", "\n", "end\n", "\n", "<end>\n", "\n", "\n", "\n", "state->end\n", "\n", "\n", "\n", "\n", "\n", "unexplored\n", "\n", "<unexplored>\n", "\n", "\n", "\n", "state-1->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-2->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-3->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-4->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-5->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-6->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-7->end\n", "\n", "\n", "\n", "\n", "\n", "state-20\n", "\n", "<state-20>\n", "\n", "\n", "\n", "state-7->state-20\n", "\n", "\n", "click('constraints')\n", "\n", "\n", "\n", "state-21\n", "\n", "<state-21>\n", "\n", "\n", "\n", "state-7->state-21\n", "\n", "\n", "click('')\n", "\n", "\n", "\n", "state-22\n", "\n", "<state-22>\n", "\n", "\n", "\n", "state-7->state-22\n", "\n", "\n", "click('coverage')\n", "\n", "\n", "\n", "state-23\n", "\n", "<state-23>\n", "\n", "\n", "\n", "state-7->state-23\n", "\n", "\n", "click('coverage-based')\n", "\n", "\n", "\n", "state-24\n", "\n", "<state-24>\n", "\n", "\n", "\n", "state-7->state-24\n", "\n", "\n", "click('fuzzing functions and APIs')\n", "\n", "\n", "\n", "state-25\n", "\n", "<state-25>\n", "\n", "\n", "\n", "state-7->state-25\n", "\n", "\n", "click('probabilistic grammar fuzzing')\n", "\n", "\n", "\n", "state-26\n", "\n", "<state-26>\n", "\n", "\n", "\n", "state-7->state-26\n", "\n", "\n", "click('fuzz configurations')\n", "\n", "\n", "\n", "state-27\n", "\n", "<state-27>\n", "\n", "\n", "\n", "state-7->state-27\n", "\n", "\n", "click('fuzzing graphical user interfaces')\n", "\n", "\n", "\n", "state-28\n", "\n", "<state-28>\n", "\n", "\n", "\n", "state-7->state-28\n", "\n", "\n", "click('probabilistic-based')\n", "\n", "\n", "\n", "state-29\n", "\n", "<state-29>\n", "\n", "\n", "\n", "state-7->state-29\n", "\n", "\n", "click('Chapter introducing fuzzing')\n", "\n", "\n", "\n", "state-30\n", "\n", "<state-30>\n", "\n", "\n", "\n", "state-7->state-30\n", "\n", "\n", "click('create an efficient grammar fuzzer')\n", "\n", "\n", "\n", "state-31\n", "\n", "<state-31>\n", "\n", "\n", "\n", "state-7->state-31\n", "\n", "\n", "click('The Fuzzing Book')\n", "\n", "\n", "\n", "state-32\n", "\n", "<state-32>\n", "\n", "\n", "\n", "state-7->state-32\n", "\n", "\n", "click('basic fuzzing')\n", "\n", "\n", "\n", "state-33\n", "\n", "<state-33>\n", "\n", "\n", "\n", "state-7->state-33\n", "\n", "\n", "click('Fuzzer')\n", "\n", "\n", "\n", "state-34\n", "\n", "<state-34>\n", "\n", "\n", "\n", "state-7->state-34\n", "\n", "\n", "click('the GrammarFuzzer class')\n", "\n", "\n", "\n", "state-35\n", "\n", "<state-35>\n", "\n", "\n", "\n", "state-7->state-35\n", "\n", "\n", "click('grammar toolbox')\n", "\n", "\n", "\n", "state-36\n", "\n", "<state-36>\n", "\n", "\n", "\n", "state-7->state-36\n", "\n", "\n", "click('generator-based')\n", "\n", "\n", "\n", "state-37\n", "\n", "<state-37>\n", "\n", "\n", "\n", "state-7->state-37\n", "\n", "\n", "click('Cite')\n", "\n", "\n", "\n", "state-38\n", "\n", "<state-38>\n", "\n", "\n", "\n", "state-7->state-38\n", "\n", "\n", "click('use the code provided in this chapter')\n", "\n", "\n", "\n", "state-39\n", "\n", "<state-39>\n", "\n", "\n", "\n", "state-7->state-39\n", "\n", "\n", "click('MutationFuzzer')\n", "\n", "\n", "\n", "state-40\n", "\n", "<state-40>\n", "\n", "\n", "\n", "state-7->state-40\n", "\n", "\n", "click('fuzzingbook.Grammars')\n", "\n", "\n", "\n", "state-41\n", "\n", "<state-41>\n", "\n", "\n", "\n", "state-7->state-41\n", "\n", "\n", "click('later in this book')\n", "\n", "\n", "\n", "state-42\n", "\n", "<state-42>\n", "\n", "\n", "\n", "state-7->state-42\n", "\n", "\n", "click('"Mutation-Based Fuzzing"')\n", "\n", "\n", "\n", "state-43\n", "\n", "<state-43>\n", "\n", "\n", "\n", "state-7->state-43\n", "\n", "\n", "click('next chapter')\n", "\n", "\n", "\n", "state-44\n", "\n", "<state-44>\n", "\n", "\n", "\n", "state-7->state-44\n", "\n", "\n", "click('chapter on coverage')\n", "\n", "\n", "\n", "state-45\n", "\n", "<state-45>\n", "\n", "\n", "\n", "state-7->state-45\n", "\n", "\n", "click('mutation-based fuzzing')\n", "\n", "\n", "\n", "state-46\n", "\n", "<state-46>\n", "\n", "\n", "\n", "state-7->state-46\n", "\n", "\n", "click('our chapter on coverage-based fuzzing')\n", "\n", "\n", "\n", "state-47\n", "\n", "<state-47>\n", "\n", "\n", "\n", "state-7->state-47\n", "\n", "\n", "click('probabilities')\n", "\n", "\n", "\n", "state-48\n", "\n", "<state-48>\n", "\n", "\n", "\n", "state-7->state-48\n", "\n", "\n", "check('a58a0898-625b-11ed-9297-6298cf1a5790', <boolean>)\n", "check('a598b348-625b-11ed-9297-6298cf1a5790', <boolean>)\n", "check('a588f304-625b-11ed-9297-6298cf1a5790', <boolean>)\n", "submit('')\n", "\n", "\n", "\n", "state-8->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-9->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-10->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-11->end\n", "\n", "\n", "\n", "\n", "\n", "state-49\n", "\n", "<state-49>\n", "\n", "\n", "\n", "state-11->state-49\n", "\n", "\n", "click('discussed above')\n", "\n", "\n", "\n", "state-50\n", "\n", "<state-50>\n", "\n", "\n", "\n", "state-11->state-50\n", "\n", "\n", "click('Sitemap')\n", "\n", "\n", "\n", "state-51\n", "\n", "<state-51>\n", "\n", "\n", "\n", "state-11->state-51\n", "\n", "\n", "click('fuzzingbook.Fuzzer')\n", "\n", "\n", "\n", "state-52\n", "\n", "<state-52>\n", "\n", "\n", "\n", "state-11->state-52\n", "\n", "\n", "click('')\n", "\n", "\n", "\n", "state-53\n", "\n", "<state-53>\n", "\n", "\n", "\n", "state-11->state-53\n", "\n", "\n", "click('chapter on information flow')\n", "\n", "\n", "\n", "state-54\n", "\n", "<state-54>\n", "\n", "\n", "\n", "state-11->state-54\n", "\n", "\n", "click('A Fuzzing Architecture')\n", "\n", "\n", "\n", "state-55\n", "\n", "<state-55>\n", "\n", "\n", "\n", "state-11->state-55\n", "\n", "\n", "click('About this book')\n", "\n", "\n", "\n", "state-56\n", "\n", "<state-56>\n", "\n", "\n", "\n", "state-11->state-56\n", "\n", "\n", "click('reduce failing inputs for efficient debugging')\n", "\n", "\n", "\n", "state-57\n", "\n", "<state-57>\n", "\n", "\n", "\n", "state-11->state-57\n", "\n", "\n", "click('use grammars to specify the input format and thus get many more valid inputs')\n", "\n", "\n", "\n", "state-58\n", "\n", "<state-58>\n", "\n", "\n", "\n", "state-11->state-58\n", "\n", "\n", "click('IV\n", "Semantical Fuzzing')\n", "\n", "\n", "\n", "state-59\n", "\n", "<state-59>\n", "\n", "\n", "\n", "state-11->state-59\n", "\n", "\n", "click('Intro_Testing')\n", "\n", "\n", "\n", "state-60\n", "\n", "<state-60>\n", "\n", "\n", "\n", "state-11->state-60\n", "\n", "\n", "click('runtime verification')\n", "\n", "\n", "\n", "state-61\n", "\n", "<state-61>\n", "\n", "\n", "\n", "state-11->state-61\n", "\n", "\n", "click('chapter on testing')\n", "\n", "\n", "\n", "state-62\n", "\n", "<state-62>\n", "\n", "\n", "\n", "state-11->state-62\n", "\n", "\n", "click('The Fuzzing Book')\n", "\n", "\n", "\n", "state-63\n", "\n", "<state-63>\n", "\n", "\n", "\n", "state-11->state-63\n", "\n", "\n", "click('V\n", "Domain-Specific Fuzzing')\n", "\n", "\n", "\n", "state-64\n", "\n", "<state-64>\n", "\n", "\n", "\n", "state-11->state-64\n", "\n", "\n", "click('"Introduction to Software Testing"')\n", "\n", "\n", "\n", "state-65\n", "\n", "<state-65>\n", "\n", "\n", "\n", "state-11->state-65\n", "\n", "\n", "click('Cite')\n", "\n", "\n", "\n", "state-66\n", "\n", "<state-66>\n", "\n", "\n", "\n", "state-11->state-66\n", "\n", "\n", "click('II\n", "Lexical Fuzzing')\n", "\n", "\n", "\n", "state-67\n", "\n", "<state-67>\n", "\n", "\n", "\n", "state-11->state-67\n", "\n", "\n", "click('use the code provided in this chapter')\n", "\n", "\n", "\n", "state-68\n", "\n", "<state-68>\n", "\n", "\n", "\n", "state-11->state-68\n", "\n", "\n", "click('chapter on mining function specifications')\n", "\n", "\n", "\n", "state-69\n", "\n", "<state-69>\n", "\n", "\n", "\n", "state-11->state-69\n", "\n", "\n", "click('VI\n", "Managing Fuzzing')\n", "\n", "\n", "\n", "state-70\n", "\n", "<state-70>\n", "\n", "\n", "\n", "state-11->state-70\n", "\n", "\n", "click('Index (beta)')\n", "\n", "\n", "\n", "state-71\n", "\n", "<state-71>\n", "\n", "\n", "\n", "state-11->state-71\n", "\n", "\n", "click('Introduction to Testing')\n", "\n", "\n", "\n", "state-72\n", "\n", "<state-72>\n", "\n", "\n", "\n", "state-11->state-72\n", "\n", "\n", "click('I\n", "Whetting Your Appetite')\n", "\n", "\n", "\n", "state-73\n", "\n", "<state-73>\n", "\n", "\n", "\n", "state-11->state-73\n", "\n", "\n", "click('ExpectError')\n", "\n", "\n", "\n", "state-74\n", "\n", "<state-74>\n", "\n", "\n", "\n", "state-11->state-74\n", "\n", "\n", "click('Appendices')\n", "\n", "\n", "\n", "state-75\n", "\n", "<state-75>\n", "\n", "\n", "\n", "state-11->state-75\n", "\n", "\n", "click('III\n", "Syntactical Fuzzing')\n", "\n", "\n", "\n", "state-76\n", "\n", "<state-76>\n", "\n", "\n", "\n", "state-11->state-76\n", "\n", "\n", "click('use mutations on existing inputs to get more valid inputs')\n", "\n", "\n", "\n", "state-77\n", "\n", "<state-77>\n", "\n", "\n", "\n", "state-11->state-77\n", "\n", "\n", "check('e1cd81d4-6ff0-11ed-9dea-6298cf1a578e', <boolean>)\n", "check('e1d4c994-6ff0-11ed-9dea-6298cf1a578e', <boolean>)\n", "submit('')\n", "\n", "\n", "\n", "state-12->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-13->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-14->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-15->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-16->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-17->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-18->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-19->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-20->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-21->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-22->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-23->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-24->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-25->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-26->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-27->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-28->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-29->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-30->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-31->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-32->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-33->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-34->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-35->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-36->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-37->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-38->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-39->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-40->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-41->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-42->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-43->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-44->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-45->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-46->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-47->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-48->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-49->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-50->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-51->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-52->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-53->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-54->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-55->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-56->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-57->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-58->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-59->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-60->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-61->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-62->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-63->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-64->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-65->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-66->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-67->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-68->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-69->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-70->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-71->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-72->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-73->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-74->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-75->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-76->unexplored\n", "\n", "\n", "\n", "\n", "\n", "state-77->unexplored\n", "\n", "\n", "\n", "\n", "\n" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "# Inspect this graph in the notebook to see it in full glory\n", "fsm_diagram(book_fuzzer.grammar)" ] }, { "attachments": {}, "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "We now have all the basic capabilities we need: We can automatically explore large websites; we can explore \"deep\" functionality by filling out forms; and we can have our coverage-based fuzzer automatically focus on yet unexplored states. Still, there is a lot more one can do; the [exercises](#Exercises) will give you some ideas." ] }, { "cell_type": "code", "execution_count": 158, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:52.918074Z", "iopub.status.busy": "2023-01-07T14:53:52.917725Z", "iopub.status.idle": "2023-01-07T14:53:53.303700Z", "shell.execute_reply": "2023-01-07T14:53:53.303963Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "gui_driver.quit()" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Synopsis\n", "\n", "This chapter demonstrates how to programmatically interact with user interfaces, using Selenium on Web browsers. It provides an experimental `GUICoverageFuzzer` class that automatically explores a user interface by systematically interacting with all available user interface elements." ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "The function `start_webdriver()` starts a headless Web browser in the background and returns a _GUI driver_ as handle for further communication." ] }, { "cell_type": "code", "execution_count": 159, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:53.306485Z", "iopub.status.busy": "2023-01-07T14:53:53.306047Z", "iopub.status.idle": "2023-01-07T14:53:55.829354Z", "shell.execute_reply": "2023-01-07T14:53:55.829592Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "gui_driver = start_webdriver()" ] }, { "attachments": {}, "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "We let the browser open the URL of the server we want to investigate (in this case, the vulnerable server from [the chapter on Web fuzzing](WebFuzzer.ipynb)) and obtain a screenshot." ] }, { "cell_type": "code", "execution_count": 160, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:55.832402Z", "iopub.status.busy": "2023-01-07T14:53:55.832008Z", "iopub.status.idle": "2023-01-07T14:53:55.911365Z", "shell.execute_reply": "2023-01-07T14:53:55.911578Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [ { "data": { "image/png": "\n", "text/plain": [ "" ] }, "execution_count": 160, "metadata": {}, "output_type": "execute_result" } ], "source": [ "gui_driver.get(httpd_url)\n", "Image(gui_driver.get_screenshot_as_png())" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "The `GUICoverageFuzzer` class explores the user interface and builds a _grammar_ that encodes all states as well as the user interactions required to move from one state to the next. It is paired with a `GUIRunner` which interacts with the GUI driver." ] }, { "cell_type": "code", "execution_count": 161, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:55.913830Z", "iopub.status.busy": "2023-01-07T14:53:55.913519Z", "iopub.status.idle": "2023-01-07T14:53:56.152360Z", "shell.execute_reply": "2023-01-07T14:53:56.152570Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "gui_fuzzer = GUICoverageFuzzer(gui_driver)" ] }, { "cell_type": "code", "execution_count": 162, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:56.154579Z", "iopub.status.busy": "2023-01-07T14:53:56.154272Z", "iopub.status.idle": "2023-01-07T14:53:56.155648Z", "shell.execute_reply": "2023-01-07T14:53:56.155861Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "gui_runner = GUIRunner(gui_driver)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "The `explore_all()` method extracts all states and all transitions from a Web user interface." ] }, { "cell_type": "code", "execution_count": 163, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:56.157926Z", "iopub.status.busy": "2023-01-07T14:53:56.157595Z", "iopub.status.idle": "2023-01-07T14:53:57.221839Z", "shell.execute_reply": "2023-01-07T14:53:57.222046Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "gui_fuzzer.explore_all(gui_runner)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "The grammar embeds a finite state automation and is best visualized as such." ] }, { "cell_type": "code", "execution_count": 164, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:57.227500Z", "iopub.status.busy": "2023-01-07T14:53:57.227141Z", "iopub.status.idle": "2023-01-07T14:53:57.473810Z", "shell.execute_reply": "2023-01-07T14:53:57.474066Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [ { "data": { "image/svg+xml": [ "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "start\n", "\n", "<start>\n", "\n", "\n", "\n", "state\n", "\n", "<state>\n", "\n", "\n", "\n", "start->state\n", "\n", "\n", "\n", "\n", "\n", "state-1\n", "\n", "<state-1>\n", "\n", "\n", "\n", "state->state-1\n", "\n", "\n", "click('terms and conditions')\n", "\n", "\n", "\n", "state-2\n", "\n", "<state-2>\n", "\n", "\n", "\n", "state->state-2\n", "\n", "\n", "fill('zip', '<number>')\n", "check('terms', <boolean>)\n", "fill('name', '<text>')\n", "fill('email', '<email>')\n", "fill('city', '<text>')\n", "submit('submit')\n", "\n", "\n", "\n", "end\n", "\n", "<end>\n", "\n", "\n", "\n", "state->end\n", "\n", "\n", "\n", "\n", "\n", "state-1->state\n", "\n", "\n", "click('order form')\n", "\n", "\n", "\n", "state-1->end\n", "\n", "\n", "\n", "\n", "\n", "state-2->state\n", "\n", "\n", "click('order form')\n", "\n", "\n", "\n", "state-2->end\n", "\n", "\n", "\n", "\n", "\n" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "fsm_diagram(gui_fuzzer.grammar)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "The GUI Fuzzer `fuzz()` method produces sequences of interactions that follow paths through the finite state machine. Since `GUICoverageFuzzer` is derived from `CoverageFuzzer` (see the [chapter on coverage-based grammar fuzzing](GrammarCoverageFuzzer.ipynb)), it automatically covers (a) as many transitions between states as well as (b) as many form elements as possible. In our case, the first set of actions explores the transition via the \"order form\" link; the second set then goes until the \"\" state." ] }, { "cell_type": "code", "execution_count": 165, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:57.476846Z", "iopub.status.busy": "2023-01-07T14:53:57.476469Z", "iopub.status.idle": "2023-01-07T14:53:57.497382Z", "shell.execute_reply": "2023-01-07T14:53:57.497663Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "fill('zip', '1')\n", "check('terms', False)\n", "fill('name', 'Q')\n", "fill('email', 'K@i')\n", "fill('city', 'lGd')\n", "submit('submit')\n", "click('order form')\n", "click('terms and conditions')\n", "click('order form')\n", "fill('zip', '6')\n", "check('terms', True)\n", "fill('name', 'w')\n", "fill('email', 'S@q')\n", "fill('city', 'h')\n", "submit('submit')\n", "\n" ] } ], "source": [ "gui_driver.get(httpd_url)\n", "actions = gui_fuzzer.fuzz()\n", "print(actions)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "These actions can be fed into the GUI runner, which will execute them on the given GUI driver." ] }, { "cell_type": "code", "execution_count": 166, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:57.499845Z", "iopub.status.busy": "2023-01-07T14:53:57.499472Z", "iopub.status.idle": "2023-01-07T14:53:57.967216Z", "shell.execute_reply": "2023-01-07T14:53:57.967431Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "gui_driver.get(httpd_url)\n", "result, outcome = gui_runner.run(actions)" ] }, { "cell_type": "code", "execution_count": 167, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:57.969354Z", "iopub.status.busy": "2023-01-07T14:53:57.968636Z", "iopub.status.idle": "2023-01-07T14:53:57.982762Z", "shell.execute_reply": "2023-01-07T14:53:57.983051Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [ { "data": { "image/png": "\n", "text/plain": [ "" ] }, "execution_count": 167, "metadata": {}, "output_type": "execute_result" } ], "source": [ "Image(gui_driver.get_screenshot_as_png())" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "Further invocations of `fuzz()` will further cover the model – for instance, exploring the terms and conditions." ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "fragment" } }, "source": [ "Internally, `GUIFuzzer` and `GUICoverageFuzzer` use a subclass `GUIGrammarMiner` which implements the analysis of the GUI and all its states. Subclassing `GUIGrammarMiner` allows to extend the interpretation of GUIs; the `GUIFuzzer` constructor allows to pass a miner via the `miner` keyword parameter." ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "A tool like `GUICoverageFuzzer` will provide \"deep\" exploration of user interfaces, even filling out forms to explore what is behind them. Keep in mind, though, that `GUICoverageFuzzer` is experimental: It only supports a subset of HTML form and link features, and does not take JavaScript into account." ] }, { "cell_type": "code", "execution_count": 168, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:57.985223Z", "iopub.status.busy": "2023-01-07T14:53:57.984935Z", "iopub.status.idle": "2023-01-07T14:53:57.986079Z", "shell.execute_reply": "2023-01-07T14:53:57.986332Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "# ignore\n", "from ClassDiagram import display_class_hierarchy\n", "from Fuzzer import Fuzzer, Runner\n", "from Grammars import Grammar, Expansion\n", "from GrammarFuzzer import GrammarFuzzer, DerivationTree" ] }, { "cell_type": "code", "execution_count": 169, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:57.993394Z", "iopub.status.busy": "2023-01-07T14:53:57.992924Z", "iopub.status.idle": "2023-01-07T14:53:58.398620Z", "shell.execute_reply": "2023-01-07T14:53:58.398857Z" }, "slideshow": { "slide_type": "subslide" } }, "outputs": [ { "data": { "image/svg+xml": [ "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "GUIFuzzer\n", "\n", "\n", "GUIFuzzer\n", "\n", "\n", "\n", "__init__()\n", "\n", "\n", "\n", "restart()\n", "\n", "\n", "\n", "run()\n", "\n", "\n", "\n", "fsm_last_state_symbol()\n", "\n", "\n", "\n", "fsm_path()\n", "\n", "\n", "\n", "set_grammar()\n", "\n", "\n", "\n", "update_existing_state()\n", "\n", "\n", "\n", "update_new_state()\n", "\n", "\n", "\n", "update_state()\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "GrammarFuzzer\n", "\n", "\n", "GrammarFuzzer\n", "\n", "\n", "\n", "__init__()\n", "\n", "\n", "\n", "fuzz()\n", "\n", "\n", "\n", "fuzz_tree()\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "GUIFuzzer->GrammarFuzzer\n", "\n", "\n", "\n", "\n", "\n", "Fuzzer\n", "\n", "\n", "Fuzzer\n", "\n", "\n", "\n", "__init__()\n", "\n", "\n", "\n", "fuzz()\n", "\n", "\n", "\n", "run()\n", "\n", "\n", "\n", "runs()\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "GrammarFuzzer->Fuzzer\n", "\n", "\n", "\n", "\n", "\n", "GUICoverageFuzzer\n", "\n", "\n", "GUICoverageFuzzer\n", "\n", "\n", "\n", "__init__()\n", "\n", "\n", "\n", "explore_all()\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "GUICoverageFuzzer->GUIFuzzer\n", "\n", "\n", "\n", "\n", "\n", "GrammarCoverageFuzzer\n", "\n", "\n", "GrammarCoverageFuzzer\n", "\n", "\n", "\n", "\n", "\n", "GUICoverageFuzzer->GrammarCoverageFuzzer\n", "\n", "\n", "\n", "\n", "\n", "SimpleGrammarCoverageFuzzer\n", "\n", "\n", "SimpleGrammarCoverageFuzzer\n", "\n", "\n", "\n", "\n", "\n", "GrammarCoverageFuzzer->SimpleGrammarCoverageFuzzer\n", "\n", "\n", "\n", "\n", "\n", "TrackingGrammarCoverageFuzzer\n", "\n", "\n", "TrackingGrammarCoverageFuzzer\n", "\n", "\n", "\n", "__init__()\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "SimpleGrammarCoverageFuzzer->TrackingGrammarCoverageFuzzer\n", "\n", "\n", "\n", "\n", "\n", "TrackingGrammarCoverageFuzzer->GrammarFuzzer\n", "\n", "\n", "\n", "\n", "\n", "GUIRunner\n", "\n", "\n", "GUIRunner\n", "\n", "\n", "\n", "DELAY_AFTER_CHECK\n", "\n", "\n", "\n", "DELAY_AFTER_CLICK\n", "\n", "\n", "\n", "DELAY_AFTER_FILL\n", "\n", "\n", "\n", "DELAY_AFTER_SUBMIT\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "__init__()\n", "\n", "\n", "\n", "run()\n", "\n", "\n", "\n", "do_check()\n", "\n", "\n", "\n", "do_click()\n", "\n", "\n", "\n", "do_fill()\n", "\n", "\n", "\n", "do_submit()\n", "\n", "\n", "\n", "find_element()\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "Runner\n", "\n", "\n", "Runner\n", "\n", "\n", "\n", "FAIL\n", "\n", "\n", "\n", "PASS\n", "\n", "\n", "\n", "UNRESOLVED\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "__init__()\n", "\n", "\n", "\n", "run()\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "GUIRunner->Runner\n", "\n", "\n", "\n", "\n", "\n", "GUIGrammarMiner\n", "\n", "\n", "GUIGrammarMiner\n", "\n", "\n", "\n", "FINAL_STATE\n", "\n", "\n", "\n", "GUI_GRAMMAR\n", "\n", "\n", "\n", "START_STATE\n", "\n", "\n", "\n", "UNEXPLORED_STATE\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "__init__()\n", "\n", "\n", "\n", "follow_link()\n", "\n", "\n", "\n", "mine_a_element_actions()\n", "\n", "\n", "\n", "mine_button_element_actions()\n", "\n", "\n", "\n", "mine_input_element_actions()\n", "\n", "\n", "\n", "mine_state_actions()\n", "\n", "\n", "\n", "mine_state_grammar()\n", "\n", "\n", "\n", "new_state_symbol()\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "Legend\n", "Legend\n", "• \n", "public_method()\n", "• \n", "private_method()\n", "• \n", "overloaded_method()\n", "Hover over names to see doc\n", "\n", "\n", "\n" ], "text/plain": [ "" ] }, "execution_count": 169, "metadata": {}, "output_type": "execute_result" } ], "source": [ "# ignore\n", "display_class_hierarchy([GUIFuzzer, GUICoverageFuzzer,\n", " GUIRunner, GUIGrammarMiner],\n", " public_methods=[\n", " Fuzzer.__init__,\n", " Fuzzer.fuzz,\n", " Fuzzer.run,\n", " Fuzzer.runs,\n", " Runner.__init__,\n", " Runner.run,\n", " GUIRunner.__init__,\n", " GUIRunner.run,\n", " GrammarFuzzer.__init__,\n", " GrammarFuzzer.fuzz,\n", " GrammarFuzzer.fuzz_tree,\n", " GUIFuzzer.__init__,\n", " GUIFuzzer.restart,\n", " GUIFuzzer.run,\n", " GUIGrammarMiner.__init__,\n", " GrammarCoverageFuzzer.__init__,\n", " GUICoverageFuzzer.__init__,\n", " GUICoverageFuzzer.explore_all,\n", " ],\n", " types={\n", " 'DerivationTree': DerivationTree,\n", " 'Expansion': Expansion,\n", " 'Grammar': Grammar\n", " },\n", " project='fuzzingbook')" ] }, { "cell_type": "markdown", "metadata": { "button": false, "new_sheet": true, "run_control": { "read_only": false }, "slideshow": { "slide_type": "slide" } }, "source": [ "## Lessons Learned\n", "\n", "* _Selenium_ is a powerful framework for interacting with user interfaces, especially Web-based user interfaces.\n", "* A _finite state model_ can encode user interface states and transitions.\n", "* Encoding user interface models into a _grammar_ integrates generating text (for forms) and generating user interactions (for navigating)\n", "* To systematically explore a user interface, cover all _state transitions_, which is equivalent to covering all _expansion alternatives_ in the equivalent grammar." ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "We are done, so we clean up. We shut down our Web server, quit the Web driver (and the associated browser), and finally clean up temporary files left by Selenium." ] }, { "cell_type": "code", "execution_count": 170, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:58.401222Z", "iopub.status.busy": "2023-01-07T14:53:58.400903Z", "iopub.status.idle": "2023-01-07T14:53:58.402232Z", "shell.execute_reply": "2023-01-07T14:53:58.402475Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "httpd_process.terminate()" ] }, { "cell_type": "code", "execution_count": 171, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:58.404420Z", "iopub.status.busy": "2023-01-07T14:53:58.403604Z", "iopub.status.idle": "2023-01-07T14:53:58.779860Z", "shell.execute_reply": "2023-01-07T14:53:58.780101Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "gui_driver.quit()" ] }, { "cell_type": "code", "execution_count": 172, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:58.782267Z", "iopub.status.busy": "2023-01-07T14:53:58.781919Z", "iopub.status.idle": "2023-01-07T14:53:58.783180Z", "shell.execute_reply": "2023-01-07T14:53:58.783400Z" }, "slideshow": { "slide_type": "skip" } }, "outputs": [], "source": [ "import os" ] }, { "cell_type": "code", "execution_count": 173, "metadata": { "execution": { "iopub.execute_input": "2023-01-07T14:53:58.785630Z", "iopub.status.busy": "2023-01-07T14:53:58.785228Z", "iopub.status.idle": "2023-01-07T14:53:58.786854Z", "shell.execute_reply": "2023-01-07T14:53:58.787075Z" }, "slideshow": { "slide_type": "fragment" } }, "outputs": [], "source": [ "for temp_file in [ORDERS_DB, \"geckodriver.log\", \"ghostdriver.log\"]:\n", " if os.path.exists(temp_file):\n", " os.remove(temp_file)" ] }, { "cell_type": "markdown", "metadata": { "button": false, "new_sheet": false, "run_control": { "read_only": false }, "slideshow": { "slide_type": "slide" } }, "source": [ "## Next Steps\n", "\n", "From here, you can learn how to\n", "\n", "* [fuzz in the large](FuzzingInTheLarge.ipynb). running a myriad of fuzzers on the same system" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Background\n", "\n", "Automatic testing of graphical user interfaces is a rich field – in research as in practice.\n", "\n", "Coverage criteria for GUIs as well as how to achieve them were first discussed in \\cite{Memon2001}. Memon also introduced the concept of *GUI Ripping* \\cite{Memon2003} – the process in which the software's GUI is automatically traversed by interacting with all its user interface elements.\n", "\n", "The CrawlJax tool \\cite{Mesbah2012} uses dynamic state changes in Web user interfaces to identify candidate elements to interact with. As our approach above, it uses the set of interactable user interface elements as a state in a finite-state model.\n", "\n", "The [Alex framework](https://learnlib.github.io/alex/) uses a similar approach to learn automata for web applications. Starting from a set of test inputs, it produces a mixed-mode behavioral model of the application." ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "## Exercises\n", "\n", "As powerful as our GUI fuzzer is at this point, there are still several possibilities left for further optimization and extension. Here are some ideas to get you started. Enjoy user interface fuzzing!" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "### Exercise 1: Stay in Local State\n", "\n", "Rather than having each `run()` start at the very beginning, have the miner start from the current state and explore states reachable from there." ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "### Exercise 2: Going Back\n", "\n", "Make use of the web driver `back()` method and go back to an earlier state, from which we could again start exploration. (Note that a \"back\" functionality may not be available on non-Web user interfaces.)" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "### Exercise 3: Avoiding Bad Form Values\n", "\n", "Detect that some form values are _invalid_, such that the miner does not produce them again." ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "### Exercise 4: Saving Form Values\n", "\n", "Save _successful_ form values, such that the tester does not have to infer them again and again." ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "### Exercise 5: Same Names, Same States\n", "\n", "When the miner finds a link with a name it has already seen, it is likely to lead to a state already seen, too; therefore, one could give its exploration a lower priority." ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "### Exercise 6: Combinatorial Coverage\n", "\n", "Extend the grammar miner such that for every boolean value, there is a separate value to be covered." ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "### Exercise 7: Implicit Delays\n", "\n", "Rather than using _explicit_ (given) delays, use _implicit_ delays and wait for specific elements to appear. these elements could stem from previous explorations of the state." ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "### Exercise 8: Oracles\n", "\n", "Extend the grammar miner such that it also produces _oracles_ – for instance, checking for the presence of specific UI elements." ] }, { "attachments": {}, "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "### Exercise 9: More UI Elements\n", "\n", "Run the miner on a website of your choice. Find out which other types of user interface elements and actions need to be supported." ] } ], "metadata": { "ipub": { "bibliography": "fuzzingbook.bib", "toc": true }, "kernelspec": { "display_name": "Python 3", "language": "python", "name": "python3" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.10.2" }, "toc": { "base_numbering": 1, "nav_menu": {}, "number_sections": true, "sideBar": true, "skip_h1_title": true, "title_cell": "", "title_sidebar": "Contents", "toc_cell": false, "toc_position": {}, "toc_section_display": true, "toc_window_display": true }, "toc-autonumbering": false, "vscode": { "interpreter": { "hash": "4185989cf89c47c310c2629adcadd634093b57a2c49dffb5ae8d0d14fa302f2b" } } }, "nbformat": 4, "nbformat_minor": 4 }