{
 "cells": [
  {
   "cell_type": "raw",
   "metadata": {
    "raw_mimetype": "text/restructuredtext"
   },
   "source": [
    " .. _libcst-scope-tutorial:\n",
    "\n",
    "==============\n",
    "Scope Analysis\n",
    "==============\n",
    "Scope analysis keeps track of assignments and accesses which could be useful for code automatic refactoring. If you're not familiar with scope analysis, see :ref:`Scope Metadata <libcst-scope-metadata>` for more detail about scope metadata. This tutorial demonstrates some use cases of scope analysis. If you're new to metadata, see :doc:`Metadata Tutorial <metadata_tutorial>` to get started.\n",
    "Given source codes, scope analysis parses all variable :class:`~libcst.metadata.Assignment` (or a :class:`~libcst.metadata.BuiltinAssignment` if it's a builtin) and :class:`~libcst.metadata.Access` to store in :class:`~libcst.metadata.Scope` containers.\n",
    "\n",
    ".. note::\n",
    "   The scope analysis only handles local variable name access and cannot handle simple string type annotation forward references. See :class:`~libcst.metadata.Access`\n",
    "\n",
    "Given the following example source code contains a couple of unused imports (``f``, ``i``, ``m`` and ``n``) and undefined variable references (``func_undefined`` and ``var_undefined``). Scope analysis helps us identifying those unused imports and undefined variables to automatically provide warnings to developers to prevent bugs while they're developing.\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "nbsphinx": "hidden"
   },
   "outputs": [],
   "source": [
    "import sys\n",
    "sys.path.append(\"../../\")"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "source = \"\"\"\\\n",
    "import a, b, c as d, e as f  # expect to keep: a, c as d\n",
    "from g import h, i, j as k, l as m  # expect to keep: h, j as k\n",
    "from n import o  # expect to be removed entirely\n",
    "\n",
    "a()\n",
    "\n",
    "def fun():\n",
    "    d()\n",
    "\n",
    "class Cls:\n",
    "    att = h.something\n",
    "    \n",
    "    def __new__(self) -> \"Cls\":\n",
    "        var = k.method()\n",
    "        func_undefined(var_undefined)\n",
    "\"\"\""
   ]
  },
  {
   "cell_type": "raw",
   "metadata": {
    "raw_mimetype": "text/restructuredtext"
   },
   "source": [
    "With a parsed :class:`~libcst.Module`, we construct a :class:`~libcst.metadata.MetadataWrapper` object and it provides a :func:`~libcst.metadata.MetadataWrapper.resolve` function to resolve metadata given a metadata provider.\n",
    ":class:`~libcst.metadata.ScopeProvider` is used here for analysing scope and there are three types of scopes (:class:`~libcst.metadata.GlobalScope`, :class:`~libcst.metadata.FunctionScope` and :class:`~libcst.metadata.ClassScope`) in this example.\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "import libcst as cst\n",
    "\n",
    "\n",
    "wrapper = cst.metadata.MetadataWrapper(cst.parse_module(source))\n",
    "scopes = set(wrapper.resolve(cst.metadata.ScopeProvider).values())\n",
    "for scope in scopes:\n",
    "    print(scope)"
   ]
  },
  {
   "cell_type": "raw",
   "metadata": {
    "raw_mimetype": "text/restructuredtext"
   },
   "source": [
    "Warn on unused imports and undefined references\n",
    "===============================================\n",
    "To find all unused imports, we iterate through :attr:`~libcst.metadata.Scope.assignments` and an assignment is unused when its :attr:`~libcst.metadata.BaseAssignment.references` is empty. To find all undefined references, we iterate through :attr:`~libcst.metadata.Scope.accesses` (we focus on :class:`~libcst.Import`/:class:`~libcst.ImportFrom` assignments) and an access is undefined reference when its :attr:`~libcst.metadata.Access.referents` is empty. When reporting the warning to developer, we'll want to report the line number and column offset along with the suggestion to make it more clear. We can get position information from :class:`~libcst.metadata.PositionProvider` and print the warnings as follows.\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "from collections import defaultdict\n",
    "from typing import Dict, Union, Set\n",
    "\n",
    "unused_imports: Dict[Union[cst.Import, cst.ImportFrom], Set[str]] = defaultdict(set)\n",
    "undefined_references: Dict[cst.CSTNode, Set[str]] = defaultdict(set)\n",
    "ranges = wrapper.resolve(cst.metadata.PositionProvider)\n",
    "for scope in scopes:\n",
    "    for assignment in scope.assignments:\n",
    "        node = assignment.node\n",
    "        if isinstance(assignment, cst.metadata.Assignment) and isinstance(\n",
    "            node, (cst.Import, cst.ImportFrom)\n",
    "        ):\n",
    "            if len(assignment.references) == 0:\n",
    "                unused_imports[node].add(assignment.name)\n",
    "                location = ranges[node].start\n",
    "                print(\n",
    "                    f\"Warning on line {location.line:2d}, column {location.column:2d}: Imported name `{assignment.name}` is unused.\"\n",
    "                )\n",
    "\n",
    "    for access in scope.accesses:\n",
    "        if len(access.referents) == 0:\n",
    "            node = access.node\n",
    "            location = ranges[node].start\n",
    "            print(\n",
    "                f\"Warning on line {location.line:2d}, column {location.column:2d}: Name reference `{node.value}` is not defined.\"\n",
    "            )\n"
   ]
  },
  {
   "cell_type": "raw",
   "metadata": {
    "raw_mimetype": "text/restructuredtext"
   },
   "source": [
    "Automatically Remove Unused Import\n",
    "==================================\n",
    "Unused import is a commmon code suggestion provided by lint tool like `flake8 F401 <https://lintlyci.github.io/Flake8Rules/rules/F401.html>`_ ``imported but unused``.\n",
    "Even though reporting unused import is already useful, with LibCST we can provide automatic fix to remove unused import. That can make the suggestion more actionable and save developer's time.\n",
    "\n",
    "An import statement may import multiple names, we want to remove those unused names from the import statement. If all the names in the import statement are not used, we remove the entire import.\n",
    "To remove the unused name, we implement ``RemoveUnusedImportTransformer`` by subclassing :class:`~libcst.CSTTransformer`. We overwrite ``leave_Import`` and ``leave_ImportFrom`` to modify the import statements.\n",
    "When we find the import node in lookup table, we iterate through all ``names`` and keep used names in ``names_to_keep``.\n",
    "If ``names_to_keep`` is empty, all names are unused and we remove the entire import node.\n",
    "Otherwise, we update the import node and just removing partial names."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "class RemoveUnusedImportTransformer(cst.CSTTransformer):\n",
    "    def __init__(\n",
    "        self, unused_imports: Dict[Union[cst.Import, cst.ImportFrom], Set[str]]\n",
    "    ) -> None:\n",
    "        self.unused_imports = unused_imports\n",
    "\n",
    "    def leave_import_alike(\n",
    "        self,\n",
    "        original_node: Union[cst.Import, cst.ImportFrom],\n",
    "        updated_node: Union[cst.Import, cst.ImportFrom],\n",
    "    ) -> Union[cst.Import, cst.ImportFrom, cst.RemovalSentinel]:\n",
    "        if original_node not in self.unused_imports:\n",
    "            return updated_node\n",
    "        names_to_keep = []\n",
    "        for name in updated_node.names:\n",
    "            asname = name.asname\n",
    "            if asname is not None:\n",
    "                name_value = asname.name.value\n",
    "            else:\n",
    "                name_value = name.name.value\n",
    "            if name_value not in self.unused_imports[original_node]:\n",
    "                names_to_keep.append(name.with_changes(comma=cst.MaybeSentinel.DEFAULT))\n",
    "        if len(names_to_keep) == 0:\n",
    "            return cst.RemoveFromParent()\n",
    "        else:\n",
    "            return updated_node.with_changes(names=names_to_keep)\n",
    "\n",
    "    def leave_Import(\n",
    "        self, original_node: cst.Import, updated_node: cst.Import\n",
    "    ) -> cst.Import:\n",
    "        return self.leave_import_alike(original_node, updated_node)\n",
    "\n",
    "    def leave_ImportFrom(\n",
    "        self, original_node: cst.ImportFrom, updated_node: cst.ImportFrom\n",
    "    ) -> cst.ImportFrom:\n",
    "        return self.leave_import_alike(original_node, updated_node)\n"
   ]
  },
  {
   "cell_type": "raw",
   "metadata": {
    "raw_mimetype": "text/restructuredtext"
   },
   "source": [
    "After the transform, we use ``.code`` to generate fixed code and all unused names are fixed as expected! The difflib is used to show only changed part and only import lines are updated as expected."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "import difflib\n",
    "fixed_module = wrapper.module.visit(RemoveUnusedImportTransformer(unused_imports))\n",
    "\n",
    "# Use difflib to show the changes to verify unused imports are removed as expected.\n",
    "print(\n",
    "    \"\".join(\n",
    "        difflib.unified_diff(source.splitlines(1), fixed_module.code.splitlines(1))\n",
    "    )\n",
    ")"
   ]
  }
 ],
 "metadata": {
  "celltoolbar": "Raw Cell Format",
  "kernelspec": {
   "display_name": "Python 3",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.7.3"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 2
}