{
 "cells": [
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# Introduction\n",
    "In [Carola Lilienthal](https://twitter.com/cairolali)'s talk about architecture and technical debt at [Herbstcampus 2017](https://www.herbstcampus.de/veranstaltung-5926-keynote-i%3A-architektur-f%C3%BCr-unser-gehirn.html?id=5926), I was reminded that I wanted to implement some of the examples of her book \"[Long-lived software systems](http://www.llsa.de/)\" (available only in German) with the structural analysis tool [jQAssistant](https://jqassistant.org/). Especially the visualizations of the dependencies between different business subdomains seemed like a great starting point to try out some stuff. In Carola's book, there are visualization like the following:\n",
    "\n",
    "![](resources/analyze_business_domain_dependencies.png)\n",
    "\n",
    "This visualization was created by using [SotoGraph](https://www.hello2morrow.com/products/sotograph), a tool for software architecture validation. It shows the user-defined subdomains as well as the relationships between them. The green connections between the modules show the downward dependencies to other modules and the red one the upward dependencies. This visualization can help if you want to further modularize your system towards microservices or to identify unwanted dependencies between modules.\n",
    "\n",
    "After Carola's talk (aka during the rest of the conference's morning), I immediately created the [Java Type Dependency Analysis](https://www.feststelltaste.de/java-type-dependency-analysis/) with visualizations in [D3](https://d3js.org/). During coding, I realized that there it is only a small step to analyze dependencies between business domains. \n",
    "\n",
    "What's missing is the information which type belong to which business subdomain. We'll find out now!"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# A simple case study\n",
    "Once, I've developed a party planning application called DropOver (that didn't go live, but that's another story) within a small team. We wrote that web application in Java and paid especially attention to structuring the code along the business' subdomain \"partying\". This led to this package structure that resembles the main parts of the application:\n",
    "\n",
    "![](./resources/dropover_package_structure.png)\n",
    "\n",
    "The application's main entry point is a `site` for a party including location, time, the site's `creator` and so on. A user can `comment` on a site as well as add some specific widgets like `todo` lists, `scheduling` or `files` upload and also gets notified by the `mail` feature. And there is a special package `framework` where all the cross-cutting concerns are placed like the dependency injection configuration or common, technical software elements for persisting data.\n",
    "\n",
    "The main point to take away here is that it's easy to determine the business subdomain for a software element thanks to the alignment of the package structure along the business' subdomain: It's the 3rd position in the Java package name: \n",
    "\n",
    "`at.dropover.`**`<subdomain>`**`.`\n",
    "\n",
    "This information can easily be used to retrieve the information about the subdomain."
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# Software from a graph's perspective\n",
    "I've built the web application, scanned the software artifact (a standard JAR file that we created for integration testing purposes) with the jQAssistant command line tool (with `jqassistant.sh scan -f dropover-classes.jar` in this case) and started the server (with `jqassistant.sh server`). Taking a look at the accompanied Neo4j browser, we can see the graph that jQAssistant stored in Neo4j. E. g. we can display the relationship between the JAR file and the contained Java types:\n",
    "\n",
    "![](./resources/analyze_business_domain_dependencies_dropover_bundle_graph.png)\n",
    "\n",
    "In the following, I set up the connection between my Python glue code and the Neo4j database. The query executed lists simply all Java types of the application (aka the JAR artifact). As mentioned above, we can also get the information about the subdomain derived from the package name:"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 1,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style>\n",
       "    .dataframe thead tr:only-child th {\n",
       "        text-align: right;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: left;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>subdomain</th>\n",
       "      <th>type</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>scheduling</td>\n",
       "      <td>at.dropover.scheduling.interactor.GetSchedulings</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>scheduling</td>\n",
       "      <td>at.dropover.scheduling.interactor.validation.S...</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>site</td>\n",
       "      <td>at.dropover.site.entity.Site</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>files</td>\n",
       "      <td>at.dropover.files.boundary.UploadFileRequestModel</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>scheduling</td>\n",
       "      <td>at.dropover.scheduling.entity.gateway.inmemory...</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "    subdomain                                               type\n",
       "0  scheduling   at.dropover.scheduling.interactor.GetSchedulings\n",
       "1  scheduling  at.dropover.scheduling.interactor.validation.S...\n",
       "2        site                       at.dropover.site.entity.Site\n",
       "3       files  at.dropover.files.boundary.UploadFileRequestModel\n",
       "4  scheduling  at.dropover.scheduling.entity.gateway.inmemory..."
      ]
     },
     "execution_count": 1,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "import py2neo\n",
    "import pandas as pd\n",
    "\n",
    "query=\"\"\"\n",
    "MATCH\n",
    "    (:Jar:Archive)-[:CONTAINS]->(type:Type)\n",
    "RETURN\n",
    "    type.fqn AS type, SPLIT(type.fqn, \".\")[2] AS subdomain\n",
    "\"\"\"\n",
    "\n",
    "graph = py2neo.Graph()\n",
    "subdomaininfo = pd.DataFrame(graph.run(query).data())\n",
    "subdomaininfo.head()"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# Gathering dependencies between subdomains\n",
    "The request returns the corresponding subdomain for each type. It collects all direct dependencies from one type to all other types and returns only the 3rd position of the package name (`fqn`: full qualified name) of the type and the direct dependency &ndash; that means the subdomains of each.\n",
    "\n",
    "Combined with the approach in [Java Type Dependency Analysis](https://www.feststelltaste.de/java-type-dependency-analysis/), we can now visualize the dependencies between the various subdomains."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 2,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "[{'imports': ['comment', 'framework', 'creator', 'site'], 'name': 'comment'},\n",
       " {'imports': ['site', 'mail', 'framework', 'creator'], 'name': 'mail'}]"
      ]
     },
     "execution_count": 2,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "import json\n",
    "\n",
    "query=\"\"\"\n",
    "MATCH^b^b\n",
    "    (:Jar:Archive)-[:CONTAINS]->\n",
    "    (type:Type)-[:DEPENDS_ON]->(directDependency:Type)\n",
    "    <-[:CONTAINS]-(:Jar:Archive)\n",
    "RETURN \n",
    "    SPLIT(type.fqn, \".\")[2] AS name, \n",
    "    COLLECT(DISTINCT SPLIT(directDependency.fqn, \".\")[2]) AS imports\n",
    "\"\"\"\n",
    "\n",
    "graph = py2neo.Graph()\n",
    "json_data = graph.run(query).data()\n",
    "\n",
    "with open ( \"vis/flare-imports.json\", mode='w') as json_file:\n",
    "    json_file.write(json.dumps(json_data, indent=3))\n",
    "\n",
    "json_data[:2]"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# Discussion\n",
    "In the output (that you can find [here](https://www.feststelltaste.de/wp-content/uploads/demos/business_subdomains_dep/bundle_dotted_imports.html) as well), we can see the dependencies between the various subdomains by hovering / touching one of the subdomains' text labels.\n",
    "\n",
    "![](./resources/analyze_business_domain_dependencies_dropover_bundle_creator.png)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "From the visualization above, we can see that the `creator` subdomain is used by Java source code from the subdomains `comment`, `site`, `scheduling`, `mail` and `framework`. The first four make perfect sense because if you create one of those content types in the application, they are created by some person (they are \"personalized\" content). Whereas `todo` and `files` are user agnostic content types and thus don't have any dependencies on `creator` (well, that's a tricky situation in retrospect; wouldn't do that now). \n",
    "\n",
    "What could look like a mess are the dependencies from and to `framework`. In the pseudo subdomain `framework`, there are some base classes for all the data objects that are persistent in a data store. That explains the outbound dependency of `creator`. The dependencies from `framework` to `creator` are needed for the central dependency injection configuration of the application. So this seems reasonable as well but also can be improved by further dividing the big `framework` package to smaller, more specific ones.\n",
    "\n",
    "Where it get's interesting is the following visualization of the dependencies of the subdomain `site`:\n",
    "\n",
    "![](./resources/analyze_business_domain_dependencies_dropover_bundle_site.png)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "A `site` is the central point of the application. Almost everything else depends on it (except `mail`, which is implemented in an interceptor style / aspect). But `site` is also depending on some of its content. This could be a sign of a design flaw and should be investigated further."
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# Summary\n",
    "In this blog post, we've learned how you can visualize dependencies between subdomains with jQAssistant / Neo4j. It's an easy way to quickly get an overview over the subdomain's relationships.\n",
    "\n",
    "In the case study, we got lucky because we were able to use the package naming conventions for retrieving the subdomain information. In another post, I'll show how you can get that information for classes that don't follow such a naming convention by using some Python'n'Pandas glue code.\n",
    "\n",
    "Albeit the nice visualization, my recommendation is not to invest too much time into analyzing unwanted dependencies graphically. Tools like jQAssistant provide the means to check for unwanted dependencies automatically during your build process. With this feature, you can let fail the build in case of violations and track the untangling progress of your system, too."
   ]
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "Python 3",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.6.4"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 2
}