{
 "cells": [
  {
   "cell_type": "markdown",
   "id": "1cf27c95-0b45-4d97-a62d-9950654eb386",
   "metadata": {},
   "source": [
    "# Some corpus statistics (Nestle1904LFT)\n",
    "\n",
    "**Work in progress!**"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "1495a021-daa1-4c2e-80d5-ab7d2d75bc3f",
   "metadata": {
    "jp-MarkdownHeadingCollapsed": true,
    "tags": []
   },
   "source": [
    "## Table of content <a class=\"anchor\" id=\"TOC\"></a>\n",
    "* <a href=\"#bullet1\">1 - Introduction</a>\n",
    "* <a href=\"#bullet2\">2 - Load Text-Fabric app and data</a>\n",
    "* <a href=\"#bullet3\">3 - Performing the queries</a>\n",
    "    * <a href=\"#bullet3x1\">3.1 - The 25 most frequent words in the corpus</a>\n",
    "    * <a href=\"#bullet3x2\">3.2 - Frequency of characters in corpus</a>\n",
    "    * <a href=\"#bullet3x3\">3.3 - Some stats on node types</a>    \n",
    "    * <a href=\"#bullet3x4\">3.4 - The available text formats</a>    \n",
    "    * <a href=\"#bullet3x5\">3.5 - List of feature frequencies</a> \n",
    "    * <a href=\"#bullet3x6\">3.6 - Frequency list of punctuations</a>\n",
    "    * <a href=\"#bullet3x7\">3.7 - Node number ranges</a>\n",
    "    * <a href=\"#bullet3x8\">3.8 - Count the objects per type</a>\n",
    "    * <a href=\"#bullet3x9\">3.9 - Obtain meta data for a feature</a>"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "e6830070-1e97-4bdf-aa0c-5eda4e624a84",
   "metadata": {},
   "source": [
    "# 1 - Introduction <a class=\"anchor\" id=\"bullet1\"></a>\n",
    "##### [Back to TOC](#TOC)\n",
    "\n",
    "This Jupyter Notebook showcases several examples of statistical analysis performed on a Text-Fabric corpus. For demonstration purposes various methods of collecting and presenting the data are employed. "
   ]
  },
  {
   "cell_type": "markdown",
   "id": "a1b900e2-995f-4f36-ad74-d821092ca02c",
   "metadata": {},
   "source": [
    "# 2 - Load Text-Fabric app and data <a class=\"anchor\" id=\"bullet2\"></a>\n",
    "##### [Back to TOC](#TOC)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 1,
   "id": "6bd6c621-361d-487f-a8df-c27fb1ec9de2",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "%load_ext autoreload\n",
    "%autoreload 2"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 2,
   "id": "0071a0db-916c-4357-88bd-6b3255af0764",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "# Loading the Text-Fabric code\n",
    "# Note: it is assumed Text-Fabric is installed in your environment\n",
    "from tf.fabric import Fabric\n",
    "from tf.app import use"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 3,
   "id": "ed76db5d-5463-4bf1-99ca-7f14b3a0f277",
   "metadata": {
    "scrolled": true,
    "tags": []
   },
   "outputs": [
    {
     "data": {
      "text/markdown": [
       "**Locating corpus resources ...**"
      ],
      "text/plain": [
       "<IPython.core.display.Markdown object>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    },
    {
     "data": {
      "text/html": [
       "<b title=\"local release\">app:</b> <span title=\"rv0.6=#e68bd68c7c4c862c1464d995d51e27db7691254f offline under C:/Users/tonyj/text-fabric-data/github\">~/text-fabric-data/github/tonyjurg/Nestle1904LFT/app</span>"
      ],
      "text/plain": [
       "<IPython.core.display.HTML object>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    },
    {
     "data": {
      "text/html": [
       "<b title=\"local release\">data:</b> <span title=\"rv0.6=#e68bd68c7c4c862c1464d995d51e27db7691254f offline under C:/Users/tonyj/text-fabric-data/github\">~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6</span>"
      ],
      "text/plain": [
       "<IPython.core.display.HTML object>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    },
    {
     "data": {
      "text/html": [
       "\n",
       "            <b>TF:</b> <a target=\"_blank\" href=\"https://annotation.github.io/text-fabric/tf/cheatsheet.html\" title=\"text-fabric api\">TF API 12.2.2</a>, <a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/master/app\" title=\"tonyjurg/Nestle1904LFT app\">tonyjurg/Nestle1904LFT/app  v3</a>, <a target=\"_blank\" href=\"https://annotation.github.io/text-fabric/tf/about/searchusage.html\" title=\"Search Templates Introduction and Reference\">Search Reference</a><br>\n",
       "            <b>Data:</b> <a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs//about.md\" title=\"provenance of Nestle 1904 (Low Fat Tree)\">tonyjurg - Nestle1904LFT 0.6</a>, <a target=\"_blank\" href=\"https://annotation.github.io/text-fabric/tf/writing/greek.html\" title=\"How TF features represent text\">Character table</a>, <a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/home.md\" title=\"tonyjurg - Nestle1904LFT feature documentation\">Feature docs</a><br>\n",
       "            <details class=\"nodeinfo\"><summary><b>Node types</b></summary>\n",
       "<table class=\"nodeinfo\">\n",
       "    <tr>\n",
       "        <th>Name</th>\n",
       "        <th># of nodes</th>\n",
       "        <th># slots / node</th>\n",
       "        <th>% coverage</th>\n",
       "    </tr>\n",
       "\n",
       "<tr>\n",
       "    <th>book</th>\n",
       "    <td>27</td>\n",
       "    <td>5102.93</td>\n",
       "    <td><b>100</b></td>\n",
       "</tr>\n",
       "\n",
       "<tr>\n",
       "    <th>chapter</th>\n",
       "    <td>260</td>\n",
       "    <td>529.92</td>\n",
       "    <td><b>100</b></td>\n",
       "</tr>\n",
       "\n",
       "<tr>\n",
       "    <th>verse</th>\n",
       "    <td>7943</td>\n",
       "    <td>17.35</td>\n",
       "    <td><b>100</b></td>\n",
       "</tr>\n",
       "\n",
       "<tr>\n",
       "    <th>sentence</th>\n",
       "    <td>8011</td>\n",
       "    <td>17.20</td>\n",
       "    <td><b>100</b></td>\n",
       "</tr>\n",
       "\n",
       "<tr>\n",
       "    <th>wg</th>\n",
       "    <td>105430</td>\n",
       "    <td>6.85</td>\n",
       "    <td><i>524</i></td>\n",
       "</tr>\n",
       "\n",
       "<tr>\n",
       "    <th><i>word</i></th>\n",
       "    <td>137779</td>\n",
       "    <td>1.00</td>\n",
       "    <td><b>100</b></td>\n",
       "</tr>\n",
       "</table></details>\n",
       "            <b>Sets:</b> no custom sets<br>\n",
       "            <b>Features:</b><br>\n",
       "<details><summary><b>Nestle 1904 (Low Fat Tree)</b></summary>\n",
       "    <div class=\"fcorpus\">\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/after.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/after.tf\">after</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Characters (eg. punctuations) following the word</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/book.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/book.tf\">book</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Book name (in English language)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/booknumber.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/booknumber.tf\">booknumber</a>\n",
       "</div>\n",
       "<div class=\"fmono\">int</div>\n",
       "\n",
       "<span> ✅ NT book number (Matthew=1, Mark=2, ..., Revelation=27)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/bookshort.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/bookshort.tf\">bookshort</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Book name (abbreviated)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/case.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/case.tf\">case</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Gramatical case (Nominative, Genitive, Dative, Accusative, Vocative)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/chapter.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/chapter.tf\">chapter</a>\n",
       "</div>\n",
       "<div class=\"fmono\">int</div>\n",
       "\n",
       "<span> ✅ Chapter number inside book</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/clausetype.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/clausetype.tf\">clausetype</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Clause type details (e.g. Verbless, Minor)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/containedclause.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/containedclause.tf\">containedclause</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> 🆗 Contained clause (WG number)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/degree.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/degree.tf\">degree</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Degree (e.g. Comparitative, Superlative)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/gloss.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/gloss.tf\">gloss</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ English gloss</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/gn.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/gn.tf\">gn</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Gramatical gender (Masculine, Feminine, Neuter)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/headverse.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/headverse.tf\">headverse</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Start verse number of a sentence</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/junction.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/junction.tf\">junction</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Junction data related to a wordgroup</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/lemma.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/lemma.tf\">lemma</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Lexeme (lemma)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/lex_dom.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/lex_dom.tf\">lex_dom</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Lexical domain according to Semantic Dictionary of Biblical Greek, SDBG (not present everywhere?)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/ln.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/ln.tf\">ln</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Lauw-Nida lexical classification (not present everywhere?)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/markafter.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/markafter.tf\">markafter</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> 🆗 Text critical marker after word</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/markbefore.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/markbefore.tf\">markbefore</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> 🆗 Text critical marker before word</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/markorder.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/markorder.tf\">markorder</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span>  Order of punctuation and text critical marker</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/monad.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/monad.tf\">monad</a>\n",
       "</div>\n",
       "<div class=\"fmono\">int</div>\n",
       "\n",
       "<span> ✅ Monad (smallest token matching word order in the corpus)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/mood.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/mood.tf\">mood</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Gramatical mood of the verb (passive, etc)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/morph.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/morph.tf\">morph</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Morphological tag (Sandborg-Petersen morphology)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/nodeID.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/nodeID.tf\">nodeID</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Node ID (as in the XML source data)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/normalized.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/normalized.tf\">normalized</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Surface word with accents normalized and trailing punctuations removed</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/nu.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/nu.tf\">nu</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Gramatical number (Singular, Plural)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/number.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/number.tf\">number</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Gramatical number of the verb (e.g. singular, plural)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/otype.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/otype.tf\">otype</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> </span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/person.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/person.tf\">person</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Gramatical person of the verb (first, second, third)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/punctuation.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/punctuation.tf\">punctuation</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Punctuation after word</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/ref.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/ref.tf\">ref</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Value of the ref ID (taken from XML sourcedata)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/reference.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/reference.tf\">reference</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Reference (to nodeID in XML source data, not yet post-processes)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/roleclausedistance.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/roleclausedistance.tf\">roleclausedistance</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ⚠️ Distance to the wordgroup defining the syntactical role of this word</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/sentence.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/sentence.tf\">sentence</a>\n",
       "</div>\n",
       "<div class=\"fmono\">int</div>\n",
       "\n",
       "<span> ✅ Sentence number (counted per chapter)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/sp.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/sp.tf\">sp</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Part of Speech (abbreviated)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/sp_full.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/sp_full.tf\">sp_full</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Part of Speech (long description)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/strongs.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/strongs.tf\">strongs</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Strongs number</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/subj_ref.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/subj_ref.tf\">subj_ref</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> 🆗 Subject reference (to nodeID in XML source data, not yet post-processes)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/tense.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/tense.tf\">tense</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Gramatical tense of the verb (e.g. Present, Aorist)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/type.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/type.tf\">type</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Gramatical type  of noun or pronoun (e.g. Common, Personal)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/unicode.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/unicode.tf\">unicode</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Word as it apears in the text in Unicode (incl. punctuations)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/verse.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/verse.tf\">verse</a>\n",
       "</div>\n",
       "<div class=\"fmono\">int</div>\n",
       "\n",
       "<span> ✅ Verse number inside chapter</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/voice.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/voice.tf\">voice</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Gramatical voice of the verb (e.g. active,passive)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/wgclass.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/wgclass.tf\">wgclass</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Class of the wordgroup (e.g. cl, np, vp)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/wglevel.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/wglevel.tf\">wglevel</a>\n",
       "</div>\n",
       "<div class=\"fmono\">int</div>\n",
       "\n",
       "<span> 🆗 Number of the parent wordgroups for a wordgroup</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/wgnum.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/wgnum.tf\">wgnum</a>\n",
       "</div>\n",
       "<div class=\"fmono\">int</div>\n",
       "\n",
       "<span> ✅ Wordgroup number (counted per book)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/wgrole.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/wgrole.tf\">wgrole</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Syntactical role of the wordgroup (abbreviated)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/wgrolelong.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/wgrolelong.tf\">wgrolelong</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Syntactical role of the wordgroup (full)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/wgrule.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/wgrule.tf\">wgrule</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Wordgroup rule information (e.g. Np-Appos, ClCl2, PrepNp)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/wgtype.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/wgtype.tf\">wgtype</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Wordgroup type details (e.g. group, apposition)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/word.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/word.tf\">word</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Word as it appears in the text (excl. punctuations)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/wordlevel.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/wordlevel.tf\">wordlevel</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> 🆗 Number of the parent wordgroups for a word</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/wordrole.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/wordrole.tf\">wordrole</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Syntactical role of the word (abbreviated)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/wordrolelong.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/wordrolelong.tf\">wordrolelong</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Syntactical role of the word (full)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/wordtranslit.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/wordtranslit.tf\">wordtranslit</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> 🆗 Transliteration of the text (in latin letters, excl. punctuations)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat \">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/wordunacc.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/wordunacc.tf\">wordunacc</a>\n",
       "</div>\n",
       "<div class=\"fmono\">str</div>\n",
       "\n",
       "<span> ✅ Word without accents (excl. punctuations)</span>\n",
       "\n",
       "</div>\n",
       "\n",
       "<div class=\"frow\">\n",
       "    <div class=\"fnamecat edge\">\n",
       "<a target=\"_blank\" href=\"https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/oslots.md\" title=\"~/text-fabric-data/github/tonyjurg/Nestle1904LFT/tf/0.6/oslots.tf\">oslots</a>\n",
       "</div>\n",
       "<div class=\"fmono\">none</div>\n",
       "\n",
       "<span> </span>\n",
       "\n",
       "</div>\n",
       "\n",
       "    </div>\n",
       "</details>\n",
       "\n",
       "            <b>Settings:</b><br><details ><summary><b>specified</b></summary><ol><li><b>apiVersion</b>: <code>3</code></li><li><b>appName</b>: <code>tonyjurg/Nestle1904LFT</code></li><li><details><summary><b>appPath</b>:</summary><code>C:/Users/tonyj/text-fabric-data/github/tonyjurg/Nestle1904LFT/app</code></details></li><li><b>commit</b>: <code>e68bd68c7c4c862c1464d995d51e27db7691254f</code></li><li><b>css</b>: <code>''</code></li><li><details><summary><b>dataDisplay</b>:</summary><ul><li><details><summary><b>excludedFeatures</b>:</summary><ul><li><code>orig_order</code></li><li><code>verse</code></li><li><code>book</code></li><li><code>chapter</code></li></ul></details></li><li><details><summary><b>noneValues</b>:</summary><ul><li><code>none</code></li><li><code>unknown</code></li><li><i>no value</i></li><li><code>NA</code></li><li><code>''</code></li></ul></details></li><li><b>showVerseInTuple</b>: <code>0</code></li><li><b>textFormat</b>: <code>text-orig-full</code></li></ul></details></li><li><details><summary><b>docs</b>:</summary><ul><li><b>docBase</b>: <code>https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/</code></li><li><b>docPage</b>: <code>about</code></li><li><b>docRoot</b>: <code>https://github.com/tonyjurg/Nestle1904LFT</code></li><li><details><summary><b>featureBase</b>:</summary><code>https://github.com/tonyjurg/Nestle1904LFT/blob/main/docs/features/&lt;feature&gt;.md</code></details></li></ul></details></li><li><b>interfaceDefaults</b>: {<b>fmt</b>: <code>layout-orig-full</code>}</li><li><b>isCompatible</b>: <code>True</code></li><li><b>local</b>: <code>local</code></li><li><details><summary><b>localDir</b>:</summary><code>C:/Users/tonyj/text-fabric-data/github/tonyjurg/Nestle1904LFT/_temp</code></details></li><li><details><summary><b>provenanceSpec</b>:</summary><ul><li><b>corpus</b>: <code>Nestle 1904 (Low Fat Tree)</code></li><li><b>doi</b>: <code>10.5281/zenodo.10182594</code></li><li><b>org</b>: <code>tonyjurg</code></li><li><b>relative</b>: <code>/tf</code></li><li><b>repo</b>: <code>Nestle1904LFT</code></li><li><b>repro</b>: <code>Nestle1904LFT</code></li><li><b>version</b>: <code>0.6</code></li><li><b>webBase</b>: <code>https://learner.bible/text/show_text/nestle1904/</code></li><li><b>webHint</b>: <code>Show this on the Bible Online Learner website</code></li><li><b>webLang</b>: <code>en</code></li><li><details><summary><b>webUrl</b>:</summary><code>https://learner.bible/text/show_text/nestle1904/&lt;1&gt;/&lt;2&gt;/&lt;3&gt;</code></details></li><li><b>webUrlLex</b>: <code>{webBase}/word?version={version}&amp;id=&lt;lid&gt;</code></li></ul></details></li><li><b>release</b>: <code>v0.6</code></li><li><details><summary><b>typeDisplay</b>:</summary><ul><li><details><summary><b>book</b>:</summary><ul><li><b>condense</b>: <code>True</code></li><li><b>hidden</b>: <code>True</code></li><li><b>label</b>: <code>{book}</code></li><li><b>style</b>: <code>''</code></li></ul></details></li><li><details><summary><b>chapter</b>:</summary><ul><li><b>condense</b>: <code>True</code></li><li><b>hidden</b>: <code>True</code></li><li><b>label</b>: <code>{chapter}</code></li><li><b>style</b>: <code>''</code></li></ul></details></li><li><details><summary><b>sentence</b>:</summary><ul><li><b>hidden</b>: <code>0</code></li><li><b>label</b>: <code>#{sentence} (start: {book} {chapter}:{headverse})</code></li><li><b>style</b>: <code>''</code></li></ul></details></li><li><details><summary><b>verse</b>:</summary><ul><li><b>condense</b>: <code>True</code></li><li><b>excludedFeatures</b>: <code>chapter verse</code></li><li><b>label</b>: <code>{book} {chapter}:{verse}</code></li><li><b>style</b>: <code>''</code></li></ul></details></li><li><details><summary><b>wg</b>:</summary><ul><li><b>hidden</b>: <code>0</code></li><li><details><summary><b>label</b>:</summary><code>#{wgnum}: {wgtype} {wgclass} {clausetype} {wgrole} {wgrule} {junction}</code></details></li><li><b>style</b>: <code>''</code></li></ul></details></li><li><details><summary><b>word</b>:</summary><ul><li><b>base</b>: <code>True</code></li><li><b>features</b>: <code>lemma</code></li><li><b>featuresBare</b>: <code>gloss</code></li><li><b>surpress</b>: <code>chapter verse</code></li></ul></details></li></ul></details></li><li><b>writing</b>: <code>grc</code></li></ol></details>\n"
      ],
      "text/plain": [
       "<IPython.core.display.HTML object>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    },
    {
     "data": {
      "text/html": [
       "<style>tr.tf.ltr, td.tf.ltr, th.tf.ltr { text-align: left ! important;}\n",
       "tr.tf.rtl, td.tf.rtl, th.tf.rtl { text-align: right ! important;}\n",
       "@font-face {\n",
       "  font-family: \"Gentium Plus\";\n",
       "  src: local('Gentium Plus'), local('GentiumPlus'),\n",
       "    url('/browser/static/fonts/GentiumPlus-R.woff') format('woff'),\n",
       "    url('https://github.com/annotation/text-fabric/blob/master/tf/browser/static/fonts/GentiumPlus-R.woff?raw=true') format('woff');\n",
       "}\n",
       "\n",
       "@font-face {\n",
       "  font-family: \"Ezra SIL\";\n",
       "  src: local('Ezra SIL'), local('EzraSIL'),\n",
       "    url('/browser/static/fonts/SILEOT.woff') format('woff'),\n",
       "    url('https://github.com/annotation/text-fabric/blob/master/tf/browser/static/fonts/SILEOT.woff?raw=true') format('woff');\n",
       "}\n",
       "\n",
       "@font-face {\n",
       "  font-family: \"SBL Hebrew\";\n",
       "  src: local('SBL Hebrew'), local('SBLHebrew'),\n",
       "    url('/browser/static/fonts/SBL_Hbrw.woff') format('woff'),\n",
       "    url('https://github.com/annotation/text-fabric/blob/master/tf/browser/static/fonts/SBL_Hbrw.woff?raw=true') format('woff');\n",
       "}\n",
       "\n",
       "@font-face {\n",
       "  font-family: \"Estrangelo Edessa\";\n",
       "  src: local('Estrangelo Edessa'), local('EstrangeloEdessa');\n",
       "    url('/browser/static/fonts/SyrCOMEdessa.woff') format('woff'),\n",
       "    url('https://github.com/annotation/text-fabric/blob/master/tf/browser/static/fonts/SyrCOMEdessa.woff?raw=true') format('woff');\n",
       "}\n",
       "\n",
       "@font-face {\n",
       "  font-family: AmiriQuran;\n",
       "  font-style: normal;\n",
       "  font-weight: 400;\n",
       "  src: local('Amiri Quran'), local('AmiriQuran'),\n",
       "    url('/browser/static/fonts/AmiriQuran.woff') format('woff'),\n",
       "    url('https://github.com/annotation/text-fabric/blob/master/tf/browser/static/fonts/AmiriQuran.woff?raw=true') format('woff');\n",
       "}\n",
       "\n",
       "@font-face {\n",
       "  font-family: AmiriQuranColored;\n",
       "  font-style: normal;\n",
       "  font-weight: 400;\n",
       "  src: local('Amiri Quran Colored'), local('AmiriQuranColored'),\n",
       "    url('/browser/static/fonts/AmiriQuranColored.woff') format('woff'),\n",
       "    url('https://github.com/annotation/text-fabric/blob/master/tf/browser/static/fonts/AmiriQuranColored.woff?raw=true') format('woff');\n",
       "}\n",
       "\n",
       "@font-face {\n",
       "  font-family: \"Santakku\";\n",
       "  src: local('Santakku'),\n",
       "    url('/browser/static/fonts/Santakku.woff') format('woff'),\n",
       "    url('https://github.com/annotation/text-fabric/blob/master/tf/browser/static/fonts/Santakku.woff?raw=true') format('woff');\n",
       "}\n",
       "\n",
       "@font-face {\n",
       "  font-family: \"SantakkuM\";\n",
       "  src: local('SantakkuM'),\n",
       "    url('/browser/static/fonts/SantakkuM.woff') format('woff'),\n",
       "    url('https://github.com/annotation/text-fabric/blob/master/tf/browser/static/fonts/SantakkuM.woff?raw=true') format('woff');\n",
       "}\n",
       "/* bypassing some classical notebook settings */\n",
       "div#notebook {\n",
       "  line-height: unset;\n",
       "}\n",
       "/* neutral text */\n",
       ".txtn,.txtn a:visited,.txtn a:link {\n",
       "    font-family: sans-serif;\n",
       "    font-size: medium;\n",
       "    direction: ltr;\n",
       "    unicode-bidi: embed;\n",
       "    text-decoration: none;\n",
       "    color: var(--text-color);\n",
       "}\n",
       "/* transcription text */\n",
       ".txtt,.txtt a:visited,.txtt a:link {\n",
       "    font-family: monospace;\n",
       "    font-size: medium;\n",
       "    direction: ltr;\n",
       "    unicode-bidi: embed;\n",
       "    text-decoration: none;\n",
       "    color: var(--text-color);\n",
       "}\n",
       "/* source text */\n",
       ".txto,.txto a:visited,.txto a:link {\n",
       "    font-family: serif;\n",
       "    font-size: medium;\n",
       "    direction: ltr;\n",
       "    unicode-bidi: embed;\n",
       "    text-decoration: none;\n",
       "    color: var(--text-color);\n",
       "}\n",
       "/* phonetic text */\n",
       ".txtp,.txtp a:visited,.txtp a:link {\n",
       "    font-family: Gentium, sans-serif;\n",
       "    font-size: medium;\n",
       "    direction: ltr;\n",
       "    unicode-bidi: embed;\n",
       "    text-decoration: none;\n",
       "    color: var(--text-color);\n",
       "}\n",
       "/* original script text */\n",
       ".txtu,.txtu a:visited,.txtu a:link {\n",
       "    font-family: Gentium, sans-serif;\n",
       "    font-size: medium;\n",
       "    text-decoration: none;\n",
       "    color: var(--text-color);\n",
       "}\n",
       "/* hebrew */\n",
       ".txtu.hbo,.lex.hbo {\n",
       "    font-family: \"Ezra SIL\", \"SBL Hebrew\", sans-serif;\n",
       "    font-size: large;\n",
       "    direction: rtl ! important;\n",
       "    unicode-bidi: embed;\n",
       "}\n",
       "/* syriac */\n",
       ".txtu.syc,.lex.syc {\n",
       "    font-family: \"Estrangelo Edessa\", sans-serif;\n",
       "    font-size: medium;\n",
       "    direction: rtl ! important;\n",
       "    unicode-bidi: embed;\n",
       "}\n",
       "/* neo aramaic */\n",
       ".txtu.cld,.lex.cld {\n",
       "    font-family: \"CharisSIL-R\", sans-serif;\n",
       "    font-size: medium;\n",
       "    direction: ltr ! important;\n",
       "    unicode-bidi: embed;\n",
       "}\n",
       "/* standard arabic */\n",
       ".txtu.ara,.lex.ara {\n",
       "    font-family: \"AmiriQuran\", sans-serif;\n",
       "    font-size: large;\n",
       "    direction: rtl ! important;\n",
       "    unicode-bidi: embed;\n",
       "}\n",
       "/* cuneiform */\n",
       ".txtu.akk,.lex.akk {\n",
       "    font-family: Santakku, sans-serif;\n",
       "    font-size: large;\n",
       "    direction: ltr ! important;\n",
       "    unicode-bidi: embed;\n",
       "}\n",
       "/* greek */\n",
       ".txtu.grc,.lex.grc a:link {\n",
       "    font-family: Gentium, sans-serif;\n",
       "    font-size: medium;\n",
       "    direction: ltr ! important;\n",
       "    unicode-bidi: embed;\n",
       "}\n",
       "a:hover {\n",
       "    text-decoration: underline | important;\n",
       "    color: #0000ff | important;\n",
       "}\n",
       ".ltr {\n",
       "    direction: ltr ! important;\n",
       "    unicode-bidi: embed;\n",
       "}\n",
       ".rtl {\n",
       "    direction: rtl ! important;\n",
       "    unicode-bidi: embed;\n",
       "}\n",
       ".ubd {\n",
       "    unicode-bidi: embed;\n",
       "}\n",
       ".col {\n",
       "   display: inline-block;\n",
       "}\n",
       ".features {\n",
       "    font-family: monospace;\n",
       "    font-size: medium;\n",
       "    font-weight: bold;\n",
       "    color: var(--features);\n",
       "    display: flex;\n",
       "    flex-flow: column nowrap;\n",
       "    justify-content: flex-start;\n",
       "    align-items: flex-start;\n",
       "    align-content: flex-start;\n",
       "    padding: 2px;\n",
       "    margin: 2px;\n",
       "    direction: ltr;\n",
       "    unicode-bidi: embed;\n",
       "    border: var(--meta-width) solid var(--meta-color);\n",
       "    border-radius: var(--meta-width);\n",
       "}\n",
       ".features div,.features span {\n",
       "    padding: 0;\n",
       "    margin: -2px 0;\n",
       "}\n",
       ".features .f {\n",
       "    font-family: sans-serif;\n",
       "    font-size: small;\n",
       "    font-weight: normal;\n",
       "    color: #5555bb;\n",
       "}\n",
       ".features .xft {\n",
       "  color: #000000;\n",
       "  background-color: #eeeeee;\n",
       "  font-size: medium;\n",
       "  margin: 2px 0px;\n",
       "}\n",
       ".features .xft .f {\n",
       "  color: #000000;\n",
       "  background-color: #eeeeee;\n",
       "  font-size: small;\n",
       "  font-weight: normal;\n",
       "}\n",
       ".tfsechead {\n",
       "    font-family: sans-serif;\n",
       "    font-size: small;\n",
       "    font-weight: bold;\n",
       "    color: var(--tfsechead);\n",
       "    unicode-bidi: embed;\n",
       "    text-align: start;\n",
       "}\n",
       ".structure {\n",
       "    font-family: sans-serif;\n",
       "    font-size: small;\n",
       "    font-weight: bold;\n",
       "    color: var(--structure);\n",
       "    unicode-bidi: embed;\n",
       "    text-align: start;\n",
       "}\n",
       ".comments {\n",
       "    display: flex;\n",
       "    justify-content: flex-start;\n",
       "    align-items: flex-start;\n",
       "    align-content: flex-start;\n",
       "    flex-flow: column nowrap;\n",
       "}\n",
       ".nd, a:link.nd {\n",
       "    font-family: sans-serif;\n",
       "    font-size: small;\n",
       "    color: var(--node);\n",
       "    vertical-align: super;\n",
       "    direction: ltr ! important;\n",
       "    unicode-bidi: embed;\n",
       "}\n",
       ".nde, a:link.nde {\n",
       "    font-family: sans-serif;\n",
       "    font-size: small;\n",
       "    color: var(--node);\n",
       "    direction: ltr ! important;\n",
       "    unicode-bidi: embed;\n",
       "}\n",
       ".etf {\n",
       "    font-size: normal;\n",
       "    border-radius: 0.2rem;\n",
       "    border: 1pt solid white;\n",
       "    padding: 0 0.2rem ! important;\n",
       "    margin: 0 0.2rem ! important;\n",
       "}\n",
       ".etfx {\n",
       "    font-size: x-large;\n",
       "}\n",
       ".lex {\n",
       "  color: var(--lex-color);;\n",
       "}\n",
       "#colormapplus, #colormapmin, .ecolormapmin {\n",
       "  font-weight: bold;\n",
       "  border-radius: 0.1rem;\n",
       "  background-color: #eeeeff;\n",
       "  padding: 0 1rem;\n",
       "  margin: 0 1rem;\n",
       "}\n",
       ".clr {\n",
       "  font-style: italic;\n",
       "  font-size: small;\n",
       "}\n",
       ".clmap,.eclmap {\n",
       "  padding: 0;\n",
       "}\n",
       ".children,.children.ltr {\n",
       "    display: flex;\n",
       "    border: 0;\n",
       "    background-color: #ffffff;\n",
       "    justify-content: flex-start;\n",
       "    align-items: flex-start;\n",
       "    align-content: flex-start;\n",
       "}\n",
       ".children.stretch {\n",
       "    align-items: stretch;\n",
       "}\n",
       ".children.hor {\n",
       "    flex-flow: row nowrap;\n",
       "}\n",
       ".children.hor.wrap {\n",
       "    flex-flow: row wrap;\n",
       "}\n",
       ".children.ver {\n",
       "    flex-flow: column nowrap;\n",
       "}\n",
       ".children.ver.wrap {\n",
       "    flex-flow: column wrap;\n",
       "}\n",
       ".contnr {\n",
       "    width: fit-content;\n",
       "    display: flex;\n",
       "    justify-content: flex-start;\n",
       "    align-items: flex-start;\n",
       "    align-content: flex-start;\n",
       "    flex-flow: column nowrap;\n",
       "    background: #ffffff none repeat scroll 0 0;\n",
       "    padding:  10px 2px 2px 2px;\n",
       "    margin: 16px 2px 2px 2px;\n",
       "    border-style: solid;\n",
       "    font-size: small;\n",
       "}\n",
       ".contnr.trm {\n",
       "    background-attachment: local;\n",
       "}\n",
       ".contnr.cnul {\n",
       "    padding:  0;\n",
       "    margin: 0;\n",
       "    border-style: solid;\n",
       "    font-size: xx-small;\n",
       "}\n",
       ".contnr.cnul,.lbl.cnul {\n",
       "    border-color: var(--border-color-nul);\n",
       "    border-width: var(--border-width-nul);\n",
       "    border-radius: var(--border-width-nul);\n",
       "}\n",
       ".contnr.c0,.lbl.c0 {\n",
       "    border-color: var(--border-color0);\n",
       "    border-width: var(--border-width0);\n",
       "    border-radius: var(--border-width0);\n",
       "}\n",
       ".contnr.c1,.lbl.c1 {\n",
       "    border-color: var(--border-color1);\n",
       "    border-width: var(--border-width1);\n",
       "    border-radius: var(--border-width1);\n",
       "}\n",
       ".contnr.c2,.lbl.c2 {\n",
       "    border-color: var(--border-color2);\n",
       "    border-width: var(--border-width2);\n",
       "    border-radius: var(--border-width2);\n",
       "}\n",
       ".contnr.c3,.lbl.c3 {\n",
       "    border-color: var(--border-color3);\n",
       "    border-width: var(--border-width3);\n",
       "    border-radius: var(--border-width3);\n",
       "}\n",
       ".contnr.c4,.lbl.c4 {\n",
       "    border-color: var(--border-color4);\n",
       "    border-width: var(--border-width4);\n",
       "    border-radius: var(--border-width4);\n",
       "}\n",
       "span.plain {\n",
       "    /*display: inline-block;*/\n",
       "    display: inline-flex;\n",
       "    flex-flow: row wrap;\n",
       "    white-space: pre-wrap;\n",
       "}\n",
       "span.break {\n",
       "  flex-basis: 100%;\n",
       "  height: 0;\n",
       "}\n",
       ".plain {\n",
       "    background-color: #ffffff;\n",
       "}\n",
       ".plain.l,.contnr.l,.contnr.l>.lbl {\n",
       "    border-left-style: dotted\n",
       "}\n",
       ".plain.r,.contnr.r,.contnr.r>.lbl {\n",
       "    border-right-style: dotted\n",
       "}\n",
       ".plain.lno,.contnr.lno,.contnr.lno>.lbl {\n",
       "    border-left-style: none\n",
       "}\n",
       ".plain.rno,.contnr.rno,.contnr.rno>.lbl {\n",
       "    border-right-style: none\n",
       "}\n",
       ".plain.l {\n",
       "    padding-left: 4px;\n",
       "    margin-left: 2px;\n",
       "    border-width: var(--border-width-plain);\n",
       "}\n",
       ".plain.r {\n",
       "    padding-right: 4px;\n",
       "    margin-right: 2px;\n",
       "    border-width: var(--border-width-plain);\n",
       "}\n",
       ".lbl {\n",
       "    font-family: monospace;\n",
       "    margin-top: -24px;\n",
       "    margin-left: 20px;\n",
       "    background: #ffffff none repeat scroll 0 0;\n",
       "    padding: 0 6px;\n",
       "    border-style: solid;\n",
       "    display: block;\n",
       "    color: var(--label)\n",
       "}\n",
       ".lbl.trm {\n",
       "    background-attachment: local;\n",
       "    margin-top: 2px;\n",
       "    margin-left: 2px;\n",
       "    padding: 2px 2px;\n",
       "    border-style: none;\n",
       "}\n",
       ".lbl.cnul {\n",
       "    font-size: xx-small;\n",
       "}\n",
       ".lbl.c0 {\n",
       "    font-size: small;\n",
       "}\n",
       ".lbl.c1 {\n",
       "    font-size: small;\n",
       "}\n",
       ".lbl.c2 {\n",
       "    font-size: medium;\n",
       "}\n",
       ".lbl.c3 {\n",
       "    font-size: medium;\n",
       "}\n",
       ".lbl.c4 {\n",
       "    font-size: large;\n",
       "}\n",
       ".occs, a:link.occs {\n",
       "    font-size: small;\n",
       "}\n",
       "\n",
       "/* PROVENANCE */\n",
       "\n",
       "div.prov {\n",
       "\tmargin: 40px;\n",
       "\tpadding: 20px;\n",
       "\tborder: 2px solid var(--fog-rim);\n",
       "}\n",
       "div.pline {\n",
       "\tdisplay: flex;\n",
       "\tflex-flow: row nowrap;\n",
       "\tjustify-content: stretch;\n",
       "\talign-items: baseline;\n",
       "}\n",
       "div.p2line {\n",
       "\tmargin-left: 2em;\n",
       "\tdisplay: flex;\n",
       "\tflex-flow: row nowrap;\n",
       "\tjustify-content: stretch;\n",
       "\talign-items: baseline;\n",
       "}\n",
       "div.psline {\n",
       "\tdisplay: flex;\n",
       "\tflex-flow: row nowrap;\n",
       "\tjustify-content: stretch;\n",
       "\talign-items: baseline;\n",
       "\tbackground-color: var(--gold-mist-back);\n",
       "}\n",
       "div.pname {\n",
       "\tflex: 0 0 5rem;\n",
       "\tfont-weight: bold;\n",
       "}\n",
       "div.pval {\n",
       "    flex: 1 1 auto;\n",
       "}\n",
       "\n",
       "/* KEYBOARD */\n",
       ".ccoff {\n",
       "  background-color: inherit;\n",
       "}\n",
       ".ccon {\n",
       "  background-color: yellow ! important;\n",
       "}\n",
       ".ccon,.ccoff {\n",
       "  padding: 0.2rem;\n",
       "  margin: 0.2rem;\n",
       "  border: 0.1rem solid var(--letter-box-border);\n",
       "  border-radius: 0.1rem;\n",
       "}\n",
       ".ccline {\n",
       "  font-size: xx-large ! important;\n",
       "  font-weight: bold;\n",
       "  line-height: 2em ! important;\n",
       "}\n",
       "/* TF header */\n",
       "\n",
       "summary {\n",
       "  /* needed to override the normalize.less\n",
       "   * in the classical Jupyter Notebook\n",
       "   */\n",
       "  display: list-item ! important;\n",
       "}\n",
       "\n",
       ".fcorpus {\n",
       "  display: flex;\n",
       "  flex-flow: column nowrap;\n",
       "  justify-content: flex-start;\n",
       "  align-items: flex-start;\n",
       "  align-content: flex-start;\n",
       "  overflow: auto;\n",
       "}\n",
       ".frow {\n",
       "  display: flex;\n",
       "  flex-flow: row nowrap;\n",
       "  justify-content: flex-start;\n",
       "  align-items: flex-start;\n",
       "  align-content: flex-start;\n",
       "}\n",
       ".fmeta {\n",
       "  display: flex;\n",
       "  flex-flow: column nowrap;\n",
       "  justify-content: flex-start;\n",
       "  align-items: flex-start;\n",
       "  align-content: flex-start;\n",
       "}\n",
       ".fmetarow {\n",
       "  display: flex;\n",
       "  flex-flow: row nowrap;\n",
       "  justify-content: flex-start;\n",
       "  align-items: flex-start;\n",
       "  align-content: flex-start;\n",
       "}\n",
       ".fmetakey {\n",
       "  min-width: 8em;\n",
       "  font-family: monospace;\n",
       "}\n",
       ".fnamecat {\n",
       "  min-width: 8em;\n",
       "}\n",
       ".fnamecat.edge {\n",
       "  font-weight: bold;\n",
       "  font-style: italic;\n",
       "}\n",
       ".fmono {\n",
       "    font-family: monospace;\n",
       "}\n",
       "\n",
       ":root {\n",
       "\t--node:               hsla(120, 100%,  20%, 1.0  );\n",
       "\t--label:              hsla(  0, 100%,  20%, 1.0  );\n",
       "\t--tfsechead:          hsla(  0, 100%,  25%, 1.0  );\n",
       "\t--structure:          hsla(120, 100%,  25%, 1.0  );\n",
       "\t--features:           hsla(  0,   0%,  30%, 1.0  );\n",
       "  --text-color:         hsla( 60,  80%,  10%, 1.0  );\n",
       "  --lex-color:          hsla(220,  90%,  60%, 1.0  );\n",
       "  --meta-color:         hsla(  0,   0%,  90%, 0.7  );\n",
       "  --meta-width:         3px;\n",
       "  --border-color-nul:   hsla(  0,   0%,  90%, 0.5  );\n",
       "  --border-color0:      hsla(  0,   0%,  90%, 0.9  );\n",
       "  --border-color1:      hsla(  0,   0%,  80%, 0.9  );\n",
       "  --border-color2:      hsla(  0,   0%,  70%, 0.9  );\n",
       "  --border-color3:      hsla(  0,   0%,  80%, 0.8  );\n",
       "  --border-color4:      hsla(  0,   0%,  60%, 0.9  );\n",
       "\t--letter-box-border:  hsla(  0,   0%,  80%, 0.5  );\n",
       "  --border-width-nul:   2px;\n",
       "  --border-width0:      2px;\n",
       "  --border-width1:      3px;\n",
       "  --border-width2:      4px;\n",
       "  --border-width3:      6px;\n",
       "  --border-width4:      5px;\n",
       "  --border-width-plain: 2px;\n",
       "}\n",
       ".hl {\n",
       "  background-color: var(--hl-strong);\n",
       "}\n",
       "span.hl {\n",
       "\tbackground-color: var(--hl-strong);\n",
       "\tborder-width: 0;\n",
       "\tborder-radius: 2px;\n",
       "\tborder-style: solid;\n",
       "}\n",
       "div.contnr.hl,div.lbl.hl {\n",
       "  background-color: var(--hl-strong);\n",
       "}\n",
       "div.contnr.hl {\n",
       "  border-color: var(--hl-rim) ! important;\n",
       "\tborder-width: 4px ! important;\n",
       "}\n",
       "\n",
       "span.hlbx {\n",
       "\tborder-color: var(--hl-rim);\n",
       "\tborder-width: 4px ! important;\n",
       "\tborder-style: solid;\n",
       "\tborder-radius: 6px;\n",
       "  padding: 4px;\n",
       "  margin: 4px;\n",
       "}\n",
       ".ehl {\n",
       "  background-color: var(--ehl-strong);\n",
       "}\n",
       "\n",
       ":root {\n",
       "\t--hl-strong:        hsla( 60, 100%,  70%, 0.9  );\n",
       "\t--hl-rim:           hsla( 55,  80%,  50%, 1.0  );\n",
       "\t--ehl-strong:       hsla(240, 100%,  70%, 0.9  );\n",
       "}\n",
       "</style>"
      ],
      "text/plain": [
       "<IPython.core.display.HTML object>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    },
    {
     "data": {
      "text/html": [
       "\n",
       "<script>\n",
       "globalThis.copyChar = (el, c) => {\n",
       "    for (const el of document.getElementsByClassName('ccon')) {\n",
       "        el.className = 'ccoff'\n",
       "    }\n",
       "    el.className = 'ccon'\n",
       "    navigator.clipboard.writeText(String.fromCharCode(c))\n",
       "}\n",
       "</script>\n"
      ],
      "text/plain": [
       "<IPython.core.display.HTML object>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    },
    {
     "data": {
      "text/html": [
       "<div><b>TF API:</b> names <a target=\"_blank\" href=\"https://annotation.github.io/text-fabric/tf/cheatsheet.html\" title=\"doc\">N F E L T S C TF Fs Fall Es Eall Cs Call</a> directly usable</div><hr>"
      ],
      "text/plain": [
       "<IPython.core.display.HTML object>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "# load the N1904 app and data\n",
    "N1904 = use (\"tonyjurg/Nestle1904LFT\", version=\"0.6\", hoist=globals())"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 4,
   "id": "d5da5d1a-6827-49b3-ad37-7ca29ba59b45",
   "metadata": {
    "tags": []
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<style>tr.tf.ltr, td.tf.ltr, th.tf.ltr { text-align: left ! important;}\n",
       "tr.tf.rtl, td.tf.rtl, th.tf.rtl { text-align: right ! important;}\n",
       "@font-face {\n",
       "  font-family: \"Gentium Plus\";\n",
       "  src: local('Gentium Plus'), local('GentiumPlus'),\n",
       "    url('/browser/static/fonts/GentiumPlus-R.woff') format('woff'),\n",
       "    url('https://github.com/annotation/text-fabric/blob/master/tf/browser/static/fonts/GentiumPlus-R.woff?raw=true') format('woff');\n",
       "}\n",
       "\n",
       "@font-face {\n",
       "  font-family: \"Ezra SIL\";\n",
       "  src: local('Ezra SIL'), local('EzraSIL'),\n",
       "    url('/browser/static/fonts/SILEOT.woff') format('woff'),\n",
       "    url('https://github.com/annotation/text-fabric/blob/master/tf/browser/static/fonts/SILEOT.woff?raw=true') format('woff');\n",
       "}\n",
       "\n",
       "@font-face {\n",
       "  font-family: \"SBL Hebrew\";\n",
       "  src: local('SBL Hebrew'), local('SBLHebrew'),\n",
       "    url('/browser/static/fonts/SBL_Hbrw.woff') format('woff'),\n",
       "    url('https://github.com/annotation/text-fabric/blob/master/tf/browser/static/fonts/SBL_Hbrw.woff?raw=true') format('woff');\n",
       "}\n",
       "\n",
       "@font-face {\n",
       "  font-family: \"Estrangelo Edessa\";\n",
       "  src: local('Estrangelo Edessa'), local('EstrangeloEdessa');\n",
       "    url('/browser/static/fonts/SyrCOMEdessa.woff') format('woff'),\n",
       "    url('https://github.com/annotation/text-fabric/blob/master/tf/browser/static/fonts/SyrCOMEdessa.woff?raw=true') format('woff');\n",
       "}\n",
       "\n",
       "@font-face {\n",
       "  font-family: AmiriQuran;\n",
       "  font-style: normal;\n",
       "  font-weight: 400;\n",
       "  src: local('Amiri Quran'), local('AmiriQuran'),\n",
       "    url('/browser/static/fonts/AmiriQuran.woff') format('woff'),\n",
       "    url('https://github.com/annotation/text-fabric/blob/master/tf/browser/static/fonts/AmiriQuran.woff?raw=true') format('woff');\n",
       "}\n",
       "\n",
       "@font-face {\n",
       "  font-family: AmiriQuranColored;\n",
       "  font-style: normal;\n",
       "  font-weight: 400;\n",
       "  src: local('Amiri Quran Colored'), local('AmiriQuranColored'),\n",
       "    url('/browser/static/fonts/AmiriQuranColored.woff') format('woff'),\n",
       "    url('https://github.com/annotation/text-fabric/blob/master/tf/browser/static/fonts/AmiriQuranColored.woff?raw=true') format('woff');\n",
       "}\n",
       "\n",
       "@font-face {\n",
       "  font-family: \"Santakku\";\n",
       "  src: local('Santakku'),\n",
       "    url('/browser/static/fonts/Santakku.woff') format('woff'),\n",
       "    url('https://github.com/annotation/text-fabric/blob/master/tf/browser/static/fonts/Santakku.woff?raw=true') format('woff');\n",
       "}\n",
       "\n",
       "@font-face {\n",
       "  font-family: \"SantakkuM\";\n",
       "  src: local('SantakkuM'),\n",
       "    url('/browser/static/fonts/SantakkuM.woff') format('woff'),\n",
       "    url('https://github.com/annotation/text-fabric/blob/master/tf/browser/static/fonts/SantakkuM.woff?raw=true') format('woff');\n",
       "}\n",
       "/* bypassing some classical notebook settings */\n",
       "div#notebook {\n",
       "  line-height: unset;\n",
       "}\n",
       "/* neutral text */\n",
       ".txtn,.txtn a:visited,.txtn a:link {\n",
       "    font-family: sans-serif;\n",
       "    font-size: medium;\n",
       "    direction: ltr;\n",
       "    unicode-bidi: embed;\n",
       "    text-decoration: none;\n",
       "    color: var(--text-color);\n",
       "}\n",
       "/* transcription text */\n",
       ".txtt,.txtt a:visited,.txtt a:link {\n",
       "    font-family: monospace;\n",
       "    font-size: medium;\n",
       "    direction: ltr;\n",
       "    unicode-bidi: embed;\n",
       "    text-decoration: none;\n",
       "    color: var(--text-color);\n",
       "}\n",
       "/* source text */\n",
       ".txto,.txto a:visited,.txto a:link {\n",
       "    font-family: serif;\n",
       "    font-size: medium;\n",
       "    direction: ltr;\n",
       "    unicode-bidi: embed;\n",
       "    text-decoration: none;\n",
       "    color: var(--text-color);\n",
       "}\n",
       "/* phonetic text */\n",
       ".txtp,.txtp a:visited,.txtp a:link {\n",
       "    font-family: Gentium, sans-serif;\n",
       "    font-size: medium;\n",
       "    direction: ltr;\n",
       "    unicode-bidi: embed;\n",
       "    text-decoration: none;\n",
       "    color: var(--text-color);\n",
       "}\n",
       "/* original script text */\n",
       ".txtu,.txtu a:visited,.txtu a:link {\n",
       "    font-family: Gentium, sans-serif;\n",
       "    font-size: medium;\n",
       "    text-decoration: none;\n",
       "    color: var(--text-color);\n",
       "}\n",
       "/* hebrew */\n",
       ".txtu.hbo,.lex.hbo {\n",
       "    font-family: \"Ezra SIL\", \"SBL Hebrew\", sans-serif;\n",
       "    font-size: large;\n",
       "    direction: rtl ! important;\n",
       "    unicode-bidi: embed;\n",
       "}\n",
       "/* syriac */\n",
       ".txtu.syc,.lex.syc {\n",
       "    font-family: \"Estrangelo Edessa\", sans-serif;\n",
       "    font-size: medium;\n",
       "    direction: rtl ! important;\n",
       "    unicode-bidi: embed;\n",
       "}\n",
       "/* neo aramaic */\n",
       ".txtu.cld,.lex.cld {\n",
       "    font-family: \"CharisSIL-R\", sans-serif;\n",
       "    font-size: medium;\n",
       "    direction: ltr ! important;\n",
       "    unicode-bidi: embed;\n",
       "}\n",
       "/* standard arabic */\n",
       ".txtu.ara,.lex.ara {\n",
       "    font-family: \"AmiriQuran\", sans-serif;\n",
       "    font-size: large;\n",
       "    direction: rtl ! important;\n",
       "    unicode-bidi: embed;\n",
       "}\n",
       "/* cuneiform */\n",
       ".txtu.akk,.lex.akk {\n",
       "    font-family: Santakku, sans-serif;\n",
       "    font-size: large;\n",
       "    direction: ltr ! important;\n",
       "    unicode-bidi: embed;\n",
       "}\n",
       "/* greek */\n",
       ".txtu.grc,.lex.grc a:link {\n",
       "    font-family: Gentium, sans-serif;\n",
       "    font-size: medium;\n",
       "    direction: ltr ! important;\n",
       "    unicode-bidi: embed;\n",
       "}\n",
       "a:hover {\n",
       "    text-decoration: underline | important;\n",
       "    color: #0000ff | important;\n",
       "}\n",
       ".ltr {\n",
       "    direction: ltr ! important;\n",
       "    unicode-bidi: embed;\n",
       "}\n",
       ".rtl {\n",
       "    direction: rtl ! important;\n",
       "    unicode-bidi: embed;\n",
       "}\n",
       ".ubd {\n",
       "    unicode-bidi: embed;\n",
       "}\n",
       ".col {\n",
       "   display: inline-block;\n",
       "}\n",
       ".features {\n",
       "    font-family: monospace;\n",
       "    font-size: medium;\n",
       "    font-weight: bold;\n",
       "    color: var(--features);\n",
       "    display: flex;\n",
       "    flex-flow: column nowrap;\n",
       "    justify-content: flex-start;\n",
       "    align-items: flex-start;\n",
       "    align-content: flex-start;\n",
       "    padding: 2px;\n",
       "    margin: 2px;\n",
       "    direction: ltr;\n",
       "    unicode-bidi: embed;\n",
       "    border: var(--meta-width) solid var(--meta-color);\n",
       "    border-radius: var(--meta-width);\n",
       "}\n",
       ".features div,.features span {\n",
       "    padding: 0;\n",
       "    margin: -2px 0;\n",
       "}\n",
       ".features .f {\n",
       "    font-family: sans-serif;\n",
       "    font-size: small;\n",
       "    font-weight: normal;\n",
       "    color: #5555bb;\n",
       "}\n",
       ".features .xft {\n",
       "  color: #000000;\n",
       "  background-color: #eeeeee;\n",
       "  font-size: medium;\n",
       "  margin: 2px 0px;\n",
       "}\n",
       ".features .xft .f {\n",
       "  color: #000000;\n",
       "  background-color: #eeeeee;\n",
       "  font-size: small;\n",
       "  font-weight: normal;\n",
       "}\n",
       ".tfsechead {\n",
       "    font-family: sans-serif;\n",
       "    font-size: small;\n",
       "    font-weight: bold;\n",
       "    color: var(--tfsechead);\n",
       "    unicode-bidi: embed;\n",
       "    text-align: start;\n",
       "}\n",
       ".structure {\n",
       "    font-family: sans-serif;\n",
       "    font-size: small;\n",
       "    font-weight: bold;\n",
       "    color: var(--structure);\n",
       "    unicode-bidi: embed;\n",
       "    text-align: start;\n",
       "}\n",
       ".comments {\n",
       "    display: flex;\n",
       "    justify-content: flex-start;\n",
       "    align-items: flex-start;\n",
       "    align-content: flex-start;\n",
       "    flex-flow: column nowrap;\n",
       "}\n",
       ".nd, a:link.nd {\n",
       "    font-family: sans-serif;\n",
       "    font-size: small;\n",
       "    color: var(--node);\n",
       "    vertical-align: super;\n",
       "    direction: ltr ! important;\n",
       "    unicode-bidi: embed;\n",
       "}\n",
       ".nde, a:link.nde {\n",
       "    font-family: sans-serif;\n",
       "    font-size: small;\n",
       "    color: var(--node);\n",
       "    direction: ltr ! important;\n",
       "    unicode-bidi: embed;\n",
       "}\n",
       ".etf {\n",
       "    font-size: normal;\n",
       "    border-radius: 0.2rem;\n",
       "    border: 1pt solid white;\n",
       "    padding: 0 0.2rem ! important;\n",
       "    margin: 0 0.2rem ! important;\n",
       "}\n",
       ".etfx {\n",
       "    font-size: x-large;\n",
       "}\n",
       ".lex {\n",
       "  color: var(--lex-color);;\n",
       "}\n",
       "#colormapplus, #colormapmin, .ecolormapmin {\n",
       "  font-weight: bold;\n",
       "  border-radius: 0.1rem;\n",
       "  background-color: #eeeeff;\n",
       "  padding: 0 1rem;\n",
       "  margin: 0 1rem;\n",
       "}\n",
       ".clr {\n",
       "  font-style: italic;\n",
       "  font-size: small;\n",
       "}\n",
       ".clmap,.eclmap {\n",
       "  padding: 0;\n",
       "}\n",
       ".children,.children.ltr {\n",
       "    display: flex;\n",
       "    border: 0;\n",
       "    background-color: #ffffff;\n",
       "    justify-content: flex-start;\n",
       "    align-items: flex-start;\n",
       "    align-content: flex-start;\n",
       "}\n",
       ".children.stretch {\n",
       "    align-items: stretch;\n",
       "}\n",
       ".children.hor {\n",
       "    flex-flow: row nowrap;\n",
       "}\n",
       ".children.hor.wrap {\n",
       "    flex-flow: row wrap;\n",
       "}\n",
       ".children.ver {\n",
       "    flex-flow: column nowrap;\n",
       "}\n",
       ".children.ver.wrap {\n",
       "    flex-flow: column wrap;\n",
       "}\n",
       ".contnr {\n",
       "    width: fit-content;\n",
       "    display: flex;\n",
       "    justify-content: flex-start;\n",
       "    align-items: flex-start;\n",
       "    align-content: flex-start;\n",
       "    flex-flow: column nowrap;\n",
       "    background: #ffffff none repeat scroll 0 0;\n",
       "    padding:  10px 2px 2px 2px;\n",
       "    margin: 16px 2px 2px 2px;\n",
       "    border-style: solid;\n",
       "    font-size: small;\n",
       "}\n",
       ".contnr.trm {\n",
       "    background-attachment: local;\n",
       "}\n",
       ".contnr.cnul {\n",
       "    padding:  0;\n",
       "    margin: 0;\n",
       "    border-style: solid;\n",
       "    font-size: xx-small;\n",
       "}\n",
       ".contnr.cnul,.lbl.cnul {\n",
       "    border-color: var(--border-color-nul);\n",
       "    border-width: var(--border-width-nul);\n",
       "    border-radius: var(--border-width-nul);\n",
       "}\n",
       ".contnr.c0,.lbl.c0 {\n",
       "    border-color: var(--border-color0);\n",
       "    border-width: var(--border-width0);\n",
       "    border-radius: var(--border-width0);\n",
       "}\n",
       ".contnr.c1,.lbl.c1 {\n",
       "    border-color: var(--border-color1);\n",
       "    border-width: var(--border-width1);\n",
       "    border-radius: var(--border-width1);\n",
       "}\n",
       ".contnr.c2,.lbl.c2 {\n",
       "    border-color: var(--border-color2);\n",
       "    border-width: var(--border-width2);\n",
       "    border-radius: var(--border-width2);\n",
       "}\n",
       ".contnr.c3,.lbl.c3 {\n",
       "    border-color: var(--border-color3);\n",
       "    border-width: var(--border-width3);\n",
       "    border-radius: var(--border-width3);\n",
       "}\n",
       ".contnr.c4,.lbl.c4 {\n",
       "    border-color: var(--border-color4);\n",
       "    border-width: var(--border-width4);\n",
       "    border-radius: var(--border-width4);\n",
       "}\n",
       "span.plain {\n",
       "    /*display: inline-block;*/\n",
       "    display: inline-flex;\n",
       "    flex-flow: row wrap;\n",
       "    white-space: pre-wrap;\n",
       "}\n",
       "span.break {\n",
       "  flex-basis: 100%;\n",
       "  height: 0;\n",
       "}\n",
       ".plain {\n",
       "    background-color: #ffffff;\n",
       "}\n",
       ".plain.l,.contnr.l,.contnr.l>.lbl {\n",
       "    border-left-style: dotted\n",
       "}\n",
       ".plain.r,.contnr.r,.contnr.r>.lbl {\n",
       "    border-right-style: dotted\n",
       "}\n",
       ".plain.lno,.contnr.lno,.contnr.lno>.lbl {\n",
       "    border-left-style: none\n",
       "}\n",
       ".plain.rno,.contnr.rno,.contnr.rno>.lbl {\n",
       "    border-right-style: none\n",
       "}\n",
       ".plain.l {\n",
       "    padding-left: 4px;\n",
       "    margin-left: 2px;\n",
       "    border-width: var(--border-width-plain);\n",
       "}\n",
       ".plain.r {\n",
       "    padding-right: 4px;\n",
       "    margin-right: 2px;\n",
       "    border-width: var(--border-width-plain);\n",
       "}\n",
       ".lbl {\n",
       "    font-family: monospace;\n",
       "    margin-top: -24px;\n",
       "    margin-left: 20px;\n",
       "    background: #ffffff none repeat scroll 0 0;\n",
       "    padding: 0 6px;\n",
       "    border-style: solid;\n",
       "    display: block;\n",
       "    color: var(--label)\n",
       "}\n",
       ".lbl.trm {\n",
       "    background-attachment: local;\n",
       "    margin-top: 2px;\n",
       "    margin-left: 2px;\n",
       "    padding: 2px 2px;\n",
       "    border-style: none;\n",
       "}\n",
       ".lbl.cnul {\n",
       "    font-size: xx-small;\n",
       "}\n",
       ".lbl.c0 {\n",
       "    font-size: small;\n",
       "}\n",
       ".lbl.c1 {\n",
       "    font-size: small;\n",
       "}\n",
       ".lbl.c2 {\n",
       "    font-size: medium;\n",
       "}\n",
       ".lbl.c3 {\n",
       "    font-size: medium;\n",
       "}\n",
       ".lbl.c4 {\n",
       "    font-size: large;\n",
       "}\n",
       ".occs, a:link.occs {\n",
       "    font-size: small;\n",
       "}\n",
       "\n",
       "/* PROVENANCE */\n",
       "\n",
       "div.prov {\n",
       "\tmargin: 40px;\n",
       "\tpadding: 20px;\n",
       "\tborder: 2px solid var(--fog-rim);\n",
       "}\n",
       "div.pline {\n",
       "\tdisplay: flex;\n",
       "\tflex-flow: row nowrap;\n",
       "\tjustify-content: stretch;\n",
       "\talign-items: baseline;\n",
       "}\n",
       "div.p2line {\n",
       "\tmargin-left: 2em;\n",
       "\tdisplay: flex;\n",
       "\tflex-flow: row nowrap;\n",
       "\tjustify-content: stretch;\n",
       "\talign-items: baseline;\n",
       "}\n",
       "div.psline {\n",
       "\tdisplay: flex;\n",
       "\tflex-flow: row nowrap;\n",
       "\tjustify-content: stretch;\n",
       "\talign-items: baseline;\n",
       "\tbackground-color: var(--gold-mist-back);\n",
       "}\n",
       "div.pname {\n",
       "\tflex: 0 0 5rem;\n",
       "\tfont-weight: bold;\n",
       "}\n",
       "div.pval {\n",
       "    flex: 1 1 auto;\n",
       "}\n",
       "\n",
       "/* KEYBOARD */\n",
       ".ccoff {\n",
       "  background-color: inherit;\n",
       "}\n",
       ".ccon {\n",
       "  background-color: yellow ! important;\n",
       "}\n",
       ".ccon,.ccoff {\n",
       "  padding: 0.2rem;\n",
       "  margin: 0.2rem;\n",
       "  border: 0.1rem solid var(--letter-box-border);\n",
       "  border-radius: 0.1rem;\n",
       "}\n",
       ".ccline {\n",
       "  font-size: xx-large ! important;\n",
       "  font-weight: bold;\n",
       "  line-height: 2em ! important;\n",
       "}\n",
       "/* TF header */\n",
       "\n",
       "summary {\n",
       "  /* needed to override the normalize.less\n",
       "   * in the classical Jupyter Notebook\n",
       "   */\n",
       "  display: list-item ! important;\n",
       "}\n",
       "\n",
       ".fcorpus {\n",
       "  display: flex;\n",
       "  flex-flow: column nowrap;\n",
       "  justify-content: flex-start;\n",
       "  align-items: flex-start;\n",
       "  align-content: flex-start;\n",
       "  overflow: auto;\n",
       "}\n",
       ".frow {\n",
       "  display: flex;\n",
       "  flex-flow: row nowrap;\n",
       "  justify-content: flex-start;\n",
       "  align-items: flex-start;\n",
       "  align-content: flex-start;\n",
       "}\n",
       ".fmeta {\n",
       "  display: flex;\n",
       "  flex-flow: column nowrap;\n",
       "  justify-content: flex-start;\n",
       "  align-items: flex-start;\n",
       "  align-content: flex-start;\n",
       "}\n",
       ".fmetarow {\n",
       "  display: flex;\n",
       "  flex-flow: row nowrap;\n",
       "  justify-content: flex-start;\n",
       "  align-items: flex-start;\n",
       "  align-content: flex-start;\n",
       "}\n",
       ".fmetakey {\n",
       "  min-width: 8em;\n",
       "  font-family: monospace;\n",
       "}\n",
       ".fnamecat {\n",
       "  min-width: 8em;\n",
       "}\n",
       ".fnamecat.edge {\n",
       "  font-weight: bold;\n",
       "  font-style: italic;\n",
       "}\n",
       ".fmono {\n",
       "    font-family: monospace;\n",
       "}\n",
       "\n",
       ":root {\n",
       "\t--node:               hsla(120, 100%,  20%, 1.0  );\n",
       "\t--label:              hsla(  0, 100%,  20%, 1.0  );\n",
       "\t--tfsechead:          hsla(  0, 100%,  25%, 1.0  );\n",
       "\t--structure:          hsla(120, 100%,  25%, 1.0  );\n",
       "\t--features:           hsla(  0,   0%,  30%, 1.0  );\n",
       "  --text-color:         hsla( 60,  80%,  10%, 1.0  );\n",
       "  --lex-color:          hsla(220,  90%,  60%, 1.0  );\n",
       "  --meta-color:         hsla(  0,   0%,  90%, 0.7  );\n",
       "  --meta-width:         3px;\n",
       "  --border-color-nul:   hsla(  0,   0%,  90%, 0.5  );\n",
       "  --border-color0:      hsla(  0,   0%,  90%, 0.9  );\n",
       "  --border-color1:      hsla(  0,   0%,  80%, 0.9  );\n",
       "  --border-color2:      hsla(  0,   0%,  70%, 0.9  );\n",
       "  --border-color3:      hsla(  0,   0%,  80%, 0.8  );\n",
       "  --border-color4:      hsla(  0,   0%,  60%, 0.9  );\n",
       "\t--letter-box-border:  hsla(  0,   0%,  80%, 0.5  );\n",
       "  --border-width-nul:   2px;\n",
       "  --border-width0:      2px;\n",
       "  --border-width1:      3px;\n",
       "  --border-width2:      4px;\n",
       "  --border-width3:      6px;\n",
       "  --border-width4:      5px;\n",
       "  --border-width-plain: 2px;\n",
       "}\n",
       ".hl {\n",
       "  background-color: var(--hl-strong);\n",
       "}\n",
       "span.hl {\n",
       "\tbackground-color: var(--hl-strong);\n",
       "\tborder-width: 0;\n",
       "\tborder-radius: 2px;\n",
       "\tborder-style: solid;\n",
       "}\n",
       "div.contnr.hl,div.lbl.hl {\n",
       "  background-color: var(--hl-strong);\n",
       "}\n",
       "div.contnr.hl {\n",
       "  border-color: var(--hl-rim) ! important;\n",
       "\tborder-width: 4px ! important;\n",
       "}\n",
       "\n",
       "span.hlbx {\n",
       "\tborder-color: var(--hl-rim);\n",
       "\tborder-width: 4px ! important;\n",
       "\tborder-style: solid;\n",
       "\tborder-radius: 6px;\n",
       "  padding: 4px;\n",
       "  margin: 4px;\n",
       "}\n",
       ".ehl {\n",
       "  background-color: var(--ehl-strong);\n",
       "}\n",
       "\n",
       ":root {\n",
       "\t--hl-strong:        hsla( 60, 100%,  70%, 0.9  );\n",
       "\t--hl-rim:           hsla( 55,  80%,  50%, 1.0  );\n",
       "\t--ehl-strong:       hsla(240, 100%,  70%, 0.9  );\n",
       "}\n",
       "</style>"
      ],
      "text/plain": [
       "<IPython.core.display.HTML object>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "# The following will push the Text-Fabric stylesheet to this notebook (to facilitate proper display with notebook viewer)\n",
    "N1904.dh(N1904.getCss())"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 5,
   "id": "80c5a250-0785-46ed-bd51-c8e3e29205f6",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "# Set default view in a way to limit noise as much as possible.\n",
    "N1904.displaySetup(condensed=True, multiFeatures=False, queryFeatures=False)"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "58ef1678-a19d-4c0c-80f3-84f8471a90e2",
   "metadata": {
    "tags": []
   },
   "source": [
    "# 3 - Performing the queries <a class=\"anchor\" id=\"bullet3\"></a>\n",
    "##### [Back to TOC](#TOC)"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "b59c83bd-329d-4820-8bcc-ca92e1c55f6d",
   "metadata": {},
   "source": [
    "## 3.1 - The 25 most frequent words in the corpus<a class=\"anchor\" id=\"bullet3x1\"></a>\n",
    "##### [Back to TOC](#TOC)\n",
    "\n",
    "The method [`freqList`](https://annotation.github.io/text-fabric/tf/core/nodefeature.html#tf.core.nodefeature.NodeFeature.freqList) returns A tuple of (value, frequency), items, ordered by frequency, highest frequencies first."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 4,
   "id": "1d4b1b93-08e5-41f4-a587-66e444a3e271",
   "metadata": {
    "tags": []
   },
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Amount\tword\n",
      "8545\tκαὶ\n",
      "2769\tὁ\n",
      "2684\tἐν\n",
      "2620\tδὲ\n",
      "2497\tτοῦ\n",
      "1755\tεἰς\n",
      "1658\tτὸ\n",
      "1556\tτὸν\n",
      "1518\tτὴν\n",
      "1411\tαὐτοῦ\n",
      "1300\tτῆς\n",
      "1281\tὅτι\n",
      "1221\tτῷ\n",
      "1201\tτῶν\n",
      "1069\tοἱ\n",
      "941\tἡ\n",
      "921\tγὰρ\n",
      "902\tμὴ\n",
      "859\tτῇ\n",
      "849\tαὐτῷ\n",
      "817\tτὰ\n",
      "767\tοὐκ\n",
      "722\tτοὺς\n",
      "689\tΘεοῦ\n",
      "670\tπρὸς\n"
     ]
    }
   ],
   "source": [
    "print(\"Amount\\tword\")\n",
    "for (w, amount) in F.word.freqList(\"word\")[0:25]:\n",
    "    print(f\"{amount}\\t{w}\")"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "211b2bde-002b-4243-87c9-4bd850868354",
   "metadata": {
    "jupyter": {
     "outputs_hidden": true
    },
    "tags": []
   },
   "source": [
    "## 3.2 - Frequency of characters in corpus <a class=\"anchor\" id=\"bullet3x2\"></a>\n",
    "##### [Back to TOC](#TOC)\n",
    "\n",
    "This code generates a table that displays the frequency of characters within the Text-Fabric corpus. The API call 'C.characters.data' produces a Python dictionary structure that contains the data. The remaining code unpacks and sorts this structure to present the results in a formated table. \n",
    "\n",
    "Note the first line of the output is 'Format:  text-orig-full'. This "
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 5,
   "id": "b8e8ce2d-43db-48dd-ace9-2156c7046692",
   "metadata": {
    "tags": []
   },
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Format:  text-critical\n",
      "╒═════════════╤═════════════╕\n",
      "│ character   │   frequency │\n",
      "╞═════════════╪═════════════╡\n",
      "│ ν           │       56230 │\n",
      "├─────────────┼─────────────┤\n",
      "│ α           │       51892 │\n",
      "├─────────────┼─────────────┤\n",
      "│ τ           │       50599 │\n",
      "├─────────────┼─────────────┤\n",
      "│ ο           │       45151 │\n",
      "├─────────────┼─────────────┤\n",
      "│ ε           │       38597 │\n",
      "├─────────────┼─────────────┤\n",
      "│ ς           │       27090 │\n",
      "├─────────────┼─────────────┤\n",
      "│ ι           │       26131 │\n",
      "├─────────────┼─────────────┤\n",
      "│ σ           │       24095 │\n",
      "├─────────────┼─────────────┤\n",
      "│ ρ           │       22871 │\n",
      "├─────────────┼─────────────┤\n",
      "│ κ           │       22630 │\n",
      "├─────────────┼─────────────┤\n",
      "│ π           │       20308 │\n",
      "├─────────────┼─────────────┤\n",
      "│ μ           │       19218 │\n",
      "├─────────────┼─────────────┤\n",
      "│ λ           │       18228 │\n",
      "├─────────────┼─────────────┤\n",
      "│ δ           │       12476 │\n",
      "├─────────────┼─────────────┤\n",
      "│ ἐ           │       12116 │\n",
      "╘═════════════╧═════════════╛\n"
     ]
    },
    {
     "data": {
      "text/markdown": [
       "**Warning: table truncated!**"
      ],
      "text/plain": [
       "<IPython.core.display.Markdown object>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Format:  text-normalized\n",
      "╒═════════════╤═════════════╕\n",
      "│ character   │   frequency │\n",
      "╞═════════════╪═════════════╡\n",
      "│             │      137779 │\n",
      "├─────────────┼─────────────┤\n",
      "│ ν           │       56230 │\n",
      "├─────────────┼─────────────┤\n",
      "│ α           │       52127 │\n",
      "├─────────────┼─────────────┤\n",
      "│ τ           │       50599 │\n",
      "├─────────────┼─────────────┤\n",
      "│ ο           │       45516 │\n",
      "├─────────────┼─────────────┤\n",
      "│ ε           │       38807 │\n",
      "├─────────────┼─────────────┤\n",
      "│ ς           │       27090 │\n",
      "├─────────────┼─────────────┤\n",
      "│ ι           │       26404 │\n",
      "├─────────────┼─────────────┤\n",
      "│ σ           │       24095 │\n",
      "├─────────────┼─────────────┤\n",
      "│ ρ           │       22871 │\n",
      "├─────────────┼─────────────┤\n",
      "│ κ           │       22630 │\n",
      "├─────────────┼─────────────┤\n",
      "│ ί           │       21518 │\n",
      "├─────────────┼─────────────┤\n",
      "│ π           │       20308 │\n",
      "├─────────────┼─────────────┤\n",
      "│ μ           │       19218 │\n",
      "├─────────────┼─────────────┤\n",
      "│ λ           │       18228 │\n",
      "╘═════════════╧═════════════╛\n"
     ]
    },
    {
     "data": {
      "text/markdown": [
       "**Warning: table truncated!**"
      ],
      "text/plain": [
       "<IPython.core.display.Markdown object>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Format:  text-orig-full\n",
      "╒═════════════╤═════════════╕\n",
      "│ character   │   frequency │\n",
      "╞═════════════╪═════════════╡\n",
      "│             │      137779 │\n",
      "├─────────────┼─────────────┤\n",
      "│ ν           │       56230 │\n",
      "├─────────────┼─────────────┤\n",
      "│ α           │       51892 │\n",
      "├─────────────┼─────────────┤\n",
      "│ τ           │       50599 │\n",
      "├─────────────┼─────────────┤\n",
      "│ ο           │       45151 │\n",
      "├─────────────┼─────────────┤\n",
      "│ ε           │       38597 │\n",
      "├─────────────┼─────────────┤\n",
      "│ ς           │       27090 │\n",
      "├─────────────┼─────────────┤\n",
      "│ ι           │       26131 │\n",
      "├─────────────┼─────────────┤\n",
      "│ σ           │       24095 │\n",
      "├─────────────┼─────────────┤\n",
      "│ ρ           │       22871 │\n",
      "├─────────────┼─────────────┤\n",
      "│ κ           │       22630 │\n",
      "├─────────────┼─────────────┤\n",
      "│ π           │       20308 │\n",
      "├─────────────┼─────────────┤\n",
      "│ μ           │       19218 │\n",
      "├─────────────┼─────────────┤\n",
      "│ λ           │       18228 │\n",
      "├─────────────┼─────────────┤\n",
      "│ δ           │       12476 │\n",
      "╘═════════════╧═════════════╛\n"
     ]
    },
    {
     "data": {
      "text/markdown": [
       "**Warning: table truncated!**"
      ],
      "text/plain": [
       "<IPython.core.display.Markdown object>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Format:  text-transliterated\n",
      "╒═════════════╤═════════════╕\n",
      "│ character   │   frequency │\n",
      "╞═════════════╪═════════════╡\n",
      "│             │      137779 │\n",
      "├─────────────┼─────────────┤\n",
      "│ e           │       93371 │\n",
      "├─────────────┼─────────────┤\n",
      "│ o           │       87008 │\n",
      "├─────────────┼─────────────┤\n",
      "│ a           │       75119 │\n",
      "├─────────────┼─────────────┤\n",
      "│ i           │       62778 │\n",
      "├─────────────┼─────────────┤\n",
      "│ t           │       60011 │\n",
      "├─────────────┼─────────────┤\n",
      "│ n           │       56230 │\n",
      "├─────────────┼─────────────┤\n",
      "│ s           │       52132 │\n",
      "├─────────────┼─────────────┤\n",
      "│ u           │       39287 │\n",
      "├─────────────┼─────────────┤\n",
      "│ k           │       27300 │\n",
      "├─────────────┼─────────────┤\n",
      "│ p           │       25081 │\n",
      "├─────────────┼─────────────┤\n",
      "│ r           │       22871 │\n",
      "├─────────────┼─────────────┤\n",
      "│ h           │       20033 │\n",
      "├─────────────┼─────────────┤\n",
      "│ m           │       19218 │\n",
      "├─────────────┼─────────────┤\n",
      "│ l           │       18228 │\n",
      "╘═════════════╧═════════════╛\n"
     ]
    },
    {
     "data": {
      "text/markdown": [
       "**Warning: table truncated!**"
      ],
      "text/plain": [
       "<IPython.core.display.Markdown object>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Format:  text-unaccented\n",
      "╒═════════════╤═════════════╕\n",
      "│ character   │   frequency │\n",
      "╞═════════════╪═════════════╡\n",
      "│             │      137779 │\n",
      "├─────────────┼─────────────┤\n",
      "│ α           │       75119 │\n",
      "├─────────────┼─────────────┤\n",
      "│ ε           │       66656 │\n",
      "├─────────────┼─────────────┤\n",
      "│ ο           │       65731 │\n",
      "├─────────────┼─────────────┤\n",
      "│ ι           │       62834 │\n",
      "├─────────────┼─────────────┤\n",
      "│ ν           │       56230 │\n",
      "├─────────────┼─────────────┤\n",
      "│ τ           │       50599 │\n",
      "├─────────────┼─────────────┤\n",
      "│ υ           │       39287 │\n",
      "├─────────────┼─────────────┤\n",
      "│ ς           │       27090 │\n",
      "├─────────────┼─────────────┤\n",
      "│ η           │       26715 │\n",
      "├─────────────┼─────────────┤\n",
      "│ σ           │       24095 │\n",
      "├─────────────┼─────────────┤\n",
      "│ ρ           │       23046 │\n",
      "├─────────────┼─────────────┤\n",
      "│ κ           │       22630 │\n",
      "├─────────────┼─────────────┤\n",
      "│ ω           │       21277 │\n",
      "├─────────────┼─────────────┤\n",
      "│ π           │       20308 │\n",
      "╘═════════════╧═════════════╛\n"
     ]
    },
    {
     "data": {
      "text/markdown": [
       "**Warning: table truncated!**"
      ],
      "text/plain": [
       "<IPython.core.display.Markdown object>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "# Library to format table\n",
    "from tabulate import tabulate\n",
    "\n",
    "# The following API call will result in a Python dictionary structure\n",
    "FrequencyDictionary=C.characters.data\n",
    "\n",
    "# Present the results\n",
    "KeyList = list(FrequencyDictionary.keys())\n",
    "for Key in KeyList:\n",
    "    print('Format: ',Key)\n",
    "    # 'key' refers to the pre-defined formats the text will be displayed\n",
    "    FrequencyList=FrequencyDictionary[Key]\n",
    "    SortedFrequencyList=sorted(FrequencyList, key=lambda x: x[1], reverse=True)\n",
    "    \n",
    "    # In this example the table will be truncated to the first 15 entries\n",
    "    max_rows = 15  # Set your desired number of rows here\n",
    "    TruncatedTable = SortedFrequencyList[:max_rows]\n",
    "    \n",
    "    headers = [\"character\", \"frequency\"]\n",
    "    print(tabulate(TruncatedTable, headers=headers, tablefmt='fancy_grid'))\n",
    "    \n",
    "    # Add a warning using markdown (API call A.dm) allowing it to be printed in bold type\n",
    "    N1904.dm(\"**Warning: table truncated!**\")"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "75627859-1d9c-4d99-9020-d2302f6de408",
   "metadata": {},
   "source": [
    "## 3.3 - Some stats on node types <a class=\"anchor\" id=\"bullet3x3\"></a>\n",
    "##### [Back to TOC](#TOC)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 8,
   "id": "b5ce40f1-9a22-444f-955a-c5545797a056",
   "metadata": {
    "tags": []
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "(('book', 5102.925925925926, 137780, 137806),\n",
       " ('chapter', 529.9192307692308, 137807, 138066),\n",
       " ('verse', 17.345965000629484, 146078, 154020),\n",
       " ('sentence', 17.198726750717764, 138067, 146077),\n",
       " ('wg', 7.583849727185382, 154021, 267467),\n",
       " ('word', 1, 1, 137779))"
      ]
     },
     "execution_count": 8,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "C.levels.data"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "f6ad9acc-92e3-47b9-bfaf-8c06ed33ada4",
   "metadata": {
    "tags": []
   },
   "source": [
    "## 3.4 - The available text formats <a class=\"anchor\" id=\"bullet3x4\"></a>\n",
    "##### [Back to TOC](#TOC)\n",
    "\n",
    "Not particular a statistic function, but still important in relation to the corpus. The output of this command provides details on available formats to present the text of the corpus. See also [module tf.advanced.options\n",
    "Display Settings](https://annotation.github.io/text-fabric/tf/advanced/options.html)."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 8,
   "id": "97137d58-68cb-4383-a545-5668e603493f",
   "metadata": {
    "tags": []
   },
   "outputs": [
    {
     "data": {
      "text/markdown": [
       "format | level | template\n",
       "--- | --- | ---\n",
       "`text-critical` | **word** | `{unicode} `\n",
       "`text-normalized` | **word** | `{normalized}{after}`\n",
       "`text-orig-full` | **word** | `{word}{after}`\n",
       "`text-transliterated` | **word** | `{wordtranslit}{after}`\n",
       "`text-unaccented` | **word** | `{wordunacc}{after}`\n"
      ],
      "text/plain": [
       "<IPython.core.display.Markdown object>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "N1904.showFormats()"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "1bae482a-9abb-4280-a52c-b3011037fded",
   "metadata": {},
   "source": [
    "The same result (although formatted different) can be obtained by the following call:"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 9,
   "id": "acaaf356-eeae-4101-b5ef-090607dca5fc",
   "metadata": {
    "tags": []
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "{'text-critical': 'word',\n",
       " 'text-normalized': 'word',\n",
       " 'text-orig-full': 'word',\n",
       " 'text-transliterated': 'word',\n",
       " 'text-unaccented': 'word'}"
      ]
     },
     "execution_count": 9,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "T.formats"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "76294b50-192f-47e0-95c2-09a1ca79fe17",
   "metadata": {},
   "source": [
    "Note that this data originates from file `otext.tf`:\n",
    "\n",
    "> \n",
    "```\n",
    "@config\n",
    "...\n",
    "@fmt:text-orig-full={word}{after}\n",
    "...\n",
    "```\n"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "d23c6817",
   "metadata": {},
   "source": [
    "## 3.5 - List of feature frequencies <a class=\"anchor\" id=\"bullet3x5\"></a>\n",
    "##### [Back to TOC](#TOC)\n",
    "\n",
    "This code generates a lot of output! For that reason we will cut it off after 5 lines per feature."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 6,
   "id": "75b2827e-81e1-4e28-a46c-bd50bc56a5aa",
   "metadata": {
    "tags": []
   },
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Feature: after \n",
      "\n",
      "\t value\t frequency\n",
      "\t   \t 119270\n",
      "\t ,  \t 9462\n",
      "\t .  \t 5717\n",
      "\t ·  \t 2359\n",
      "\t ;  \t 971\n",
      "\n",
      "\n",
      "Feature: book \n",
      "\n",
      "\t value\t frequency\n",
      "\t Luke \t 21785\n",
      "\t Matthew \t 20529\n",
      "\t Acts \t 20307\n",
      "\t John \t 17582\n",
      "\t Mark \t 12695\n",
      "\n",
      "\n",
      "Feature: booknumber \n",
      "\n",
      "\t value\t frequency\n",
      "\t 3 \t 19457\n",
      "\t 5 \t 18394\n",
      "\t 1 \t 18300\n",
      "\t 4 \t 15644\n",
      "\t 2 \t 11278\n",
      "\n",
      "\n",
      "Feature: bookshort \n",
      "\n",
      "\t value\t frequency\n",
      "\t Luke \t 19457\n",
      "\t Acts \t 18394\n",
      "\t Matt \t 18300\n",
      "\t John \t 15644\n",
      "\t Mark \t 11278\n",
      "\n",
      "\n",
      "Feature: case \n",
      "\n",
      "\t value\t frequency\n",
      "\t  \t 58261\n",
      "\t nominative \t 24197\n",
      "\t accusative \t 23031\n",
      "\t genitive \t 19515\n",
      "\t dative \t 12126\n",
      "\n",
      "\n",
      "Feature: chapter \n",
      "\n",
      "\t value\t frequency\n",
      "\t 1 \t 12922\n",
      "\t 2 \t 10923\n",
      "\t 3 \t 9652\n",
      "\t 4 \t 9631\n",
      "\t 5 \t 8788\n",
      "\n",
      "\n",
      "Feature: clausetype \n",
      "\n",
      "\t value\t frequency\n",
      "\t  \t 102662\n",
      "\t VerbElided \t 1009\n",
      "\t Verbless \t 929\n",
      "\t Minor \t 830\n",
      "\n",
      "\n",
      "Feature: containedclause \n",
      "\n",
      "\t value\t frequency\n",
      "\t  \t 8372\n",
      "\t 2 \t 148\n",
      "\t 172 \t 69\n",
      "\t 97 \t 69\n",
      "\t 389 \t 68\n",
      "\n",
      "\n",
      "Feature: degree \n",
      "\n",
      "\t value\t frequency\n",
      "\t  \t 137266\n",
      "\t comparative \t 313\n",
      "\t superlative \t 200\n",
      "\n",
      "\n",
      "Feature: gloss \n",
      "\n",
      "\t value\t frequency\n",
      "\t the \t 9857\n",
      "\t and \t 6212\n",
      "\t - \t 5496\n",
      "\t in \t 2320\n",
      "\t And \t 2218\n",
      "\n",
      "\n",
      "Feature: gn \n",
      "\n",
      "\t value\t frequency\n",
      "\t  \t 63804\n",
      "\t masculine \t 41486\n",
      "\t feminine \t 18736\n",
      "\t neuter \t 13753\n",
      "\n",
      "\n",
      "Feature: headverse \n",
      "\n",
      "\t value\t frequency\n",
      "\t 1 \t 298\n",
      "\t 7 \t 270\n",
      "\t 12 \t 267\n",
      "\t 9 \t 264\n",
      "\t 13 \t 260\n",
      "\n",
      "\n",
      "Feature: junction \n",
      "\n",
      "\t value\t frequency\n",
      "\t  \t 103128\n",
      "\t apposition \t 2302\n",
      "\n",
      "\n",
      "Feature: lemma \n",
      "\n",
      "\t value\t frequency\n",
      "\t ὁ \t 19783\n",
      "\t καί \t 8978\n",
      "\t αὐτός \t 5561\n",
      "\t σύ \t 2892\n",
      "\t δέ \t 2787\n",
      "\n",
      "\n",
      "Feature: lex_dom \n",
      "\n",
      "\t value\t frequency\n",
      "\t 092004 \t 26322\n",
      "\t  \t 10487\n",
      "\t 089017 \t 4370\n",
      "\t 093001 \t 3672\n",
      "\t 033006 \t 3225\n",
      "\n",
      "\n",
      "Feature: ln \n",
      "\n",
      "\t value\t frequency\n",
      "\t 92.24 \t 19781\n",
      "\t  \t 10488\n",
      "\t 92.11 \t 4718\n",
      "\t 89.92 \t 2903\n",
      "\t 89.87 \t 2756\n",
      "\n",
      "\n",
      "Feature: markafter \n",
      "\n",
      "\t value\t frequency\n",
      "\t  \t 137728\n",
      "\t — \t 31\n",
      "\t ) \t 11\n",
      "\t ]] \t 7\n",
      "\t ( \t 1\n",
      "\n",
      "\n",
      "Feature: markbefore \n",
      "\n",
      "\t value\t frequency\n",
      "\t  \t 137745\n",
      "\t — \t 16\n",
      "\t ( \t 10\n",
      "\t [[ \t 7\n",
      "\t [ \t 1\n",
      "\n",
      "\n",
      "Feature: markorder \n",
      "\n",
      "\t value\t frequency\n",
      "\t  \t 137694\n",
      "\t 0 \t 34\n",
      "\t 3 \t 32\n",
      "\t 2 \t 10\n",
      "\t 1 \t 9\n",
      "\n",
      "\n",
      "Feature: monad \n",
      "\n",
      "\t value\t frequency\n",
      "\t 1 \t 1\n",
      "\t 2 \t 1\n",
      "\t 3 \t 1\n",
      "\t 4 \t 1\n",
      "\t 5 \t 1\n",
      "\n",
      "\n",
      "Feature: mood \n",
      "\n",
      "\t value\t frequency\n",
      "\t  \t 109422\n",
      "\t indicative \t 15617\n",
      "\t participle \t 6653\n",
      "\t infinitive \t 2285\n",
      "\t imperative \t 1877\n",
      "\n",
      "\n",
      "Feature: morph \n",
      "\n",
      "\t value\t frequency\n",
      "\t CONJ \t 16316\n",
      "\t PREP \t 10568\n",
      "\t ADV \t 3808\n",
      "\t N-NSM \t 3475\n",
      "\t N-GSM \t 2935\n",
      "\n",
      "\n",
      "Feature: nodeID \n",
      "\n",
      "\t value\t frequency\n",
      "\t  \t 52046\n",
      "\t common \t 14186\n",
      "\t personal \t 6040\n",
      "\t proper \t 2192\n",
      "\t relative \t 885\n",
      "\n",
      "\n",
      "Feature: normalized \n",
      "\n",
      "\t value\t frequency\n",
      "\t καί \t 8576\n",
      "\t ὁ \t 2769\n",
      "\t δέ \t 2764\n",
      "\t ἐν \t 2684\n",
      "\t τοῦ \t 2497\n",
      "\n",
      "\n",
      "Feature: nu \n",
      "\n",
      "\t value\t frequency\n",
      "\t singular \t 69846\n",
      "\t  \t 38842\n",
      "\t plural \t 29091\n",
      "\n",
      "\n",
      "Feature: number \n",
      "\n",
      "\t value\t frequency\n",
      "\t singular \t 69846\n",
      "\t  \t 38842\n",
      "\t plural \t 29091\n",
      "\n",
      "\n",
      "Feature: person \n",
      "\n",
      "\t value\t frequency\n",
      "\t  \t 118360\n",
      "\t third \t 12747\n",
      "\t second \t 3729\n",
      "\t first \t 2943\n",
      "\n",
      "\n",
      "Feature: punctuation \n",
      "\n",
      "\t value\t frequency\n",
      "\t  \t 119270\n",
      "\t , \t 9462\n",
      "\t . \t 5717\n",
      "\t · \t 2359\n",
      "\t ; \t 971\n",
      "\n",
      "\n",
      "Feature: ref \n",
      "\n",
      "\t value\t frequency\n",
      "\t 1CO 10:1!1 \t 1\n",
      "\t 1CO 10:1!10 \t 1\n",
      "\t 1CO 10:1!11 \t 1\n",
      "\t 1CO 10:1!12 \t 1\n",
      "\t 1CO 10:1!13 \t 1\n",
      "\n",
      "\n",
      "Feature: reference \n",
      "\n",
      "\t value\t frequency\n",
      "\t 1CO 10:1!1 \t 1\n",
      "\t 1CO 10:1!10 \t 1\n",
      "\t 1CO 10:1!11 \t 1\n",
      "\t 1CO 10:1!12 \t 1\n",
      "\t 1CO 10:1!13 \t 1\n",
      "\n",
      "\n",
      "Feature: roleclausedistance \n",
      "\n",
      "\t value\t frequency\n",
      "\t 0 \t 56129\n",
      "\t 1 \t 37597\n",
      "\t 2 \t 22297\n",
      "\t 3 \t 12084\n",
      "\t 4 \t 5277\n",
      "\n",
      "\n",
      "Feature: sentence \n",
      "\n",
      "\t value\t frequency\n",
      "\t 3 \t 1130\n",
      "\t 4 \t 987\n",
      "\t 1 \t 810\n",
      "\t 5 \t 774\n",
      "\t 6 \t 707\n",
      "\n",
      "\n",
      "Feature: sp \n",
      "\n",
      "\t value\t frequency\n",
      "\t noun \t 28455\n",
      "\t verb \t 28357\n",
      "\t det \t 19786\n",
      "\t conj \t 18227\n",
      "\t pron \t 16177\n",
      "\n",
      "\n",
      "Feature: sp_full \n",
      "\n",
      "\t value\t frequency\n",
      "\t Noun \t 28455\n",
      "\t Verb \t 28357\n",
      "\t Determiner \t 19786\n",
      "\t Conjunction \t 18227\n",
      "\t Pronoun \t 16177\n",
      "\n",
      "\n",
      "Feature: strongs \n",
      "\n",
      "\t value\t frequency\n",
      "\t 3588 \t 19783\n",
      "\t 2532 \t 8978\n",
      "\t 846 \t 5561\n",
      "\t 4771 \t 2892\n",
      "\t 1161 \t 2787\n",
      "\n",
      "\n",
      "Feature: subj_ref \n",
      "\n",
      "\t value\t frequency\n",
      "\t  \t 121204\n",
      "\t n46003022002 \t 172\n",
      "\t n66001009002 \t 131\n",
      "\t n45001001001 \t 104\n",
      "\t n47010001004 \t 104\n",
      "\n",
      "\n",
      "Feature: tense \n",
      "\n",
      "\t value\t frequency\n",
      "\t  \t 109422\n",
      "\t aorist \t 11803\n",
      "\t present \t 11579\n",
      "\t imperfect \t 1689\n",
      "\t future \t 1626\n",
      "\n",
      "\n",
      "Feature: type \n",
      "\n",
      "\t value\t frequency\n",
      "\t  \t 93321\n",
      "\t common \t 23644\n",
      "\t personal \t 11521\n",
      "\t proper \t 4639\n",
      "\t demonstrative \t 1722\n",
      "\n",
      "\n",
      "Feature: unicode \n",
      "\n",
      "\t value\t frequency\n",
      "\t καὶ \t 8541\n",
      "\t ὁ \t 2768\n",
      "\t ἐν \t 2683\n",
      "\t δὲ \t 2619\n",
      "\t τοῦ \t 2497\n",
      "\n",
      "\n",
      "Feature: verse \n",
      "\n",
      "\t value\t frequency\n",
      "\t 10 \t 4928\n",
      "\t 12 \t 4910\n",
      "\t 4 \t 4800\n",
      "\t 9 \t 4800\n",
      "\t 1 \t 4793\n",
      "\n",
      "\n",
      "Feature: voice \n",
      "\n",
      "\t value\t frequency\n",
      "\t  \t 109422\n",
      "\t active \t 20742\n",
      "\t passive \t 3493\n",
      "\t middle \t 2408\n",
      "\t middlepassive \t 1714\n",
      "\n",
      "\n",
      "Feature: wgclass \n",
      "\n",
      "\t value\t frequency\n",
      "\t np \t 33710\n",
      "\t cl \t 30857\n",
      "\t cl* \t 16378\n",
      "\t  \t 12760\n",
      "\t pp \t 11169\n",
      "\n",
      "\n",
      "Feature: wglevel \n",
      "\n",
      "\t value\t frequency\n",
      "\t 5 \t 16862\n",
      "\t 4 \t 16527\n",
      "\t 6 \t 15520\n",
      "\t 7 \t 12162\n",
      "\t 3 \t 10442\n",
      "\n",
      "\n",
      "Feature: wgnum \n",
      "\n",
      "\t value\t frequency\n",
      "\t 2 \t 27\n",
      "\t 3 \t 27\n",
      "\t 4 \t 27\n",
      "\t 5 \t 27\n",
      "\t 6 \t 27\n",
      "\n",
      "\n",
      "Feature: wgrole \n",
      "\n",
      "\t value\t frequency\n",
      "\t  \t 69235\n",
      "\t adv \t 16710\n",
      "\t o \t 9329\n",
      "\t s \t 6710\n",
      "\t p \t 1770\n",
      "\n",
      "\n",
      "Feature: wgrolelong \n",
      "\n",
      "\t value\t frequency\n",
      "\t  \t 69263\n",
      "\t Adverbial \t 16710\n",
      "\t Object \t 9329\n",
      "\t Subject \t 6710\n",
      "\t Predicate \t 1770\n",
      "\n",
      "\n",
      "Feature: wgrule \n",
      "\n",
      "\t value\t frequency\n",
      "\t DetNP \t 15696\n",
      "\t  \t 14701\n",
      "\t PrepNp \t 11044\n",
      "\t NPofNP \t 6819\n",
      "\t Conj-CL \t 5571\n",
      "\n",
      "\n",
      "Feature: wgtype \n",
      "\n",
      "\t value\t frequency\n",
      "\t  \t 92932\n",
      "\t group \t 9699\n",
      "\t apposition \t 2799\n",
      "\n",
      "\n",
      "Feature: word \n",
      "\n",
      "\t value\t frequency\n",
      "\t καὶ \t 8545\n",
      "\t ὁ \t 2769\n",
      "\t ἐν \t 2684\n",
      "\t δὲ \t 2620\n",
      "\t τοῦ \t 2497\n",
      "\n",
      "\n",
      "Feature: wordlevel \n",
      "\n",
      "\t value\t frequency\n",
      "\t 6 \t 21857\n",
      "\t 7 \t 20984\n",
      "\t 5 \t 20538\n",
      "\t 8 \t 16755\n",
      "\t 9 \t 12772\n",
      "\n",
      "\n",
      "Feature: wordrole \n",
      "\n",
      "\t value\t frequency\n",
      "\t adv \t 41598\n",
      "\t v \t 25817\n",
      "\t s \t 22908\n",
      "\t o \t 21929\n",
      "\t  \t 9347\n",
      "\n",
      "\n",
      "Feature: wordrolelong \n",
      "\n",
      "\t value\t frequency\n",
      "\t Adverbial \t 41598\n",
      "\t Verbal \t 25817\n",
      "\t Subject \t 22908\n",
      "\t Object \t 21929\n",
      "\t  \t 9347\n",
      "\n",
      "\n",
      "Feature: wordtranslit \n",
      "\n",
      "\t value\t frequency\n",
      "\t kai \t 8576\n",
      "\t en \t 3152\n",
      "\t o \t 3149\n",
      "\t to \t 2885\n",
      "\t de \t 2769\n",
      "\n",
      "\n",
      "Feature: wordunacc \n",
      "\n",
      "\t value\t frequency\n",
      "\t και \t 8576\n",
      "\t ο \t 3019\n",
      "\t δε \t 2764\n",
      "\t εν \t 2752\n",
      "\t του \t 2497\n",
      "\n",
      "\n"
     ]
    }
   ],
   "source": [
    "FeatureList=Fall()\n",
    "LinesToPrint=5\n",
    "for Feature in FeatureList: \n",
    "    if Feature!='otype':\n",
    "        print ('Feature:',Feature,'\\n\\n\\t value\\t frequency')\n",
    "        FeatureFrequenceLists=Fs(Feature).freqList()\n",
    "        PrintedLine=0\n",
    "        for item, freq in FeatureFrequenceLists:\n",
    "            PrintedLine+=1\n",
    "            print ('\\t',item,'\\t',freq)\n",
    "            if PrintedLine==LinesToPrint: break\n",
    "        print ('\\n')"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "cba64820-a3e6-4b40-8a25-e0f95f2fd66e",
   "metadata": {
    "tags": []
   },
   "source": [
    "## 3.6 - Frequency list of punctuations <a class=\"anchor\" id=\"bullet3x6\"></a>\n",
    "##### [Back to TOC](#TOC)\n",
    "\n",
    "Make a list of punctuations with their Unicode values. Here, the function used is for printing markdown-formatted strings, although the desired result has not yet been achieved."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 11,
   "id": "c797fa57-d536-4471-b44d-d3a45653f34a",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/markdown": [
       " String | Unicode | Frequency\n",
       "--- | --- | ---"
      ],
      "text/plain": [
       "<IPython.core.display.Markdown object>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    },
    {
     "data": {
      "text/markdown": [
       " ` ` | 32 | 119272 "
      ],
      "text/plain": [
       "<IPython.core.display.Markdown object>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    },
    {
     "data": {
      "text/markdown": [
       " `,` | 44 | 9441 "
      ],
      "text/plain": [
       "<IPython.core.display.Markdown object>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    },
    {
     "data": {
      "text/markdown": [
       " `.` | 46 | 5712 "
      ],
      "text/plain": [
       "<IPython.core.display.Markdown object>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    },
    {
     "data": {
      "text/markdown": [
       " `·` | 183 | 2355 "
      ],
      "text/plain": [
       "<IPython.core.display.Markdown object>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    },
    {
     "data": {
      "text/markdown": [
       " `;` | 59 | 969 "
      ],
      "text/plain": [
       "<IPython.core.display.Markdown object>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    },
    {
     "data": {
      "text/markdown": [
       " `—` | 8212 | 30 "
      ],
      "text/plain": [
       "<IPython.core.display.Markdown object>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "result = F.after.freqList()\n",
    "N1904.dm(\" String | Unicode | Frequency\\n--- | --- | ---\")\n",
    "for (string, freq) in result:\n",
    "    # important: string does contain two characters in case of punctuations\n",
    "    frequency=str(freq)             #convert it to a string\n",
    "    unicode_value = str(ord(string[0])) #convert it to a string\n",
    "    N1904.dm(\" `{}` | {} | {} \".format(string[0],unicode_value,frequency))  "
   ]
  },
  {
   "cell_type": "markdown",
   "id": "b3cbf04f",
   "metadata": {},
   "source": [
    "## 3.7 - Node number ranges <a class=\"anchor\" id=\"bullet3x7\"></a>\n",
    "##### [Back to TOC](#TOC)\n",
    "\n",
    "The node number ranges are readily available by calling `F.otype.all` which returns a list of all node types. "
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 26,
   "id": "20dd1920",
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "book (137780, 137806)\n",
      "chapter (137807, 138066)\n",
      "verse (146078, 154020)\n",
      "sentence (138067, 146077)\n",
      "wg (154021, 268899)\n",
      "word (1, 137779)\n"
     ]
    }
   ],
   "source": [
    "for NodeType in F.otype.all:\n",
    "    print (NodeType, F.otype.sInterval(NodeType))"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "86e62381-0fdd-4e56-8855-11e8c73aec7e",
   "metadata": {},
   "source": [
    "## 3.8 - Count the objects per type <a class=\"anchor\" id=\"bullet3x8\"></a>\n",
    "##### [Back to TOC](#TOC)\n",
    "\n",
    "Using the same API call, we can produce also another list where we are counting the number of nodes for each type."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 27,
   "id": "dc4b5cae-9f19-4a42-aa9e-6decf3df4c2f",
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "     27 books\n",
      "    260 chapters\n",
      "   7943 verses\n",
      "   8011 sentences\n",
      " 114879 wgs\n",
      " 137779 words\n"
     ]
    }
   ],
   "source": [
    "for otype in F.otype.all:\n",
    "    i = 0\n",
    "    for n in F.otype.s(otype):\n",
    "        i += 1\n",
    "    print (\"{:>7} {}s\".format(i, otype))"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 7,
   "id": "c5730f29-e9d8-4483-9493-b31b7efbdafd",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "\n",
       "    <div class=\"pline\">      <div class=\"pname\">Job:</div><div class=\"pval\">Ellipsis</div>\n",
       "    </div>\n",
       "    <div class=\"pline\">\n",
       "      <div class=\"pname\">Author:</div><div class=\"pval\">program author</div>\n",
       "    </div>\n",
       "    <div class=\"pline\">\n",
       "      <div class=\"pname\">Created:</div><div class=\"pval\">2023-07-28T23:07:21+02:00</div>\n",
       "    </div>\n",
       "        <div class=\"pline\">\n",
       "      <div class=\"pname\">Data:</div>\n",
       "      <div class=\"pval\">Nestle 1904</div>\n",
       "    </div>\n",
       "    <div class=\"p2line\">\n",
       "      <div class=\"pname\">version</div>\n",
       "      <div class=\"pval\">0.5</div>\n",
       "    </div>\n",
       "    <div class=\"p2line\">\n",
       "      <div class=\"pname\">release</div>\n",
       "      <div class=\"pval\">none</div>\n",
       "    </div>\n",
       "    <div class=\"p2line\">\n",
       "      <div class=\"pname\">download</div>\n",
       "      <div class=\"pval\"><a href=\"https://github.com/tonyjurg/Nestle1904LFT/tree/None/tf\">tonyjurg/Nestle1904LFT/tf v:0.5(unknown release or commit)</a></div>\n",
       "    </div>\n",
       "    <div class=\"p2line\">\n",
       "      <div class=\"pname\">DOI</div>\n",
       "      <div class=\"pval\">no DOI</div>\n",
       "    </div>\n",
       "    \n",
       "    <div class=\"pline\">\n",
       "      <div class=\"pname\">Tool:</div>\n",
       "      <div class=\"pval\">Text-Fabric 11.4.10 <a href=\"https://doi.org/10.5281/zenodo.592193\">10.5281/zenodo.592193</a></div>\n",
       "    </div>\n",
       "        <div class=\"pline\">\n",
       "      <div class=\"pname\">TF App:</div>\n",
       "      <div class=\"pval\">tonyjurg/Nestle1904LFT on GitHub</div>\n",
       "    </div>\n",
       "    <div class=\"p2line\">\n",
       "      <div class=\"pname\">commit</div>\n",
       "      <div class=\"pval\"><a href=\"https://github.com/tonyjurg/Nestle1904LFT/tree/f2eb5e2b0f8805ad720d91a5cb9e2aa2fdc6c99a\">f2eb5e2b0f8805ad720d91a5cb9e2aa2fdc6c99a</a></div>\n",
       "    </div>  "
      ],
      "text/plain": [
       "<IPython.core.display.HTML object>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "N1904.showProvenance(...)"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "68b0b53c-fc49-4ad1-8945-5f26dfd818dc",
   "metadata": {},
   "source": [
    "## 3.9 - Obtain meta data for a feature <a class=\"anchor\" id=\"bullet3x9\"></a>\n",
    "##### [Back to TOC](#TOC)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "6de718fc-1823-413a-9c09-b9f70f014c7a",
   "metadata": {},
   "outputs": [],
   "source": [
    "This can be usefull if you want to process all feature in a script."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 12,
   "id": "07370a48-c263-4910-bd1a-bc4e46c73a07",
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "{'Availability': 'Creative Commons Attribution 4.0 International (CC BY 4.0)', 'Converter_author': 'Tony Jurg, ReMa Student Vrije Universiteit Amsterdam, Netherlands', 'Converter_execution': 'Tony Jurg, ReMa Student Vrije Universiteit Amsterdam, Netherlands', 'Converter_version': '0.3', 'Convertor_source': 'https://github.com/tonyjurg/Nestle1904LFT/tree/main/tools', 'Data source': 'MACULA Greek Linguistic Datasets, available at https://github.com/Clear-Bible/macula-greek/tree/main/Nestle1904/lowfat', 'Editors': 'Eberhard Nestle', 'Name': 'Greek New Testament (Nestle 1904 based on Low Fat Tree)', 'TextFabric version': '11.4.10', 'description': 'Word as it appears in the text (excl. punctuations)', 'valueType': 'str', 'writtenBy': 'Text-Fabric', 'dateWritten': '2023-06-19T15:13:46Z'}\n"
     ]
    }
   ],
   "source": [
    "# Just print the structured tuple returned by the function call\n",
    "FeatureName='word'\n",
    "MetaData=Fs(FeatureName).meta\n",
    "print (MetaData)"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "c9fc2cac-b1f6-430e-b900-82fd7fece295",
   "metadata": {},
   "source": [
    "Now do some very basic calculation with the data:"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 13,
   "id": "cbe58101-e241-44e0-9a36-aeef8aa47bc6",
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "feature  word is of type str.\n"
     ]
    }
   ],
   "source": [
    "print ('feature ',FeatureName, end='')\n",
    "if MetaData['valueType']=='str':\n",
    "    print (' is of type str.')\n",
    "else:\n",
    "    print (' is not of type str.')"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "08c67b53-bd6c-42e6-a0cf-b7f609cd9879",
   "metadata": {},
   "source": [
    "# trying the various formats"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "cf68d1b9-cbec-470a-8726-31ef9a475603",
   "metadata": {},
   "outputs": [],
   "source": [
    "origText=T.text(node,fmt='text-orig-full')\n",
    "critText=T.text(node,fmt='text-critical-signs')\n",
    "\n",
    "        'fmt:text-orig-full':     '{word}{after}',\n",
    "        'fmt:text-normalized':    '{normalized}{after}',\n",
    "        'fmt:text-unaccented':    '{wordunacc}{after}',\n",
    "        'fmt:text-transliterated':'{wordtranslit}{after}', \n",
    "        'fmt:text-critical':  "
   ]
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "Python 3 (ipykernel)",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.11.5"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 5
}