{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "\n", "\n", "\n", "\n", "You might want to consider the [start](search.ipynb) of this tutorial.\n", "\n", "Short introductions to other TF datasets:\n", "\n", "* [Dead Sea Scrolls](https://nbviewer.jupyter.org/github/annotation/tutorials/blob/master/lorentz2020/dss.ipynb),\n", "* [Old Babylonian Letters](https://nbviewer.jupyter.org/github/annotation/tutorials/blob/master/lorentz2020/oldbabylonian.ipynb),\n", "or the\n", "* [Quran](https://nbviewer.jupyter.org/github/annotation/tutorials/blob/master/lorentz2020/quran.ipynb)\n" ] }, { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [], "source": [ "%load_ext autoreload\n", "%autoreload 2" ] }, { "cell_type": "code", "execution_count": 2, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T10:06:39.818664Z", "start_time": "2018-05-24T10:06:39.796588Z" } }, "outputs": [], "source": [ "from tf.app import use" ] }, { "cell_type": "code", "execution_count": 3, "metadata": {}, "outputs": [ { "data": { "text/html": [ "TF-app: ~/text-fabric-data/etcbc/bhsa/app" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "data: ~/text-fabric-data/etcbc/bhsa/tf/2021" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "data: ~/text-fabric-data/etcbc/phono/tf/2021" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "data: ~/text-fabric-data/etcbc/parallels/tf/2021" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "name": "stdout", "output_type": "stream", "text": [ "This is Text-Fabric 9.3.2\n", "Api reference : https://annotation.github.io/text-fabric/tf/cheatsheet.html\n", "\n", "122 features found and 0 ignored\n" ] }, { "data": { "text/html": [ "Text-Fabric: Text-Fabric API 9.3.2, etcbc/bhsa/app v3, Search Reference
Data: BHSA, Character table, Feature docs
Features:
\n", "
Parallel Passages\n", "
\n", "\n", "
\n", "
\n", "crossref\n", "
\n", "
int
\n", "
\n", " 🆗 links between similar passages\n", "
\n", "\n", "
\n", "
author:
\n", "
BHSA Data: Constantijn Sikkel; Parallels Notebook: Dirk Roorda, Martijn Naaijer
\n", "
\n", "\n", "
\n", "
coreData:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:40:46Z
\n", "
\n", "\n", "
\n", "
provenance:
\n", "
Parallels notebook, see https://github.com/ETCBC/parallels
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "\n", "
BHSA = Biblia Hebraica Stuttgartensia Amstelodamensis\n", "
\n", "\n", "
\n", "
\n", "book\n", "
\n", "
str
\n", "
\n", " ✅ book name in Latin (Genesis; Numeri; Reges1; ...)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:17:55Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "book@ll\n", "
\n", "
str
\n", "
\n", " ✅ book name in amharic (ኣማርኛ)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:20:27Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
language:
\n", "
ኣማርኛ
\n", "
\n", "\n", "
\n", "
languageCode:
\n", "
am
\n", "
\n", "\n", "
\n", "
languageEnglish:
\n", "
amharic
\n", "
\n", "\n", "
\n", "
provenance:
\n", "
book names from wikipedia and other sources
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "chapter\n", "
\n", "
int
\n", "
\n", " ✅ chapter number (1; 2; 3; ...)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:17:55Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "code\n", "
\n", "
int
\n", "
\n", " ✅ identifier of a clause atom relationship (0; 74; 367; ...)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:17:56Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "det\n", "
\n", "
str
\n", "
\n", " ✅ determinedness of phrase(atom) (det; und; NA.)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:17:56Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "domain\n", "
\n", "
str
\n", "
\n", " ✅ text type of clause (? (Unknown); N (narrative); D (discursive); Q (Quotation).)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:17:57Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "freq_lex\n", "
\n", "
int
\n", "
\n", " ✅ frequency of lexemes\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:24:45Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
provenance:
\n", "
computed on the basis of the ETCBC core set of features
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "function\n", "
\n", "
str
\n", "
\n", " ✅ syntactic function of phrase (Cmpl; Objc; Pred; ...)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:17:57Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "g_cons\n", "
\n", "
str
\n", "
\n", " ✅ word consonantal-transliterated (B R>CJT BR> >LHJM ...)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:17:57Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "g_cons_utf8\n", "
\n", "
str
\n", "
\n", " ✅ word consonantal-Hebrew (ב ראשׁית ברא אלהים)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:17:58Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "g_lex\n", "
\n", "
str
\n", "
\n", " ✅ lexeme pointed-transliterated (B.:- R;>CIJT B.@R@> >:ELOH ...)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:17:58Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "g_lex_utf8\n", "
\n", "
str
\n", "
\n", " ✅ lexeme pointed-Hebrew (בְּ רֵאשִׁית בָּרָא אֱלֹה)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:17:59Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "g_word\n", "
\n", "
str
\n", "
\n", " ✅ word pointed-transliterated (B.:- R;>CI73JT B.@R@74> >:ELOHI92JM)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:18:04Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "g_word_utf8\n", "
\n", "
str
\n", "
\n", " ✅ word pointed-Hebrew (בְּ רֵאשִׁ֖ית בָּרָ֣א אֱלֹהִ֑ים)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:18:04Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "gloss\n", "
\n", "
str
\n", "
\n", " 🆗 english translation of lexeme (beginning create god(s))\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:21:13Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
provenance:
\n", "
from additional lexicon file provided by the ETCBC
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "gn\n", "
\n", "
str
\n", "
\n", " ✅ grammatical gender (m; f; NA; unknown.)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:18:05Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "label\n", "
\n", "
str
\n", "
\n", " ✅ (half-)verse label (half verses: A; B; C; verses: GEN 01,02)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:18:06Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "language\n", "
\n", "
str
\n", "
\n", " ✅ of word or lexeme (Hebrew; Aramaic.)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:21:13Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
provenance:
\n", "
from additional lexicon file provided by the ETCBC
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "lex\n", "
\n", "
str
\n", "
\n", " ✅ lexeme consonantal-transliterated (B R>CJT/ BR>[ >LHJM/)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:21:14Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
provenance:
\n", "
from additional lexicon file provided by the ETCBC
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "lex_utf8\n", "
\n", "
str
\n", "
\n", " ✅ lexeme consonantal-Hebrew (ב ראשׁית֜ ברא אלהים֜)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:21:15Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
provenance:
\n", "
from additional lexicon file provided by the ETCBC
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "ls\n", "
\n", "
str
\n", "
\n", " ✅ lexical set, subclassification of part-of-speech (card; ques; mult)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:21:15Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
provenance:
\n", "
from additional lexicon file provided by the ETCBC
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "nametype\n", "
\n", "
str
\n", "
\n", " ⚠️ named entity type (pers; mens; gens; topo; ppde.)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:21:15Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
provenance:
\n", "
from additional lexicon file provided by the ETCBC
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "nme\n", "
\n", "
str
\n", "
\n", " ✅ nominal ending consonantal-transliterated (absent; n/a; JM, ...)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:18:08Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "nu\n", "
\n", "
str
\n", "
\n", " ✅ grammatical number (sg; du; pl; NA; unknown.)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:18:08Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "number\n", "
\n", "
int
\n", "
\n", " ✅ sequence number of an object within its context\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:18:09Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "otype\n", "
\n", "
str
\n", "
\n", " \n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:21:15Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "pargr\n", "
\n", "
str
\n", "
\n", " 🆗 hierarchical paragraph number (1; 1.2; 1.2.3.4; ...)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:22:50Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
provenance:
\n", "
from additional paragraph file provided by the ETCBC
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "pdp\n", "
\n", "
str
\n", "
\n", " ✅ phrase dependent part-of-speech (art; verb; subs; nmpr, ...)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:18:10Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "pfm\n", "
\n", "
str
\n", "
\n", " ✅ preformative consonantal-transliterated (absent; n/a; J, ...)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:18:11Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "prs\n", "
\n", "
str
\n", "
\n", " ✅ pronominal suffix consonantal-transliterated (absent; n/a; W; ...)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:18:11Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "prs_gn\n", "
\n", "
str
\n", "
\n", " ✅ pronominal suffix gender (m; f; NA; unknown.)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:18:11Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "prs_nu\n", "
\n", "
str
\n", "
\n", " ✅ pronominal suffix number (sg; du; pl; NA; unknown.)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:18:12Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "prs_ps\n", "
\n", "
str
\n", "
\n", " ✅ pronominal suffix person (p1; p2; p3; NA; unknown.)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:18:12Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "ps\n", "
\n", "
str
\n", "
\n", " ✅ grammatical person (p1; p2; p3; NA; unknown.)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:18:12Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "qere\n", "
\n", "
str
\n", "
\n", " ✅ word pointed-transliterated masoretic reading correction\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:23:29Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
provenance:
\n", "
from additional ketiv/qere file provided by the ETCBC
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "qere_trailer\n", "
\n", "
str
\n", "
\n", " ✅ interword material -pointed-transliterated (Masoretic correction)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:23:29Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
provenance:
\n", "
from additional ketiv/qere file provided by the ETCBC
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "qere_trailer_utf8\n", "
\n", "
str
\n", "
\n", " ✅ interword material -pointed-transliterated (Masoretic correction)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:23:29Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
provenance:
\n", "
from additional ketiv/qere file provided by the ETCBC
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "qere_utf8\n", "
\n", "
str
\n", "
\n", " ✅ word pointed-Hebrew masoretic reading correction\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:23:29Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
provenance:
\n", "
from additional ketiv/qere file provided by the ETCBC
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "rank_lex\n", "
\n", "
int
\n", "
\n", " ✅ ranking of lexemes based on freqnuecy\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:24:46Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
provenance:
\n", "
computed on the basis of the ETCBC core set of features
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "rela\n", "
\n", "
str
\n", "
\n", " ✅ linguistic relation between clause/(sub)phrase(atom) (ADJ; MOD; ATR; ...)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:18:13Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "sp\n", "
\n", "
str
\n", "
\n", " ✅ part-of-speech (art; verb; subs; nmpr, ...)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:21:16Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
provenance:
\n", "
from additional lexicon file provided by the ETCBC
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "st\n", "
\n", "
str
\n", "
\n", " ✅ state of a noun (a (absolute); c (construct); e (emphatic).)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:18:14Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "tab\n", "
\n", "
int
\n", "
\n", " ✅ clause atom: its level in the linguistic embedding\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:18:16Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "trailer\n", "
\n", "
str
\n", "
\n", " ✅ interword material pointed-transliterated (& 00 05 00_P ...)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:18:01Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "trailer_utf8\n", "
\n", "
str
\n", "
\n", " ✅ interword material pointed-Hebrew (־ ׃)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:18:01Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "txt\n", "
\n", "
str
\n", "
\n", " ✅ text type of clause and surrounding (repetion of ? N D Q as in feature domain)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:18:16Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "typ\n", "
\n", "
str
\n", "
\n", " ✅ clause/phrase(atom) type (VP; NP; Ellp; Ptcp; WayX)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:18:16Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "uvf\n", "
\n", "
str
\n", "
\n", " ✅ univalent final consonant consonantal-transliterated (absent; N; J; ...)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:18:17Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "vbe\n", "
\n", "
str
\n", "
\n", " ✅ verbal ending consonantal-transliterated (n/a; W; ...)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:18:17Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "vbs\n", "
\n", "
str
\n", "
\n", " ✅ root formation consonantal-transliterated (absent; n/a; H; ...)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:18:17Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "verse\n", "
\n", "
int
\n", "
\n", " ✅ verse number\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:18:18Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "voc_lex\n", "
\n", "
str
\n", "
\n", " ✅ vocalized lexeme pointed-transliterated (B.: R;>CIJT BR> >:ELOHIJM)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:21:16Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
provenance:
\n", "
from additional lexicon file provided by the ETCBC
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "voc_lex_utf8\n", "
\n", "
str
\n", "
\n", " ✅ vocalized lexeme pointed-Hebrew (בְּ רֵאשִׁית ברא אֱלֹהִים)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:21:17Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
provenance:
\n", "
from additional lexicon file provided by the ETCBC
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "vs\n", "
\n", "
str
\n", "
\n", " ✅ verbal stem (qal; piel; hif; apel; pael)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:18:18Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "vt\n", "
\n", "
str
\n", "
\n", " ✅ verbal tense (perf; impv; wayq; infc)\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:18:18Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "mother\n", "
\n", "
none
\n", "
\n", " ✅ linguistic dependency between textual objects\n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:18:22Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "oslots\n", "
\n", "
none
\n", "
\n", " \n", "
\n", "\n", "
\n", "
author:
\n", "
Eep Talstra Centre for Bible and Computer
\n", "
\n", "\n", "
\n", "
dataset:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
datasetName:
\n", "
Biblia Hebraica Stuttgartensia Amstelodamensis
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:21:17Z
\n", "
\n", "\n", "
\n", "
email:
\n", "
shebanq@ancient-data.org
\n", "
\n", "\n", "
\n", "
encoders:
\n", "
Constantijn Sikkel (QDF), Ulrik Petersen (MQL) and Dirk Roorda (TF)
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
website:
\n", "
https://shebanq.ancient-data.org
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "\n", "
Phonetic Transcriptions\n", "
\n", "\n", "
\n", "
\n", "phono\n", "
\n", "
str
\n", "
\n", " 🆗 phonological transcription (bᵊ rēšˌîṯ bārˈā ʔᵉlōhˈîm)\n", "
\n", "\n", "
\n", "
author:
\n", "
BHSA Data: Constantijn Sikkel; Phono Notebook: Dirk Roorda
\n", "
\n", "\n", "
\n", "
coreData:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:25:55Z
\n", "
\n", "\n", "
\n", "
provenance:
\n", "
computed by the phono notebook, see https://github.com/ETCBC/phono
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "phono_trailer\n", "
\n", "
str
\n", "
\n", " 🆗 interword material in phonological transcription\n", "
\n", "\n", "
\n", "
author:
\n", "
BHSA Data: Constantijn Sikkel; Phono Notebook: Dirk Roorda
\n", "
\n", "\n", "
\n", "
coreData:
\n", "
BHSA
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2021-12-09T14:25:55Z
\n", "
\n", "\n", "
\n", "
provenance:
\n", "
computed by the phono notebook, see https://github.com/ETCBC/phono
\n", "
\n", "\n", "
\n", "
version:
\n", "
2021
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "\n", "\n" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
Text-Fabric API: names N F E L T S C TF directly usable

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "A = use(\"ETCBC/bhsa\", hoist=globals())" ] }, { "cell_type": "markdown", "metadata": { "tags": [] }, "source": [ "# Relationships\n", "\n", "There are relationship between objects that are about their identities, the way they\n", "occupy space, and the way they are connected.\n", "\n", "Are two objects the same, do they occupy the same slots, do they overlap, is one embedded in the other,\n", "does one come before the other? Is there an edge from one to another?\n", "\n", "Although these relationships are easy to define, and even easy to implement,\n", "they may be very costly to use.\n", "When searching, most of them have to be computed very many times.\n", "\n", "Some of them have been precomputed and stored in an index, e.g. the embedding relationships.\n", "They can be used without penalty.\n", "\n", "Other relations are not suitable for pre-computing: most inequality relations are of that kind.\n", "It would require an enormous amount of storage to pre-compute for each node the set of nodes that\n", "occupy different slots. This type of relation will not be used in narrowing down the search space,\n", "which means that it may take more time to get the results.\n", "\n", "We are going to test all of our relationships here.\n", "\n", "Let us first see what relationships we have:" ] }, { "cell_type": "code", "execution_count": 4, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T09:05:33.516913Z", "start_time": "2018-05-24T09:05:33.512571Z" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " = left equal to right (as node)\n", " # left unequal to right (as node)\n", " < left before right (in canonical node ordering)\n", " > left after right (in canonical node ordering)\n", " == left occupies same slots as right\n", " && left has overlapping slots with right\n", " ## left and right do not have the same slot set\n", " || left and right do not have common slots\n", " [[ left embeds right\n", " ]] left embedded in right\n", " << left completely before right\n", " >> left completely after right\n", " =: left and right start at the same slot\n", " := left and right end at the same slot\n", " :: left and right start and end at the same slot\n", " <: left immediately before right\n", " :> left immediately after right\n", " =k: left and right start at k-nearly the same slot\n", " :k= left and right end at k-nearly the same slot\n", " :k: left and right start and end at k-near slots\n", " left k-nearly after right\n", " .f. left.f = right.f\n", " .f=g. left.f = right.g\n", " .f~r~g. left.f matches right.g\n", " .f#g. left.f # right.g\n", " .f>g. left.f > right.g\n", " .f edge feature \"crossref\" with value specification allowed\n", " edge feature \"crossref\" with value specification allowed (either direction)\n", " -crossrefLCS> edge feature \"crossrefLCS\" with value specification allowed\n", " edge feature \"crossrefLCS\" with value specification allowed (either direction)\n", " -crossrefSET> edge feature \"crossrefSET\" with value specification allowed\n", " edge feature \"crossrefSET\" with value specification allowed (either direction)\n", "-distributional_parent> edge feature \"distributional_parent\"\n", " edge feature \"distributional_parent\" (either direction)\n", " -functional_parent> edge feature \"functional_parent\"\n", " edge feature \"functional_parent\" (either direction)\n", " -mother> edge feature \"mother\"\n", " edge feature \"mother\" (either direction)\n", " -omap@2017-2021> edge feature \"omap@2017-2021\" with value specification allowed\n", " edge feature \"omap@2017-2021\" with value specification allowed (either direction)\n", " -omap@c-2021> edge feature \"omap@c-2021\" with value specification allowed\n", " edge feature \"omap@c-2021\" with value specification allowed (either direction)\n", "The warp feature \"oslots\" cannot be used in searches.\n", "One of the above relations on nodes and/or slots will suit you better.\n" ] } ], "source": [ "S.relationsLegend()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "The top of the list are identity and spatial relationships.\n", "We are going to discuss them. They are documented in\n", "[relationships](https://annotation.github.io/text-fabric/tf/about/searchusage.html#relational-operators)\n", "\n", "The bottom of the list are relationships defined by the edge features of your dataset.\n", "We have discussed them in [advanced](searchAdvanced.ipynb)." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# Identity and order\n", "\n", "## `=` equal as node\n", "\n", "The `=` means that both parts are the same node. Left and right are not two things with similar properties,\n", "no, they are one and the same thing.\n", "\n", "Useful if the thing you search for it part of two wildly different patterns." ] }, { "cell_type": "code", "execution_count": 5, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T09:05:59.853397Z", "start_time": "2018-05-24T09:05:58.186929Z" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.39s 48 results\n" ] } ], "source": [ "query = \"\"\"\n", "v1:verse\n", " sentence\n", " clause rela=Objc\n", " phrase\n", " word sp=verb gn=f nu=pl\n", "v2:verse\n", " sentence\n", " c1:clause\n", " c2:clause\n", " c3:clause\n", " c1 < c2\n", " c2 < c3\n", "v1 = v2\n", "\"\"\"\n", "results = A.search(query)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "We show the results with the first clause in magenta and the second sentence in cyan.\n", "\n", "Note that the first and the second sentence may be the same sentence!\n", "\n", "**And observe that the last clause is the same one as the first one, hence they have the same color.**" ] }, { "cell_type": "code", "execution_count": 6, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T09:05:59.853397Z", "start_time": "2018-05-24T09:05:58.186929Z" } }, "outputs": [ { "data": { "text/html": [ "\n", "
npsentenceclausephrasewordsentenceclauseclauseclause
1Jeremiah 44:15וַיַּעֲנ֣וּ אֶֽת־יִרְמְיָ֗הוּ כָּל־הָאֲנָשִׁ֤ים הַיֹּֽדְעִים֙ כִּֽי־מְקַטְּרֹ֤ות נְשֵׁיהֶם֙ לֵאלֹהִ֣ים אֲחֵרִ֔ים וְכָל־הַנָּשִׁ֥ים הָעֹמְדֹ֖ות קָהָ֣ל גָּדֹ֑ול וְכָל־הָעָ֛ם הַיֹּשְׁבִ֥ים בְּאֶֽרֶץ־מִצְרַ֖יִם בְּפַתְרֹ֥וס לֵאמֹֽר׃ כִּֽי־מְקַטְּרֹ֤ות נְשֵׁיהֶם֙ לֵאלֹהִ֣ים אֲחֵרִ֔ים מְקַטְּרֹ֤ות מְקַטְּרֹ֤ות וַיַּעֲנ֣וּ אֶֽת־יִרְמְיָ֗הוּ כָּל־הָאֲנָשִׁ֤ים הַיֹּֽדְעִים֙ כִּֽי־מְקַטְּרֹ֤ות נְשֵׁיהֶם֙ לֵאלֹהִ֣ים אֲחֵרִ֔ים וְכָל־הַנָּשִׁ֥ים הָעֹמְדֹ֖ות קָהָ֣ל גָּדֹ֑ול וְכָל־הָעָ֛ם הַיֹּשְׁבִ֥ים בְּאֶֽרֶץ־מִצְרַ֖יִם בְּפַתְרֹ֥וס לֵאמֹֽר׃ וַיַּעֲנ֣וּ אֶֽת־יִרְמְיָ֗הוּ כָּל־הָאֲנָשִׁ֤ים וְכָל־הַנָּשִׁ֥ים קָהָ֣ל גָּדֹ֑ול וְכָל־הָעָ֛ם הַיֹּֽדְעִים֙ כִּֽי־מְקַטְּרֹ֤ות נְשֵׁיהֶם֙ לֵאלֹהִ֣ים אֲחֵרִ֔ים
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "skipCols = \"1 6\" # the verses\n", "colorMap = {2: \"magenta\", 6: \"cyan\"}\n", "A.table(results, end=1, colorMap=colorMap, skipCols=skipCols)" ] }, { "cell_type": "code", "execution_count": 7, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T09:06:03.554488Z", "start_time": "2018-05-24T09:06:03.528480Z" } }, "outputs": [ { "data": { "text/html": [ "

result 1

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
verse
sentence
clause
phrase
sp=conj
phrase
gn=mnu=plsp=verb
phrase
phrase
gn=mnu=sgsp=subs
sp=art
gn=mnu=plsp=subs
clause
rela=Attr
phrase
sp=art
phrase
gn=mnu=plsp=verb
clause
rela=Objc
phrase
phrase
phrase
gn=fnu=plsp=subs
phrase
sp=prep
gn=mnu=plsp=subs
gn=mnu=plsp=adjv
clause
phrase
sp=conj
gn=mnu=sgsp=subs
sp=art
gn=fnu=plsp=subs
clause
rela=Attr
phrase
sp=art
phrase
gn=fnu=plsp=verb
clause
phrase
gn=mnu=sgsp=subs
gn=mnu=sgsp=adjv
sp=conj
gn=mnu=sgsp=subs
sp=art
gn=mnu=sgsp=subs
clause
rela=Attr
phrase
sp=art
phrase
gn=mnu=plsp=verb
phrase
sp=prep
nu=sgsp=subs
phrase
sp=prep
clause
rela=Adju
phrase
sp=prep
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "A.show(results, end=1, colorMap=colorMap, skipCols=skipCols)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## # unequal as node\n", "\n", "`n # m` if `n` and `m` are not the same node.\n", "\n", "If you write a template, and you know that one node should come before another one,\n", "consider using `<` or `>`, which will constrain the results better.\n", "\n", "We have seen this in action in the search for gapped phrases." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## `<` and `>` canonical order\n", "\n", "`n < m` if `n` comes before `m` in the\n", "[canonical ordering](https://annotation.github.io/text-fabric/tf/core/nodes.html)\n", "of nodes.\n", "\n", "We have seen them in action before." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# Space occupation\n", "\n", "We show that the following relationships also work with custom sets, as introduced in the\n", "[search Advanced tutorial](searchAdvanced.ipynb).\n", "\n", "We make two custom sets, `common` and `rare`, consisting of nodes whose contained slots have all\n", "common lexemes or some rare lexemes, like we did in\n", "[search Advanced](searchAdvanced.ipynb)." ] }, { "cell_type": "code", "execution_count": 8, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T08:06:08.187551Z", "start_time": "2018-05-24T08:06:00.209985Z" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "669195 members in set common\n", "425320 members in set rare\n" ] } ], "source": [ "COMMON_RANK = 100\n", "RARE_RANK = 500\n", "\n", "common = set()\n", "rare = set()\n", "\n", "for n in N.walk():\n", " nTp = F.otype.v(n)\n", " if nTp == \"lex\":\n", " continue\n", " if nTp == \"word\":\n", " ranks = [F.rank_lex.v(n)]\n", " else:\n", " ranks = [F.rank_lex.v(w) for w in L.d(n, otype=\"word\")]\n", " maxRank = max(ranks)\n", " minRank = min(ranks)\n", " if maxRank < COMMON_RANK:\n", " common.add(n)\n", " if maxRank > RARE_RANK:\n", " rare.add(n)\n", "\n", "print(f\"{len(common):>6} members in set common\")\n", "print(f\"{len(rare):>6} members in set rare\")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Now we can do all kinds of searches within the domain of `common` and `rare` things.\n", "\n", "We give the names to all the sets and put them in a dictionary." ] }, { "cell_type": "code", "execution_count": 9, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T08:07:11.688552Z", "start_time": "2018-05-24T08:07:11.685127Z" } }, "outputs": [], "source": [ "customSets = dict(\n", " common=common,\n", " rare=rare,\n", ")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "**Expert remark**\n", "\n", "Note that these sets contain both slot nodes and non-slot nodes.\n", "The code that implements the basic relationship is heavily optimized and contains case distinctions as to whether nodes are slot\n", "or non-slot.\n", "For ordinary node types, it is clear on beforehand whether its nodes are slot or non-slot, but custom sets may contain both.\n", "\n", "So our `frequent` and `infrequent` sets are good tests whether the basic relationships are correctly implemented." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## `==` same slots\n", "\n", "Two objects are extensionally equal if they occupy exactly the same slots." ] }, { "cell_type": "code", "execution_count": 10, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T09:15:45.123089Z", "start_time": "2018-05-24T09:15:19.569287Z" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.16s 3583 results\n" ] } ], "source": [ "query = \"\"\"\n", "v:verse\n", " s:sentence\n", "v == s\n", "\"\"\"\n", "results = A.search(query)" ] }, { "cell_type": "code", "execution_count": 11, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T09:15:45.123089Z", "start_time": "2018-05-24T09:15:19.569287Z" } }, "outputs": [ { "data": { "text/html": [ "\n", "\n", "\n", "\n", "\n", "\n", "\n", "
npsentence
1Genesis 1:1בְּרֵאשִׁ֖ית בָּרָ֣א אֱלֹהִ֑ים אֵ֥ת הַשָּׁמַ֖יִם וְאֵ֥ת הָאָֽרֶץ׃
2Genesis 1:16וַיַּ֣עַשׂ אֱלֹהִ֔ים אֶת־שְׁנֵ֥י הַמְּאֹרֹ֖ת הַגְּדֹלִ֑ים אֶת־הַמָּאֹ֤ור הַגָּדֹל֙ לְמֶמְשֶׁ֣לֶת הַיֹּ֔ום וְאֶת־הַמָּאֹ֤ור הַקָּטֹן֙ לְמֶמְשֶׁ֣לֶת הַלַּ֔יְלָה וְאֵ֖ת הַכֹּוכָבִֽים׃
3Genesis 2:1וַיְכֻלּ֛וּ הַשָּׁמַ֥יִם וְהָאָ֖רֶץ וְכָל־צְבָאָֽם׃
4Genesis 2:9וַיַּצְמַ֞ח יְהוָ֤ה אֱלֹהִים֙ מִן־הָ֣אֲדָמָ֔ה כָּל־עֵ֛ץ נֶחְמָ֥ד לְמַרְאֶ֖ה וְטֹ֣וב לְמַאֲכָ֑ל וְעֵ֤ץ הַֽחַיִּים֙ בְּתֹ֣וךְ הַגָּ֔ן וְעֵ֕ץ הַדַּ֖עַת טֹ֥וב וָרָֽע׃
5Genesis 3:5כִּ֚י יֹדֵ֣עַ אֱלֹהִ֔ים כִּ֗י בְּיֹום֙ אֲכָלְכֶ֣ם מִמֶּ֔נּוּ וְנִפְקְח֖וּ עֵֽינֵיכֶ֑ם וִהְיִיתֶם֙ כֵּֽאלֹהִ֔ים יֹדְעֵ֖י טֹ֥וב וָרָֽע׃
6Genesis 3:23וַֽיְשַׁלְּחֵ֛הוּ יְהוָ֥ה אֱלֹהִ֖ים מִגַּן־עֵ֑דֶן לַֽעֲבֹד֙ אֶת־הָ֣אֲדָמָ֔ה אֲשֶׁ֥ר לֻקַּ֖ח מִשָּֽׁם׃
7Genesis 4:24כִּ֥י שִׁבְעָתַ֖יִם יֻקַּם־קָ֑יִן וְלֶ֖מֶךְ שִׁבְעִ֥ים וְשִׁבְעָֽה׃
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 1

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "A.table(results, end=7, skipCols=\"1\")\n", "A.show(results, start=1, end=1, skipCols=\"1\")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Now a similar query with the custom sets:" ] }, { "cell_type": "code", "execution_count": 12, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T09:15:45.123089Z", "start_time": "2018-05-24T09:15:19.569287Z" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.54s 135 results\n" ] }, { "data": { "text/html": [ "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "
npsentence
1Exodus 6:10וַיְדַבֵּ֥ר יְהוָ֖ה אֶל־מֹשֶׁ֥ה לֵּאמֹֽר׃
2Exodus 6:28וַיְהִ֗י בְּיֹ֨ום דִּבֶּ֧ר יְהוָ֛ה אֶל־מֹשֶׁ֖ה בְּאֶ֥רֶץ מִצְרָֽיִם׃ פ
3Exodus 13:1וַיְדַבֵּ֥ר יְהוָ֖ה אֶל־מֹשֶׁ֥ה לֵּאמֹֽר׃
4Exodus 14:1וַיְדַבֵּ֥ר יְהֹוָ֖ה אֶל־מֹשֶׁ֥ה לֵּאמֹֽר׃
5Exodus 16:11וַיְדַבֵּ֥ר יְהוָ֖ה אֶל־מֹשֶׁ֥ה לֵּאמֹֽר׃
6Exodus 20:1וַיְדַבֵּ֣ר אֱלֹהִ֔ים אֵ֛ת כָּל־הַדְּבָרִ֥ים הָאֵ֖לֶּה לֵאמֹֽר׃ ס
7Exodus 25:1וַיְדַבֵּ֥ר יְהוָ֖ה אֶל־מֹשֶׁ֥ה לֵּאמֹֽר׃
8Exodus 30:11וַיְדַבֵּ֥ר יְהוָ֖ה אֶל־מֹשֶׁ֥ה לֵּאמֹֽר׃
9Exodus 30:17וַיְדַבֵּ֥ר יְהוָ֖ה אֶל־מֹשֶׁ֥ה לֵּאמֹֽר׃
10Exodus 30:22וַיְדַבֵּ֥ר יְהוָ֖ה אֶל־מֹשֶׁ֥ה לֵּאמֹֽר׃
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 1

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
verse
otype=verse
sentence
otype=sentence
clause
otype=clause
phrase
otype=phrase
otype=word
phrase
otype=phrase
phrase
otype=phrase
otype=word
phrase
otype=phrase
otype=word
otype=word
clause
otype=clause
phrase
otype=phrase
otype=word
otype=word
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "query = \"\"\"\n", "v:common otype=verse\n", " s:common otype=sentence\n", "v == s\n", "\"\"\"\n", "resultsCustom = A.search(query, sets=customSets)\n", "A.table(resultsCustom, end=10, skipCols=\"1\")\n", "A.show(resultsCustom, start=1, end=1, skipCols=\"1\")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "As a check we compute manually the maximum rank of the lexemes in the clauses yielded by the query without the custom sets:" ] }, { "cell_type": "code", "execution_count": 13, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Rejected 3448 non-common results, leaving 135 ones\n" ] } ], "source": [ "rejected = 0\n", "for (verse, sentence) in results:\n", " maxRank = max(F.rank_lex.v(w) for w in L.d(verse, otype=\"word\"))\n", " if maxRank >= COMMON_RANK:\n", " rejected += 1\n", "\n", "print(f\"Rejected {rejected} non-common results, leaving {len(results) - rejected} ones\")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## `&&` overlap\n", "\n", "Two objects overlap if and only if they share at least one slot.\n", "This is quite costly to use in some cases.\n", "\n", "We are going to find the sentences that overlap with two verses." ] }, { "cell_type": "code", "execution_count": 14, "metadata": {}, "outputs": [], "source": [ "query = \"\"\"\n", "sentence\n", "/with/\n", "v1:verse\n", "&& ..\n", "v2:verse\n", "&& ..\n", "v1 < v2\n", "/-/\n", "\"\"\"" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Explanation: the query looks for sentences and delivers results that are tuples with only a sentence node.\n", "\n", "This is because the stuff within the `/with/` quantifier does not contribute to the result tuples.\n", "\n", "The `/with/` quantifier poses a few restrictions on its parent, the `sentence`.\n", "\n", "From within the quantifier you can refer to the parent by `..`.\n", "\n", "The condition is that there are verses `v1` and `v2` that have overlap with the sentence, and that `v1` comes before `v2`.\n", "\n", "The result is a tuple of exactly the sentences that span multiple verses." ] }, { "cell_type": "code", "execution_count": 15, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.58s 887 results\n" ] } ], "source": [ "results = A.search(query)" ] }, { "cell_type": "code", "execution_count": 16, "metadata": {}, "outputs": [ { "data": { "text/html": [ "\n", "\n", "\n", "\n", "\n", "
npsentence
1Genesis 1:17וַיִּתֵּ֥ן אֹתָ֛ם אֱלֹהִ֖ים בִּרְקִ֣יעַ הַשָּׁמָ֑יִם לְהָאִ֖יר עַל־הָאָֽרֶץ׃ וְלִמְשֹׁל֙ בַּיֹּ֣ום וּבַלַּ֔יְלָה וּֽלֲהַבְדִּ֔יל בֵּ֥ין הָאֹ֖ור וּבֵ֣ין הַחֹ֑שֶׁךְ
2Genesis 1:29הִנֵּה֩ נָתַ֨תִּי לָכֶ֜ם אֶת־כָּל־עֵ֣שֶׂב׀ זֹרֵ֣עַ זֶ֗רַע אֲשֶׁר֙ עַל־פְּנֵ֣י כָל־הָאָ֔רֶץ וְאֶת־כָּל־הָעֵ֛ץ אֲשֶׁר־בֹּ֥ו פְרִי־עֵ֖ץ זֹרֵ֣עַ זָ֑רַע וּֽלְכָל־חַיַּ֣ת הָ֠אָרֶץ וּלְכָל־עֹ֨וף הַשָּׁמַ֜יִם וּלְכֹ֣ל׀ רֹומֵ֣שׂ עַל־הָאָ֗רֶץ אֲשֶׁר־בֹּו֙ נֶ֣פֶשׁ חַיָּ֔ה אֶת־כָּל־יֶ֥רֶק עֵ֖שֶׂב לְאָכְלָ֑ה
3Genesis 2:4בְּיֹ֗ום עֲשֹׂ֛ות יְהוָ֥ה אֱלֹהִ֖ים אֶ֥רֶץ וְשָׁמָֽיִם׃ וַיִּיצֶר֩ יְהוָ֨ה אֱלֹהִ֜ים אֶת־הָֽאָדָ֗ם עָפָר֙ מִן־הָ֣אֲדָמָ֔ה
4Genesis 7:2מִכֹּ֣ל׀ הַבְּהֵמָ֣ה הַטְּהֹורָ֗ה תִּֽקַּח־לְךָ֛ שִׁבְעָ֥ה שִׁבְעָ֖ה אִ֣ישׁ וְאִשְׁתֹּ֑ו וּמִן־הַבְּהֵמָ֡ה אֲ֠שֶׁר לֹ֣א טְהֹרָ֥ה הִ֛וא שְׁנַ֖יִם אִ֥ישׁ וְאִשְׁתֹּֽו׃ גַּ֣ם מֵעֹ֧וף הַשָּׁמַ֛יִם שִׁבְעָ֥ה שִׁבְעָ֖ה זָכָ֣ר וּנְקֵבָ֑ה לְחַיֹּ֥ות זֶ֖רַע עַל־פְּנֵ֥י כָל־הָאָֽרֶץ׃
5Genesis 7:8מִן־הַבְּהֵמָה֙ הַטְּהֹורָ֔ה וּמִן־הַ֨בְּהֵמָ֔ה אֲשֶׁ֥ר אֵינֶ֖נָּה טְהֹרָ֑ה וּמִ֨ן־הָעֹ֔וף וְכֹ֥ל אֲשֶׁר־רֹמֵ֖שׂ עַל־הָֽאֲדָמָֽה׃ שְׁנַ֨יִם שְׁנַ֜יִם בָּ֧אוּ אֶל־נֹ֛חַ אֶל־הַתֵּבָ֖ה זָכָ֣ר וּנְקֵבָ֑ה כַּֽאֲשֶׁ֛ר צִוָּ֥ה אֱלֹהִ֖ים אֶת־נֹֽחַ׃
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 1

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
verse
sentence
clause וַיִּתֵּ֥ן אֹתָ֛ם אֱלֹהִ֖ים בִּרְקִ֣יעַ הַשָּׁמָ֑יִם
clause לְהָאִ֖יר עַל־הָאָֽרֶץ׃
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
verse
sentence
clause וְלִמְשֹׁל֙ בַּיֹּ֣ום וּבַלַּ֔יְלָה
clause וּֽלֲהַבְדִּ֔יל בֵּ֥ין הָאֹ֖ור וּבֵ֣ין הַחֹ֑שֶׁךְ
sentence
clause וַיַּ֥רְא אֱלֹהִ֖ים
clause כִּי־טֹֽוב׃
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 2

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
verse
sentence
clause וַיֹּ֣אמֶר אֱלֹהִ֗ים
sentence
clause הִנֵּה֩ נָתַ֨תִּי לָכֶ֜ם אֶת־כָּל־עֵ֣שֶׂב׀
clause זֹרֵ֣עַ זֶ֗רַע
clause אֲשֶׁר֙ עַל־פְּנֵ֣י כָל־הָאָ֔רֶץ
clause וְאֶת־כָּל־הָעֵ֛ץ
clause אֲשֶׁר־בֹּ֥ו פְרִי־עֵ֖ץ
clause זֹרֵ֣עַ זָ֑רַע
sentence
clause לָכֶ֥ם יִֽהְיֶ֖ה לְאָכְלָֽה׃
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
verse
sentence
clause וּֽלְכָל־חַיַּ֣ת הָ֠אָרֶץ וּלְכָל־עֹ֨וף הַשָּׁמַ֜יִם וּלְכֹ֣ל׀ רֹומֵ֣שׂ עַל־הָאָ֗רֶץ
clause אֲשֶׁר־בֹּו֙ נֶ֣פֶשׁ חַיָּ֔ה
clause אֶת־כָּל־יֶ֥רֶק עֵ֖שֶׂב לְאָכְלָ֑ה
sentence
clause וַֽיְהִי־כֵֽן׃
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 3

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
verse
sentence
clause אֵ֣לֶּה תֹולְדֹ֧ות הַשָּׁמַ֛יִם וְהָאָ֖רֶץ
clause בְּהִבָּֽרְאָ֑ם
sentence
clause בְּיֹ֗ום
clause עֲשֹׂ֛ות יְהוָ֥ה אֱלֹהִ֖ים אֶ֥רֶץ וְשָׁמָֽיִם׃
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
verse
sentence
clause וַיִּיצֶר֩ יְהוָ֨ה אֱלֹהִ֜ים אֶת־הָֽאָדָ֗ם עָפָר֙ מִן־הָ֣אֲדָמָ֔ה
sentence
clause וַיִּפַּ֥ח בְּאַפָּ֖יו נִשְׁמַ֣ת חַיִּ֑ים
sentence
clause וַֽיְהִ֥י הָֽאָדָ֖ם לְנֶ֥פֶשׁ חַיָּֽה׃
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "A.table(results, end=5)\n", "A.show(results, condensed=False, baseTypes=\"clause\", start=1, end=3)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Now with custom sets:" ] }, { "cell_type": "code", "execution_count": 17, "metadata": {}, "outputs": [], "source": [ "query = \"\"\"\n", "common otype=sentence\n", "/with/\n", "v1:verse\n", "&& ..\n", "v2:verse\n", "&& ..\n", "v1 < v2\n", "/-/\n", "\"\"\"" ] }, { "cell_type": "code", "execution_count": 18, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.50s 6 results\n" ] } ], "source": [ "resultsCommon = A.search(query, sets=customSets)" ] }, { "cell_type": "code", "execution_count": 19, "metadata": {}, "outputs": [ { "data": { "text/html": [ "\n", "\n", "\n", "\n", "\n", "\n", "
npsentence
1Numbers 27:8אִ֣ישׁ כִּֽי־יָמ֗וּת וּבֵן֙ אֵ֣ין לֹ֔ו וְאִם־אֵ֥ין לֹ֖ו בַּ֑ת וְאִם־אֵ֥ין לֹ֖ו אַחִ֑ים וְאִם־אֵ֣ין אַחִים֮ לְאָבִיו֒
2Jeremiah 7:24וְלֹ֤א שָֽׁמְעוּ֙ לְמִן־הַיֹּ֗ום אֲשֶׁ֨ר יָצְא֤וּ אֲבֹֽותֵיכֶם֙ מֵאֶ֣רֶץ מִצְרַ֔יִם עַ֖ד הַיֹּ֣ום הַזֶּ֑ה
3Jeremiah 21:8וְאֶל־הָעָ֤ם הַזֶּה֙ תֹּאמַ֔ר וּלְבֵית֙ מֶ֣לֶךְ יְהוּדָ֔ה
4Jeremiah 21:11שִׁמְע֖וּ דְּבַר־יְהוָֽה׃ בֵּ֣ית דָּוִ֗ד
5Ezekiel 37:18וְכַֽאֲשֶׁר֙ יֹאמְר֣וּ אֵלֶ֔יךָ בְּנֵ֥י עַמְּךָ֖ לֵאמֹ֑ר דַּבֵּ֣ר אֲלֵהֶ֗ם
6Psalms 25:1אֵלֶ֥יךָ יְ֝הוָ֗ה נַפְשִׁ֥י אֶשָּֽׂא׃ אֱֽלֹהַ֗י
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "A.table(resultsCommon)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "The following query has the same results but is less insightful." ] }, { "cell_type": "code", "execution_count": 20, "metadata": {}, "outputs": [], "source": [ "query = \"\"\"\n", "sentence\n", " =: w1:word\n", " w2:word\n", " :=\n", "\n", "v1:verse\n", " wv1:word\n", "\n", "v2:verse\n", " wv2:word\n", "\n", "w1 = wv1\n", "w2 = wv2\n", "v1 < v2\n", "\"\"\"" ] }, { "cell_type": "code", "execution_count": 21, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.95s 887 results\n" ] } ], "source": [ "fastResults = A.search(query)" ] }, { "cell_type": "code", "execution_count": 22, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "True" ] }, "execution_count": 22, "metadata": {}, "output_type": "execute_result" } ], "source": [ "sorted((x[0],) for x in fastResults) == sorted(results)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## `##` not the same slots\n", "\n", "True when the two objects in question do not occupy exactly the same set of slots.\n", "This is a very loose relationship.\n", "\n", "We look for sentences that start with a sentence atom that is not co-extensive with its sentence." ] }, { "cell_type": "code", "execution_count": 23, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T09:15:45.123089Z", "start_time": "2018-05-24T09:15:19.569287Z" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.11s 702 results\n" ] }, { "data": { "text/html": [ "\n", "\n", "\n", "\n", "\n", "
npsentencesentence_atom
1Genesis 1:29הִנֵּה֩ נָתַ֨תִּי לָכֶ֜ם אֶת־כָּל־עֵ֣שֶׂב׀ זֹרֵ֣עַ זֶ֗רַע אֲשֶׁר֙ עַל־פְּנֵ֣י כָל־הָאָ֔רֶץ וְאֶת־כָּל־הָעֵ֛ץ אֲשֶׁר־בֹּ֥ו פְרִי־עֵ֖ץ זֹרֵ֣עַ זָ֑רַע וּֽלְכָל־חַיַּ֣ת הָ֠אָרֶץ וּלְכָל־עֹ֨וף הַשָּׁמַ֜יִם וּלְכֹ֣ל׀ רֹומֵ֣שׂ עַל־הָאָ֗רֶץ אֲשֶׁר־בֹּו֙ נֶ֣פֶשׁ חַיָּ֔ה אֶת־כָּל־יֶ֥רֶק עֵ֖שֶׂב לְאָכְלָ֑ה הִנֵּה֩ נָתַ֨תִּי לָכֶ֜ם אֶת־כָּל־עֵ֣שֶׂב׀ זֹרֵ֣עַ זֶ֗רַע אֲשֶׁר֙ עַל־פְּנֵ֣י כָל־הָאָ֔רֶץ וְאֶת־כָּל־הָעֵ֛ץ אֲשֶׁר־בֹּ֥ו פְרִי־עֵ֖ץ זֹרֵ֣עַ זָ֑רַע
2Genesis 2:4בְּיֹ֗ום עֲשֹׂ֛ות יְהוָ֥ה אֱלֹהִ֖ים אֶ֥רֶץ וְשָׁמָֽיִם׃ וַיִּיצֶר֩ יְהוָ֨ה אֱלֹהִ֜ים אֶת־הָֽאָדָ֗ם עָפָר֙ מִן־הָ֣אֲדָמָ֔ה בְּיֹ֗ום עֲשֹׂ֛ות יְהוָ֥ה אֱלֹהִ֖ים אֶ֥רֶץ וְשָׁמָֽיִם׃
3Genesis 3:3וּמִפְּרִ֣י הָעֵץ֮ אֲשֶׁ֣ר בְּתֹוךְ־הַגָּן֒ לֹ֤א תֹֽאכְלוּ֙ מִמֶּ֔נּוּ וּמִפְּרִ֣י הָעֵץ֮ אֲשֶׁ֣ר בְּתֹוךְ־הַגָּן֒
4Genesis 13:1וַיַּעַל֩ אַבְרָ֨ם מִמִּצְרַ֜יִם ה֠וּא וְאִשְׁתֹּ֧ו וְכָל־אֲשֶׁר־לֹ֛ו הַנֶּֽגְבָּה׃ וַיַּעַל֩ אַבְרָ֨ם מִמִּצְרַ֜יִם ה֠וּא וְאִשְׁתֹּ֧ו וְכָל־אֲשֶׁר־לֹ֛ו
5Genesis 13:9הִפָּ֥רֶד נָ֖א מֵעָלָ֑י אִם־הַשְּׂמֹ֣אל וְאִם־הַיָּמִ֖ין הִפָּ֥רֶד נָ֖א מֵעָלָ֑י אִם־הַשְּׂמֹ֣אל
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 1

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
verse
sentence
clause
phrase וַ
phrase יֹּ֣אמֶר
phrase אֱלֹהִ֗ים
sentence
clause
phrase הִנֵּה֩
phrase נָתַ֨תִּי
phrase לָכֶ֜ם
phrase אֶת־כָּל־עֵ֣שֶׂב׀
clause
phrase זֹרֵ֣עַ
phrase זֶ֗רַע
clause
phrase אֲשֶׁר֙
phrase עַל־פְּנֵ֣י כָל־הָאָ֔רֶץ
clause
phrase וְאֶת־כָּל־הָעֵ֛ץ
clause
phrase אֲשֶׁר־
phrase בֹּ֥ו
phrase פְרִי־עֵ֖ץ
clause
phrase זֹרֵ֣עַ
phrase זָ֑רַע
sentence
clause
phrase לָכֶ֥ם
phrase יִֽהְיֶ֖ה
phrase לְאָכְלָֽה׃
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
verse
sentence
clause
phrase וּֽ
phrase לְכָל־חַיַּ֣ת הָ֠אָרֶץ וּלְכָל־עֹ֨וף הַשָּׁמַ֜יִם וּלְכֹ֣ל׀ רֹומֵ֣שׂ עַל־הָאָ֗רֶץ
clause
phrase אֲשֶׁר־
phrase בֹּו֙
phrase נֶ֣פֶשׁ חַיָּ֔ה
clause
phrase אֶת־כָּל־יֶ֥רֶק עֵ֖שֶׂב
phrase לְאָכְלָ֑ה
sentence
clause
phrase וַֽ
phrase יְהִי־
phrase כֵֽן׃
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "query = \"\"\"\n", "s:sentence\n", "=: sa:sentence_atom\n", "s ## sa\n", "\"\"\"\n", "results = A.search(query)\n", "A.table(results, end=5)\n", "A.show(results, baseTypes=\"phrase\", start=1, end=1)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Now in the common domain." ] }, { "cell_type": "code", "execution_count": 24, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T09:15:45.123089Z", "start_time": "2018-05-24T09:15:19.569287Z" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.30s 20 results\n" ] }, { "data": { "text/html": [ "\n", "\n", "\n", "\n", "\n", "
npsentencesentence_atom
1Numbers 23:19הַה֤וּא אָמַר֙ וְדִבֶּ֖ר הַה֤וּא אָמַר֙
2Numbers 27:8אִ֣ישׁ כִּֽי־יָמ֗וּת וּבֵן֙ אֵ֣ין לֹ֔ו וְאִם־אֵ֥ין לֹ֖ו בַּ֑ת וְאִם־אֵ֥ין לֹ֖ו אַחִ֑ים וְאִם־אֵ֣ין אַחִים֮ לְאָבִיו֒ אִ֣ישׁ כִּֽי־יָמ֗וּת וּבֵן֙ אֵ֣ין לֹ֔ו
3Judges 7:4אֲשֶׁר֩ אֹמַ֨ר אֵלֶ֜יךָ ה֚וּא יֵלֵ֣ךְ אִתָּ֔ךְ אֲשֶׁר֩ אֹמַ֨ר אֵלֶ֜יךָ
4Judges 7:4וְכֹ֨ל אֲשֶׁר־אֹמַ֜ר אֵלֶ֗יךָ ה֖וּא לֹ֥א יֵלֵֽךְ׃ וְכֹ֨ל אֲשֶׁר־אֹמַ֜ר אֵלֶ֗יךָ
52_Samuel 5:6וַיֹּ֨אמֶר לְדָוִ֤ד לֵאמֹר֙ לֵאמֹ֔ר וַיֹּ֨אמֶר לְדָוִ֤ד לֵאמֹר֙
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 1

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
verse
otype=verse
sentence
otype=sentence
clause
otype=clause
phrase לֹ֣א
otype=phrase
phrase אִ֥ישׁ
otype=phrase
phrase אֵל֙
otype=phrase
sentence
otype=sentence
clause
otype=clause
phrase וִֽ
otype=phrase
phrase יכַזֵּ֔ב
otype=phrase
sentence
otype=sentence
clause
otype=clause
phrase וּ
otype=phrase
phrase בֶן־אָדָ֖ם
otype=phrase
sentence
otype=sentence
clause
otype=clause
phrase וְ
otype=phrase
phrase יִתְנֶחָ֑ם
otype=phrase
sentence
otype=sentence
clause
otype=clause
phrase הַ
otype=phrase
phrase ה֤וּא
otype=phrase
phrase אָמַר֙
otype=phrase
sentence
otype=sentence
clause
otype=clause
phrase וְ
otype=phrase
phrase לֹ֣א
otype=phrase
phrase יַעֲשֶׂ֔ה
otype=phrase
sentence
otype=sentence
clause
otype=clause
phrase וְ
otype=phrase
phrase דִבֶּ֖ר
otype=phrase
sentence
otype=sentence
clause
otype=clause
phrase וְ
otype=phrase
phrase לֹ֥א
otype=phrase
phrase יְקִימֶֽנָּה׃
otype=phrase
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "query = \"\"\"\n", "s:common otype=sentence\n", "=: sa:sentence_atom\n", "s ## sa\n", "\"\"\"\n", "resultsCustom = A.search(query, sets=customSets)\n", "A.table(resultsCustom, end=5)\n", "A.show(resultsCustom, baseTypes=\"phrase\", start=1, end=1)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## `||` disjoint slots\n", "\n", "True when the two objects in question do not share any slots.\n", "This is a rather loose relationship.\n", "\n", "This can be used for locating gaps: a textual object that lies inside a gap of another object.\n", "See also [gaps](searchGaps.ipynb).\n", "\n", "Here we check whether there are phrases with disjoint subphrases." ] }, { "cell_type": "code", "execution_count": 25, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T09:15:45.123089Z", "start_time": "2018-05-24T09:15:19.569287Z" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.67s 168504 results\n" ] }, { "data": { "text/html": [ "\n", "\n", "\n", "\n", "\n", "\n", "\n", "
npphrasesubphrasesubphrase
1Genesis 1:1אֵ֥ת הַשָּׁמַ֖יִם וְאֵ֥ת הָאָֽרֶץ׃ אֵ֥ת הַשָּׁמַ֖יִם אֵ֥ת הָאָֽרֶץ׃
2Genesis 1:2תֹ֨הוּ֙ וָבֹ֔הוּ תֹ֨הוּ֙ בֹ֔הוּ
3Genesis 1:2עַל־פְּנֵ֣י תְהֹ֑ום פְּנֵ֣י תְהֹ֑ום
4Genesis 1:2ר֣וּחַ אֱלֹהִ֔ים ר֣וּחַ אֱלֹהִ֔ים
5Genesis 1:2עַל־פְּנֵ֥י הַמָּֽיִם׃ פְּנֵ֥י הַמָּֽיִם׃
6Genesis 1:4בֵּ֥ין הָאֹ֖ור וּבֵ֥ין הַחֹֽשֶׁךְ׃ בֵּ֥ין הָאֹ֖ור בֵ֥ין הַחֹֽשֶׁךְ׃
7Genesis 1:5יֹ֥ום אֶחָֽד׃ פ יֹ֥ום אֶחָֽד׃ פ
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 1

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "query = \"\"\"\n", "p:phrase\n", " s1:subphrase\n", " < s2:subphrase\n", "s1 || s2\n", "\"\"\"\n", "results = A.search(query)\n", "A.table(results, end=7)\n", "A.show(results, start=1, end=1)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Now in the common domain." ] }, { "cell_type": "code", "execution_count": 26, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T09:15:45.123089Z", "start_time": "2018-05-24T09:15:19.569287Z" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.42s 10999 results\n" ] }, { "data": { "text/html": [ "\n", "\n", "\n", "\n", "\n", "\n", "\n", "
npphrasesubphrasesubphrase
1Genesis 1:2עַל־פְּנֵ֥י הַמָּֽיִם׃ פְּנֵ֥י הַמָּֽיִם׃
2Genesis 1:5יֹ֥ום אֶחָֽד׃ פ יֹ֥ום אֶחָֽד׃ פ
3Genesis 1:29עַל־פְּנֵ֣י כָל־הָאָ֔רֶץ פְּנֵ֣י כָל־הָאָ֔רֶץ
4Genesis 1:29עַל־פְּנֵ֣י כָל־הָאָ֔רֶץ פְּנֵ֣י כָל־
5Genesis 1:29עַל־פְּנֵ֣י כָל־הָאָ֔רֶץ פְּנֵ֣י הָאָ֔רֶץ
6Genesis 1:29עַל־פְּנֵ֣י כָל־הָאָ֔רֶץ כָל־הָאָ֔רֶץ
7Genesis 2:4יְהוָ֥ה אֱלֹהִ֖ים יְהוָ֥ה אֱלֹהִ֖ים
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 1

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
verse
otype=verse
sentence
otype=sentence
clause
otype=clause
phrase
otype=phrase
otype=word
phrase
otype=phrase
otype=word
otype=word
phrase
otype=phrase
phrase
otype=phrase
otype=word
otype=word
otype=word
sentence
otype=sentence
clause
otype=clause
phrase
otype=phrase
otype=word
phrase
otype=phrase
phrase
otype=phrase
otype=word
otype=word
otype=word
sentence
otype=sentence
clause
otype=clause
phrase
otype=phrase
otype=word
phrase
otype=phrase
phrase
otype=phrase
phrase
otype=phrase
otype=word
otype=word
otype=word
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "query = \"\"\"\n", "p:common otype=phrase\n", " s1:subphrase\n", " < s2:subphrase\n", "s1 || s2\n", "\"\"\"\n", "resultsCustom = A.search(query, sets=customSets)\n", "A.table(resultsCustom, end=7)\n", "A.show(resultsCustom, start=1, end=1)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "We see that Genesis 1:1 has fallen out. Let's check the ranks of the lexemes of its last phrase:" ] }, { "cell_type": "code", "execution_count": 27, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "[('אֵ֥ת', 4),\n", " ('הַ', 1),\n", " ('שָּׁמַ֖יִם', 122),\n", " ('וְ', 0),\n", " ('אֵ֥ת', 4),\n", " ('הָ', 1),\n", " ('אָֽרֶץ', 22)]" ] }, "execution_count": 27, "metadata": {}, "output_type": "execute_result" } ], "source": [ "firstSentence = F.otype.s(\"sentence\")[0]\n", "lastPhrase = L.d(firstSentence, otype=\"phrase\")[-1]\n", "[(F.g_word_utf8.v(w), F.rank_lex.v(w)) for w in L.d(lastPhrase, otype=\"word\")]" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "There you have it: the heavens are rare!" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## `[[` and `]]` embedding\n", "\n", "`n [[ m` if object `n` embeds `m`.\n", "\n", "`n ]] m` if object `n` lies embedded in `m`.\n", "\n", "These relations are used implicitly in templates when there is indentation:\n", "\n", "```\n", "s:sentence\n", " p:phrase\n", " w1:word gn=f\n", " w2:word gn=m\n", "```\n", "\n", "The template above implicitly states the following embeddings:\n", "\n", "* `s ]] p`\n", "* `p ]] w1`\n", "* `p ]] w2`\n", "\n", "We have seen these relations in action." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# Positioning\n", "\n", "## `<<` and `>>` before and after (slot-wise)\n", "\n", "These relations test whether one object comes before or after an other,\n", "in the sense that the slots\n", "occupied by the one object lie completely\n", "before or after the slots occupied by the other object." ] }, { "cell_type": "code", "execution_count": 28, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T09:23:27.279338Z", "start_time": "2018-05-24T09:23:27.084721Z" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 1.62s 136137 results\n" ] }, { "data": { "text/html": [ "\n", "\n", "\n", "\n", "\n", "
npsentenceclausephraseclause
1Genesis 1:11תַּֽדְשֵׁ֤א הָאָ֨רֶץ֙ דֶּ֔שֶׁא עֵ֚שֶׂב מַזְרִ֣יעַ זֶ֔רַע עֵ֣ץ פְּרִ֞י עֹ֤שֶׂה פְּרִי֙ לְמִינֹ֔ו אֲשֶׁ֥ר זַרְעֹו־בֹ֖ו עַל־הָאָ֑רֶץ תַּֽדְשֵׁ֤א הָאָ֨רֶץ֙ דֶּ֔שֶׁא עֵ֚שֶׂב עֵ֣ץ פְּרִ֞י עֹ֤שֶׂה אֲשֶׁ֥ר זַרְעֹו־בֹ֖ו
2Genesis 1:11תַּֽדְשֵׁ֤א הָאָ֨רֶץ֙ דֶּ֔שֶׁא עֵ֚שֶׂב מַזְרִ֣יעַ זֶ֔רַע עֵ֣ץ פְּרִ֞י עֹ֤שֶׂה פְּרִי֙ לְמִינֹ֔ו אֲשֶׁ֥ר זַרְעֹו־בֹ֖ו עַל־הָאָ֑רֶץ תַּֽדְשֵׁ֤א הָאָ֨רֶץ֙ דֶּ֔שֶׁא עֵ֚שֶׂב עֵ֣ץ פְּרִ֞י פְּרִי֙ אֲשֶׁ֥ר זַרְעֹו־בֹ֖ו
3Genesis 1:11תַּֽדְשֵׁ֤א הָאָ֨רֶץ֙ דֶּ֔שֶׁא עֵ֚שֶׂב מַזְרִ֣יעַ זֶ֔רַע עֵ֣ץ פְּרִ֞י עֹ֤שֶׂה פְּרִי֙ לְמִינֹ֔ו אֲשֶׁ֥ר זַרְעֹו־בֹ֖ו עַל־הָאָ֑רֶץ תַּֽדְשֵׁ֤א הָאָ֨רֶץ֙ דֶּ֔שֶׁא עֵ֚שֶׂב עֵ֣ץ פְּרִ֞י לְמִינֹ֔ו אֲשֶׁ֥ר זַרְעֹו־בֹ֖ו
4Genesis 1:11תַּֽדְשֵׁ֤א הָאָ֨רֶץ֙ דֶּ֔שֶׁא עֵ֚שֶׂב מַזְרִ֣יעַ זֶ֔רַע עֵ֣ץ פְּרִ֞י עֹ֤שֶׂה פְּרִי֙ לְמִינֹ֔ו אֲשֶׁ֥ר זַרְעֹו־בֹ֖ו עַל־הָאָ֑רֶץ מַזְרִ֣יעַ זֶ֔רַע עֹ֤שֶׂה אֲשֶׁ֥ר זַרְעֹו־בֹ֖ו
5Genesis 1:11תַּֽדְשֵׁ֤א הָאָ֨רֶץ֙ דֶּ֔שֶׁא עֵ֚שֶׂב מַזְרִ֣יעַ זֶ֔רַע עֵ֣ץ פְּרִ֞י עֹ֤שֶׂה פְּרִי֙ לְמִינֹ֔ו אֲשֶׁ֥ר זַרְעֹו־בֹ֖ו עַל־הָאָ֑רֶץ מַזְרִ֣יעַ זֶ֔רַע פְּרִי֙ אֲשֶׁ֥ר זַרְעֹו־בֹ֖ו
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 1

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
verse
sentence
clause
phrase וַ
phrase יֹּ֣אמֶר
phrase אֱלֹהִ֗ים
sentence
clause
phrase תַּֽדְשֵׁ֤א
phrase הָאָ֨רֶץ֙
phrase דֶּ֔שֶׁא עֵ֚שֶׂב
clause
phrase מַזְרִ֣יעַ
phrase זֶ֔רַע
clause
phrase עֵ֣ץ פְּרִ֞י
clause
phrase עֹ֤שֶׂה
phrase פְּרִי֙
phrase לְמִינֹ֔ו
clause
phrase אֲשֶׁ֥ר
phrase זַרְעֹו־
phrase בֹ֖ו
clause
phrase עַל־הָאָ֑רֶץ
sentence
clause
phrase וַֽ
phrase יְהִי־
phrase כֵֽן׃
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "query = \"\"\"\n", "sentence\n", " c1:clause\n", " p:phrase\n", " c2:clause\n", " c1 << p\n", " c2 >> p\n", "\"\"\"\n", "results = A.search(query)\n", "colorMap = {2: \"lightyellow\", 3: \"cyan\", 4: \"magenta\", 5: \"blue\"}\n", "A.table(results, end=5, baseTypes=\"phrase\", colorMap=colorMap)\n", "A.show(results, condensed=False, baseTypes=\"phrase\", start=1, end=1, colorMap=colorMap)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "In the common domain:" ] }, { "cell_type": "code", "execution_count": 29, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T09:15:45.123089Z", "start_time": "2018-05-24T09:15:19.569287Z" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.81s 509 results\n" ] }, { "data": { "text/html": [ "\n", "\n", "\n", "\n", "\n", "
npsentenceclausephraseclause
1Genesis 3:5כִּ֚י יֹדֵ֣עַ אֱלֹהִ֔ים כִּ֗י בְּיֹום֙ אֲכָלְכֶ֣ם מִמֶּ֔נּוּ וְנִפְקְח֖וּ עֵֽינֵיכֶ֑ם וִהְיִיתֶם֙ כֵּֽאלֹהִ֔ים יֹדְעֵ֖י טֹ֥וב וָרָֽע׃ כִּ֚י יֹדֵ֣עַ אֱלֹהִ֔ים נִפְקְח֖וּ וִהְיִיתֶם֙ כֵּֽאלֹהִ֔ים
2Genesis 3:5כִּ֚י יֹדֵ֣עַ אֱלֹהִ֔ים כִּ֗י בְּיֹום֙ אֲכָלְכֶ֣ם מִמֶּ֔נּוּ וְנִפְקְח֖וּ עֵֽינֵיכֶ֑ם וִהְיִיתֶם֙ כֵּֽאלֹהִ֔ים יֹדְעֵ֖י טֹ֥וב וָרָֽע׃ כִּ֗י בְּיֹום֙ נִפְקְח֖וּ וִהְיִיתֶם֙ כֵּֽאלֹהִ֔ים
3Genesis 3:5כִּ֚י יֹדֵ֣עַ אֱלֹהִ֔ים כִּ֗י בְּיֹום֙ אֲכָלְכֶ֣ם מִמֶּ֔נּוּ וְנִפְקְח֖וּ עֵֽינֵיכֶ֑ם וִהְיִיתֶם֙ כֵּֽאלֹהִ֔ים יֹדְעֵ֖י טֹ֥וב וָרָֽע׃ אֲכָלְכֶ֣ם מִמֶּ֔נּוּ נִפְקְח֖וּ וִהְיִיתֶם֙ כֵּֽאלֹהִ֔ים
4Genesis 3:12הָֽאִשָּׁה֙ אֲשֶׁ֣ר נָתַ֣תָּה עִמָּדִ֔י הִ֛וא נָֽתְנָה־לִּ֥י מִן־הָעֵ֖ץ וָאֹכֵֽל׃ הָֽאִשָּׁה֙ עִמָּדִ֔י וָאֹכֵֽל׃
5Genesis 14:17וַיֵּצֵ֣א מֶֽלֶךְ־סְדֹם֮ לִקְרָאתֹו֒ אַחֲרֵ֣י שׁוּבֹ֗ו מֵֽהַכֹּות֙ אֶת־כְּדָר־לָעֹ֔מֶר וְאֶת־הַמְּלָכִ֖ים אֲשֶׁ֣ר אִתֹּ֑ו אֶל־עֵ֣מֶק שָׁוֵ֔ה אַחֲרֵ֣י שׁוּבֹ֗ו אֶת־כְּדָר־לָעֹ֔מֶר וְאֶת־הַמְּלָכִ֖ים אֲשֶׁ֣ר אִתֹּ֑ו
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 1

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
verse
otype=verse
sentence
otype=sentence
clause
otype=clause
phrase כִּ֚י
otype=phrase
phrase יֹדֵ֣עַ
otype=phrase
phrase אֱלֹהִ֔ים
otype=phrase
clause
otype=clause
phrase כִּ֗י
otype=phrase
phrase בְּיֹום֙
otype=phrase
clause
otype=clause
phrase אֲכָלְכֶ֣ם
otype=phrase
phrase מִמֶּ֔נּוּ
otype=phrase
clause
otype=clause
phrase וְ
otype=phrase
phrase נִפְקְח֖וּ
otype=phrase
phrase עֵֽינֵיכֶ֑ם
otype=phrase
clause
otype=clause
phrase וִ
otype=phrase
phrase הְיִיתֶם֙
otype=phrase
phrase כֵּֽאלֹהִ֔ים
otype=phrase
clause
otype=clause
phrase יֹדְעֵ֖י טֹ֥וב וָרָֽע׃
otype=phrase
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "query = \"\"\"\n", "sentence\n", " c1:common otype=clause\n", " p:rare otype=phrase\n", " c2:common otype=clause\n", " c1 << p\n", " c2 >> p\n", "\"\"\"\n", "resultsCustom = A.search(query, sets=customSets)\n", "A.table(resultsCustom, end=5, baseTypes=\"phrase\", colorMap=colorMap)\n", "A.show(\n", " resultsCustom,\n", " condensed=False,\n", " baseTypes=\"phrase\",\n", " start=1,\n", " end=1,\n", " colorMap=colorMap,\n", ")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## `=:` same start slots\n", "This relation holds when the left and right hand sides are nodes that have the same first slot.\n", "It serves to enforce the the children of a parent are textually the first things inside that\n", "parent. We have seen it in action before.\n", "\n", "## `:=` same end slots\n", "This relation holds when the left and right hand sides are nodes that have the same last slot\n", "It serves to enforce the the children of a parent are textually the last things inside that\n", "parent. We have seen it in action before.\n", "\n", "## `::` same boundary slots\n", "This relation holds when `=:` and `:=` both hold between the left and right hand sides.\n", "It serves to look for parents with single children, or at least, where the parent is textually spanned by a single child." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Let us look for a phrase, whose start and end slots coincide with its containing clause.\n", "But only if the phrase does not coincide with its parent clause." ] }, { "cell_type": "code", "execution_count": 30, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T09:31:17.618695Z", "start_time": "2018-05-24T09:31:17.079362Z" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.15s 52 results\n" ] }, { "data": { "text/html": [ "\n", "\n", "\n", "\n", "\n", "
npclausephrase
1Genesis 10:21גַּם־ה֑וּא אֲבִי֙ כָּל־בְּנֵי־עֵ֔בֶר אֲחִ֖י יֶ֥פֶת הַגָּדֹֽול׃ גַּם־ה֑וּא אֲחִ֖י יֶ֥פֶת הַגָּדֹֽול׃
2Genesis 24:24בַּת־בְּתוּאֵ֖ל אָנֹ֑כִי בֶּן־מִלְכָּ֕ה בַּת־בְּתוּאֵ֖ל בֶּן־מִלְכָּ֕ה
3Genesis 31:53אֱלֹהֵ֨י אַבְרָהָ֜ם וֵֽאלֹהֵ֤י נָחֹור֙ יִשְׁפְּט֣וּ בֵינֵ֔ינוּ אֱלֹהֵ֖י אֲבִיהֶ֑ם אֱלֹהֵ֨י אַבְרָהָ֜ם וֵֽאלֹהֵ֤י נָחֹור֙ אֱלֹהֵ֖י אֲבִיהֶ֑ם
4Exodus 28:1לְכַהֲנֹו־לִ֑י אַהֲרֹ֕ן נָדָ֧ב וַאֲבִיה֛וּא אֶלְעָזָ֥ר וְאִיתָמָ֖ר בְּנֵ֥י אַהֲרֹֽן׃ לְכַהֲנֹו־אַהֲרֹ֕ן נָדָ֧ב וַאֲבִיה֛וּא אֶלְעָזָ֥ר וְאִיתָמָ֖ר בְּנֵ֥י אַהֲרֹֽן׃
5Exodus 28:14מִגְבָּלֹ֛ת תַּעֲשֶׂ֥ה אֹתָ֖ם מַעֲשֵׂ֣ה עֲבֹ֑ת מִגְבָּלֹ֛ת מַעֲשֵׂ֣ה עֲבֹ֑ת
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 1

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
clause
phrase גַּם־ה֑וּא
phrase אֲבִי֙ כָּל־בְּנֵי־עֵ֔בֶר
phrase אֲחִ֖י יֶ֥פֶת הַגָּדֹֽול׃
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 2

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
clause
phrase בַּת־בְּתוּאֵ֖ל
phrase אָנֹ֑כִי
phrase בֶּן־מִלְכָּ֕ה
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 3

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
clause
phrase אֱלֹהֵ֨י אַבְרָהָ֜ם וֵֽאלֹהֵ֤י נָחֹור֙
phrase יִשְׁפְּט֣וּ
phrase בֵינֵ֔ינוּ
phrase אֱלֹהֵ֖י אֲבִיהֶ֑ם
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 4

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
clause
phrase לְכַהֲנֹו־
phrase לִ֑י
phrase אַהֲרֹ֕ן נָדָ֧ב וַאֲבִיה֛וּא אֶלְעָזָ֥ר וְאִיתָמָ֖ר בְּנֵ֥י אַהֲרֹֽן׃
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 5

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
clause
phrase מִגְבָּלֹ֛ת
phrase תַּעֲשֶׂ֥ה
phrase אֹתָ֖ם
phrase מַעֲשֵׂ֣ה עֲבֹ֑ת
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "query = \"\"\"\n", "c:clause\n", " :: p:phrase\n", "c ## p\n", "\"\"\"\n", "results = A.search(query)\n", "A.table(results, start=1, end=5, baseTypes=\"phrase\")\n", "A.show(results, start=1, end=5, condenseType=\"clause\", baseTypes=\"phrase\")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Here you see an extra phrase in such clauses, lying embedded in the clause-spanning phrase.\n", "\n", "A nice case of **Mind the gap!**." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "In the common domain:" ] }, { "cell_type": "code", "execution_count": 31, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T09:15:45.123089Z", "start_time": "2018-05-24T09:15:19.569287Z" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.59s 1 result\n" ] }, { "data": { "text/html": [ "\n", "
npclausephrase
1Leviticus 11:9אֶת־זֶה֙ תֹּֽאכְל֔וּ מִכֹּ֖ל אֶת־זֶה֙ מִכֹּ֖ל
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 1

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
clause
otype=clause
phrase אֶת־זֶה֙
otype=phrase
phrase תֹּֽאכְל֔וּ
otype=phrase
phrase מִכֹּ֖ל
otype=phrase
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "query = \"\"\"\n", "c:common otype=clause\n", " :: p:common otype=phrase\n", "c ## p\n", "\"\"\"\n", "resultsCustom = A.search(query, sets=customSets)\n", "A.table(resultsCustom, start=1, end=10, baseTypes=\"phrase\")\n", "A.show(resultsCustom, start=1, end=5, condenseType=\"clause\", baseTypes=\"phrase\")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## `<:` adjacent before\n", "This relation holds when the left hand sides ends in a slot that lies before the first slot of the right hand side.\n", "It serves to enforce an ordering between siblings of a parent.\n", "\n", "## `:>` adjacent after\n", "This relation holds when the left hand sides starts in a slot that lies after the last slot of the right hand side." ] }, { "cell_type": "code", "execution_count": 32, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T09:33:15.105847Z", "start_time": "2018-05-24T09:33:14.237419Z" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.64s 162653 results\n" ] }, { "data": { "text/html": [ "\n", "\n", "\n", "
npclausephrasephrase
1Genesis 1:1בְּרֵאשִׁ֖ית בָּרָ֣א אֱלֹהִ֑ים אֵ֥ת הַשָּׁמַ֖יִם וְאֵ֥ת הָאָֽרֶץ׃ בְּרֵאשִׁ֖ית בָּרָ֣א
2Genesis 1:1בְּרֵאשִׁ֖ית בָּרָ֣א אֱלֹהִ֑ים אֵ֥ת הַשָּׁמַ֖יִם וְאֵ֥ת הָאָֽרֶץ׃ בָּרָ֣א אֱלֹהִ֑ים
3Genesis 1:1בְּרֵאשִׁ֖ית בָּרָ֣א אֱלֹהִ֑ים אֵ֥ת הַשָּׁמַ֖יִם וְאֵ֥ת הָאָֽרֶץ׃ אֱלֹהִ֑ים אֵ֥ת הַשָּׁמַ֖יִם וְאֵ֥ת הָאָֽרֶץ׃
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "query = \"\"\"\n", "clause\n", " phrase\n", " <: phrase\n", "\"\"\"\n", "results = A.search(query)\n", "A.table(results, start=1, end=3, baseTypes=\"phrase\")" ] }, { "cell_type": "code", "execution_count": 33, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T09:33:15.105847Z", "start_time": "2018-05-24T09:33:14.237419Z" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.64s 162653 results\n" ] }, { "data": { "text/html": [ "\n", "\n", "\n", "
npclausephrasephrase
1Genesis 1:1בְּרֵאשִׁ֖ית בָּרָ֣א אֱלֹהִ֑ים אֵ֥ת הַשָּׁמַ֖יִם וְאֵ֥ת הָאָֽרֶץ׃ בָּרָ֣א בְּרֵאשִׁ֖ית
2Genesis 1:1בְּרֵאשִׁ֖ית בָּרָ֣א אֱלֹהִ֑ים אֵ֥ת הַשָּׁמַ֖יִם וְאֵ֥ת הָאָֽרֶץ׃ אֱלֹהִ֑ים בָּרָ֣א
3Genesis 1:1בְּרֵאשִׁ֖ית בָּרָ֣א אֱלֹהִ֑ים אֵ֥ת הַשָּׁמַ֖יִם וְאֵ֥ת הָאָֽרֶץ׃ אֵ֥ת הַשָּׁמַ֖יִם וְאֵ֥ת הָאָֽרֶץ׃ אֱלֹהִ֑ים
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "query = \"\"\"\n", "clause\n", " phrase\n", " :> phrase\n", "\"\"\"\n", "results = A.search(query)\n", "A.table(results, start=1, end=3, baseTypes=\"phrase\")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Playing with common and rare:" ] }, { "cell_type": "code", "execution_count": 34, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T09:33:15.105847Z", "start_time": "2018-05-24T09:33:14.237419Z" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.65s 25956 results\n" ] }, { "data": { "text/html": [ "\n", "\n", "\n", "
npclausephrasephrase
1Genesis 1:2וְהָאָ֗רֶץ הָיְתָ֥ה תֹ֨הוּ֙ וָבֹ֔הוּ הָיְתָ֥ה תֹ֨הוּ֙ וָבֹ֔הוּ
2Genesis 1:4כִּי־טֹ֑וב כִּי־טֹ֑וב
3Genesis 1:4וַיַּבְדֵּ֣ל אֱלֹהִ֔ים בֵּ֥ין הָאֹ֖ור וּבֵ֥ין הַחֹֽשֶׁךְ׃ וַיַּבְדֵּ֣ל
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "query = \"\"\"\n", "clause\n", " common otype=phrase\n", " <: rare otype=phrase\n", "\"\"\"\n", "resultsCommon = A.search(query, sets=customSets)\n", "A.table(resultsCommon, start=1, end=3, baseTypes=\"phrase\")" ] }, { "cell_type": "code", "execution_count": 35, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T09:33:15.105847Z", "start_time": "2018-05-24T09:33:14.237419Z" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.63s 9684 results\n" ] }, { "data": { "text/html": [ "\n", "\n", "\n", "
npclausephrasephrase
1Genesis 1:1בְּרֵאשִׁ֖ית בָּרָ֣א אֱלֹהִ֑ים אֵ֥ת הַשָּׁמַ֖יִם וְאֵ֥ת הָאָֽרֶץ׃ אֱלֹהִ֑ים בָּרָ֣א
2Genesis 1:2וְר֣וּחַ אֱלֹהִ֔ים מְרַחֶ֖פֶת עַל־פְּנֵ֥י הַמָּֽיִם׃ עַל־פְּנֵ֥י הַמָּֽיִם׃ מְרַחֶ֖פֶת
3Genesis 1:4וַיַּבְדֵּ֣ל אֱלֹהִ֔ים בֵּ֥ין הָאֹ֖ור וּבֵ֥ין הַחֹֽשֶׁךְ׃ אֱלֹהִ֔ים יַּבְדֵּ֣ל
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "query = \"\"\"\n", "clause\n", " common otype=phrase\n", " :> rare otype=phrase\n", "\"\"\"\n", "resultsCommon = A.search(query, sets=customSets)\n", "A.table(resultsCommon, start=1, end=3, baseTypes=\"phrase\")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Another example: are there clauses with multiple clause atoms without a gap between the two?" ] }, { "cell_type": "code", "execution_count": 36, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T09:33:15.105847Z", "start_time": "2018-05-24T09:33:14.237419Z" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.16s 0 results\n" ] }, { "data": { "text/html": [ "
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "query = \"\"\"\n", "clause\n", " clause_atom\n", " <: clause_atom\n", "\"\"\"\n", "results = A.search(query)\n", "A.table(results, start=1, end=10, baseTypes=\"clause_atom\")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Conclusion: there is always textual material between clause atoms of the same clause.\n", "If we lift the adjacency to sequentially before (`<<`) we do get results:" ] }, { "cell_type": "code", "execution_count": 37, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T09:34:14.004381Z", "start_time": "2018-05-24T09:34:13.002485Z" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.18s 2707 results\n" ] }, { "data": { "text/html": [ "\n", "\n", "\n", "\n", "\n", "
npclauseclause_atomclause_atom
1Genesis 1:7וַיַּבְדֵּ֗ל בֵּ֤ין הַמַּ֨יִם֙ וּבֵ֣ין הַמַּ֔יִם וַיַּבְדֵּ֗ל בֵּ֤ין הַמַּ֨יִם֙ וּבֵ֣ין הַמַּ֔יִם
2Genesis 1:11תַּֽדְשֵׁ֤א הָאָ֨רֶץ֙ דֶּ֔שֶׁא עֵ֚שֶׂב עֵ֣ץ פְּרִ֞י תַּֽדְשֵׁ֤א הָאָ֨רֶץ֙ דֶּ֔שֶׁא עֵ֚שֶׂב עֵ֣ץ פְּרִ֞י
3Genesis 1:11עֹ֤שֶׂה פְּרִי֙ לְמִינֹ֔ו עַל־הָאָ֑רֶץ עֹ֤שֶׂה פְּרִי֙ לְמִינֹ֔ו עַל־הָאָ֑רֶץ
4Genesis 1:12וַתֹּוצֵ֨א הָאָ֜רֶץ דֶּ֠שֶׁא עֵ֣שֶׂב וְעֵ֧ץ וַתֹּוצֵ֨א הָאָ֜רֶץ דֶּ֠שֶׁא עֵ֣שֶׂב וְעֵ֧ץ
5Genesis 1:12עֹ֥שֶׂה פְּרִ֛י לְמִינֵ֑הוּ עֹ֥שֶׂה פְּרִ֛י לְמִינֵ֑הוּ
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 1

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
verse
sentence
clause
phrase וַ
phrase יַּ֣עַשׂ
phrase אֱלֹהִים֮
phrase אֶת־הָרָקִיעַ֒
sentence
clause
phrase וַ
phrase יַּבְדֵּ֗ל
phrase בֵּ֤ין הַמַּ֨יִם֙
clause
phrase אֲשֶׁר֙
phrase מִתַּ֣חַת לָרָקִ֔יעַ
clause
phrase וּבֵ֣ין הַמַּ֔יִם
clause
phrase אֲשֶׁ֖ר
phrase מֵעַ֣ל לָרָקִ֑יעַ
sentence
clause
phrase וַֽ
phrase יְהִי־
phrase כֵֽן׃
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "query = \"\"\"\n", "clause\n", " clause_atom\n", " << clause_atom\n", "\"\"\"\n", "results = A.search(query)\n", "A.table(results, start=1, end=5, baseTypes=\"clause_atom\")\n", "A.show(results, start=1, end=1, baseTypes=\"clause_atom\")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# Nearness\n", "\n", "The relations with `:` in their name always have a requirement somewhere that a slot of the\n", "left hand node equals a slot of the right hand node, or that the two are adjacent.\n", "\n", "All these relationships can be relaxed by a **nearness number**.\n", "If you put a number `k` inside the relationship symbols, those restrictions will be relaxed to\n", "*the one slot and the other slot should have a mutual distance of at most `k`*.\n", "\n", "## `=k:` same start within `k` slots\n", "\n", "Here is an example.\n", "\n", "First we look for clauses, with a phrase in it that starts at the\n", "same slot as the clause." ] }, { "cell_type": "code", "execution_count": 38, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T10:07:04.112246Z", "start_time": "2018-05-24T10:07:03.896523Z" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.16s 126 results\n" ] } ], "source": [ "results = A.search(\n", " \"\"\"\n", "chapter book=Genesis chapter=1\n", " clause\n", " =: phrase\n", "\"\"\"\n", ")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Now we add a bit of freedom, but not much: 0. Indeed, this is no extra\n", "freedom, and it should give the same number of results." ] }, { "cell_type": "code", "execution_count": 39, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T10:07:07.275070Z", "start_time": "2018-05-24T10:07:07.067280Z" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.20s 126 results\n" ] } ], "source": [ "results = A.search(\n", " \"\"\"\n", "chapter book=Genesis chapter=1\n", " clause\n", " =0: phrase\n", "\"\"\"\n", ")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Now we add real freedom: 1 and 2" ] }, { "cell_type": "code", "execution_count": 40, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T10:07:08.830980Z", "start_time": "2018-05-24T10:07:08.610457Z" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.13s 236 results\n" ] } ], "source": [ "results = A.search(\n", " \"\"\"\n", "chapter book=Genesis chapter=1\n", " clause\n", " =1: phrase\n", "\"\"\"\n", ")" ] }, { "cell_type": "code", "execution_count": 41, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T10:07:10.594598Z", "start_time": "2018-05-24T10:07:10.370887Z" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.12s 315 results\n" ] } ], "source": [ "results = A.search(\n", " \"\"\"\n", "chapter book=Genesis chapter=1\n", " clause\n", " =2: phrase\n", "\"\"\"\n", ")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Let us see some cases:" ] }, { "cell_type": "code", "execution_count": 42, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T10:07:12.554216Z", "start_time": "2018-05-24T10:07:12.507558Z" } }, "outputs": [ { "data": { "text/html": [ "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "
npclausephrase
1Genesis 1:1בְּרֵאשִׁ֖ית בָּרָ֣א אֱלֹהִ֑ים אֵ֥ת הַשָּׁמַ֖יִם וְאֵ֥ת הָאָֽרֶץ׃ בְּרֵאשִׁ֖ית
2Genesis 1:1בְּרֵאשִׁ֖ית בָּרָ֣א אֱלֹהִ֑ים אֵ֥ת הַשָּׁמַ֖יִם וְאֵ֥ת הָאָֽרֶץ׃ בָּרָ֣א
3Genesis 1:2וְהָאָ֗רֶץ הָיְתָ֥ה תֹ֨הוּ֙ וָבֹ֔הוּ וְ
4Genesis 1:2וְהָאָ֗רֶץ הָיְתָ֥ה תֹ֨הוּ֙ וָבֹ֔הוּ הָאָ֗רֶץ
5Genesis 1:2וְחֹ֖שֶׁךְ עַל־פְּנֵ֣י תְהֹ֑ום וְ
6Genesis 1:2וְחֹ֖שֶׁךְ עַל־פְּנֵ֣י תְהֹ֑ום חֹ֖שֶׁךְ
7Genesis 1:2וְחֹ֖שֶׁךְ עַל־פְּנֵ֣י תְהֹ֑ום עַל־פְּנֵ֣י תְהֹ֑ום
8Genesis 1:2וְר֣וּחַ אֱלֹהִ֔ים מְרַחֶ֖פֶת עַל־פְּנֵ֥י הַמָּֽיִם׃ וְ
9Genesis 1:2וְר֣וּחַ אֱלֹהִ֔ים מְרַחֶ֖פֶת עַל־פְּנֵ֥י הַמָּֽיִם׃ ר֣וּחַ אֱלֹהִ֔ים
10Genesis 1:3וַיֹּ֥אמֶר אֱלֹהִ֖ים וַ
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 1

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
verse
book=Genesischapter=1
sentence
clause
phrase בְּרֵאשִׁ֖ית
phrase בָּרָ֣א
phrase אֱלֹהִ֑ים
phrase אֵ֥ת הַשָּׁמַ֖יִם וְאֵ֥ת הָאָֽרֶץ׃
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 2

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
verse
book=Genesischapter=1
sentence
clause
phrase בְּרֵאשִׁ֖ית
phrase בָּרָ֣א
phrase אֱלֹהִ֑ים
phrase אֵ֥ת הַשָּׁמַ֖יִם וְאֵ֥ת הָאָֽרֶץ׃
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 3

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
verse
book=Genesischapter=1
sentence
clause
phrase וְ
phrase הָאָ֗רֶץ
phrase הָיְתָ֥ה
phrase תֹ֨הוּ֙ וָבֹ֔הוּ
sentence
clause
phrase וְ
phrase חֹ֖שֶׁךְ
phrase עַל־פְּנֵ֣י תְהֹ֑ום
sentence
clause
phrase וְ
phrase ר֣וּחַ אֱלֹהִ֔ים
phrase מְרַחֶ֖פֶת
phrase עַל־פְּנֵ֥י הַמָּֽיִם׃
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 4

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
verse
book=Genesischapter=1
sentence
clause
phrase וְ
phrase הָאָ֗רֶץ
phrase הָיְתָ֥ה
phrase תֹ֨הוּ֙ וָבֹ֔הוּ
sentence
clause
phrase וְ
phrase חֹ֖שֶׁךְ
phrase עַל־פְּנֵ֣י תְהֹ֑ום
sentence
clause
phrase וְ
phrase ר֣וּחַ אֱלֹהִ֔ים
phrase מְרַחֶ֖פֶת
phrase עַל־פְּנֵ֥י הַמָּֽיִם׃
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "A.table(results, start=1, end=10, baseTypes=\"phrase\", skipCols=\"1\")\n", "A.show(\n", " results,\n", " condensed=False,\n", " start=1,\n", " end=4,\n", " colorMap={2: \"yellow\", 3: \"cyan\"},\n", " baseTypes=\"phrase\",\n", " skipCols=\"1\",\n", ")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "The first and second result show the same clause, with its first and second phrase respectively.\n", "\n", "Note that we look for phrases that lie embedded in their clause.\n", "So we do not get phrases of a preceding clause.\n", "\n", "But if we want, we can get those as well." ] }, { "cell_type": "code", "execution_count": 43, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T10:07:18.484171Z", "start_time": "2018-05-24T10:07:18.052778Z" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.03s 485 results\n" ] } ], "source": [ "results = A.search(\n", " \"\"\"\n", "chapter book=Genesis chapter=1\n", " c:clause\n", " p:phrase\n", "\n", " c =2: p\n", "\"\"\"\n", ")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "We have more results now. Here is a closer look:" ] }, { "cell_type": "code", "execution_count": 44, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T10:07:21.149115Z", "start_time": "2018-05-24T10:07:21.113629Z" } }, "outputs": [ { "data": { "text/html": [ "\n", "\n", "\n", "\n", "\n", "
npclausephrase
1Genesis 1:1בְּרֵאשִׁ֖ית בָּרָ֣א אֱלֹהִ֑ים אֵ֥ת הַשָּׁמַ֖יִם וְאֵ֥ת הָאָֽרֶץ׃ בְּרֵאשִׁ֖ית
2Genesis 1:1בְּרֵאשִׁ֖ית בָּרָ֣א אֱלֹהִ֑ים אֵ֥ת הַשָּׁמַ֖יִם וְאֵ֥ת הָאָֽרֶץ׃ בָּרָ֣א
3Genesis 1:2וְהָאָ֗רֶץ הָיְתָ֥ה תֹ֨הוּ֙ וָבֹ֔הוּ וְ
4Genesis 1:2וְהָאָ֗רֶץ הָיְתָ֥ה תֹ֨הוּ֙ וָבֹ֔הוּ הָאָ֗רֶץ
5Genesis 1:2וְחֹ֖שֶׁךְ עַל־פְּנֵ֣י תְהֹ֑ום וְ
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 12

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
verse
book=Genesischapter=1
sentence
clause
phrase וַ
phrase יֹּ֥אמֶר
phrase אֱלֹהִ֖ים
sentence
clause
phrase יְהִ֣י
phrase אֹ֑ור
sentence
clause
phrase וַֽ
phrase יְהִי־
phrase אֹֽור׃
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 13

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
verse
book=Genesischapter=1
sentence
clause
phrase וַ
phrase יֹּ֥אמֶר
phrase אֱלֹהִ֖ים
sentence
clause
phrase יְהִ֣י
phrase אֹ֑ור
sentence
clause
phrase וַֽ
phrase יְהִי־
phrase אֹֽור׃
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 14

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
verse
book=Genesischapter=1
sentence
clause
phrase וַ
phrase יֹּ֥אמֶר
phrase אֱלֹהִ֖ים
sentence
clause
phrase יְהִ֣י
phrase אֹ֑ור
sentence
clause
phrase וַֽ
phrase יְהִי־
phrase אֹֽור׃
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "A.table(results, start=1, end=5, baseTypes=\"phrase\", skipCols=\"1\")\n", "A.show(\n", " results,\n", " condensed=False,\n", " start=12,\n", " end=14,\n", " colorMap={2: \"yellow\", 3: \"cyan\"},\n", " baseTypes=\"phrase\",\n", " skipCols=\"1\",\n", ")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Here you see in results 13 and 14 a phrase of the previous clause." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Lets also play with common and rare:" ] }, { "cell_type": "code", "execution_count": 45, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T10:07:04.112246Z", "start_time": "2018-05-24T10:07:03.896523Z" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.30s 10313 results\n" ] } ], "source": [ "resultsCommon = A.search(\n", " \"\"\"\n", "verse\n", " clause\n", " =: rare otype=phrase\n", "\"\"\",\n", " sets=customSets,\n", ")" ] }, { "cell_type": "code", "execution_count": 46, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T10:07:04.112246Z", "start_time": "2018-05-24T10:07:03.896523Z" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.35s 10313 results\n" ] } ], "source": [ "resultsCommon = A.search(\n", " \"\"\"\n", "verse\n", " clause\n", " =0: rare otype=phrase\n", "\"\"\",\n", " sets=customSets,\n", ")" ] }, { "cell_type": "code", "execution_count": 47, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T10:07:04.112246Z", "start_time": "2018-05-24T10:07:03.896523Z" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.46s 26113 results\n" ] } ], "source": [ "resultsCommon = A.search(\n", " \"\"\"\n", "verse\n", " clause\n", " =1: rare otype=phrase\n", "\"\"\",\n", " sets=customSets,\n", ")" ] }, { "cell_type": "code", "execution_count": 48, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T10:07:04.112246Z", "start_time": "2018-05-24T10:07:03.896523Z" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.54s 39382 results\n" ] } ], "source": [ "resultsCommon = A.search(\n", " \"\"\"\n", "verse\n", " clause\n", " =2: rare otype=phrase\n", "\"\"\",\n", " sets=customSets,\n", ")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## `:k=` same end within `k` slots\n", "\n", "## `:k:` same start and end within `k` slots" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# All steps\n", "\n", "* **[start](start.ipynb)** your first step in mastering the bible computationally\n", "* **[display](display.ipynb)** become an expert in creating pretty displays of your text structures\n", "* **[search](search.ipynb)** turbo charge your hand-coding with search templates\n", "\n", "---\n", "\n", "[advanced](searchAdvanced.ipynb)\n", "[sets](searchSets.ipynb)\n", "relations\n", "\n", "You are comfortable in space now.\n", "\n", "Ready to enter a whole new dimension?\n", "\n", "[quantifiers](searchQuantifiers.ipynb)\n", "[from MQL](searchFromMQL.ipynb)\n", "[rough](searchRough.ipynb)\n", "[gaps](searchGaps.ipynb)\n", "\n", "---\n", "\n", "* **[export Excel](exportExcel.ipynb)** make tailor-made spreadsheets out of your results\n", "* **[share](share.ipynb)** draw in other people's data and let them use yours\n", "* **[export](export.ipynb)** export your dataset as an Emdros database\n", "* **[annotate](annotate.ipynb)** annotate plain text by means of other tools and import the annotations as TF features\n", "* **[map](map.ipynb)** map somebody else's annotations to a new version of the corpus\n", "* **[volumes](volumes.ipynb)** work with selected books only\n", "* **[trees](trees.ipynb)** work with the BHSA data as syntax trees\n", "\n", "CC-BY Dirk Roorda" ] } ], "metadata": { "kernelspec": { "display_name": "Python 3 (ipykernel)", "language": "python", "name": "python3" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.11.1" }, "widgets": { "application/vnd.jupyter.widget-state+json": { "state": {}, "version_major": 2, "version_minor": 0 } } }, "nbformat": 4, "nbformat_minor": 4 }