{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "\n", "\n", "\n", "\n", "\n", "---\n", "\n", "To get started: consult [start](start.ipynb)\n", "\n", "---\n", "\n", "# Sharing data features\n", "\n", "## Explore additional data\n", "\n", "Once you analyse a corpus, it is likely that you produce data that others can reuse.\n", "Maybe you have defined a set of proper name occurrences, or special numerals, or you have computed part-of-speech assignments.\n", "\n", "It is possible to turn these insights into *new features*, i.e. new `.tf` files with values assigned to specific nodes.\n", "\n", "## Make your own data\n", "\n", "New data is a product of your own methods and computations in the first place.\n", "But how do you turn that data into new TF features?\n", "It turns out that the last step is not that difficult.\n", "\n", "If you can shape your data as a mapping (dictionary) from node numbers (integers) to values\n", "(strings or integers), then TF can turn that data into a feature file for you with one command.\n", "\n", "## Share your new data\n", "You can then easily share your new features on GitHub, so that your colleagues everywhere\n", "can try it out for themselves.\n", "\n", "You can add such data on the fly, by passing a `mod={org}/{repo}/{path}` parameter,\n", "or a bunch of them separated by commas.\n", "\n", "If the data is there, it will be auto-downloaded and stored on your machine.\n", "\n", "Let's do it." ] }, { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [], "source": [ "%load_ext autoreload\n", "%autoreload 2" ] }, { "cell_type": "code", "execution_count": 2, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T10:06:39.818664Z", "start_time": "2018-05-24T10:06:39.796588Z" }, "lines_to_next_cell": 2 }, "outputs": [], "source": [ "import os\n", "from tf.app import use" ] }, { "cell_type": "code", "execution_count": 3, "metadata": { "lines_to_next_cell": 2 }, "outputs": [ { "data": { "text/html": [ "TF-app: ~/text-fabric-data/etcbc/dss/app" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "data: ~/text-fabric-data/etcbc/dss/tf/0.9" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "data: ~/text-fabric-data/etcbc/dss/parallels/tf/0.9" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "name": "stdout", "output_type": "stream", "text": [ "This is Text-Fabric 9.2.2\n", "Api reference : https://annotation.github.io/text-fabric/tf/cheatsheet.html\n", "\n", "67 features found and 1 ignored\n" ] }, { "data": { "text/html": [ "Text-Fabric: Text-Fabric API 9.2.2, etcbc/dss/app v3, Search Reference
Data: DSS, Character table, Feature docs
Features:
\n", "
Parallel Passages\n", "
\n", "\n", "
\n", "
\n", "sim\n", "
\n", "
int
\n", "
\n", " similarity between lines, as a percentage of the common material wrt the combined material\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2019-05-09
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2019-06-11T14:51:21Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
sourceCreatedBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
sourceCreatedDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
sourceDescription:
\n", "
Dead Sea Scrolls: biblical and non-biblical scrolls
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "\n", "
Dead Sea Scrolls\n", "
\n", "\n", "
\n", "
\n", "after\n", "
\n", "
str
\n", "
\n", " space behind the word, if any\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:55Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
(space)
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "alt\n", "
\n", "
int
\n", "
\n", " alternative reading\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:56Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
1
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "biblical\n", "
\n", "
int
\n", "
\n", " whether we are in biblical material or not\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
applies:
\n", "
scroll fragment line cluster word
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:56Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
remark:
\n", "
for lines it means that the material is taken from the bib source while there is also material for this line in the nonbib source. But the nonbib material is either identical or virtually absent, in which case the bib material is a reconstruction and marked as such.
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
1=biblical, 2=biblical but also with nonbiblical material
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "book\n", "
\n", "
str
\n", "
\n", " acronym of the book in which the word occurs\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:56Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "chapter\n", "
\n", "
str
\n", "
\n", " label of the chapter in which the word occurs\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:56Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "cl\n", "
\n", "
str
\n", "
\n", " class (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:56Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
advb, art, artp, card, cmn, conj, gent, indp, intj, intr, mult, nega, objm, ord, prep, prp, rela, unknown
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "cl2\n", "
\n", "
str
\n", "
\n", " class (for part 2) (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:57Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
d, h, n, unknown
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "cor\n", "
\n", "
int
\n", "
\n", " correction made by an ancient or modern editor\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:57Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
1 = modern, 2 = ancient, 3 = ancient supralinear
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "fragment\n", "
\n", "
str
\n", "
\n", " label of a fragment of a scroll\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:57Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "full\n", "
\n", "
str
\n", "
\n", " full transcription (Unicode) of a word including flags and brackets\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:57Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "fulle\n", "
\n", "
str
\n", "
\n", " full transcription (ETCBC transliteration) of a word including flags and brackets\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:57Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "fullo\n", "
\n", "
str
\n", "
\n", " full transcription (original source) of a word including flags and brackets\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:58Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "g_cons\n", "
\n", "
str
\n", "
\n", " Dead Sea Scrolls: additions based on BHSA and machine learning\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss-additions
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Martijn Naaijer, ETCBC
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2020
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:58Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martijn Naaijer's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "glex\n", "
\n", "
str
\n", "
\n", " representation (Unicode) of a lexeme leaving out non-letters\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:58Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "glexe\n", "
\n", "
str
\n", "
\n", " representation (ETCBC transliteration) of a lexeme leaving out non-letters\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:59Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "glexo\n", "
\n", "
str
\n", "
\n", " representation (original source) of a lexeme leaving out non-letters\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:59Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "glyph\n", "
\n", "
str
\n", "
\n", " representation (Unicode) of a word or sign\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:00Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "glyphe\n", "
\n", "
str
\n", "
\n", " representation (ETCBC transliteration) of a word or sign\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:02Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "glypho\n", "
\n", "
str
\n", "
\n", " representation (original source) of a word or sign\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:04Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "gn\n", "
\n", "
str
\n", "
\n", " gender (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:06Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
b, c, f, m, unknown
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "gn2\n", "
\n", "
str
\n", "
\n", " gender (for part 2) (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:06Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
c, f, m, unknown
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "gn3\n", "
\n", "
str
\n", "
\n", " gender (for part 3) (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:06Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
c, f, m
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "gn_etcbc\n", "
\n", "
str
\n", "
\n", " Dead Sea Scrolls: additions based on BHSA and machine learning\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss-additions
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Martijn Naaijer, ETCBC
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2020
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:06Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martijn Naaijer's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "halfverse\n", "
\n", "
str
\n", "
\n", " label of the half-verse in which the word occurs\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:06Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "intl\n", "
\n", "
int
\n", "
\n", " interlinear material, the value indicates the sequence number of the interlinear line\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:06Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "lang\n", "
\n", "
str
\n", "
\n", " language of a word or sign, only if it is not Hebrew\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:06Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
g=greek, a=aramaic
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "lex\n", "
\n", "
str
\n", "
\n", " representation (Unicode) of a lexeme\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:06Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "lex_etcbc\n", "
\n", "
str
\n", "
\n", " Dead Sea Scrolls: additions based on BHSA and machine learning\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss-additions
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Martijn Naaijer, ETCBC
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2020
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:07Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martijn Naaijer's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "lexe\n", "
\n", "
str
\n", "
\n", " representation (ETCBC transliteration) of a lexeme\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:07Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "lexo\n", "
\n", "
str
\n", "
\n", " representation (original source) of a lexeme\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:08Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "line\n", "
\n", "
str
\n", "
\n", " label of a line of a fragment of a scroll\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:08Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "md\n", "
\n", "
str
\n", "
\n", " mood (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:08Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
coho, cons, juss, unknown
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "merr\n", "
\n", "
str
\n", "
\n", " errors in parsing the morphology tag\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:08Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "morpho\n", "
\n", "
str
\n", "
\n", " morphological tag (by Abegg)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:08Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "nr\n", "
\n", "
str
\n", "
\n", " Dead Sea Scrolls: additions based on BHSA and machine learning\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss-additions
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Martijn Naaijer, ETCBC
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2020
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:09Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martijn Naaijer's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "nu\n", "
\n", "
str
\n", "
\n", " number (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:09Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
d, p, s, unknown
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "nu2\n", "
\n", "
str
\n", "
\n", " number (for part 2) (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:09Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
p, s, unknown
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "nu3\n", "
\n", "
str
\n", "
\n", " number (for part 3) (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:09Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
s
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "nu_etcbc\n", "
\n", "
str
\n", "
\n", " Dead Sea Scrolls: additions based on BHSA and machine learning\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss-additions
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Martijn Naaijer, ETCBC
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2020
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:09Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martijn Naaijer's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "otype\n", "
\n", "
str
\n", "
\n", " Dead Sea Scrolls: biblical and non-biblical scrolls\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:10Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "ps\n", "
\n", "
str
\n", "
\n", " person (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:10Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
1, 2, 3, unknown
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "ps2\n", "
\n", "
str
\n", "
\n", " person (for part 2) (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:10Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
1, 2, 3
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "ps3\n", "
\n", "
str
\n", "
\n", " person (for part 3) (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:10Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
1, 2, 3
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "ps_etcbc\n", "
\n", "
str
\n", "
\n", " Dead Sea Scrolls: additions based on BHSA and machine learning\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss-additions
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Martijn Naaijer, ETCBC
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2020
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:10Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martijn Naaijer's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "punc\n", "
\n", "
str
\n", "
\n", " trailing punctuation (Unicode) of a word\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:11Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "punce\n", "
\n", "
str
\n", "
\n", " trailing punctuation (ETCBC transliteration) of a word\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:11Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "punco\n", "
\n", "
str
\n", "
\n", " trailing punctuation (original source) of a word\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:11Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "rec\n", "
\n", "
int
\n", "
\n", " reconstructed by a modern editor\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:11Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
1
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "rem\n", "
\n", "
int
\n", "
\n", " removed by an ancient or modern editor\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:12Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
1 = modern, 2 = ancient
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "script\n", "
\n", "
str
\n", "
\n", " script in which the word or sign is written if it is not Hebrew\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:12Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
paleohebrew greekcapital
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "scroll\n", "
\n", "
str
\n", "
\n", " acronym of a scroll\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:12Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "sp\n", "
\n", "
str
\n", "
\n", " part of speech (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:12Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
adjv, numr, pron, ptcl, subs, suff, unknown, verb
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "sp_etcbc\n", "
\n", "
str
\n", "
\n", " Dead Sea Scrolls: additions based on BHSA and machine learning\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss-additions
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Martijn Naaijer, ETCBC
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2020
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:12Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martijn Naaijer's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "srcLn\n", "
\n", "
int
\n", "
\n", " the line number of the word in the source data file\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:12Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "st\n", "
\n", "
str
\n", "
\n", " state (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:13Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
a, c, d, unknown
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "type\n", "
\n", "
str
\n", "
\n", " type of sign or cluster\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:13Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "unc\n", "
\n", "
int
\n", "
\n", " uncertain material in various degrees: higher degree is less certain\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:15Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
1 2 3 4
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "vac\n", "
\n", "
int
\n", "
\n", " empty, unwritten space\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:15Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
1
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "verse\n", "
\n", "
str
\n", "
\n", " label of the verse in which the word occurs\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:15Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "vs\n", "
\n", "
str
\n", "
\n", " verbal stem (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:15Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
aphel, apoel, haphel, hifil, hishtafel, hishtaphel, hithaphel, hithpaal, hithpeel, hithpolel, hitopel, hitpael, hitpalpel, hitpoel, hofal, hophal, hotpaal, hpealal, ishtaphel, ithpaal, ithpeel, ithpoel, nifal, nitpael, pael, palel, passive, peal, peil, piel, pilpel, poal, poel, polal, polel, pual, pulal, qal, shaphel, tifil, unknown
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "vs_etcbc\n", "
\n", "
str
\n", "
\n", " Dead Sea Scrolls: additions based on BHSA and machine learning\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss-additions
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Martijn Naaijer, ETCBC
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2020
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:15Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martijn Naaijer's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "vt\n", "
\n", "
str
\n", "
\n", " verbal tense/aspect (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:16Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
impf, impv, infa, infc, perf, ptca, ptcp, unknown, wayy
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "vt_etcbc\n", "
\n", "
str
\n", "
\n", " Dead Sea Scrolls: additions based on BHSA and machine learning\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss-additions
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Martijn Naaijer, ETCBC
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2020
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:16Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martijn Naaijer's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "occ\n", "
\n", "
none
\n", "
\n", " edge feature from a lexeme to its occurrences\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:17Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "oslots\n", "
\n", "
none
\n", "
\n", " Dead Sea Scrolls: biblical and non-biblical scrolls\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:17Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "\n", "\n" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
Text-Fabric API: names N F E L T S C TF directly usable

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "A = use(\"etcbc/dss\", hoist=globals())" ] }, { "cell_type": "markdown", "metadata": { "lines_to_next_cell": 2 }, "source": [ "# Making data\n", "\n", "We illustrate the data creation part by creating a new feature, `cert`.\n", "The idea is that we mark every consonant sign voor certainty.\n", "\n", "A certain consonant gets `cert=100`.\n", "\n", "If the consonant has the uncertain feature `unc`, then 10 times its value is subtracted from 100.\n", "\n", "If the consonant has the feature `rec`, it loses 45 points.\n", "\n", "Ancient removal `rem2` leads to minus 20, modern removal `rem` to minus 40.\n", "\n", "Ancient correction `cor2` leads to minus 12, modern correction `cor` to minus 18.\n", "\n", "Alternate marking `alt` leads to minus 25.\n", "\n", "The minimum is `1`.\n", "\n", "We extend the `cert` measure to words, fragments and scrolls by averaging over signs that\n", "have received a `cert` measure." ] }, { "cell_type": "code", "execution_count": 4, "metadata": {}, "outputs": [], "source": [ "def measure(s):\n", " c = 100\n", " d = F.unc.v(s)\n", " if d:\n", " c -= 10 * d\n", " d = F.rec.v(s)\n", " if d:\n", " c -= 45\n", " d = F.rem.v(s)\n", " if d == 1:\n", " c -= 40\n", " elif d == 2:\n", " c -= 20\n", " d = F.cor.v(s)\n", " if d == 2 or d == 3:\n", " c -= 12\n", " elif d == 1:\n", " c -= 18\n", " d = F.alt.v(s)\n", " if d:\n", " c -= 25\n", " if c < 1:\n", " c = 1\n", " return c" ] }, { "cell_type": "code", "execution_count": 5, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 3.85s 1625373 certainties determined\n" ] } ], "source": [ "CONS = \"cons\"\n", "cert = {}\n", "\n", "A.indent(reset=True)\n", "\n", "for sc in F.otype.s(\"scroll\"):\n", " fN = 0\n", " fSum = 0\n", " for f in L.d(sc, otype=\"fragment\"):\n", " lN = 0\n", " lSum = 0\n", " for ln in L.d(f, otype=\"line\"):\n", " wN = 0\n", " wSum = 0\n", " for w in L.d(ln, otype=\"word\"):\n", " sN = 0\n", " sSum = 0\n", " for s in L.d(w, otype=\"sign\"):\n", " if F.type.v(s) != CONS:\n", " continue\n", " sCert = measure(s)\n", " cert[s] = sCert\n", " sN += 1\n", " sSum += sCert\n", " if sN:\n", " wCert = int(round(sSum / sN))\n", " cert[w] = wCert\n", " wN += 1\n", " wSum += wCert\n", " if wN:\n", " lCert = int(round(wSum / wN))\n", " cert[ln] = lCert\n", " lN += 1\n", " lSum += lCert\n", " if lN:\n", " fCert = int(round(lSum / lN))\n", " cert[f] = fCert\n", " fN += 1\n", " fSum += fCert\n", " if fN:\n", " scCert = int(round(fSum / fN))\n", " cert[sc] = scCert\n", "\n", "A.info(f\"{len(cert)} certainties determined\")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# Saving data\n", "\n", "The [documentation](https://annotation.github.io/text-fabric/tf/core/fabric.html#tf.core.fabric.FabricCore.save) explains how to save this data into a text-fabric\n", "data file.\n", "\n", "We choose a location where to save it, the `exercises` folder in the `dss` repository in the `dss` organization.\n", "\n", "In order to do this, we restart the TF API, but now with the desired output location in the `locations` parameter." ] }, { "cell_type": "code", "execution_count": 6, "metadata": {}, "outputs": [], "source": [ "GITHUB = os.path.expanduser(\"~/github\")\n", "ORG = \"etcbc\"\n", "REPO = \"dss\"\n", "PATH = \"exercises\"\n", "VERSION = A.version" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Note the version: we have built the version against a specific version of the data:" ] }, { "cell_type": "code", "execution_count": 7, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "'0.9'" ] }, "execution_count": 7, "metadata": {}, "output_type": "execute_result" } ], "source": [ "A.version" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Later on, we pass this version on, so that users of our data will get the shared data in exactly the same version as their core data." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "We have to specify a bit of metadata for this feature:" ] }, { "cell_type": "code", "execution_count": 8, "metadata": {}, "outputs": [], "source": [ "metaData = {\n", " \"cert\": dict(\n", " valueType=\"int\",\n", " description=\"measure of certainty of material, between 1 and 100 (most certain)\",\n", " creator=\"Dirk Roorda\",\n", " ),\n", "}" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Now we can give the save command:" ] }, { "cell_type": "code", "execution_count": 9, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.00s Exporting 1 node and 0 edge and 0 config features to ~/github/etcbc/dss/exercises/tf/0.9:\n", " | 1.31s T cert to ~/github/etcbc/dss/exercises/tf/0.9\n", " 1.31s Exported 1 node features and 0 edge features and 0 config features to ~/github/etcbc/dss/exercises/tf/0.9\n" ] }, { "data": { "text/plain": [ "True" ] }, "execution_count": 9, "metadata": {}, "output_type": "execute_result" } ], "source": [ "TF.save(\n", " nodeFeatures=dict(cert=cert),\n", " metaData=metaData,\n", " location=f\"{GITHUB}/{ORG}/{REPO}/{PATH}/tf\",\n", " module=VERSION,\n", ")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# Sharing data\n", "\n", "How to share your own data is explained in the\n", "[documentation](https://annotation.github.io/text-fabric/tf/about/datasharing.html).\n", "\n", "Here we show it step by step for the `cert` feature.\n", "\n", "If you commit your changes to the exercises repo, and have done a `git push origin master`,\n", "you already have shared your data!\n", "\n", "If you want to make a stable release, so that you can keep developing, while your users fall back\n", "on the stable data, you can make a new release.\n", "\n", "Go to the GitHub website for that, go to your repo, and click *Releases* and follow the nudges.\n", "\n", "If you want to make it even smoother for your users, you can zip the data and attach it as a binary to the release just created.\n", "\n", "We need to zip the data in exactly the right directory structure. Text-Fabric can do that for us:" ] }, { "cell_type": "code", "execution_count": 10, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "This is a TF dataset\n", "Create release data for etcbc/dss/exercises/tf\n", "Found 5 versions\n", "zip files end up in ~/Downloads/etcbc-release/dss\n", "zipping etcbc/dss 0.3 with 1 features ==> exercises-tf-0.3.zip\n", "zipping etcbc/dss 0.4 with 1 features ==> exercises-tf-0.4.zip\n", "zipping etcbc/dss 0.5 with 1 features ==> exercises-tf-0.5.zip\n", "zipping etcbc/dss 0.6 with 1 features ==> exercises-tf-0.6.zip\n", "zipping etcbc/dss 0.9 with 1 features ==> exercises-tf-0.9.zip\n" ] } ], "source": [ "%%sh\n", "\n", "text-fabric-zip etcbc/dss/exercises/tf" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "All versions have been zipped, but it works OK if you only attach the newest version to the newest release.\n", "\n", "If a user asks for an older version in this release, the system can still find it." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# Use the data\n", "\n", "We can use the data by calling it up when we say `use('ETCBC/dss', ...)`.\n", "\n", "Here is how:\n", "\n", "(use the line without `clone` if the data is really published,\n", "use the line with `clone` if you want to test your local copy of the feature)." ] }, { "cell_type": "code", "execution_count": 11, "metadata": {}, "outputs": [ { "data": { "text/html": [ "TF-app: ~/text-fabric-data/etcbc/dss/app" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "data: ~/text-fabric-data/etcbc/dss/tf/0.9" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "data: ~/github/etcbc/dss/exercises/tf/0.9" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "data: ~/text-fabric-data/etcbc/dss/parallels/tf/0.9" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "name": "stdout", "output_type": "stream", "text": [ "This is Text-Fabric 9.2.2\n", "Api reference : https://annotation.github.io/text-fabric/tf/cheatsheet.html\n", "\n", "68 features found and 1 ignored\n", " | 6.84s T cert from ~/github/etcbc/dss/exercises/tf/0.9\n" ] }, { "data": { "text/html": [ "Text-Fabric: Text-Fabric API 9.2.2, etcbc/dss/app v3, Search Reference
Data: DSS, Character table, Feature docs
Features:
\n", "
Parallel Passages\n", "
\n", "\n", "
\n", "
\n", "sim\n", "
\n", "
int
\n", "
\n", " similarity between lines, as a percentage of the common material wrt the combined material\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2019-05-09
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2019-06-11T14:51:21Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
sourceCreatedBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
sourceCreatedDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
sourceDescription:
\n", "
Dead Sea Scrolls: biblical and non-biblical scrolls
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "\n", "
etcbc/dss/exercises/tf\n", "
\n", "\n", "
\n", "
\n", "cert\n", "
\n", "
int
\n", "
\n", " measure of certainty of material, between 1 and 100 (most certain)\n", "
\n", "\n", "
\n", "
creator:
\n", "
Dirk Roorda
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-01-31T10:58:59Z
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "\n", "
Dead Sea Scrolls\n", "
\n", "\n", "
\n", "
\n", "after\n", "
\n", "
str
\n", "
\n", " space behind the word, if any\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:55Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
(space)
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "alt\n", "
\n", "
int
\n", "
\n", " alternative reading\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:56Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
1
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "biblical\n", "
\n", "
int
\n", "
\n", " whether we are in biblical material or not\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
applies:
\n", "
scroll fragment line cluster word
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:56Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
remark:
\n", "
for lines it means that the material is taken from the bib source while there is also material for this line in the nonbib source. But the nonbib material is either identical or virtually absent, in which case the bib material is a reconstruction and marked as such.
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
1=biblical, 2=biblical but also with nonbiblical material
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "book\n", "
\n", "
str
\n", "
\n", " acronym of the book in which the word occurs\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:56Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "chapter\n", "
\n", "
str
\n", "
\n", " label of the chapter in which the word occurs\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:56Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "cl\n", "
\n", "
str
\n", "
\n", " class (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:56Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
advb, art, artp, card, cmn, conj, gent, indp, intj, intr, mult, nega, objm, ord, prep, prp, rela, unknown
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "cl2\n", "
\n", "
str
\n", "
\n", " class (for part 2) (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:57Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
d, h, n, unknown
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "cor\n", "
\n", "
int
\n", "
\n", " correction made by an ancient or modern editor\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:57Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
1 = modern, 2 = ancient, 3 = ancient supralinear
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "fragment\n", "
\n", "
str
\n", "
\n", " label of a fragment of a scroll\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:57Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "full\n", "
\n", "
str
\n", "
\n", " full transcription (Unicode) of a word including flags and brackets\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:57Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "fulle\n", "
\n", "
str
\n", "
\n", " full transcription (ETCBC transliteration) of a word including flags and brackets\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:57Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "fullo\n", "
\n", "
str
\n", "
\n", " full transcription (original source) of a word including flags and brackets\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:58Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "g_cons\n", "
\n", "
str
\n", "
\n", " Dead Sea Scrolls: additions based on BHSA and machine learning\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss-additions
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Martijn Naaijer, ETCBC
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2020
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:58Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martijn Naaijer's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "glex\n", "
\n", "
str
\n", "
\n", " representation (Unicode) of a lexeme leaving out non-letters\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:58Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "glexe\n", "
\n", "
str
\n", "
\n", " representation (ETCBC transliteration) of a lexeme leaving out non-letters\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:59Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "glexo\n", "
\n", "
str
\n", "
\n", " representation (original source) of a lexeme leaving out non-letters\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:59Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "glyph\n", "
\n", "
str
\n", "
\n", " representation (Unicode) of a word or sign\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:00Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "glyphe\n", "
\n", "
str
\n", "
\n", " representation (ETCBC transliteration) of a word or sign\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:02Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "glypho\n", "
\n", "
str
\n", "
\n", " representation (original source) of a word or sign\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:04Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "gn\n", "
\n", "
str
\n", "
\n", " gender (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:06Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
b, c, f, m, unknown
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "gn2\n", "
\n", "
str
\n", "
\n", " gender (for part 2) (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:06Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
c, f, m, unknown
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "gn3\n", "
\n", "
str
\n", "
\n", " gender (for part 3) (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:06Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
c, f, m
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "gn_etcbc\n", "
\n", "
str
\n", "
\n", " Dead Sea Scrolls: additions based on BHSA and machine learning\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss-additions
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Martijn Naaijer, ETCBC
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2020
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:06Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martijn Naaijer's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "halfverse\n", "
\n", "
str
\n", "
\n", " label of the half-verse in which the word occurs\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:06Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "intl\n", "
\n", "
int
\n", "
\n", " interlinear material, the value indicates the sequence number of the interlinear line\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:06Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "lang\n", "
\n", "
str
\n", "
\n", " language of a word or sign, only if it is not Hebrew\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:06Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
g=greek, a=aramaic
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "lex\n", "
\n", "
str
\n", "
\n", " representation (Unicode) of a lexeme\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:06Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "lex_etcbc\n", "
\n", "
str
\n", "
\n", " Dead Sea Scrolls: additions based on BHSA and machine learning\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss-additions
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Martijn Naaijer, ETCBC
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2020
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:07Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martijn Naaijer's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "lexe\n", "
\n", "
str
\n", "
\n", " representation (ETCBC transliteration) of a lexeme\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:07Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "lexo\n", "
\n", "
str
\n", "
\n", " representation (original source) of a lexeme\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:08Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "line\n", "
\n", "
str
\n", "
\n", " label of a line of a fragment of a scroll\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:08Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "md\n", "
\n", "
str
\n", "
\n", " mood (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:08Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
coho, cons, juss, unknown
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "merr\n", "
\n", "
str
\n", "
\n", " errors in parsing the morphology tag\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:08Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "morpho\n", "
\n", "
str
\n", "
\n", " morphological tag (by Abegg)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:08Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "nr\n", "
\n", "
str
\n", "
\n", " Dead Sea Scrolls: additions based on BHSA and machine learning\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss-additions
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Martijn Naaijer, ETCBC
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2020
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:09Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martijn Naaijer's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "nu\n", "
\n", "
str
\n", "
\n", " number (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:09Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
d, p, s, unknown
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "nu2\n", "
\n", "
str
\n", "
\n", " number (for part 2) (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:09Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
p, s, unknown
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "nu3\n", "
\n", "
str
\n", "
\n", " number (for part 3) (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:09Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
s
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "nu_etcbc\n", "
\n", "
str
\n", "
\n", " Dead Sea Scrolls: additions based on BHSA and machine learning\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss-additions
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Martijn Naaijer, ETCBC
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2020
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:09Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martijn Naaijer's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "otype\n", "
\n", "
str
\n", "
\n", " Dead Sea Scrolls: biblical and non-biblical scrolls\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:10Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "ps\n", "
\n", "
str
\n", "
\n", " person (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:10Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
1, 2, 3, unknown
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "ps2\n", "
\n", "
str
\n", "
\n", " person (for part 2) (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:10Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
1, 2, 3
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "ps3\n", "
\n", "
str
\n", "
\n", " person (for part 3) (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:10Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
1, 2, 3
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "ps_etcbc\n", "
\n", "
str
\n", "
\n", " Dead Sea Scrolls: additions based on BHSA and machine learning\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss-additions
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Martijn Naaijer, ETCBC
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2020
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:10Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martijn Naaijer's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "punc\n", "
\n", "
str
\n", "
\n", " trailing punctuation (Unicode) of a word\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:11Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "punce\n", "
\n", "
str
\n", "
\n", " trailing punctuation (ETCBC transliteration) of a word\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:11Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "punco\n", "
\n", "
str
\n", "
\n", " trailing punctuation (original source) of a word\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:11Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "rec\n", "
\n", "
int
\n", "
\n", " reconstructed by a modern editor\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:11Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
1
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "rem\n", "
\n", "
int
\n", "
\n", " removed by an ancient or modern editor\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:12Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
1 = modern, 2 = ancient
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "script\n", "
\n", "
str
\n", "
\n", " script in which the word or sign is written if it is not Hebrew\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:12Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
paleohebrew greekcapital
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "scroll\n", "
\n", "
str
\n", "
\n", " acronym of a scroll\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:12Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "sp\n", "
\n", "
str
\n", "
\n", " part of speech (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:12Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
adjv, numr, pron, ptcl, subs, suff, unknown, verb
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "sp_etcbc\n", "
\n", "
str
\n", "
\n", " Dead Sea Scrolls: additions based on BHSA and machine learning\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss-additions
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Martijn Naaijer, ETCBC
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2020
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:12Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martijn Naaijer's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "srcLn\n", "
\n", "
int
\n", "
\n", " the line number of the word in the source data file\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:12Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "st\n", "
\n", "
str
\n", "
\n", " state (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:13Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
a, c, d, unknown
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "type\n", "
\n", "
str
\n", "
\n", " type of sign or cluster\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:13Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "unc\n", "
\n", "
int
\n", "
\n", " uncertain material in various degrees: higher degree is less certain\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:15Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
1 2 3 4
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "vac\n", "
\n", "
int
\n", "
\n", " empty, unwritten space\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:15Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
1
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "verse\n", "
\n", "
str
\n", "
\n", " label of the verse in which the word occurs\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:15Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "vs\n", "
\n", "
str
\n", "
\n", " verbal stem (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:15Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
aphel, apoel, haphel, hifil, hishtafel, hishtaphel, hithaphel, hithpaal, hithpeel, hithpolel, hitopel, hitpael, hitpalpel, hitpoel, hofal, hophal, hotpaal, hpealal, ishtaphel, ithpaal, ithpeel, ithpoel, nifal, nitpael, pael, palel, passive, peal, peil, piel, pilpel, poal, poel, polal, polel, pual, pulal, qal, shaphel, tifil, unknown
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "vs_etcbc\n", "
\n", "
str
\n", "
\n", " Dead Sea Scrolls: additions based on BHSA and machine learning\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss-additions
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Martijn Naaijer, ETCBC
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2020
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:15Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martijn Naaijer's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "vt\n", "
\n", "
str
\n", "
\n", " verbal tense/aspect (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:16Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
impf, impv, infa, infc, perf, ptca, ptcp, unknown, wayy
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "vt_etcbc\n", "
\n", "
str
\n", "
\n", " Dead Sea Scrolls: additions based on BHSA and machine learning\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss-additions
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Martijn Naaijer, ETCBC
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2020
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:16Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martijn Naaijer's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "occ\n", "
\n", "
none
\n", "
\n", " edge feature from a lexeme to its occurrences\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:17Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "oslots\n", "
\n", "
none
\n", "
\n", " Dead Sea Scrolls: biblical and non-biblical scrolls\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:17Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "\n", "\n" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
Text-Fabric API: names N F E L T S C TF directly usable

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "A = use(\n", " \"etcbc/dss\", hoist=globals(), mod=\"etcbc/dss/exercises/tf:clone\"\n", ")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Above you see a new section in the feature list: **etcbc/dss/exercises/tf** with our foreign feature in it: `cert`.\n", "\n", "Now, suppose did not know much about this feature, then we would like to do a few basic checks:" ] }, { "cell_type": "code", "execution_count": 12, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "((100, 730507),\n", " (55, 667515),\n", " (60, 50110),\n", " (80, 38107),\n", " (70, 20927),\n", " (90, 18767),\n", " (88, 6423),\n", " (85, 4794),\n", " (78, 4499),\n", " (87, 4226),\n", " (95, 4070),\n", " (93, 3997),\n", " (92, 3529),\n", " (58, 3017),\n", " (82, 2839),\n", " (98, 2833),\n", " (72, 2655),\n", " (68, 2628),\n", " (89, 2562),\n", " (56, 2510),\n", " (65, 2496),\n", " (96, 2436),\n", " (57, 2358),\n", " (97, 2275),\n", " (62, 2117),\n", " (63, 2089),\n", " (66, 2076),\n", " (75, 2011),\n", " (73, 1943),\n", " (94, 1904),\n", " (83, 1818),\n", " (84, 1796),\n", " (91, 1766),\n", " (79, 1762),\n", " (61, 1759),\n", " (64, 1754),\n", " (86, 1694),\n", " (74, 1549),\n", " (59, 1479),\n", " (77, 1402),\n", " (43, 1401),\n", " (67, 1284),\n", " (81, 1272),\n", " (76, 1234),\n", " (69, 1193),\n", " (71, 1118),\n", " (99, 1068),\n", " (30, 566),\n", " (35, 352),\n", " (48, 230),\n", " (40, 133),\n", " (50, 106),\n", " (15, 100),\n", " (37, 67),\n", " (45, 49),\n", " (51, 29),\n", " (53, 23),\n", " (49, 22),\n", " (47, 18),\n", " (25, 17),\n", " (54, 14),\n", " (52, 12),\n", " (28, 11),\n", " (42, 11),\n", " (44, 9),\n", " (38, 8),\n", " (46, 7),\n", " (20, 6),\n", " (23, 3),\n", " (36, 3),\n", " (1, 1),\n", " (18, 1),\n", " (21, 1),\n", " (22, 1),\n", " (29, 1),\n", " (31, 1),\n", " (34, 1),\n", " (39, 1))" ] }, "execution_count": 12, "metadata": {}, "output_type": "execute_result" } ], "source": [ "F.cert.freqList()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Which nodes have the lowest uncertainty?" ] }, { "cell_type": "code", "execution_count": 13, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "{'sign'}" ] }, "execution_count": 13, "metadata": {}, "output_type": "execute_result" } ], "source": [ "{F.otype.v(n) for n in N.walk() if F.cert.v(n) and F.cert.v(n) < 10}" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Only signs are this uncertain.\n", "\n", "Let's look for pretty uncertain fragments:" ] }, { "cell_type": "code", "execution_count": 14, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.01s 0 results\n" ] } ], "source": [ "results = A.search(\n", " \"\"\"\n", "fragment cert<50\n", "\"\"\"\n", ")" ] }, { "cell_type": "code", "execution_count": 15, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.01s 380 results\n" ] } ], "source": [ "results = A.search(\n", " \"\"\"\n", "fragment cert<60\n", "\"\"\"\n", ")" ] }, { "cell_type": "code", "execution_count": 16, "metadata": {}, "outputs": [ { "data": { "text/html": [ "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "
npfragment
11QSb f121QSb f12
21Q21 f31Q21 f3
32Q18 f12Q18 f1
42Q18 f22Q18 f2
52Q29 f12Q29 f1
64Q163 f264Q163 f26
74Q201 f1vi4Q201 f1vi
84Q202 f1vi4Q202 f1vi
94Q204 f1vii4Q204 f1vii
104Q204 f5i4Q204 f5i
114Q205 f1xii4Q205 f1xii
124Q221 f34Q221 f3
134Q249b f14Q249b f1
144Q249c f14Q249c f1
154Q249d f14Q249d f1
164Q249f f1_34Q249f f1_3
174Q249g f1_24Q249g f1_2
184Q249g f3_74Q249g f3_7
194Q249h f1_24Q249h f1_2
204Q249i f14Q249i f1
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "A.table(results, start=1, end=20)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Same for scrolls:" ] }, { "cell_type": "code", "execution_count": 17, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.00s 0 results\n" ] } ], "source": [ "results = A.search(\n", " \"\"\"\n", "scroll cert<50\n", "\"\"\"\n", ")" ] }, { "cell_type": "code", "execution_count": 18, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.00s 27 results\n" ] } ], "source": [ "results = A.search(\n", " \"\"\"\n", "scroll cert<60\n", "\"\"\"\n", ")" ] }, { "cell_type": "code", "execution_count": 19, "metadata": {}, "outputs": [ { "data": { "text/html": [ "

result 1

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
scroll 2Q18
cert=58
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 2

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
scroll 4Q249b
cert=59
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 3

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
scroll 4Q249c
cert=57
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 4

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
scroll 4Q249d
cert=57
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 5

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
scroll 4Q249f
cert=58
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 6

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
scroll 4Q249g
cert=58
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 7

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
scroll 4Q249h
cert=59
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 8

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
scroll 4Q249j
cert=58
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 9

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
scroll 4Q483
cert=58
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 10

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
scroll 2Q4
cert=58
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 11

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
scroll 2Q9
cert=59
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 12

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
scroll 2Q10
cert=59
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 13

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
scroll 2Q11
cert=58
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 14

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
scroll 3Q1
cert=59
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 15

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
scroll 3Q3
cert=58
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 16

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
scroll 4Q8b
cert=59
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 17

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
scroll 4Q69a
cert=59
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 18

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
scroll 4Q71
cert=59
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 19

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
scroll 4Q103a
cert=58
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 20

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
scroll 4Q116
cert=58
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 21

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
scroll 4Q151
cert=57
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 22

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
scroll 4Q153
cert=57
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 23

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
scroll 4Q483
cert=57
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 24

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
scroll 4Q576
cert=56
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 25

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
scroll 5Q7
cert=58
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 26

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
scroll 8Q1
cert=58
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 27

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
scroll 11Q9
cert=58
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "A.show(results)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Lines with certainty of 50:" ] }, { "cell_type": "code", "execution_count": 20, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.04s 2061 results\n" ] } ], "source": [ "results = A.search(\n", " \"\"\"\n", "line cert<57\n", "\"\"\"\n", ")" ] }, { "cell_type": "code", "execution_count": 21, "metadata": {}, "outputs": [ { "data": { "text/html": [ "

result 100

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
line
cert=55
cluster
word מן
cert=55
word ורד
cert=55
word א
cert=55
word ו
cert=55
word כול
cert=55
cert=55
word חור
cert=55
word כ
cert=55
word עמר
cert=55
word נקא
cert=55
word ו
cert=55
word עבי
cert=55
word ו
cert=55
cert=55
word ו
cert=55
word כדי
cert=55
word פתח
cert=55
cert=55
cert=55
word כו
cert=57
word ל
cert=57
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 101

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
line
cert=55
cluster
word בית
cert=55
word א
cert=55
word כ
cert=55
word שמש
cert=55
word א
cert=55
word ε
word ׃
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 102

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
line
cert=55
cluster
word ε
word ב
cert=55
cert=55
cert=55
word אנה
cert=55
cert=55
word אמר
cert=55
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "A.show(results, start=100, end=102)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "With highlights and drilled down to sign level:" ] }, { "cell_type": "code", "execution_count": 22, "metadata": {}, "outputs": [], "source": [ "highlights = {}\n", "\n", "for s in F.otype.s(\"sign\"):\n", " if not F.cert.v(s):\n", " continue\n", " color = \"lightsalmon\" if F.cert.v(s) < 56 else \"mediumaquamarine\"\n", " highlights[s] = color" ] }, { "cell_type": "code", "execution_count": 23, "metadata": {}, "outputs": [ { "data": { "text/html": [ "

line 100

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
line:1559891
cert=55
cluster:1441915
word:1677987 מן
cert=55
204705 מ
cert=55
204706 ן
cert=55
word:1677988 ורד
cert=55
204707 ו
cert=55
204708 ר
cert=55
204709 ד
cert=55
word:1677989 א
cert=55
204710 א
cert=55
word:1677990 ו
cert=55
204711 ו
cert=55
word:1677991 כול
cert=55
204712 כ
cert=55
204713 ו
cert=55
204714 ל
cert=55
word:1677992 שער׳ה
cert=55
204715 ש
cert=55
204716 ע
cert=55
204717 ר
cert=55
204718 ׳
204719 ה
cert=55
word:1677993 חור
cert=55
204720 ח
cert=55
204721 ו
cert=55
204722 ר
cert=55
word:1677994 כ
cert=55
204723 כ
cert=55
word:1677995 עמר
cert=55
204724 ע
cert=55
204725 מ
cert=55
204726 ר
cert=55
word:1677996 נקא
cert=55
204727 נ
cert=55
204728 ק
cert=55
204729 א
cert=55
word:1677997 ו
cert=55
204730 ו
cert=55
word:1677998 עבי
cert=55
204731 ע
cert=55
204732 ב
cert=55
204733 י
cert=55
word:1677999 ו
cert=55
204734 ו
cert=55
word:1678000 הדיר
cert=55
204735 ה
cert=55
204736 ד
cert=55
204737 י
cert=55
204738 ר
cert=55
word:1678001 ו
cert=55
204739 ו
cert=55
word:1678002 כדי
cert=55
204740 כ
cert=55
204741 ד
cert=55
204742 י
cert=55
word:1678003 פתח
cert=55
204743 פ
cert=55
204744 ת
cert=55
204745 ח
cert=55
word:1678004 עינו׳הי
cert=55
204746 ע
cert=55
204747 י
cert=55
204748 נ
cert=55
204749 ו
cert=55
204750 ׳
204751 ה
cert=55
204752 י
cert=55
word:1678005 אניר
cert=55
204753 א
cert=55
204754 נ
cert=55
204755 י
cert=55
204756 ר
cert=55
word:1678006 כול
cert=57
204757 כ
cert=55
204758 ו
cert=55
word:1678006 כול
cert=57
204759 ל
cert=60
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line 101

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
line:1559892
cert=55
cluster:1441916
word:1678007 בית
cert=55
204760 ב
cert=55
204761 י
cert=55
204762 ת
cert=55
word:1678008 א
cert=55
204763 א
cert=55
word:1678009 כ
cert=55
204764 כ
cert=55
word:1678010 שמש
cert=55
204765 ש
cert=55
204766 מ
cert=55
204767 ש
cert=55
word:1678011 א
cert=55
204768 א
cert=55
word:1678012 ε
204769
204770 ε
word:1678013 ׃
204771
204772 ׃
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line 102

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
line:1559893
cert=55
cluster:1441917
word:1678014 ε
204773
204774 ε
word:1678015 ב
cert=55
204775 ב
cert=55
word:1678016 אדין
cert=55
204776 א
cert=55
204777 ד
cert=55
204778 י
cert=55
204779 ן
cert=55
word:1678017 ענית
cert=55
204780 ע
cert=55
204781 נ
cert=55
204782 י
cert=55
204783 ת
cert=55
word:1678018 אנה
cert=55
204784 א
cert=55
204785 נ
cert=55
204786 ה
cert=55
word:1678019 חנוך
cert=55
204787 ח
cert=55
204788 נ
cert=55
204789 ו
cert=55
204790 ך
cert=55
word:1678020 אמר
cert=55
204791 א
cert=55
204792 מ
cert=55
204793 ר
cert=55
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "A.show(\n", " results,\n", " start=100,\n", " end=102,\n", " withNodes=True,\n", " condensed=True,\n", " highlights=highlights,\n", " baseTypes=\"sign\",\n", ")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# All together!\n", "\n", "If more researchers have shared data modules, you can draw them all in.\n", "\n", "Then you can design queries that use features from all these different sources.\n", "\n", "In that way, you build your own research on top of the work of others." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Hover over the features to see where they come from, and you'll see they come from your local GitHub repo." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "---\n", "\n", "All chapters:\n", "\n", "* **[start](start.ipynb)** become an expert in creating pretty displays of your text structures\n", "* **[display](display.ipynb)** become an expert in creating pretty displays of your text structures\n", "* **[search](search.ipynb)** turbo charge your hand-coding with search templates\n", "* **[exportExcel](exportExcel.ipynb)** make tailor-made spreadsheets out of your results\n", "* **share** draw in other people's data and let them use yours\n", "* **[similarLines](similarLines.ipynb)** spot the similarities between lines\n", "\n", "---\n", "\n", "See the [cookbook](cookbook) for recipes for small, concrete tasks.\n", "\n", "CC-BY Dirk Roorda" ] } ], "metadata": { "kernelspec": { "display_name": "Python 3 (ipykernel)", "language": "python", "name": "python3" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.10.2" }, "widgets": { "application/vnd.jupyter.widget-state+json": { "state": {}, "version_major": 2, "version_minor": 0 } } }, "nbformat": 4, "nbformat_minor": 4 }