{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "\n", "\n", "\n", "\n", "\n", "---\n", "\n", "To get started: consult [start](start.ipynb)\n", "\n", "---\n", "\n", "# Export to Excel\n", "\n", "In a notebook, you can perform searches and view them in a tabular display and zoom in on items with\n", "pretty displays.\n", "\n", "But there are times that you want to take your results outside Text-Fabric, outside a notebook, outside Python, and just\n", "work with them in other programs, such as Excel.\n", "\n", "You want to do that not only with query results, but with all kinds of lists of tuples of nodes.\n", "\n", "There is a function for that, `A.export()`, and here we show what it can do." ] }, { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [], "source": [ "%load_ext autoreload\n", "%autoreload 2" ] }, { "cell_type": "code", "execution_count": 2, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T10:06:39.818664Z", "start_time": "2018-05-24T10:06:39.796588Z" } }, "outputs": [], "source": [ "import os\n", "from tf.app import use" ] }, { "cell_type": "code", "execution_count": 3, "metadata": {}, "outputs": [ { "data": { "text/html": [ "TF-app: ~/text-fabric-data/etcbc/dss/app" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "data: ~/text-fabric-data/etcbc/dss/tf/0.9" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "data: ~/text-fabric-data/etcbc/dss/parallels/tf/0.9" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "name": "stdout", "output_type": "stream", "text": [ "This is Text-Fabric 9.2.2\n", "Api reference : https://annotation.github.io/text-fabric/tf/cheatsheet.html\n", "\n", "67 features found and 1 ignored\n" ] }, { "data": { "text/html": [ "Text-Fabric: Text-Fabric API 9.2.2, etcbc/dss/app v3, Search Reference
Data: DSS, Character table, Feature docs
Features:
\n", "
Parallel Passages\n", "
\n", "\n", "
\n", "
\n", "sim\n", "
\n", "
int
\n", "
\n", " similarity between lines, as a percentage of the common material wrt the combined material\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2019-05-09
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2019-06-11T14:51:21Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
sourceCreatedBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
sourceCreatedDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
sourceDescription:
\n", "
Dead Sea Scrolls: biblical and non-biblical scrolls
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "\n", "
Dead Sea Scrolls\n", "
\n", "\n", "
\n", "
\n", "after\n", "
\n", "
str
\n", "
\n", " space behind the word, if any\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:55Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
(space)
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "alt\n", "
\n", "
int
\n", "
\n", " alternative reading\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:56Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
1
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "biblical\n", "
\n", "
int
\n", "
\n", " whether we are in biblical material or not\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
applies:
\n", "
scroll fragment line cluster word
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:56Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
remark:
\n", "
for lines it means that the material is taken from the bib source while there is also material for this line in the nonbib source. But the nonbib material is either identical or virtually absent, in which case the bib material is a reconstruction and marked as such.
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
1=biblical, 2=biblical but also with nonbiblical material
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "book\n", "
\n", "
str
\n", "
\n", " acronym of the book in which the word occurs\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:56Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "chapter\n", "
\n", "
str
\n", "
\n", " label of the chapter in which the word occurs\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:56Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "cl\n", "
\n", "
str
\n", "
\n", " class (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:56Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
advb, art, artp, card, cmn, conj, gent, indp, intj, intr, mult, nega, objm, ord, prep, prp, rela, unknown
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "cl2\n", "
\n", "
str
\n", "
\n", " class (for part 2) (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:57Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
d, h, n, unknown
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "cor\n", "
\n", "
int
\n", "
\n", " correction made by an ancient or modern editor\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:57Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
1 = modern, 2 = ancient, 3 = ancient supralinear
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "fragment\n", "
\n", "
str
\n", "
\n", " label of a fragment of a scroll\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:57Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "full\n", "
\n", "
str
\n", "
\n", " full transcription (Unicode) of a word including flags and brackets\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:57Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "fulle\n", "
\n", "
str
\n", "
\n", " full transcription (ETCBC transliteration) of a word including flags and brackets\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:57Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "fullo\n", "
\n", "
str
\n", "
\n", " full transcription (original source) of a word including flags and brackets\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:58Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "g_cons\n", "
\n", "
str
\n", "
\n", " Dead Sea Scrolls: additions based on BHSA and machine learning\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss-additions
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Martijn Naaijer, ETCBC
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2020
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:58Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martijn Naaijer's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "glex\n", "
\n", "
str
\n", "
\n", " representation (Unicode) of a lexeme leaving out non-letters\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:58Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "glexe\n", "
\n", "
str
\n", "
\n", " representation (ETCBC transliteration) of a lexeme leaving out non-letters\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:59Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "glexo\n", "
\n", "
str
\n", "
\n", " representation (original source) of a lexeme leaving out non-letters\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:01:59Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "glyph\n", "
\n", "
str
\n", "
\n", " representation (Unicode) of a word or sign\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:00Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "glyphe\n", "
\n", "
str
\n", "
\n", " representation (ETCBC transliteration) of a word or sign\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:02Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "glypho\n", "
\n", "
str
\n", "
\n", " representation (original source) of a word or sign\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:04Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "gn\n", "
\n", "
str
\n", "
\n", " gender (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:06Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
b, c, f, m, unknown
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "gn2\n", "
\n", "
str
\n", "
\n", " gender (for part 2) (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:06Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
c, f, m, unknown
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "gn3\n", "
\n", "
str
\n", "
\n", " gender (for part 3) (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:06Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
c, f, m
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "gn_etcbc\n", "
\n", "
str
\n", "
\n", " Dead Sea Scrolls: additions based on BHSA and machine learning\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss-additions
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Martijn Naaijer, ETCBC
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2020
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:06Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martijn Naaijer's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "halfverse\n", "
\n", "
str
\n", "
\n", " label of the half-verse in which the word occurs\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:06Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "intl\n", "
\n", "
int
\n", "
\n", " interlinear material, the value indicates the sequence number of the interlinear line\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:06Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "lang\n", "
\n", "
str
\n", "
\n", " language of a word or sign, only if it is not Hebrew\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:06Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
g=greek, a=aramaic
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "lex\n", "
\n", "
str
\n", "
\n", " representation (Unicode) of a lexeme\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:06Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "lex_etcbc\n", "
\n", "
str
\n", "
\n", " Dead Sea Scrolls: additions based on BHSA and machine learning\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss-additions
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Martijn Naaijer, ETCBC
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2020
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:07Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martijn Naaijer's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "lexe\n", "
\n", "
str
\n", "
\n", " representation (ETCBC transliteration) of a lexeme\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:07Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "lexo\n", "
\n", "
str
\n", "
\n", " representation (original source) of a lexeme\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:08Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "line\n", "
\n", "
str
\n", "
\n", " label of a line of a fragment of a scroll\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:08Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "md\n", "
\n", "
str
\n", "
\n", " mood (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:08Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
coho, cons, juss, unknown
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "merr\n", "
\n", "
str
\n", "
\n", " errors in parsing the morphology tag\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:08Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "morpho\n", "
\n", "
str
\n", "
\n", " morphological tag (by Abegg)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:08Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "nr\n", "
\n", "
str
\n", "
\n", " Dead Sea Scrolls: additions based on BHSA and machine learning\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss-additions
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Martijn Naaijer, ETCBC
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2020
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:09Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martijn Naaijer's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "nu\n", "
\n", "
str
\n", "
\n", " number (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:09Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
d, p, s, unknown
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "nu2\n", "
\n", "
str
\n", "
\n", " number (for part 2) (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:09Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
p, s, unknown
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "nu3\n", "
\n", "
str
\n", "
\n", " number (for part 3) (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:09Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
s
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "nu_etcbc\n", "
\n", "
str
\n", "
\n", " Dead Sea Scrolls: additions based on BHSA and machine learning\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss-additions
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Martijn Naaijer, ETCBC
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2020
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:09Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martijn Naaijer's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "otype\n", "
\n", "
str
\n", "
\n", " Dead Sea Scrolls: biblical and non-biblical scrolls\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:10Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "ps\n", "
\n", "
str
\n", "
\n", " person (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:10Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
1, 2, 3, unknown
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "ps2\n", "
\n", "
str
\n", "
\n", " person (for part 2) (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:10Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
1, 2, 3
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "ps3\n", "
\n", "
str
\n", "
\n", " person (for part 3) (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:10Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
1, 2, 3
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "ps_etcbc\n", "
\n", "
str
\n", "
\n", " Dead Sea Scrolls: additions based on BHSA and machine learning\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss-additions
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Martijn Naaijer, ETCBC
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2020
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:10Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martijn Naaijer's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "punc\n", "
\n", "
str
\n", "
\n", " trailing punctuation (Unicode) of a word\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:11Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "punce\n", "
\n", "
str
\n", "
\n", " trailing punctuation (ETCBC transliteration) of a word\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:11Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "punco\n", "
\n", "
str
\n", "
\n", " trailing punctuation (original source) of a word\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:11Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "rec\n", "
\n", "
int
\n", "
\n", " reconstructed by a modern editor\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:11Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
1
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "rem\n", "
\n", "
int
\n", "
\n", " removed by an ancient or modern editor\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:12Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
1 = modern, 2 = ancient
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "script\n", "
\n", "
str
\n", "
\n", " script in which the word or sign is written if it is not Hebrew\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:12Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
paleohebrew greekcapital
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "scroll\n", "
\n", "
str
\n", "
\n", " acronym of a scroll\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:12Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "sp\n", "
\n", "
str
\n", "
\n", " part of speech (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:12Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
adjv, numr, pron, ptcl, subs, suff, unknown, verb
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "sp_etcbc\n", "
\n", "
str
\n", "
\n", " Dead Sea Scrolls: additions based on BHSA and machine learning\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss-additions
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Martijn Naaijer, ETCBC
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2020
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:12Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martijn Naaijer's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "srcLn\n", "
\n", "
int
\n", "
\n", " the line number of the word in the source data file\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:12Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "st\n", "
\n", "
str
\n", "
\n", " state (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:13Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
a, c, d, unknown
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "type\n", "
\n", "
str
\n", "
\n", " type of sign or cluster\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:13Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "unc\n", "
\n", "
int
\n", "
\n", " uncertain material in various degrees: higher degree is less certain\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:15Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
1 2 3 4
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "vac\n", "
\n", "
int
\n", "
\n", " empty, unwritten space\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:15Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
1
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "verse\n", "
\n", "
str
\n", "
\n", " label of the verse in which the word occurs\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:15Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "vs\n", "
\n", "
str
\n", "
\n", " verbal stem (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:15Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
aphel, apoel, haphel, hifil, hishtafel, hishtaphel, hithaphel, hithpaal, hithpeel, hithpolel, hitopel, hitpael, hitpalpel, hitpoel, hofal, hophal, hotpaal, hpealal, ishtaphel, ithpaal, ithpeel, ithpoel, nifal, nitpael, pael, palel, passive, peal, peil, piel, pilpel, poal, poel, polal, polel, pual, pulal, qal, shaphel, tifil, unknown
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "vs_etcbc\n", "
\n", "
str
\n", "
\n", " Dead Sea Scrolls: additions based on BHSA and machine learning\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss-additions
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Martijn Naaijer, ETCBC
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2020
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:15Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martijn Naaijer's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "vt\n", "
\n", "
str
\n", "
\n", " verbal tense/aspect (morphology tag)\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:16Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
values:
\n", "
impf, impv, infa, infc, perf, ptca, ptcp, unknown, wayy
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "vt_etcbc\n", "
\n", "
str
\n", "
\n", " Dead Sea Scrolls: additions based on BHSA and machine learning\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss-additions
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Martijn Naaijer, ETCBC
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2020
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:16Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martijn Naaijer's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "occ\n", "
\n", "
none
\n", "
\n", " edge feature from a lexeme to its occurrences\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:17Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n", "oslots\n", "
\n", "
none
\n", "
\n", " Dead Sea Scrolls: biblical and non-biblical scrolls\n", "
\n", "\n", "
\n", "
acronym:
\n", "
dss
\n", "
\n", "\n", "
\n", "
convertedBy:
\n", "
Jarod Jacobs, Martijn Naaijer and Dirk Roorda
\n", "
\n", "\n", "
\n", "
createdBy:
\n", "
Martin G. Abegg, Jr., James E. Bowley, and Edward M. Cook
\n", "
\n", "\n", "
\n", "
createdDate:
\n", "
2015
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2020-12-29T15:02:17Z
\n", "
\n", "\n", "
\n", "
license:
\n", "
Creative Commons Attribution-NonCommercial 4.0 International License
\n", "
\n", "\n", "
\n", "
licenseUrl:
\n", "
http://creativecommons.org/licenses/by-nc/4.0/
\n", "
\n", "\n", "
\n", "
source:
\n", "
Martin Abegg's data files, personal communication
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "
\n", "\n", "
\n", "
\n" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "\n", "\n" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
Text-Fabric API: names N F E L T S C TF directly usable

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "A = use(\"etcbc/dss\", hoist=globals())" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# Inspect the contents of a file\n", "We write a function that can peek into file on your system, and show the first few lines.\n", "We'll use it to inspect the exported files that we are going to produce." ] }, { "cell_type": "code", "execution_count": 4, "metadata": {}, "outputs": [], "source": [ "EXPORT_FILE = os.path.expanduser(\"~/Downloads/results.tsv\")\n", "UPTO = 10\n", "\n", "\n", "def checkout():\n", " with open(EXPORT_FILE, encoding=\"utf_16\") as fh:\n", " for (i, line) in enumerate(fh):\n", " if i >= UPTO:\n", " break\n", " print(line.rstrip(\"\\n\"))" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# Encoding\n", "\n", "Our exported `.tsv` files open in Excel without hassle, even if they contain non-latin characters.\n", "That is because TF writes such files in an\n", "encoding that works well with Excel: `utf_16_le`.\n", "You can just open them in Excel, there is no need for conversion before or after opening these files.\n", "\n", "Should you want to process these files by means of a (Python) program,\n", "take care to read them with encoding `utf_16`." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# Example query\n", "\n", "We first run a query in order to export the results." ] }, { "cell_type": "code", "execution_count": 5, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T07:46:55.998382Z", "start_time": "2018-05-24T07:46:55.137956Z" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.86s 399 results\n" ] }, { "data": { "text/html": [ "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "
nplinewordsign
1CD 12:23המחנות המתהלכים באלה בקץ הרשעה עד עמוד משוח משיח אהרן מתהלכים מ
2CD 12:23המחנות המתהלכים באלה בקץ הרשעה עד עמוד משוח משיח אהרן מתהלכים כ
3CD 12:23המחנות המתהלכים באלה בקץ הרשעה עד עמוד משוח משיח אהרן מתהלכים י
4CD 12:23המחנות המתהלכים באלה בקץ הרשעה עד עמוד משוח משיח אהרן מתהלכים ם
5CD 15:11המשפטים עד עמד׳ו לפני המבקר שמה יתפתה ב׳ו בדרש׳ו את׳ו ׃ יתפתה ת
6CD 19:4ויתהלכו על פי התורה ׃   וכמשפט היסודים כסרך התורה יתהלכו י
71QS 7:24הרבים ללכת בשרירות לב׳ו לוא ישוב אל עצת היחד עוד ׃ ואיש מאנשי היחד אשר יתערב יתערב י
81QS 7:24הרבים ללכת בשרירות לב׳ו לוא ישוב אל עצת היחד עוד ׃ ואיש מאנשי היחד אשר יתערב יתערב ת
91QSa 1:11ורע ׃ ובכן תקבל להעיד עלי׳ו משפטות התורא ולהתיצב במשמע משפטים ׃ התיצב ת
101QSb 4:2ימנה את׳ו והתערב ל׳ו׃ וכליל # ε אנוש ובתענוגות בני אדם ε ך ׃ התערב ה
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "query = \"\"\"\n", "line\n", " word vs=hitpael\n", " sign unc\n", "\"\"\"\n", "results = A.search(query)\n", "A.table(results, end=10)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# Bare export\n", "\n", "You can export the table of results to Excel.\n", "\n", "The following command writes a tab-separated file `results.tsv` to your downloads directory.\n", "\n", "You can specify arguments `toDir=directory` and `toFile=file name` to write to a different file.\n", "If the directory does not exist, it will be created.\n", "\n", "We stick to the default, however." ] }, { "cell_type": "code", "execution_count": 6, "metadata": {}, "outputs": [], "source": [ "A.export(results)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Check out the contents:" ] }, { "cell_type": "code", "execution_count": 7, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "R\tS1\tS2\tS3\tNODE1\tTYPE1\tTEXT1\tNODE2\tTYPE2\tTEXT2\tvs2\tNODE3\tTYPE3\tTEXT3\tunc3\n", "1\tCD\t12\t23\t1553232\tline\tהמחנות המתהלכים באלה בקץ הרשעה עד עמוד משוח משיח אהרן \t1610602\tword\tמתהלכים \thitpael\t10574\tsign\tמ\t2\n", "2\tCD\t12\t23\t1553232\tline\tהמחנות המתהלכים באלה בקץ הרשעה עד עמוד משוח משיח אהרן \t1610602\tword\tמתהלכים \thitpael\t10578\tsign\tכ\t2\n", "3\tCD\t12\t23\t1553232\tline\tהמחנות המתהלכים באלה בקץ הרשעה עד עמוד משוח משיח אהרן \t1610602\tword\tמתהלכים \thitpael\t10579\tsign\tי\t1\n", "4\tCD\t12\t23\t1553232\tline\tהמחנות המתהלכים באלה בקץ הרשעה עד עמוד משוח משיח אהרן \t1610602\tword\tמתהלכים \thitpael\t10580\tsign\tם \t2\n", "5\tCD\t15\t11\t1553289\tline\tהמשפטים עד עמד׳ו לפני המבקר שמה יתפתה ב׳ו בדרש׳ו את׳ו ׃ \t1611517\tword\tיתפתה \thitpael\t13025\tsign\tת\t2\n", "6\tCD\t19\t4\t1553321\tline\tויתהלכו על פי התורה ׃   וכמשפט היסודים כסרך התורה \t1611956\tword\tיתהלכו \thitpael\t14285\tsign\tי\t1\n", "7\t1QS\t7\t24\t1553567\tline\tהרבים ללכת בשרירות לב׳ו לוא ישוב אל עצת היחד עוד ׃ ואיש מאנשי היחד אשר יתערב \t1616682\tword\tיתערב \thitpael\t27838\tsign\tי\t1\n", "8\t1QS\t7\t24\t1553567\tline\tהרבים ללכת בשרירות לב׳ו לוא ישוב אל עצת היחד עוד ׃ ואיש מאנשי היחד אשר יתערב \t1616682\tword\tיתערב \thitpael\t27839\tsign\tת\t2\n", "9\t1QSa\t1\t11\t1553680\tline\tורע ׃ ובכן תקבל להעיד עלי׳ו משפטות התורא ולהתיצב במשמע משפטים ׃ \t1618956\tword\tהתיצב \thitpael\t34422\tsign\tת\t2\n" ] } ], "source": [ "checkout()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "You see the following columns:\n", "\n", "* *`R`* the sequence number of the result tuple in the result list\n", "* *`S1 S2 S3`* the section as scroll name, fragment, and line number, in separate columns\n", "* *`NODEi TYPEi`* the node and its type, for each node **i** in the result tuple\n", "* *`TEXTi`* the full text of node *`i`*, if the node type admits a concise text representation\n", "* *`vs2`* *`unc3`* the value of feature *`vs`* on the word and *`unc`* on the sign,\n", "since our query mentions them on those nodes." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# Poorer exports\n", "\n", "If you do not need the full text of the lines, you can leave them out by specifying a smaller *condense type*.\n", "\n", "The export function provides text for all nodes whose type is not too big.\n", "What is too big is determined by the condense type.\n", "\n", "In this corpus, the default condense type is line. Node types bigger than lines will not get text.\n", "\n", "Now, if we change the `condenseType` to something smaller than line, e.g. `word`, the line text will be suppressed." ] }, { "cell_type": "code", "execution_count": 8, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "R\tS1\tS2\tS3\tNODE1\tTYPE1\tNODE2\tTYPE2\tTEXT2\tvs2\tNODE3\tTYPE3\tTEXT3\tunc3\n", "1\tCD\t12\t23\t1553232\tline\t1610602\tword\tמתהלכים \thitpael\t10574\tsign\tמ\t2\n", "2\tCD\t12\t23\t1553232\tline\t1610602\tword\tמתהלכים \thitpael\t10578\tsign\tכ\t2\n", "3\tCD\t12\t23\t1553232\tline\t1610602\tword\tמתהלכים \thitpael\t10579\tsign\tי\t1\n", "4\tCD\t12\t23\t1553232\tline\t1610602\tword\tמתהלכים \thitpael\t10580\tsign\tם \t2\n", "5\tCD\t15\t11\t1553289\tline\t1611517\tword\tיתפתה \thitpael\t13025\tsign\tת\t2\n", "6\tCD\t19\t4\t1553321\tline\t1611956\tword\tיתהלכו \thitpael\t14285\tsign\tי\t1\n", "7\t1QS\t7\t24\t1553567\tline\t1616682\tword\tיתערב \thitpael\t27838\tsign\tי\t1\n", "8\t1QS\t7\t24\t1553567\tline\t1616682\tword\tיתערב \thitpael\t27839\tsign\tת\t2\n", "9\t1QSa\t1\t11\t1553680\tline\t1618956\tword\tהתיצב \thitpael\t34422\tsign\tת\t2\n" ] } ], "source": [ "A.export(results, condenseType=\"word\")\n", "checkout()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# Other exports\n", "\n", "If we want to see the text in another format, we can specify it.\n", "\n", "Here is the Abegg encoding." ] }, { "cell_type": "code", "execution_count": 9, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "R\tS1\tS2\tS3\tNODE1\tTYPE1\tTEXT1\tNODE2\tTYPE2\tTEXT2\tvs2\tNODE3\tTYPE3\tTEXT3\tunc3\n", "1\tCD\t12\t23\t1553232\tline\thmjnwt hmthlkyM balh bqX hrCoh od omwd mCwj mCyj ahrN \t1610602\tword\tmthlkyM \thitpael\t10574\tsign\tm\t2\n", "2\tCD\t12\t23\t1553232\tline\thmjnwt hmthlkyM balh bqX hrCoh od omwd mCwj mCyj ahrN \t1610602\tword\tmthlkyM \thitpael\t10578\tsign\tk\t2\n", "3\tCD\t12\t23\t1553232\tline\thmjnwt hmthlkyM balh bqX hrCoh od omwd mCwj mCyj ahrN \t1610602\tword\tmthlkyM \thitpael\t10579\tsign\ty\t1\n", "4\tCD\t12\t23\t1553232\tline\thmjnwt hmthlkyM balh bqX hrCoh od omwd mCwj mCyj ahrN \t1610602\tword\tmthlkyM \thitpael\t10580\tsign\tM \t2\n", "5\tCD\t15\t11\t1553289\tline\thmCpfyM od omd/w lpny hmbqr Cmh ytpth b/w bdrC/w at/w . \t1611517\tword\tytpth \thitpael\t13025\tsign\tt\t2\n", "6\tCD\t19\t4\t1553321\tline\twythlkw ol py htwrh . □ wkmCpf hyswdyM ksrK htwrh \t1611956\tword\tythlkw \thitpael\t14285\tsign\ty\t1\n", "7\t1QS\t7\t24\t1553567\tline\thrbyM llkt bCryrwt lb/w lwa yCwb al oxt hyjd owd . wayC manCy hyjd aCr ytorb \t1616682\tword\tytorb \thitpael\t27838\tsign\ty\t1\n", "8\t1QS\t7\t24\t1553567\tline\thrbyM llkt bCryrwt lb/w lwa yCwb al oxt hyjd owd . wayC manCy hyjd aCr ytorb \t1616682\tword\tytorb \thitpael\t27839\tsign\tt\t2\n", "9\t1QSa\t1\t11\t1553680\tline\twro . wbkN tqbl lhoyd oly/w mCpfwt htwra wlhtyxb bmCmo mCpfyM . \t1618956\tword\thtyxb \thitpael\t34422\tsign\tt\t2\n" ] } ], "source": [ "A.export(results, fmt=\"text-source-full\")\n", "checkout()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Additional features\n", "\n", "If we want to export additional features, we just have to mention them.\n", "In order to do so and not change the result set, put a `*` behind the feature.\n", "\n", "The `*` means: *always true, no matter what's in the feature, even if there is nothing in there*.\n", "\n", "Let's ask for the original lexeme and morph tags and whether the sign is a reconstruction." ] }, { "cell_type": "code", "execution_count": 10, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T07:46:55.998382Z", "start_time": "2018-05-24T07:46:55.137956Z" } }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 2.39s 399 results\n" ] } ], "source": [ "query = \"\"\"\n", "line\n", " word vs=hitpael lexo* morpho*\n", " sign unc rec*\n", "\"\"\"\n", "results = A.search(query)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "The same number of results.\n", "\n", "We do the export again and peek at the results." ] }, { "cell_type": "code", "execution_count": 11, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "R\tS1\tS2\tS3\tNODE1\tTYPE1\tNODE2\tTYPE2\tTEXT2\tlexo2\tmorpho2\tvs2\tNODE3\tTYPE3\tTEXT3\trec3\tunc3\n", "1\tCD\t12\t23\t1553232\tline\t1610602\tword\tמתהלכים \thlK\tvtPmpa\thitpael\t10574\tsign\tמ\t\t2\n", "2\tCD\t12\t23\t1553232\tline\t1610602\tword\tמתהלכים \thlK\tvtPmpa\thitpael\t10578\tsign\tכ\t\t2\n", "3\tCD\t12\t23\t1553232\tline\t1610602\tword\tמתהלכים \thlK\tvtPmpa\thitpael\t10579\tsign\tי\t\t1\n", "4\tCD\t12\t23\t1553232\tline\t1610602\tword\tמתהלכים \thlK\tvtPmpa\thitpael\t10580\tsign\tם \t\t2\n", "5\tCD\t15\t11\t1553289\tline\t1611517\tword\tיתפתה \tpth_1\tvti3ms\thitpael\t13025\tsign\tת\t\t2\n", "6\tCD\t19\t4\t1553321\tline\t1611956\tword\tיתהלכו \thlK\tvti3mp\thitpael\t14285\tsign\tי\t\t1\n", "7\t1QS\t7\t24\t1553567\tline\t1616682\tword\tיתערב \torb_2\tvti3ms\thitpael\t27838\tsign\tי\t\t1\n", "8\t1QS\t7\t24\t1553567\tline\t1616682\tword\tיתערב \torb_2\tvti3ms\thitpael\t27839\tsign\tת\t\t2\n", "9\t1QSa\t1\t11\t1553680\tline\t1618956\tword\tהתיצב \tyxb\tvtc\thitpael\t34422\tsign\tת\t\t2\n" ] } ], "source": [ "A.export(results, condenseType=\"word\")\n", "checkout()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "As you see, you have an extra columns *`lexo2`*, *`morpho2`* and *`rec3`*.\n", "\n", "This gives you a lot of control over the generation of spreadsheets." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# Not from queries\n", "\n", "You can also export lists of node tuples that are not obtained by a query:" ] }, { "cell_type": "code", "execution_count": 12, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "[(1607869, 100001, 200001),\n", " (1607870, 100002, 200002),\n", " (1607871, 100003, 200003),\n", " (1607872, 100004, 200004),\n", " (1607873, 100005, 200005),\n", " (1607874, 100006, 200006),\n", " (1607875, 100007, 200007),\n", " (1607876, 100008, 200008),\n", " (1607877, 100009, 200009),\n", " (1607878, 100010, 200010)]" ] }, "execution_count": 12, "metadata": {}, "output_type": "execute_result" } ], "source": [ "words = F.otype.s(\"word\")[1000:1010]\n", "signs1 = F.otype.s(\"sign\")[100000:100010]\n", "signs2 = F.otype.s(\"sign\")[200000:200010]\n", "tuples = list(zip(words, signs1, signs2))\n", "\n", "tuples" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Ten rows, each row has a word node and two sign nodes.\n", "\n", "The word and the signs in each row do not have any meaningful relationship!\n", "\n", "Let's do a bare export:" ] }, { "cell_type": "code", "execution_count": 13, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "R\tS1\tS2\tS3\tNODE1\tTYPE1\tTEXT1\tNODE2\tTYPE2\tTEXT2\tlexo2\tmorpho2\tvs2\tNODE3\tTYPE3\tTEXT3\trec3\tunc3\n", "1\tCD\t4\t9\t1607869\tword\tהקים \t100001\tsign\tד\t\t\t\t200001\tsign\tל \t1\t\n", "2\tCD\t4\t9\t1607870\tword\tאל \t100002\tsign\tע\t\t\t\t200002\tsign\tת\t1\t\n", "3\tCD\t4\t9\t1607871\tword\tל\t100003\tsign\tת\t\t\t\t200003\tsign\tר\t1\t\n", "4\tCD\t4\t9\t1607872\tword\tראשנים \t100004\tsign\t׳\t\t\t\t200004\tsign\tי \t1\t\n", "5\tCD\t4\t9\t1607873\tword\tל\t100005\tsign\tי \t\t\t\t200005\tsign\tע\t1\t\n", "6\tCD\t4\t9\t1607874\tword\tכפר \t100006\tsign\tס\t\t\t\t200006\tsign\tש\t1\t\n", "7\tCD\t4\t10\t1607875\tword\tעל \t100007\tsign\tפ\t\t\t\t200007\tsign\tר \t1\t\n", "8\tCD\t4\t10\t1607876\tword\tעונותי׳הם \t100008\tsign\tר\t\t\t\t200008\tsign\tל\t1\t\n", "9\tCD\t4\t10\t1607877\tword\tכן \t100009\tsign\tת\t\t\t\t200009\tsign\t׳\t1\t\n" ] } ], "source": [ "A.export(tuples)\n", "checkout()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Wait a minute: why are the `lexo2` and `morpho2` and `rec3` and `unc` columns showing up?\n", "\n", "It is because we have run a query before where we asked for these features.\n", "\n", "If we do not want to be influenced by previous things we've run, we need to reset the display:" ] }, { "cell_type": "code", "execution_count": 14, "metadata": {}, "outputs": [], "source": [ "A.displayReset(\"tupleFeatures\")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Again:" ] }, { "cell_type": "code", "execution_count": 15, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "R\tS1\tS2\tS3\tNODE1\tTYPE1\tTEXT1\tNODE2\tTYPE2\tTEXT2\tNODE3\tTYPE3\tTEXT3\n", "1\tCD\t4\t9\t1607869\tword\tהקים \t100001\tsign\tד\t200001\tsign\tל \n", "2\tCD\t4\t9\t1607870\tword\tאל \t100002\tsign\tע\t200002\tsign\tת\n", "3\tCD\t4\t9\t1607871\tword\tל\t100003\tsign\tת\t200003\tsign\tר\n", "4\tCD\t4\t9\t1607872\tword\tראשנים \t100004\tsign\t׳\t200004\tsign\tי \n", "5\tCD\t4\t9\t1607873\tword\tל\t100005\tsign\tי \t200005\tsign\tע\n", "6\tCD\t4\t9\t1607874\tword\tכפר \t100006\tsign\tס\t200006\tsign\tש\n", "7\tCD\t4\t10\t1607875\tword\tעל \t100007\tsign\tפ\t200007\tsign\tר \n", "8\tCD\t4\t10\t1607876\tword\tעונותי׳הם \t100008\tsign\tר\t200008\tsign\tל\n", "9\tCD\t4\t10\t1607877\tword\tכן \t100009\tsign\tת\t200009\tsign\t׳\n" ] } ], "source": [ "A.export(tuples)\n", "checkout()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# Display setup\n", "\n", "When we exported query results, we could mention features in the query with a `*` so that they got exported.\n", "If we do not have a previous query we can achieve the same effect by specifying the desired export features per column.\n", "\n", "The display option `tupleFeatures` takes care of that." ] }, { "cell_type": "code", "execution_count": 16, "metadata": {}, "outputs": [], "source": [ "A.displaySetup(\n", " tupleFeatures=(\n", " (0, \"fulle lexe type\"),\n", " (1, \"glyphe type\"),\n", " (2, \"glyphe type\"),\n", " )\n", ")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "We assign extra features per member of the tuple.\n", "\n", "In the above case:\n", "\n", "* the first (`0`) member (the word node), gets features `fulle` (full transcription in ETCBC encoding),\n", " `glyphe` (just the actual signs), `type` (type of word);\n", "* the second and third member (the sign nodes), get features `glyphe` and `type` (type of sign)." ] }, { "cell_type": "code", "execution_count": 17, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "R\tS1\tS2\tS3\tNODE1\tTYPE1\tTEXT1\tfulle1\tlexe1\ttype1\tNODE2\tTYPE2\tTEXT2\tglyphe2\ttype2\tNODE3\tTYPE3\tTEXT3\tglyphe3\ttype3\n", "1\tCD\t4\t9\t1607869\tword\tהקים \tHQJm\tQWm\tglyph\t100001\tsign\tד\tD\tcons\t200001\tsign\tל \tL\tcons\n", "2\tCD\t4\t9\t1607870\tword\tאל \t>L\t>;L_5\tglyph\t100002\tsign\tע\t<\tcons\t200002\tsign\tת\tT\tcons\n", "3\tCD\t4\t9\t1607871\tword\tל\tL\tL:\tglyph\t100003\tsign\tת\tT\tcons\t200003\tsign\tר\tR\tcons\n", "4\tCD\t4\t9\t1607872\tword\tראשנים \tR>#NJm\tRI>COWn\tglyph\t100004\tsign\t׳\t'\tsep\t200004\tsign\tי \tJ\tcons\n", "5\tCD\t4\t9\t1607873\tword\tל\tL\tL:\tglyph\t100005\tsign\tי \tJ\tcons\t200005\tsign\tע\t<\tcons\n", "6\tCD\t4\t9\t1607874\tword\tכפר \tKPR\tKPR\tglyph\t100006\tsign\tס\tS\tcons\t200006\tsign\tש\t#\tcons\n", "7\tCD\t4\t10\t1607875\tword\tעל \t