{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "\n", "\n", "\n", "\n", "---\n", "\n", "To get started: consult [start](start.ipynb)\n", "\n", "---\n", "\n", "# Sharing data features\n", "\n", "## Explore additional data\n", "\n", "Once you analyse a corpus, it is likely that you produce data that others can reuse.\n", "Maybe you have defined a set of proper name occurrences, or special numerals, or you have computed part-of-speech assignments.\n", "\n", "It is possible to turn these insights into *new features*, i.e. new `.tf` files with values assigned to specific nodes.\n", "\n", "## Make your own data\n", "\n", "New data is a product of your own methods and computations in the first place.\n", "But how do you turn that data into new TF features?\n", "It turns out that the last step is not that difficult.\n", "\n", "If you can shape your data as a mapping (dictionary) from node numbers (integers) to values\n", "(strings or integers), then TF can turn that data into a feature file for you with one command.\n", "\n", "## Share your new data\n", "You can then easily share your new features on GitHub, so that your colleagues everywhere\n", "can try it out for themselves.\n", "\n", "You can add such data on the fly, by passing a `mod={org}/{repo}/{path}` parameter,\n", "or a bunch of them separated by commas.\n", "\n", "If the data is there, it will be auto-downloaded and stored on your machine.\n", "\n", "Let's do it.\n" ] }, { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [], "source": [ "%load_ext autoreload\n", "%autoreload 2" ] }, { "cell_type": "code", "execution_count": 2, "metadata": { "ExecuteTime": { "end_time": "2018-05-24T10:06:39.818664Z", "start_time": "2018-05-24T10:06:39.796588Z" } }, "outputs": [], "source": [ "import re\n", "import collections\n", "import os\n", "\n", "from tf.app import use" ] }, { "cell_type": "code", "execution_count": 3, "metadata": {}, "outputs": [ { "data": { "text/html": [ "TF-app: ~/text-fabric-data/github/CLARIAH/wp6-missieven/app" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "data: ~/text-fabric-data/github/CLARIAH/wp6-missieven/tf/1.0" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "Text-Fabric: Text-Fabric API 10.2.6, CLARIAH/wp6-missieven/app v3, Search Reference
Data: WP6-MISSIEVEN, Character table, Feature docs
Features:
\n", "
General Missives Dutch East India Company 1600-1800\n", "
\n", "\n", "
\n", "
\n", "author\n", "
\n", "
str
\n", "\n", " authors of the letter, surnames only\n", "\n", "
\n", "\n", "
\n", "
\n", "authorFull\n", "
\n", "
str
\n", "\n", " authors of the letter, full names\n", "\n", "
\n", "\n", "
\n", "
\n", "col\n", "
\n", "
int
\n", "\n", " column number of a column in a row in a table\n", "\n", "
\n", "\n", "
\n", "
\n", "day\n", "
\n", "
int
\n", "\n", " day part of the date of the letter\n", "\n", "
\n", "\n", "
\n", "
\n", "isden\n", "
\n", "
int
\n", "\n", " whether a word is the denominator in fraction, e.g. 4 in 1/4\n", "\n", "
\n", "\n", "
\n", "
\n", "isemph\n", "
\n", "
str
\n", "\n", " whether a word is emphasized by typography\n", "\n", "
\n", "\n", "
\n", "
\n", "isfolio\n", "
\n", "
int
\n", "\n", " a folio reference\n", "\n", "
\n", "\n", "
\n", "
\n", "isnote\n", "
\n", "
int
\n", "\n", " whether a word belongs to footnote text\n", "\n", "
\n", "\n", "
\n", "
\n", "isnum\n", "
\n", "
int
\n", "\n", " whether a word is the numerator in fraction, e.g. 1 in 1/4\n", "\n", "
\n", "\n", "
\n", "
\n", "isorig\n", "
\n", "
int
\n", "\n", " whether a word belongs to original text\n", "\n", "
\n", "\n", "
\n", "
\n", "isq\n", "
\n", "
int
\n", "\n", " whether a word is a numerical fraction, e.g. 1/4\n", "\n", "
\n", "\n", "
\n", "
\n", "isref\n", "
\n", "
int
\n", "\n", " whether a word belongs to the text of reference\n", "\n", "
\n", "\n", "
\n", "
\n", "isremark\n", "
\n", "
int
\n", "\n", " whether a word belongs to the text of editorial remarks\n", "\n", "
\n", "\n", "
\n", "
\n", "isspecial\n", "
\n", "
int
\n", "\n", " whether a word has special typography possibly with OCR mistakes as well\n", "\n", "
\n", "\n", "
\n", "
\n", "issub\n", "
\n", "
int
\n", "\n", " whether a word has subscript typography possibly indicating the denominator of a fraction\n", "\n", "
\n", "\n", "
\n", "
\n", "issuper\n", "
\n", "
int
\n", "\n", " whether a word has superscript typography possibly indicating the numerator of a fraction\n", "\n", "
\n", "\n", "
\n", "
\n", "isund\n", "
\n", "
str
\n", "\n", " whether a word is underlined by typography\n", "\n", "
\n", "\n", "
\n", "
\n", "mark\n", "
\n", "
int
\n", "\n", " footnote mark (not necessarily the same as shown on the printed page\n", "\n", "
\n", "\n", "
\n", "
\n", "month\n", "
\n", "
int
\n", "\n", " month part of the date of the letter\n", "\n", "
\n", "\n", "
\n", "
\n", "n\n", "
\n", "
int
\n", "\n", " number of a volume, letter, page, para, line, table\n", "\n", "
\n", "\n", "
\n", "
\n", "otype\n", "
\n", "
str
\n", "\n", " \n", "\n", "
\n", "\n", "
\n", "
\n", "page\n", "
\n", "
str
\n", "\n", " number of the first page of this letter in this volume\n", "\n", "
\n", "\n", "
\n", "
\n", "place\n", "
\n", "
str
\n", "\n", " place from where the letter was sent\n", "\n", "
\n", "\n", "
\n", "
\n", "punc\n", "
\n", "
str
\n", "\n", " punctuation and/or whitespace following a wordup to the next word\n", "\n", "
\n", "\n", "
\n", "
\n", "puncn\n", "
\n", "
str
\n", "\n", " punctuation and/or whitespace following a word,up to the next word, footnote text only\n", "\n", "
\n", "\n", "
\n", "
\n", "punco\n", "
\n", "
str
\n", "\n", " punctuation and/or whitespace following a word,up to the next word, original text only\n", "\n", "
\n", "\n", "
\n", "
\n", "puncr\n", "
\n", "
str
\n", "\n", " punctuation and/or whitespace following a word,up to the next word, remark text only\n", "\n", "
\n", "\n", "
\n", "
\n", "rawdate\n", "
\n", "
str
\n", "\n", " the date the letter was sent\n", "\n", "
\n", "\n", "
\n", "
\n", "row\n", "
\n", "
int
\n", "\n", " row number of a row of column in a table\n", "\n", "
\n", "\n", "
\n", "
\n", "seq\n", "
\n", "
str
\n", "\n", " ('sequence number of this letter among the letters of the same author in this volume',)\n", "\n", "
\n", "\n", "
\n", "
\n", "status\n", "
\n", "
str
\n", "\n", " status of the letter, e.g. secret, copy\n", "\n", "
\n", "\n", "
\n", "
\n", "title\n", "
\n", "
str
\n", "\n", " title of the letter\n", "\n", "
\n", "\n", "
\n", "
\n", "trans\n", "
\n", "
str
\n", "\n", " transcription of a word\n", "\n", "
\n", "\n", "
\n", "
\n", "transn\n", "
\n", "
str
\n", "\n", " transcription of a word, only for footnote text\n", "\n", "
\n", "\n", "
\n", "
\n", "transo\n", "
\n", "
str
\n", "\n", " transcription of a word, only for original text\n", "\n", "
\n", "\n", "
\n", "
\n", "transr\n", "
\n", "
str
\n", "\n", " transcription of a word, only for remark text\n", "\n", "
\n", "\n", "
\n", "
\n", "vol\n", "
\n", "
int
\n", "\n", " volume number\n", "\n", "
\n", "\n", "
\n", "
\n", "weblink\n", "
\n", "
str
\n", "\n", " the page-specific part of web links for page nodes\n", "\n", "
\n", "\n", "
\n", "
\n", "x\n", "
\n", "
int
\n", "\n", " column offset of a column in a row in a table\n", "\n", "
\n", "\n", "
\n", "
\n", "year\n", "
\n", "
int
\n", "\n", " year part of the date of the letter\n", "\n", "
\n", "\n", "
\n", "
\n", "note\n", "
\n", "
none
\n", "\n", " edge between a word and the footnotes associated with it\n", "\n", "
\n", "\n", "
\n", "
\n", "oslots\n", "
\n", "
none
\n", "\n", " \n", "\n", "
\n", "\n", "
\n", "
\n" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "\n", "\n" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
Text-Fabric API: names N F E L T S C TF directly usable

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "A = use(\"CLARIAH/wp6-missieven\", hoist=globals())\n", "VERSION = A.version" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# Making data\n", "\n", "We illustrate the data creation part by creating a new feature, `number`.\n", "The idea is that we compute a number value for each word that looks like a number,\n", "but that contains OCR errors." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "We keep things simple.\n", "\n", "We are interested in words that contain only digits and letters, and where the number of digits is greater than de number of letters.\n", "We exclude words that consist of digits only.\n", "\n", "We only work in original letter content.\n", "\n", "Let's find them by hand coding." ] }, { "cell_type": "code", "execution_count": 4, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "[11761, 28520, 30481, 31702, 36287, 37982, 37988, 106832, 112548, 119347]\n" ] }, { "data": { "text/plain": [ "4727" ] }, "execution_count": 4, "metadata": {}, "output_type": "execute_result" } ], "source": [ "results = []\n", "\n", "digitRe = re.compile(r\"[0-9]\")\n", "\n", "for w in F.otype.s(\"word\"):\n", " chars = F.transo.v(w)\n", " if not chars:\n", " continue\n", " (letters, nDigits) = digitRe.subn(\"\", chars)\n", " nLetters = len(chars) - nDigits\n", " if nLetters and nDigits > nLetters:\n", " results.append(w)\n", "\n", "print(results[0:10])\n", "len(results)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "It happens quite a bit.\n", "\n", "Let's have a quick look at the text of the results" ] }, { "cell_type": "code", "execution_count": 5, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "0001b\n", "0001b\n", "0001b\n", "0001b\n", "0001b\n", "000©\n", "006½\n", "022H\n", "024½\n", "03£\n", "042V2\n", "051|\n", "052J\n", "053f\n", "053f\n", "062|\n", "0753A\n", "084|\n", "086j\n", "087|\n" ] } ], "source": [ "print(\"\\n\".join(sorted(F.transo.v(w) for w in results)[0:20]))" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "We want to map characters to digits.\n", "To get a feel for that, inventorize the characters that occur in these words.\n", "\n", "For each character, count how often it occurs and give at most 10 examples." ] }, { "cell_type": "code", "execution_count": 6, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "61" ] }, "execution_count": 6, "metadata": {}, "output_type": "execute_result" } ], "source": [ "inventory = collections.defaultdict(list)\n", "\n", "for w in results:\n", " for c in (trans := F.transo.v(w)):\n", " if not c.isdigit():\n", " inventory[c].append(trans)\n", "\n", "len(inventory)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Quite a bit of different characters." ] }, { "cell_type": "code", "execution_count": 7, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "? ( 15x) 12?, 144?, 1617?, 16?, 18?, 19?, 286?, 29?, 31?, 413?\n", "A ( 9x) 0753A, 13A, 273A, 343A, 3933A, 423A, 43A, 4743A, 553A\n", "C ( 1x) 540C3\n", "D ( 1x) 1685De\n", "E ( 1x) 194845En\n", "H ( 4x) 022H, 22H, 2328H, 252H\n", "I ( 5x) 217IM, I299v, I85, I85, I85\n", "J ( 96x) 052J, 1079J, 1092J, 10J, 10J, 110J, 115J, 1191J, 11J, 121378J\n", "M ( 3x) 217IM, 4047M, 564M\n", "O ( 4x) 1671Op, 27O4508, O86V2, ÏO011\n", "P ( 1x) P10\n", "S ( 1x) 16S6\n", "U ( 1x) 1U8\n", "V ( 76x) 042V2, 1014V2, 1019V2, 1062V4, 1062V4, 10V5, 12V2, 1364V2, 13V2, 14V2\n", "a ( 5x) 10a, 11a, 11a, 13a, 1684dat\n", "b ( 26x) 0001b, 0001b, 0001b, 0001b, 0001b, 1156bls, 121b, 121b, 121b, 121b\n", "c ( 59x) 10c, 12c, 12c, 13c, 13c, 13c, 14c, 14c, 14c, 15c\n", "d ( 14x) 100d, 14101de, 1684dat, 29d, d08, d08, d08, d08, d08, d08\n", "e (2952x) 10e, 10e, 10e, 10e, 10e, 10e, 10e, 10e, 10e, 10e\n", "f ( 58x) 053f, 053f, 09f, 102f, 108f, 121f, 1222f, 137f, 14f, 14f\n", "g ( 9x) 16g, 22g, 28g, 36g, 430g, 6000g, 600g, 705g, 74g\n", "h ( 2x) 42h, 605h\n", "i ( 4x) 302061in, 496159in, 7897io, 8337tis\n", "j ( 24x) 086j, 1023j, 12j, 14j, 14j, 16j, 176j, 236j, 30j, 31j\n", "l ( 1x) 1156bls\n", "m ( 1x) 366m\n", "n ( 44x) 10n, 10n, 10n, 10n, 14n, 14n, 150599en, 15n, 15n, 15n\n", "o ( 9x) 24o, 24o, 36o, 36o, 438834V36o, 48o, 5622tot, 5957V3óo, 7897io\n", "p ( 2x) 1419p, 1671Op\n", "q ( 1x) 2901§§q\n", "r ( 24x) 128r, 1300r, 1394rv, 1427r, 149r, 189r, 202r, 20r, 2182r, 256r\n", "s ( 6x) 1156bls, 167s, 336s, 4395Vs, 50s, 8337tis\n", "t ( 8x) 1684dat, 22t, 4t0, 520t, 5622tot, 5622tot, 6t0, 8337tis\n", "u ( 2x) 21u, 417u\n", "v ( 20x) 124v, 1394rv, 1426v, 148v, 15v, 15v, 16v, 19v, 212v, 286v\n", "x ( 4x) 10x, 18x, 31x, 34x\n", "| ( 232x) 051|, 062|, 084|, 087|, 1034|, 104|, 104|, 106|, 108|, 10|\n", "£ ( 51x) 03£, 10£, 10£, 10£, 11£, 12£, 14£, 14£, 14£, 16£\n", "§ ( 49x) 090§, 10§, 10§, 10§, 1216§, 1372|§, 139§, 146§, 14§, 166§\n", "© ( 1x) 000©\n", "® ( 25x) 1000®, 10®, 10®, 125®, 15®, 16®, 1719®, 1®11, 2000®, 20®\n", "° ( 10x) 16°, 17°, 20°, 24°, 24°, 25°, 28°, 30°, 51°, °1677\n", "± ( 4x) 16±, 28±, 32±, 97±\n", "¼ ( 1x) 254¼\n", "½ ( 7x) 006½, 024½, 117½, 144½, 22½, 27½, 699½\n", "Ï ( 2x) 143Ï, ÏO011\n", "Ö ( 1x) Ö00\n", "Ü ( 4x) 2328Ü, 516Ü, 659Ü, 929Ü\n", "è ( 2x) è60, è70\n", "ï ( 8x) 10ï, 166ï, 24ï, 28ï, 292ï, 29ï, 42ï, 8ï4\n", "ó ( 3x) 169Vó, 25ó, 5957V3óo\n", "ö ( 2x) 189öf, 2ö00\n", "ƒ ( 765x) 12ƒ, 14ƒ, 1753ƒ, 17ƒ, 19ƒ, 8ƒ294, ƒ10, ƒ10, ƒ100, ƒ1002\n", "— ( 2x) 1—151, 568—\n", "‘ ( 1x) 6440‘\n", "’ ( 68x) 36’, d’480, ’19, ’20, ’29, ’34, ’34, ’35, ’35, ’35\n", "“ ( 1x) 29“\n", "” ( 1x) 1681”\n", "„ ( 36x) 12143„, 13757„, 1637„, 3096„, 3546„, 44246„, 615„, „10, „114, „116\n", "™ ( 1x) 30™\n", "⌊ ( 1x) 1706⌊\n" ] } ], "source": [ "for c in sorted(inventory):\n", " examples = inventory[c]\n", " n = len(examples)\n", " showExamples = \", \".join(sorted(examples)[0:10])\n", " print(f\"{c} ({n:>4}x) {showExamples}\")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "We decide to translate a few characters to numerals:" ] }, { "cell_type": "code", "execution_count": 8, "metadata": {}, "outputs": [], "source": [ "charMapping = {\n", " \"o\": 0,\n", " \"ó\": 0,\n", " \"ö\": 0,\n", " \"Ö\": 0,\n", " \"I\": 1,\n", " \"J\": 1,\n", " \"ï\": 1,\n", " \"è\": 6,\n", "}" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Now we translate all numerals with this mapping, and if the result is numeric and does not start with a 0,\n", "we save the result in a mapping from nodes to numbers." ] }, { "cell_type": "code", "execution_count": 9, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "114" ] }, "execution_count": 9, "metadata": {}, "output_type": "execute_result" } ], "source": [ "def cmap(chars):\n", " n = \"\".join(str(charMapping.get(c, c)) for c in chars)\n", " return int(n) if not n.startswith(\"0\") and n.isdigit() else None\n", "\n", "\n", "number = {w: n for w in results if (n := cmap(F.transo.v(w)))}\n", "len(number)" ] }, { "cell_type": "code", "execution_count": 10, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "{11761: 1151, 368089: 670, 379197: 94001, 379568: 131, 396613: 141, 396656: 20621, 407164: 121, 430354: 121, 432757: 128181, 432879: 1241, 434920: 141, 462917: 621, 464624: 1241, 465415: 631, 472907: 3191, 473135: 9581, 483858: 8191, 486913: 10791, 498619: 8541, 533953: 261, 533968: 331, 535684: 6121, 557983: 77841, 618358: 261, 618871: 4021, 618877: 501, 627195: 261, 653407: 1741, 667437: 15301, 675324: 65931, 750255: 3231, 750445: 5021, 1019955: 10921, 1047395: 1371, 1068377: 52141, 1070934: 49141, 1079667: 2000, 1080766: 72771, 1118656: 4061, 1173348: 161, 1178433: 101, 1196647: 191, 1200319: 201, 1211567: 660, 1230723: 3501, 1234154: 171, 1237203: 111, 1237391: 141, 1250144: 8421, 1253186: 32091, 1271818: 121, 1282202: 75621, 1327325: 121, 1346403: 131, 1352127: 421, 1352309: 421, 1372543: 371, 1379628: 161, 1393864: 2228491, 1443457: 161, 1443464: 361, 1443641: 361, 1443657: 361, 1443666: 101, 1451420: 2981, 1548082: 1101, 1554393: 421, 1653139: 2501, 1669175: 151, 1682688: 4041, 1682700: 1441, 1714540: 721, 1833190: 1213781, 1851679: 1441, 1877221: 98771, 1877228: 977381, 1877230: 167081, 1948091: 925981, 1957857: 15361, 1965567: 181, 2089027: 541, 2126313: 701, 2126473: 621, 2126645: 901, 2126699: 731, 2126709: 911, 2126717: 761, 2126753: 561, 2207671: 1321, 2207675: 361, 2207742: 361, 2351417: 151, 2379398: 121, 2945183: 240, 2968542: 480, 2968588: 240, 2993386: 360, 2993418: 360, 3037496: 250, 3704420: 185, 3820516: 9961, 4086262: 101, 4131362: 185, 4188174: 241, 4262757: 2991, 4277217: 281, 4355285: 291, 4355770: 2921, 4394040: 421, 4412289: 814, 4522464: 1661, 4792505: 121, 4993558: 185, 5146359: 11911}\n" ] } ], "source": [ "print(number)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# Saving data\n", "\n", "In [annotate](annotate.ipynb) we saw how to save features.\n", "We do the same for the `number` feature." ] }, { "cell_type": "code", "execution_count": 11, "metadata": {}, "outputs": [], "source": [ "GITHUB = os.path.expanduser(\"~/github\")\n", "ORG = A.context.org\n", "REPO = A.context.repo\n", "PATH = \"exercises/numerics\"" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Later on, we pass this version on, so that users of our data will get the shared data in exactly the same version as their core data." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "We have to specify a bit of metadata for this feature:" ] }, { "cell_type": "code", "execution_count": 12, "metadata": {}, "outputs": [], "source": [ "metaData = {\n", " \"number\": dict(\n", " valueType=\"int\",\n", " description=\"numeric value of corrected number-like strings\",\n", " creator=\"Dirk Roorda\",\n", " ),\n", "}" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Now we can give the save command:" ] }, { "cell_type": "code", "execution_count": 14, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 0.00s Exporting 1 node and 0 edge and 0 config features to ~/github/CLARIAH/wp6-missieven/exercises/numerics/tf/1.0:\n", " | 0.00s T number to ~/github/CLARIAH/wp6-missieven/exercises/numerics/tf/1.0\n", " 0.00s Exported 1 node features and 0 edge features and 0 config features to ~/github/CLARIAH/wp6-missieven/exercises/numerics/tf/1.0\n" ] }, { "data": { "text/plain": [ "True" ] }, "execution_count": 14, "metadata": {}, "output_type": "execute_result" } ], "source": [ "location = f\"{GITHUB}/{ORG}/{REPO}/{PATH}/tf\"\n", "TF.save(\n", " nodeFeatures=dict(number=number),\n", " metaData=metaData,\n", " location=location,\n", " module=VERSION,\n", " silent=\"auto\",\n", ")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Here is the data in text-fabric format: a feature file" ] }, { "cell_type": "code", "execution_count": 15, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "@node\n", "@creator=Dirk Roorda\n", "@description=numeric value of corrected number-like strings\n", "@valueType=int\n", "@writtenBy=Text-Fabric\n", "@dateWritten=2022-10-11T14:56:42Z\n", "\n", "11761\t1151\n", "368089\t670\n", "379197\t94001\n", "379568\t131\n", "396613\t141\n", "396656\t20621\n", "407164\t121\n", "430354\t121\n", "432757\t128181\n", "432879\t1241\n", "434920\t141\n", "462917\t621\n", "464624\t1241\n", "465415\t631\n", "472907\t3191\n", "473135\t9581\n", "483858\t8191\n", "486913\t10791\n", "498619\t8541\n", "533953\t261\n", "533968\t331\n", "535684\t6121\n", "557983\t77841\n", "618358\t261\n", "618871\t4021\n", "618877\t501\n", "627195\t261\n", "653407\t1741\n", "667437\t15301\n", "675324\t65931\n", "750255\t3231\n", "750445\t5021\n", "1019955\t10921\n", "1047395\t1371\n", "1068377\t52141\n", "1070934\t49141\n", "1079667\t2000\n", "1080766\t72771\n", "1118656\t4061\n", "1173348\t161\n", "1178433\t101\n", "1196647\t191\n", "1200319\t201\n", "1211567\t660\n", "1230723\t3501\n", "1234154\t171\n", "1237203\t111\n", "1237391\t141\n", "1250144\t8421\n", "1253186\t32091\n", "1271818\t121\n", "1282202\t75621\n", "1327325\t121\n", "1346403\t131\n", "1352127\t421\n", "1352309\t421\n", "1372543\t371\n", "1379628\t161\n", "1393864\t2228491\n", "1443457\t161\n", "1443464\t361\n", "1443641\t361\n", "1443657\t361\n", "1443666\t101\n", "1451420\t2981\n", "1548082\t1101\n", "1554393\t421\n", "1653139\t2501\n", "1669175\t151\n", "1682688\t4041\n", "1682700\t1441\n", "1714540\t721\n", "1833190\t1213781\n", "1851679\t1441\n", "1877221\t98771\n", "1877228\t977381\n", "1877230\t167081\n", "1948091\t925981\n", "1957857\t15361\n", "1965567\t181\n", "2089027\t541\n", "2126313\t701\n", "2126473\t621\n", "2126645\t901\n", "2126699\t731\n", "2126709\t911\n", "2126717\t761\n", "2126753\t561\n", "2207671\t1321\n", "2207675\t361\n", "2207742\t361\n", "2351417\t151\n", "2379398\t121\n", "2945183\t240\n", "2968542\t480\n", "2968588\t240\n", "2993386\t360\n", "2993418\t360\n", "3037496\t250\n", "3704420\t185\n", "3820516\t9961\n", "4086262\t101\n", "4131362\t185\n", "4188174\t241\n", "4262757\t2991\n", "4277217\t281\n", "4355285\t291\n", "4355770\t2921\n", "4394040\t421\n", "4412289\t814\n", "4522464\t1661\n", "4792505\t121\n", "4993558\t185\n", "5146359\t11911\n", "\n" ] } ], "source": [ "with open(f\"{location}/{VERSION}/number.tf\") as fh:\n", " print(fh.read())" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# Sharing data\n", "\n", "How to share your own data is explained in the\n", "[documentation](https://annotation.github.io/text-fabric/tf/about/datasharing.html).\n", "\n", "Here we show it step by step for the `number` feature.\n", "\n", "If you commit your changes to the exercises repo, and have done a `git push origin master`,\n", "you already have shared your data!\n", "\n", "**Keep it simple for small datasets:\n", "For small feature datasets, you are done.**\n", "\n", "If it gets serious, there is support for releases and efficient data transfer.\n", "Here is how:\n", "\n", "**Note (releases)**\n", "\n", "If you want to make a stable release, so that you can keep developing, while your users fall back\n", "on the stable data, you can make a new release.\n", "\n", "Go to the GitHub website for that, go to your repo, and click *Releases* and follow the nudges.\n", "\n", "**Note (release binaries)**\n", "\n", "If you want to make it even smoother for your users, you can zip the data and attach it as a binary to the release just created.\n", "\n", "We need to zip the data in exactly the right directory structure. Text-Fabric can do that for us.\n" ] }, { "cell_type": "code", "execution_count": 17, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "This is a TF dataset\n", "Create release data for CLARIAH/wp6-missieven/exercises/numerics/tf\n", "Found 2 versions\n", "zip files end up in ~/Downloads/None/CLARIAH-release/wp6-missieven\n", "zipping CLARIAH/wp6-missieven 0.9.1 with 1 features ==> exercises-numerics-tf-0.9.1.zip\n", "zipping CLARIAH/wp6-missieven 1.0 with 1 features ==> exercises-numerics-tf-1.0.zip\n" ] } ], "source": [ "%%sh\n", "\n", "text-fabric-zip CLARIAH/wp6-missieven/exercises/numerics/tf" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "All versions have been zipped, but it works OK if you only attach the newest version to the newest release.\n", "\n", "If a user asks for an older version in this release, the system can still find it." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# Use the data\n", "\n", "We can use the data by calling it up when we say `use('CLARIAH/wp6-missieven', ...)`\n", "where we put in a data module argument on the dots.\n", "We will also call up the entity data we created in the [annotate](annotate.ipynb) chapter.\n", "\n", "Note that for each module we can specify flags like `:latest`, `:hot`, `clone`.\n", "\n", "If you are the author of the data, and want to test it, use `:clone`: it takes the data from where you saved it.\n", "\n", "If you are a new user of the data, use `:hot` (get latest commit) or `:latest` (get latest release)\n", "to download the data.\n", "\n", "If you have downloaded the data before, leave out the flag." ] }, { "cell_type": "code", "execution_count": 18, "metadata": {}, "outputs": [ { "data": { "text/html": [ "TF-app: ~/text-fabric-data/github/CLARIAH/wp6-missieven/app" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "data: ~/text-fabric-data/github/CLARIAH/wp6-missieven/tf/1.0" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "data: ~/text-fabric-data/github/CLARIAH/wp6-missieven/exercises/entities/tf/1.0" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "data: ~/text-fabric-data/github/CLARIAH/wp6-missieven/exercises/numerics/tf/1.0" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "name": "stdout", "output_type": "stream", "text": [ "This is Text-Fabric 10.2.6\n", "Api reference : https://annotation.github.io/text-fabric/tf/cheatsheet.html\n", "\n", "49 features found and 0 ignored\n", " 4.57s All features loaded/computed - for details use TF.isLoaded()\n", " 0.42s All additional features loaded - for details use TF.isLoaded()\n" ] }, { "data": { "text/html": [ "Text-Fabric: Text-Fabric API 10.2.6, CLARIAH/wp6-missieven/app v3, Search Reference
Data: WP6-MISSIEVEN, Character table, Feature docs
Features:
\n", "
CLARIAH/wp6-missieven/exercises/numerics/tf\n", "
\n", "\n", "
\n", "
\n", "number\n", "
\n", "
int
\n", "\n", "
\n", " numeric value of corrected number-like strings\n", "
\n", "\n", "
\n", "
creator:
\n", "
Dirk Roorda
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T14:18:27Z
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "\n", "
CLARIAH/wp6-missieven/exercises/entities/tf\n", "
\n", "\n", "
\n", "
\n", "entityComment\n", "
\n", "
str
\n", "\n", "
\n", " comment to a named entity\n", "
\n", "\n", "
\n", "
creator:
\n", "
Dirk Roorda
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T14:17:20Z
\n", "
\n", "\n", "
\n", "
upgraded:
\n", "
‼️ from version 0.4 to 1.0
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "entityId\n", "
\n", "
str
\n", "\n", "
\n", " identifier of a named entity\n", "
\n", "\n", "
\n", "
creator:
\n", "
Dirk Roorda
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T14:17:20Z
\n", "
\n", "\n", "
\n", "
upgraded:
\n", "
‼️ from version 0.4 to 1.0
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "entityKind\n", "
\n", "
str
\n", "\n", "
\n", " kind of a named entity\n", "
\n", "\n", "
\n", "
creator:
\n", "
Dirk Roorda
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T14:17:20Z
\n", "
\n", "\n", "
\n", "
upgraded:
\n", "
‼️ from version 0.4 to 1.0
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "\n", "
General Missives Dutch East India Company 1600-1800\n", "
\n", "\n", "
\n", "
\n", "author\n", "
\n", "
str
\n", "\n", "
\n", " authors of the letter, surnames only\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:04Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
format:
\n", "
comma-space-separated values
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "authorFull\n", "
\n", "
str
\n", "\n", "
\n", " authors of the letter, full names\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:04Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
format:
\n", "
comma-space-separated values
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "col\n", "
\n", "
int
\n", "\n", "
\n", " column number of a column in a row in a table\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:04Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "day\n", "
\n", "
int
\n", "\n", "
\n", " day part of the date of the letter\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:04Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
format:
\n", "
numeral between 1 and 31 inclusive
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "isden\n", "
\n", "
int
\n", "\n", "
\n", " whether a word is the denominator in fraction, e.g. 4 in 1/4\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:04Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
format:
\n", "
integer 1 or absent
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "isemph\n", "
\n", "
str
\n", "\n", "
\n", " whether a word is emphasized by typography\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:04Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
format:
\n", "
integer 1 or absent
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "isfolio\n", "
\n", "
int
\n", "\n", "
\n", " a folio reference\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:04Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
format:
\n", "
integer 1 or absent
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "isnote\n", "
\n", "
int
\n", "\n", "
\n", " whether a word belongs to footnote text\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:04Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
format:
\n", "
integer 1 or absent
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "isnum\n", "
\n", "
int
\n", "\n", "
\n", " whether a word is the numerator in fraction, e.g. 1 in 1/4\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:04Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
format:
\n", "
integer 1 or absent
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "isorig\n", "
\n", "
int
\n", "\n", "
\n", " whether a word belongs to original text\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:04Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
format:
\n", "
integer 1 or absent
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "isq\n", "
\n", "
int
\n", "\n", "
\n", " whether a word is a numerical fraction, e.g. 1/4\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:05Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
format:
\n", "
integer 1 or absent
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "isref\n", "
\n", "
int
\n", "\n", "
\n", " whether a word belongs to the text of reference\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:05Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
format:
\n", "
integer 1 or absent
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "isremark\n", "
\n", "
int
\n", "\n", "
\n", " whether a word belongs to the text of editorial remarks\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:05Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
format:
\n", "
integer 1 or absent
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "isspecial\n", "
\n", "
int
\n", "\n", "
\n", " whether a word has special typography possibly with OCR mistakes as well\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:06Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
format:
\n", "
integer 1 or absent
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "issub\n", "
\n", "
int
\n", "\n", "
\n", " whether a word has subscript typography possibly indicating the denominator of a fraction\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:06Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
format:
\n", "
integer 1 or absent
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "issuper\n", "
\n", "
int
\n", "\n", "
\n", " whether a word has superscript typography possibly indicating the numerator of a fraction\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:06Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
format:
\n", "
integer 1 or absent
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "isund\n", "
\n", "
str
\n", "\n", "
\n", " whether a word is underlined by typography\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:06Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
format:
\n", "
integer 1 or absent
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "mark\n", "
\n", "
int
\n", "\n", "
\n", " footnote mark (not necessarily the same as shown on the printed page\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:06Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
format:
\n", "
integer
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "month\n", "
\n", "
int
\n", "\n", "
\n", " month part of the date of the letter\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:06Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
format:
\n", "
numeral between 1 and 12 inclusive
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "n\n", "
\n", "
int
\n", "\n", "
\n", " number of a volume, letter, page, para, line, table\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:06Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "otype\n", "
\n", "
str
\n", "\n", "
\n", " \n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:06Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "page\n", "
\n", "
str
\n", "\n", "
\n", " number of the first page of this letter in this volume\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:07Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
format:
\n", "
numeral (at most 4 digits)
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "place\n", "
\n", "
str
\n", "\n", "
\n", " place from where the letter was sent\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:07Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "punc\n", "
\n", "
str
\n", "\n", "
\n", " punctuation and/or whitespace following a wordup to the next word\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:07Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "puncn\n", "
\n", "
str
\n", "\n", "
\n", " punctuation and/or whitespace following a word,up to the next word, footnote text only\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:09Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "punco\n", "
\n", "
str
\n", "\n", "
\n", " punctuation and/or whitespace following a word,up to the next word, original text only\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:09Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "puncr\n", "
\n", "
str
\n", "\n", "
\n", " punctuation and/or whitespace following a word,up to the next word, remark text only\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:11Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "rawdate\n", "
\n", "
str
\n", "\n", "
\n", " the date the letter was sent\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:12Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
format:
\n", "
informal Dutch date notation
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "row\n", "
\n", "
int
\n", "\n", "
\n", " row number of a row of column in a table\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:12Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "seq\n", "
\n", "
str
\n", "\n", "
\n", " ('sequence number of this letter among the letters of the same author in this volume',)\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:12Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
format:
\n", "
roman numeral (capitalized)
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "status\n", "
\n", "
str
\n", "\n", "
\n", " status of the letter, e.g. secret, copy\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:12Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
format:
\n", "
keyword
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "title\n", "
\n", "
str
\n", "\n", "
\n", " title of the letter\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:12Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
format:
\n", "
comma-separated values
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "trans\n", "
\n", "
str
\n", "\n", "
\n", " transcription of a word\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:12Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "transn\n", "
\n", "
str
\n", "\n", "
\n", " transcription of a word, only for footnote text\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:14Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "transo\n", "
\n", "
str
\n", "\n", "
\n", " transcription of a word, only for original text\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:14Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "transr\n", "
\n", "
str
\n", "\n", "
\n", " transcription of a word, only for remark text\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:16Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "vol\n", "
\n", "
int
\n", "\n", "
\n", " volume number\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:17Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
format:
\n", "
positive integer
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "weblink\n", "
\n", "
str
\n", "\n", "
\n", " the page-specific part of web links for page nodes\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:17Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "x\n", "
\n", "
int
\n", "\n", "
\n", " column offset of a column in a row in a table\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:17Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "year\n", "
\n", "
int
\n", "\n", "
\n", " year part of the date of the letter\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:17Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
format:
\n", "
numeral between 1600 and 1800
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "note\n", "
\n", "
none
\n", "\n", "
\n", " edge between a word and the footnotes associated with it\n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:17Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
format:
\n", "
no values
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n", "oslots\n", "
\n", "
none
\n", "\n", "
\n", " \n", "
\n", "\n", "
\n", "
converters:
\n", "
Sophie Arnoult, Jesse de Does (TEI), Dirk Roorda (Text-Fabric)
\n", "
\n", "\n", "
\n", "
dateWritten:
\n", "
2022-05-04T12:42:17Z
\n", "
\n", "\n", "
\n", "
descriptionTf:
\n", "
Original text, editorial text and footnotes form three different types of text
\n", "
\n", "\n", "
\n", "
editor:
\n", "
Dr. W. Ph. Coolhaas, Dr. J. van Goor, Dr. J.E. Schooneveld-Oosterling, Dr. H.K. s'Jacob
\n", "
\n", "\n", "
\n", "
institute:
\n", "
KNAW/HuygensING
\n", "
\n", "\n", "
\n", "
language:
\n", "
nld
\n", "
\n", "\n", "
\n", "
name:
\n", "
Generale Missiven
\n", "
\n", "\n", "
\n", "
period:
\n", "
1610-1761
\n", "
\n", "\n", "
\n", "
project:
\n", "
CLARIAH WP6 Use Case 1
\n", "
\n", "\n", "
\n", "
published:
\n", "
Martinus Nijhoff (1960-1985), Bureau der Rijkscommissie voor Vaderlandse Geschiedenis (1988), Instituut voor Nederlandse Geschiedenis (1997, 2004-2007)
\n", "
\n", "\n", "
\n", "
researcher:
\n", "
Lodewijk Petram
\n", "
\n", "\n", "
\n", "
sourceFormat:
\n", "
TEI
\n", "
\n", "\n", "
\n", "
title:
\n", "
Generale Missieven van Gouverneurs-Generaal en Raden aan Heren XVII der Verenigde Oostindische Compagnie
\n", "
\n", "\n", "
\n", "
writtenBy:
\n", "
Text-Fabric
\n", "
\n", "\n", "
\n", "
\n", "\n", "
\n", "\n", "
\n", "
\n" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "\n", "\n" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "
Text-Fabric API: names N F E L T S C TF directly usable

" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "A = use(\n", " f\"CLARIAH/wp6-missieven\",\n", " hoist=globals(),\n", " mod=(\n", " f\"CLARIAH/wp6-missieven/exercises/entities/tf\",\n", " f\"CLARIAH/wp6-missieven/exercises/numerics/tf\",\n", " ),\n", " version=VERSION,\n", " silent=False,\n", ")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Above you see a new sections in the feature list that you can expand to see\n", "which features that module contributed.\n", "\n", "Now, suppose did not know much about these feature, then we would like to do a few basic checks.\n", "\n", "A good start it to do inspect a frequency list of the values of the new features,\n", "and then to perform a query looking for the nodes that have these features.\n", "\n", "We do that for the entity features and for the number feature." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Entities" ] }, { "cell_type": "code", "execution_count": 19, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "(('T11', 6),\n", " ('T2', 5),\n", " ('T13', 3),\n", " ('T16', 3),\n", " ('T8', 3),\n", " ('T9', 3),\n", " ('T10', 2),\n", " ('T15', 2),\n", " ('T17', 2),\n", " ('T3', 2),\n", " ('T5', 2),\n", " ('T1', 1),\n", " ('T12', 1),\n", " ('T4', 1),\n", " ('T6', 1),\n", " ('T7', 1))" ] }, "execution_count": 19, "metadata": {}, "output_type": "execute_result" } ], "source": [ "F.entityId.freqList()" ] }, { "cell_type": "code", "execution_count": 20, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "(('Person', 18), ('GPE', 15), ('Organization', 5))" ] }, "execution_count": 20, "metadata": {}, "output_type": "execute_result" } ], "source": [ "F.entityKind.freqList()" ] }, { "cell_type": "code", "execution_count": 21, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "(('Ternate', 5), ('Amboina', 2))" ] }, "execution_count": 21, "metadata": {}, "output_type": "execute_result" } ], "source": [ "F.entityComment.freqList()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Let's query all words that have an entity notation:" ] }, { "cell_type": "code", "execution_count": 22, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 4.40s 23 results\n" ] } ], "source": [ "query = \"\"\"\n", "word entityId entityKind* entityComment*\n", "\"\"\"\n", "results = A.search(query)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Here we query all word where the `entityId` is present.\n", "We also mention the `entityKind` and `entityComment` features, but with a `*` behind them.\n", "That is a criterion that is always True, so these mentions do not alter the result list.\n", "But now these features do occur in the query, and when we show results, these features will be shown." ] }, { "cell_type": "code", "execution_count": 23, "metadata": {}, "outputs": [ { "data": { "text/html": [ "

line 1" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line
entityId=T11entityKind=Person
Op
den
12
deser
is
een
jonge
slave
van
een
orancay
entityId=T11entityKind=Person
Orangkaja,
hier
aanduiding
voor
een
Bandanees
hoofd
of
aanzienlijke.
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line 2" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line
entityId=T11entityKind=Person
bij
entityId=T11entityKind=Person
nacht
comen
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line 3" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line
entityId=T13entityKind=Person
jongen
een
groot
arancay
entityId=T12entityKind=Person
van
Nera,
entityId=T1entityKind=Person
broeder
van
den
sabandaer
entityId=T13entityKind=Person
Sjahbandar,
uit
het
Perzisch
overgenomen
woord,
in
Zuidoost-
Azië
gebruikt
voor
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line 4" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line
entityId=T11entityKind=Person
tref
f
ten
ende
’t
hooft
ons
hier
in
’t
casteel
gebracht,
den
voorsz.
orancaye
entityId=T11entityKind=Person
was
door
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line 5" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line
entityId=T15entityKind=GPE
met
den
onsen
gesproken,
die
hem
mede
geroemt
hadde,
2
van
onse
Hollanders
entityId=T15entityKind=GPE
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line 6" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line
entityId=T16entityKind=Person
in
den
moort
van
den
admirael
Verhoeven
entityId=T16entityKind=Person
Admiraal
Pieter
Willemsz.
Verhoeff
kwam
23
november
1608
met
zijn
vloot
voor
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line 7" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line
entityComment=AmboinaentityId=T17entityKind=GPE
Ambojna
entityComment=AmboinaentityId=T17entityKind=GPE
ende
van
daer
naer
Ternnate
entityComment=TernateentityId=T2entityKind=GPE
ende
soo
den
Coninck
entityId=T3entityKind=Person
van
entityId=T3entityKind=Person
Spagnien
entityId=T4entityKind=GPE
tusschen
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line 8" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line
entityComment=TernateentityId=T2entityKind=GPE
de
Heeren
entityId=T5entityKind=Organization
Staeten
entityId=T5entityKind=Organization
den
treves
geobserveert
werde,
metten
Coninck
entityId=T6entityKind=Person
van
ditto
entityComment=TernateentityId=T2entityKind=GPE
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line 9" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line
entityComment=TernateentityId=T2entityKind=GPE
plaetse
entityComment=TernateentityId=T2entityKind=GPE
te
contracteeren
om
met
sijn
hulpe
dese
plaetse
te
ocuperen
ende
hem
daer
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line 10" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line
entityId=T8entityKind=Organization
mede
Coninck
entityId=T7entityKind=Person
van
te
maeken
onder
protexie
van
E
Mogende
Heeren
entityId=T8entityKind=Organization
Staeten,
entityId=T8entityKind=Organization
doch
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line 11" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line
entityId=T9entityKind=GPE
Dit
volck
van
Banda
entityId=T9entityKind=GPE
is
superbe,
moordadich,
wel
versien
van
waepenen,
van
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line 12" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line
entityId=T10entityKind=GPE
de
onsen
voor
desen
ende
van
de
Engelsche
entityId=T10entityKind=GPE
gecomen,
dan
weynich
couraege
omme
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "A.show(results, condensed=True)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "**Observation**\n", "\n", "It's not only words that have entity features, also the lines themselves have gotten such annotations.\n", "\n", "It turns out that it is not very useful to annotate *lines* with entities this way.\n", "It would be better to annotate them with the number of entities they contain.\n", "That is our feedback to the creator of these annotations, and because we know the GitHub repo that they are from,\n", "we can file an [issue](https://github.com/annotation/tutorials/issues/3)!" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Numerics" ] }, { "cell_type": "code", "execution_count": 24, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "((121, 6),\n", " (361, 5),\n", " (421, 4),\n", " (101, 3),\n", " (141, 3),\n", " (161, 3),\n", " (185, 3),\n", " (261, 3),\n", " (131, 2),\n", " (151, 2),\n", " (240, 2),\n", " (360, 2),\n", " (621, 2),\n", " (1241, 2),\n", " (1441, 2),\n", " (111, 1),\n", " (171, 1),\n", " (181, 1),\n", " (191, 1),\n", " (201, 1),\n", " (241, 1),\n", " (250, 1),\n", " (281, 1),\n", " (291, 1),\n", " (331, 1),\n", " (371, 1),\n", " (480, 1),\n", " (501, 1),\n", " (541, 1),\n", " (561, 1),\n", " (631, 1),\n", " (660, 1),\n", " (670, 1),\n", " (701, 1),\n", " (721, 1),\n", " (731, 1),\n", " (761, 1),\n", " (814, 1),\n", " (901, 1),\n", " (911, 1),\n", " (1101, 1),\n", " (1151, 1),\n", " (1321, 1),\n", " (1371, 1),\n", " (1661, 1),\n", " (1741, 1),\n", " (2000, 1),\n", " (2501, 1),\n", " (2921, 1),\n", " (2981, 1),\n", " (2991, 1),\n", " (3191, 1),\n", " (3231, 1),\n", " (3501, 1),\n", " (4021, 1),\n", " (4041, 1),\n", " (4061, 1),\n", " (5021, 1),\n", " (6121, 1),\n", " (8191, 1),\n", " (8421, 1),\n", " (8541, 1),\n", " (9581, 1),\n", " (9961, 1),\n", " (10791, 1),\n", " (10921, 1),\n", " (11911, 1),\n", " (15301, 1),\n", " (15361, 1),\n", " (20621, 1),\n", " (32091, 1),\n", " (49141, 1),\n", " (52141, 1),\n", " (65931, 1),\n", " (72771, 1),\n", " (75621, 1),\n", " (77841, 1),\n", " (94001, 1),\n", " (98771, 1),\n", " (128181, 1),\n", " (167081, 1),\n", " (925981, 1),\n", " (977381, 1),\n", " (1213781, 1),\n", " (2228491, 1))" ] }, "execution_count": 24, "metadata": {}, "output_type": "execute_result" } ], "source": [ "F.number.freqList()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "We see that the values that we have generated before." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Let's show the original and the number side by side." ] }, { "cell_type": "code", "execution_count": 25, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 1.87s 114 results\n" ] } ], "source": [ "results = A.search(\n", " \"\"\"\n", "word number transo*\n", "\"\"\"\n", ")" ] }, { "cell_type": "code", "execution_count": 26, "metadata": {}, "outputs": [ { "data": { "text/html": [ "

result 1" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line
5
transo=5
jan.
transo=jan
1613
transo=1613
sloot
transo=sloot
hij
transo=hij
een
transo=een
contract
transo=contract
met
transo=met
den
transo=den
vorst
transo=vorst
van
transo=van
Buton
transo=Buton
Corpus
transo=Corpus
I,
transo=I
p.
transo=p
115J,
number=1151transo=115J
20
transo=20
april
transo=april
d.
transo=d
a.
transo=a
v.
transo=v
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 2" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line
schuylt (
transo=schuylt
alles
transo=alles
tot
transo=tot
laste
transo=laste
van
transo=van
de
transo=de
Comp.
transo=Comp
e
transo=e
loopende),
transo=loopende
deminueert
transo=deminueert
d’advance
transo=d’advance
van
transo=van
60
transo=60
è70
number=670transo=è70
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 3" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line
Ticco,
transo=Ticco
samen
transo=samen
9400J
number=94001transo=9400J
realen,
transo=realen
op
transo=op
Atchin
transo=Atchin
te
transo=te
nemen,
transo=nemen
met
transo=met
waerschouwingh
transo=waerschouwingh
in
transo=in
toecomende
transo=toecomende
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 4" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line
van
transo=van
juweelen
transo=juweelen
ende
transo=ende
coopmanschappen,
transo=coopmanschappen
sijnde
transo=sijnde
per
transo=per
reste
transo=reste
3343
transo=3343
tayl
transo=tayl
13J
number=131transo=13J
maes
transo=maes
ofte
transo=ofte
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 5" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line
row
cell
Eerstelijck:
transo=Eerstelijck
cell
129323/
transo=129323
16
transo=16
cell
realen
transo=realen
voor
transo=voor
intrest
transo=intrest
van
transo=van
14J
number=141transo=14J
maenden
transo=maenden
a
transo=a
percento
transo=percento
van
transo=van
r.
transo=r
a
transo=a
35678,
transo=35678
dat
transo=dat
monteert
transo=monteert
’t
transo=’t
cargasoen,
transo=cargasoen
bestaende
transo=bestaende
uyt
transo=uyt
lijnwaten
transo=lijnwaten
ende
transo=ende
diamanten,
transo=diamanten
den
transo=den
6en
transo=6en
mey
transo=mey
anno
transo=anno
’38
transo=’38
in
transo=in
den
transo=den
Corsoer
transo=Corsoer
gescheept,
transo=gescheept
tot
transo=tot
21
transo=21
augusti
transo=augusti
1639
transo=1639
gemelte
transo=gemelte
Corsoer
transo=Corsoer
van
transo=van
Macassar
transo=Macassar
over
transo=over
Bantam
transo=Bantam
tot
transo=tot
Masilipatnam
transo=Masilipatnam
salvo
transo=salvo
is
transo=is
aengelandt ;
transo=aengelandt
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 6" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line
row
cell
ten
transo=ten
tweeden:
transo=tweeden
cell
2062J
number=20621transo=2062J
cell
realen
transo=realen
voor
transo=voor
intrest
transo=intrest
van
transo=van
16
transo=16
J
transo=J
maenden
transo=maenden
over
transo=over
’t
transo=’t
capitael,
transo=capitael
dat
transo=dat
den
transo=den
Corsoer
transo=Corsoer
gecost
transo=gecost
soude
transo=soude
hebben;
transo=hebben
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 7" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line
vercocht,
transo=vercocht
dat
transo=dat
comt
transo=comt
’t
transo=’t
te
transo=te
wesen
transo=wesen
12J
number=121transo=12J
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 8" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line
Noopende
transo=Noopende
d’affgedrongen
transo=d’affgedrongen
4006
transo=4006
tomannen
transo=tomannen
12J
number=121transo=12J
mamoudy
transo=mamoudy
is
transo=is
weynigh
transo=weynigh
off
transo=off
geen
transo=geen
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 9" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line
12818J,
number=128181transo=12818J
die
transo=die
nu
transo=nu
mede
transo=mede
overgaen.
transo=overgaen
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

result 10" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" }, { "data": { "text/html": [ "

line
Van
transo=Van
Masilipatnam
transo=Masilipatnam
wert
transo=wert
ons
transo=ons
gesonden
transo=gesonden
124J
number=1241transo=124J
onzen
transo=onzen
cleen
transo=cleen
besarsteen,
transo=besarsteen
costende
transo=costende
" ], "text/plain": [ "" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "A.show(results, start=1, end=10)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# All together!\n", "\n", "If more researchers have shared data modules, you can draw them all in.\n", "\n", "Then you can design queries that use features from all these different sources.\n", "\n", "In that way, you build your own research on top of the work of others." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Hover over the features to see where they come from, and you'll see they come from your local GitHub repo." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# For real\n", "\n", "See the [next tutorial in this series](entities.ipynb) how you can\n", "draw in and make use additional features produced by a serious algorithm to detect\n", "named entities." ] }, { "cell_type": "markdown", "metadata": { "jp-MarkdownHeadingCollapsed": true, "tags": [] }, "source": [ "---\n", "\n", "# Contents\n", "\n", "* **[start](start.ipynb)** start computing with this corpus\n", "* **[search](search.ipynb)** turbo charge your hand-coding with search templates\n", "* **[compute](compute.ipynb)** sink down a level and compute it yourself\n", "* **[exportExcel](exportExcel.ipynb)** make tailor-made spreadsheets out of your results\n", "* **[annotate](annotate.ipynb)** export text, annotate with BRAT, import annotations\n", "* **share** draw in other people's data and let them use yours\n", "* **[entities](entities.ipynb)** use results of third-party NER (named entity recognition)\n", "* **[porting](porting.ipynb)** port features made against an older version to a newer version\n", "* **[volumes](volumes.ipynb)** work with selected volumes only\n", "\n", "CC-BY Dirk Roorda" ] } ], "metadata": { "jupytext": { "encoding": "# -*- coding: utf-8 -*-" }, "kernelspec": { "display_name": "Python 3 (ipykernel)", "language": "python", "name": "python3" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.10.7" }, "widgets": { "application/vnd.jupyter.widget-state+json": { "state": {}, "version_major": 2, "version_minor": 0 } } }, "nbformat": 4, "nbformat_minor": 4 }