Page Content - Ground Truth and Storage
The timestamp has to be in UTC (Coordinated Universal Time) and not local time.
The timestamp has to be in UTC (Coordinated Universal Time) and not local time.
Transkribus specific metadata for this transcript
Alternative document page images (e.g.
black-and-white)
Unassigned regions are considered to be in the (virtual) default layer which is to be treated as below any other layers.
For generic use
Page type
Pure text is represented as a text region. This includes
drop capitals, but practically ornate text may be
considered as a graphic.
The angle the rectangle encapsulating a region has to be rotated in clockwise direction in order to correct the present skew (negative values indicate anti-clockwise rotation).
Range: -179.999,180
The nature of the text in the region
The degree of space in points between the lines of
text (line spacing)
The direction in which text in a region should be
read (within lines)
The angle the baseline of text withing a region has to be rotated (relative to the rectangle encapsulating the region) in clockwise direction in order to correct the present skew (negative values indicate anti-clockwise rotation).
Range: -179.999,180
Defines whether a region of text is indented or not
Text align
The primary language used in the region
The secondary language used in the region
The primary script used in the region
The secondary script used in the region
Point list with format "x1,y1 x2,y2 ..."
Multiple connected points that mark the baseline
of the glyphs from left to right
Overrides primaryLanguage attribute of parent text
region
Overrides the production attribute of the parent
text region
For generic use
Overrides primaryLanguage attribute of parent line
and/or text region
Overrides the production attribute of the parent
text line and/or text region.
For generic use
Overrides the production attribute of the parent
word / text line / text region.
For generic use
Text in a "simple" form (ASCII or extended ASCII
as mostly used for typing). I.e. no use of
special characters for ligatures (should be
stored as two separate characters) etc.
Correct encoding of the original, always using
the corresponding Unicode code point. I.e.
ligatures have to be represented as one
character etc.
OCR confidence value (between 0 and 1)
An image is considered to be more intricate and complex
than a graphic. These can be photos or drawings.
The angle the rectangle encapsulating a region has to be rotated in clockwise direction in order to correct the present skew (negative values indicate anti-clockwise rotation).
Range: -179.999,180
The colour bit depth required for the region
The background colour of the region
Specifies whether the region also contains
text
A line drawing is a single colour illustration without
solid areas.
The angle the rectangle encapsulating a region has to be rotated in clockwise direction in order to correct the present skew (negative values indicate anti-clockwise rotation).
Range: -179.999,180
The pen (foreground) colour of the region
The background colour of the region
Specifies whether the region also contains
text
Regions containing simple graphics, such as a company
logo, should be marked as graphic regions.
The angle the rectangle encapsulating a region has to be rotated in clockwise direction in order to correct the present skew (negative values indicate anti-clockwise rotation).
Range: -179.999,180
The type of graphic in the region
An approximation of the number of colours
used in the region
Specifies whether the region also contains
text.
The new table region type - a table is basically a collection of cells
The angle the rectangle encapsulating a region has to be rotated in clockwise direction in order to correct the present skew (negative values indicate anti-clockwise rotation).
Range: -179.999,180
The number of rows present in the table
The number of columns present in the table
The colour of the lines used in the region
The background colour of the region
Specifies the presence of line separators
Specifies whether the region also contains
text
A single cell of a table
The row index of this cell
The column index of this cell
The number of rows this cell spans
The number of columns this cell spans
Left border visiblity
Right border visiblity
Top border visiblity
Bottom border visiblity
Determines whether this cell is a label cell
Regions containing charts or graphs of any type, should
be marked as chart regions.
The angle the rectangle encapsulating a region has to be rotated in clockwise direction in order to correct the present skew (negative values indicate anti-clockwise rotation).
Range: -179.999,180
The type of chart in the region
An approximation of the number of colours
used in the region
The background colour of the region
Specifies whether the region also contains
text
Separators are lines that lie between columns and
paragraphs and can be used to logically separate
different articles from each other.
The angle the rectangle encapsulating a region has to be rotated in clockwise direction in order to correct the present skew (negative values indicate anti-clockwise rotation).
Range: -179.999,180
The colour of the separator
Regions containing equations and mathematical symbols
should be marked as maths regions.
The angle the rectangle encapsulating a region has to be rotated in clockwise direction in order to correct the present skew (negative values indicate anti-clockwise rotation).
Range: -179.999,180
The background colour of the region
Regions containing chemical formulas.
The angle the rectangle encapsulating a
region has to be rotated in clockwise
direction in order to correct the present
skew (negative values indicate
anti-clockwise rotation). Range:
-179.999,180
The background colour of the region
Regions containing musical notations.
The angle the rectangle encapsulating a region has to be rotated in clockwise direction in order to correct the present skew (negative values indicate anti-clockwise rotation).
Range: -179.999,180
The background colour of the region
Regions containing advertisements.
The angle the rectangle encapsulating a region has to be rotated in clockwise direction in order to correct the present skew (negative values indicate anti-clockwise rotation).
Range: -179.999,180
The background colour of the region
Noise regions are regions where no real data lies, only
false data created by artifacts on the document or
scanner noise.
To be used if the region type cannot be ascertained.
Determines the effective area on the paper of a printed page. Its size is equal for all pages of a book (exceptions: titlepage, multipage pictures).
It contains all living elements (except marginals) like body type, footnotes, headings, running titles.
It does not contain pagenumber (if not part of running title), marginals, signature mark, preview words.
Definition of the reading order within the page. To express a reading order between elements they have to be included in an OrderedGroup. Groups may contain further groups.
Numbered region
Position (order number) of this item within the current hierarchy level.
Indexed group containing ordered elements
Position (order number) of this item within the
current hierarchy level.
Indexed group containing unordered elements
Position (order number) of this item within the
current hierarchy level.
Numbered group (contains ordered elements)
Numbered group (contains unordered elements)
Border of the actual page (if the scanned image contains parts not belonging to the page).
Can be used to express the z-index of overlapping
regions. An element with a greater z-index is always in
front of another element with lower z-index.
Point list with format "x1,y1 x2,y2 ..."
4 indexes (i.e. positive integer values) separated by whitespaces
A single point"
Container for one-to-one relations between layout
objects (for example: DropCap - paragraph, caption -
image)
One-to-one relation between to layout object. Use 'link'
for loose relations and 'join' for strong relations
(where something is fragmented for instance).
Examples for 'link': caption - image floating -
paragraph paragraph - paragraph (when a pragraph is
split across columns and the last word of the first
paragraph DOES NOT continue in the second paragraph)
drop-cap - paragraph (when the drop-cap is a whole word)
Examples for 'join': word - word (separated word at the
end of a line) drop-cap - paragraph (when the drop-cap
is not a whole word) paragraph - paragraph (when a
pragraph is split across columns and the last word of
the first paragraph DOES continue in the second
paragraph)
For generic use
Text production type
Monospace (fixed-pitch, non-proportional) or
proportional font
For instance: Arial, Times New Roman. Add more
information if necessary (e.g. blackletter,
antiqua).
Serif or sans-serif typeface
The size of the characters in points
The degree of space (in points) between the
characters in a string of text
Background colour
Specifies whether the colour of the text appears
reversed against a background colour
For generic use
TEI like tags, i.e. metadata information available to their corresponding element (i.e. in a TextRegion, TextLine or Word)
like abbreviations, person names, gaps etc. Most tags correspond to a certain text range in their element, which is
specified using the index and offset attributes
Sub properties of this tag. E.g. when specifying a tag "abbrev" for tagging an abbreviation in the text,
a sub-property could be the property "expansion" for the expansion of the abbreviation.
Sub property nesting should not exceed child level one to prevent unnecessary complexity of the datastructure.
name of the tag - this could pre specified tags like
"abbrev", "gap" etc. or custom defined names
a custom value element for the tag
If this tag refers to some text in the corresponding element (i.e. in a TextRegion, TextLine or Word),
this attribute specifies the 0 based starting index of the referenced text
If this tag refers to some text in the corresponding element (i.e. in a TextRegion, TextLine or Word),
this attribute specifies the length of the referenced text
If this tag refers to some text in the corresponding element (i.e. in a TextRegion, TextLine or Word),
this attribute specifies if it is a continuation from the previous element
A property is a key/value pair storing information about the element it is in.
Attributes are inherited to their child element and can be ovveridden there.
The inheritance hierarchy is: Page -> Region -> TextLine -> Word -> Glyph
key of the property - this could pre specified keys like
"lang", "layout", "year_from", "year_to", "style", "weight"
or meta data from the automatic process
"editor", "editordate"
or user defined properties
"numbering"
value to the corresponding key:
for lang: "de", "us", according ISO 639.2
for year_(from|to): number
for style: italic, normal
for weight: bold, normal
for layout: float, table, ...
for numbering: 1,2,...
for editor: "user name", "htr_process"
...
Property names should be restricted to alphanumeric characters, underscores, dashes and dots.
the link reference
the link type
the link description