terms4FAIRskills describes the competencies, skills and knowledge associated with making and keeping data FAIR.\nThis terminology applies to a variety of use cases, including: assisting with the creation and assessment of stewardship curricula; facilitating the annotation, discovery and evaluation of FAIR-enabling materials \(e.g. training\) and resources; enabling the formalisation of job descriptions and CVs with recognised, structured competencies.\nIt is intended to be of use to trainers who teach FAIR data skills, researchers who wish to identify skill gaps in their teams and managers who need to recruit individuals to relevant roles.
terms4FAIRskills (T4FS)
terms4FAIRskills by the terms4FAIRskills developers is licensed under CC BY 4.0. You are free to share (copy and redistribute the material in any medium or format) and adapt (remix, transform, and build upon the material) for any purpose, even commercially. for any purpose, even commercially. The licensor cannot revoke these freedoms as long as you follow the license terms. You must give appropriate credit (by using the original ontology IRI for the whole ontology and original term IRIs for individual terms), provide a link to the license, and indicate if any changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
2025-02-09
definition
The official definition, explaining the meaning of a class or property. Shall be Aristotelian, formalized and normalized. Can be augmented with colloquial definitions.
2012-04-05:
Barry Smith
The official OBI definition, explaining the meaning of a class or property: 'Shall be Aristotelian, formalized and normalized. Can be augmented with colloquial definitions' is terrible.
Can you fix to something like:
A statement of necessary and sufficient conditions explaining the meaning of an expression referring to a class or property.
Alan Ruttenberg
Your proposed definition is a reasonable candidate, except that it is very common that necessary and sufficient conditions are not given. Mostly they are necessary, occasionally they are necessary and sufficient or just sufficient. Often they use terms that are not themselves defined and so they effectively can't be evaluated by those criteria.
On the specifics of the proposed definition:
We don't have definitions of 'meaning' or 'expression' or 'property'. For 'reference' in the intended sense I think we use the term 'denotation'. For 'expression', I think we you mean symbol, or identifier. For 'meaning' it differs for class and property. For class we want documentation that let's the intended reader determine whether an entity is instance of the class, or not. For property we want documentation that let's the intended reader determine, given a pair of potential relata, whether the assertion that the relation holds is true. The 'intended reader' part suggests that we also specify who, we expect, would be able to understand the definition, and also generalizes over human and computer reader to include textual and logical definition.
Personally, I am more comfortable weakening definition to documentation, with instructions as to what is desirable.
We also have the outstanding issue of how to aim different definitions to different audiences. A clinical audience reading chebi wants a different sort of definition documentation/definition from a chemistry trained audience, and similarly there is a need for a definition that is adequate for an ontologist to work with.
PERSON:Daniel Schober
GROUP:OBI:<http://purl.obolibrary.org/obo/obi>
https://orcid.org/0000-0002-7702-4495
Allyson Lister 8.10.2019: Although the IAO definition has been included here, we have made a curation decision to allow non-Aristotelian definitions.
definition
The official definition, explaining the meaning of a class or property. Shall be Aristotelian, formalized and normalized. Can be augmented with colloquial definitions.
IAO
term editor
Name of editor entering the term in the file. The term editor is a point of contact for information regarding the term. The term editor may be, but is not always, the author of the definition, which may have been worked upon by several people
20110707, MC: label update to term editor and definition modified accordingly. See https://github.com/information-artifact-ontology/IAO/issues/115.
PERSON:Daniel Schober
GROUP:OBI:<http://purl.obolibrary.org/obo/obi>
Peter McQuilton
https://orcid.org/0000-0002-7702-4495
term editor
Name of editor entering the term in the file. The term editor is a point of contact for information regarding the term. The term editor may be, but is not always, the author of the definition, which may have been worked upon by several people
IAO
alternative label
A label for a class or property that can be used to refer to the class or property instead of the preferred rdfs:label. Alternative labels should be used to indicate community- or context-specific labels, abbreviations, shorthand forms and the like.
An alternative name for a class or property which means the same thing as the preferred name (semantically equivalent)
OBO Operations committee
PERSON:Daniel Schober
GROUP:OBI:<http://purl.obolibrary.org/obo/obi>
Consider re-defing to: An alternative name for a class or property which can mean the same thing as the preferred name (semantically equivalent, narrow, broad or related).
alternative label
alternative term
An alternative name for a class or property which means the same thing as the preferred name (semantically equivalent)
IAO
definition source
Formal citation, e.g. identifier in external database to indicate / attribute source(s) for the definition. Free text indicate / attribute source(s) for the definition. EXAMPLE: Author Name, URI, MeSH Term C04, PUBMED ID, Wiki uri on 31.01.2007
PERSON:Daniel Schober
Discussion on obo-discuss mailing-list, see http://bit.ly/hgm99w
GROUP:OBI:<http://purl.obolibrary.org/obo/obi>
Peter McQuilton
https://orcid.org/0000-0002-7702-4495
definition source
Formal citation, e.g. identifier in external database to indicate / attribute source(s) for the definition. Free text indicate / attribute source(s) for the definition. EXAMPLE: Author Name, URI, MeSH Term C04, PUBMED ID, Wiki uri on 31.01.2007
IAO
term tracker item
the URI for an OBI Terms ticket at sourceforge, such as https://sourceforge.net/p/obi/obi-terms/772/
An IRI or similar locator for a request or discussion of an ontology term.
Person: Jie Zheng, Chris Stoeckert, Alan Ruttenberg
Person: Jie Zheng, Chris Stoeckert, Alan Ruttenberg
The 'tracker item' can associate a tracker with a specific ontology term.
term tracker item
Peter McQuilton
2019-10-17T13:38:16.742124Z
alternative definition
Peter McQuilton
2019-10-17T13:39:13.568975Z
alternative definition source
created by
creation date
This document is about information artifacts and their representations
A (currently) primitive relation that relates an information artifact to an entity.
7/6/2009 Alan Ruttenberg. Following discussion with Jonathan Rees, and introduction of "mentions" relation. Weaken the is_about relationship to be primitive.
We will try to build it back up by elaborating the various subproperties that are more precisely defined.
Some currently missing phenomena that should be considered "about" are predications - "The only person who knows the answer is sitting beside me" , Allegory, Satire, and other literary forms that can be topical without explicitly mentioning the topic.
person:Alan Ruttenberg
Smith, Ceusters, Ruttenberg, 2000 years of philosophy
is about
inheres in
this fragility is a characteristic of this vase
this red color is a characteristic of this apple
a relation between a specifically dependent continuant (the characteristic) and any other entity (the bearer), in which the characteristic depends on the bearer for its existence.
inheres_in
Note that this relation was previously called "inheres in", but was changed to be called "characteristic of" because BFO2 uses "inheres in" in a more restricted fashion. This relation differs from BFO2:inheres_in in two respects: (1) it does not impose a range constraint, and thus it allows qualities of processes, as well as of information entities, whereas BFO2 restricts inheres_in to only apply to independent continuants (2) it is declared functional, i.e. something can only be a characteristic of one thing.
characteristic of
bearer of
this apple is bearer of this red color
this vase is bearer of this fragility
Inverse of characteristic_of
A bearer can have many dependents, and its dependents can exist for different periods of time, but none of its dependents can exist when the bearer does not exist.
bearer_of
is bearer of
has characteristic
this catalysis function is a function of this enzyme
a relation between a function and an independent continuant (the bearer), in which the function specifically depends on the bearer for its existence
A function inheres in its bearer at all times for which the function exists, however the function need not be realized at all the times that the function exists.
function_of
is function of
This relation is modeled after the BFO relation of the same name which was in BFO2, but is used in a more restricted sense - specifically, we model this relation as functional (inherited from characteristic-of). Note that this relation is now removed from BFO2020.
function of
this red color is a quality of this apple
a relation between a quality and an independent continuant (the bearer), in which the quality specifically depends on the bearer for its existence
A quality inheres in its bearer at all times for which the quality exists.
is quality of
quality_of
This relation is modeled after the BFO relation of the same name which was in BFO2, but is used in a more restricted sense - specifically, we model this relation as functional (inherited from characteristic-of). Note that this relation is now removed from BFO2020.
quality of
this investigator role is a role of this person
a relation between a role and an independent continuant (the bearer), in which the role specifically depends on the bearer for its existence
A role inheres in its bearer at all times for which the role exists, however the role need not be realized at all the times that the role exists.
is role of
role_of
This relation is modeled after the BFO relation of the same name which was in BFO2, but is used in a more restricted sense - specifically, we model this relation as functional (inherited from characteristic-of). Note that this relation is now removed from BFO2020.
role of
this enzyme has function this catalysis function (more colloquially: this enzyme has this catalysis function)
a relation between an independent continuant (the bearer) and a function, in which the function specifically depends on the bearer for its existence
A bearer can have many functions, and its functions can exist for different periods of time, but none of its functions can exist when the bearer does not exist. A function need not be realized at all the times that the function exists.
has_function
has function
this apple has quality this red color
a relation between an independent continuant (the bearer) and a quality, in which the quality specifically depends on the bearer for its existence
A bearer can have many qualities, and its qualities can exist for different periods of time, but none of its qualities can exist when the bearer does not exist.
has_quality
has quality
this person has role this investigator role (more colloquially: this person has this role of investigator)
a relation between an independent continuant (the bearer) and a role, in which the role specifically depends on the bearer for its existence
A bearer can have many roles, and its roles can exist for different periods of time, but none of its roles can exist when the bearer does not exist. A role need not be realized at all the times that the role exists.
has_role
has role
a relation between an independent continuant (the bearer) and a disposition, in which the disposition specifically depends on the bearer for its existence
has disposition
This relation is modeled after the BFO relation of the same name which was in BFO2, but is used in a more restricted sense - specifically, we model this relation as functional (inherited from characteristic-of). Note that this relation is now removed from BFO2020.
disposition of
Describes how a learning medium is intended to confer a competence or capability regarding a particular data stewardship activity, e.g. a presentation confering competency in metadata creation.
Peter McQuilton
https://orcid.org/0000-0002-7702-4495
Peter McQuilton
2020-10-01T21:36:11.080721Z
confers competency about
Describes how a learning medium is intended to confer a competence or capability regarding a particular data stewardship activity, e.g. a presentation confering competency in metadata creation.
https://orcid.org/0000-0002-7702-4495
Describes how a learning medium is intended to confer knowledge of a particular data stewardship technical concept in order for that learning medium to perform its function, e.g. a presentation conferring competency in metadata.
Peter McQuilton
https://orcid.org/0000-0002-7702-4495
Peter McQuilton
2020-12-06 00:00:00
confers knowledge about
Describes how a learning medium is intended to confer knowledge of a particular data stewardship technical concept in order for that learning medium to perform its function, e.g. a presentation conferring competency in metadata.
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0002-7702-4495
Peter McQuilton
requires/improves personal attribute
Describes how a learning medium confers practical skills regarding a particular data stewardship technical concept in order for that learning medium to perform its function e.g. a workshop conferring a practical skill in repository access.
Peter McQuilton
https://orcid.org/0000-0002-7702-4495
Peter McQuilton
2020-12-06 00:00:00
confers practical skill about
Describes how a learning medium confers practical skills regarding a particular data stewardship technical concept in order for that learning medium to perform its function e.g. a workshop conferring a practical skill in repository access.
https://orcid.org/0000-0002-7702-4495
Describes how an expertise level or role (e.g. "data steward") is associated with a data stewardship activity (e.g. ‘workflow set-up and management’) to indicate whether someone has an awareness of the area, or an ability to do it, or expert knowledge of it.
Peter McQuilton
https://orcid.org/0000-0002-7702-4495
Definition added 8.2.22
has/wants competency in
Describes how an expertise level or role (e.g. "data steward") is associated with a data stewardship activity (e.g. ‘workflow set-up and management’) to indicate whether someone has an awareness of the area, or an ability to do it, or expert knowledge of it.
https://orcid.org/0000-0002-7702-4495
Peter McQuilton
https://orcid.org/0000-0002-7702-4495
has/wants knowledge about
https://orcid.org/0000-0002-7702-4495
Peter McQuilton
supports implementation of
Desires/has a competence or capability acquired or applied in a specific context, e.g. producing a research output or deploying a service. A skill may be specified in a ‘skills user story’. A badge or certificate may provide evidence that a skill has been acquired, and a publication, personal profile, portfolio or CV may provide evidence that a skill has been applied.
Peter McQuilton
https://orcid.org/0000-0002-7702-4495
Peter McQuilton
has/wants practical skill about
Desires/has a competence or capability acquired or applied in a specific context, e.g. producing a research output or deploying a service. A skill may be specified in a ‘skills user story’. A badge or certificate may provide evidence that a skill has been acquired, and a publication, personal profile, portfolio or CV may provide evidence that a skill has been applied.
FAIR4S
Peter McQuilton
https://orcid.org/0000-0002-7702-4495
Peter McQuilton
2021-03-24T15:33:43.476917Z
has aptitude for
Peter McQuilton
https://orcid.org/0000-0002-7702-4495
Peter McQuilton
2021-03-22T15:25:13.443153Z
contributes to the implementation of
The learning media that creates a competence or capability.
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0002-7702-4495
Allyson Lister 13.9.22: This class is created as the inverse of confers/requires competency about to allow consistent reasoning and access to the Data stewardship guidelines for Data stewardship activities. It needs to be revisited by the entire team to check the model.
competency gained through
The learning media that creates a competence or capability.
https://orcid.org/0000-0002-7702-4495
Describes how a learning medium requires competency in a particular data stewardship activity in order for that learning medium to perform its function, e.g. a presentation requiring competency in metadata creation prior to engaging with that learning medium.
requires competency about
Describes how a learning medium requires competency in a particular data stewardship activity in order for that learning medium to perform its function, e.g. a presentation requiring competency in metadata creation prior to engaging with that learning medium.
https://orcid.org/0000-0002-7702-4495
Describes how a learning medium requires knowledge of a particular data stewardship technical concept in order for that learning medium to perform its function, e.g. a presentation requiring knowledge of metadata.
requires knowledge about
Describes how a learning medium requires knowledge of a particular data stewardship technical concept in order for that learning medium to perform its function, e.g. a presentation requiring knowledge of metadata.
https://orcid.org/0000-0002-7702-4495
Describes how a learning medium requires practical skills of a particular data stewardship technical concept in order for that learning medium to perform its function e.g. a workshop requiring a practical skill in repository access.
requires practical skill about
Describes how a learning medium requires practical skills of a particular data stewardship technical concept in order for that learning medium to perform its function e.g. a workshop requiring a practical skill in repository access.
https://orcid.org/0000-0002-7702-4495
Describes how an evaluation indicator may confer compliance with a particular guideline or set of guidelines
https://orcid.org/0000-0002-7702-4495
https://github.com/terms4fairskills/FAIRterminology/issues/22
confers compliance with
Describes how an evaluation indicator may confer compliance with a particular guideline or set of guidelines
https://orcid.org/0000-0002-7702-4495
A learning medium may demonstrate how to meet the requirements for a particular evaluation indicator(s).
https://orcid.org/0000-0002-7702-4495
https://github.com/terms4fairskills/FAIRterminology/issues/22
demonstrates how to fulfil the requirements for
A learning medium may demonstrate how to meet the requirements for a particular evaluation indicator(s).
https://orcid.org/0000-0002-7702-4495
reference URL
entity
Entity
Julius Caesar
Verdi’s Requiem
the Second World War
your body mass index
BFO 2 Reference: In all areas of empirical inquiry we encounter general terms of two sorts. First are general terms which refer to universals or types:animaltuberculosissurgical procedurediseaseSecond, are general terms used to refer to groups of entities which instantiate a given universal but do not correspond to the extension of any subuniversal of that universal because there is nothing intrinsic to the entities in question by virtue of which they – and only they – are counted as belonging to the given group. Examples are: animal purchased by the Emperortuberculosis diagnosed on a Wednesdaysurgical procedure performed on a patient from Stockholmperson identified as candidate for clinical trial #2056-555person who is signatory of Form 656-PPVpainting by Leonardo da VinciSuch terms, which represent what are called ‘specializations’ in [81
Entity doesn't have a closure axiom because the subclasses don't necessarily exhaust all possibilites. For example Werner Ceusters 'portions of reality' include 4 sorts, entities (as BFO construes them), universals, configurations, and relations. It is an open question as to whether entities as construed in BFO will at some point also include these other portions of reality. See, for example, 'How to track absolutely everything' at http://www.referent-tracking.com/_RTU/papers/CeustersICbookRevised.pdf
An entity is anything that exists or has existed or will exist. (axiom label in BFO2 Reference: [001-001])
entity
Entity doesn't have a closure axiom because the subclasses don't necessarily exhaust all possibilites. For example Werner Ceusters 'portions of reality' include 4 sorts, entities (as BFO construes them), universals, configurations, and relations. It is an open question as to whether entities as construed in BFO will at some point also include these other portions of reality. See, for example, 'How to track absolutely everything' at http://www.referent-tracking.com/_RTU/papers/CeustersICbookRevised.pdf
per discussion with Barry Smith
An entity is anything that exists or has existed or will exist. (axiom label in BFO2 Reference: [001-001])
continuant
Continuant
An entity that exists in full at any time in which it exists at all, persists through time while maintaining its identity and has no temporal parts.
BFO 2 Reference: Continuant entities are entities which can be sliced to yield parts only along the spatial dimension, yielding for example the parts of your table which we call its legs, its top, its nails. ‘My desk stretches from the window to the door. It has spatial parts, and can be sliced (in space) in two. With respect to time, however, a thing is a continuant.’ [60, p. 240
Continuant doesn't have a closure axiom because the subclasses don't necessarily exhaust all possibilites. For example, in an expansion involving bringing in some of Ceuster's other portions of reality, questions are raised as to whether universals are continuants
A continuant is an entity that persists, endures, or continues to exist through time while maintaining its identity. (axiom label in BFO2 Reference: [008-002])
if b is a continuant and if, for some t, c has_continuant_part b at t, then c is a continuant. (axiom label in BFO2 Reference: [126-001])
if b is a continuant and if, for some t, cis continuant_part of b at t, then c is a continuant. (axiom label in BFO2 Reference: [009-002])
if b is a material entity, then there is some temporal interval (referred to below as a one-dimensional temporal region) during which b exists. (axiom label in BFO2 Reference: [011-002])
(forall (x y) (if (and (Continuant x) (exists (t) (continuantPartOfAt y x t))) (Continuant y))) // axiom label in BFO2 CLIF: [009-002]
(forall (x y) (if (and (Continuant x) (exists (t) (hasContinuantPartOfAt y x t))) (Continuant y))) // axiom label in BFO2 CLIF: [126-001]
(forall (x) (if (Continuant x) (Entity x))) // axiom label in BFO2 CLIF: [008-002]
(forall (x) (if (Material Entity x) (exists (t) (and (TemporalRegion t) (existsAt x t))))) // axiom label in BFO2 CLIF: [011-002]
continuant
Continuant doesn't have a closure axiom because the subclasses don't necessarily exhaust all possibilites. For example, in an expansion involving bringing in some of Ceuster's other portions of reality, questions are raised as to whether universals are continuants
A continuant is an entity that persists, endures, or continues to exist through time while maintaining its identity. (axiom label in BFO2 Reference: [008-002])
if b is a continuant and if, for some t, c has_continuant_part b at t, then c is a continuant. (axiom label in BFO2 Reference: [126-001])
if b is a continuant and if, for some t, cis continuant_part of b at t, then c is a continuant. (axiom label in BFO2 Reference: [009-002])
if b is a material entity, then there is some temporal interval (referred to below as a one-dimensional temporal region) during which b exists. (axiom label in BFO2 Reference: [011-002])
(forall (x y) (if (and (Continuant x) (exists (t) (continuantPartOfAt y x t))) (Continuant y))) // axiom label in BFO2 CLIF: [009-002]
(forall (x y) (if (and (Continuant x) (exists (t) (hasContinuantPartOfAt y x t))) (Continuant y))) // axiom label in BFO2 CLIF: [126-001]
(forall (x) (if (Continuant x) (Entity x))) // axiom label in BFO2 CLIF: [008-002]
(forall (x) (if (Material Entity x) (exists (t) (and (TemporalRegion t) (existsAt x t))))) // axiom label in BFO2 CLIF: [011-002]
occurrent
Occurrent
An entity that has temporal parts and that happens, unfolds or develops through time.
BFO 2 Reference: every occurrent that is not a temporal or spatiotemporal region is s-dependent on some independent continuant that is not a spatial region
BFO 2 Reference: s-dependence obtains between every process and its participants in the sense that, as a matter of necessity, this process could not have existed unless these or those participants existed also. A process may have a succession of participants at different phases of its unfolding. Thus there may be different players on the field at different times during the course of a football game; but the process which is the entire game s-depends_on all of these players nonetheless. Some temporal parts of this process will s-depend_on on only some of the players.
Occurrent doesn't have a closure axiom because the subclasses don't necessarily exhaust all possibilites. An example would be the sum of a process and the process boundary of another process.
Simons uses different terminology for relations of occurrents to regions: Denote the spatio-temporal location of a given occurrent e by 'spn[e]' and call this region its span. We may say an occurrent is at its span, in any larger region, and covers any smaller region. Now suppose we have fixed a frame of reference so that we can speak not merely of spatio-temporal but also of spatial regions (places) and temporal regions (times). The spread of an occurrent, (relative to a frame of reference) is the space it exactly occupies, and its spell is likewise the time it exactly occupies. We write 'spr[e]' and `spl[e]' respectively for the spread and spell of e, omitting mention of the frame.
An occurrent is an entity that unfolds itself in time or it is the instantaneous boundary of such an entity (for example a beginning or an ending) or it is a temporal or spatiotemporal region which such an entity occupies_temporal_region or occupies_spatiotemporal_region. (axiom label in BFO2 Reference: [077-002])
Every occurrent occupies_spatiotemporal_region some spatiotemporal region. (axiom label in BFO2 Reference: [108-001])
b is an occurrent entity iff b is an entity that has temporal parts. (axiom label in BFO2 Reference: [079-001])
(forall (x) (if (Occurrent x) (exists (r) (and (SpatioTemporalRegion r) (occupiesSpatioTemporalRegion x r))))) // axiom label in BFO2 CLIF: [108-001]
(forall (x) (iff (Occurrent x) (and (Entity x) (exists (y) (temporalPartOf y x))))) // axiom label in BFO2 CLIF: [079-001]
occurent
occurrent
Occurrent doesn't have a closure axiom because the subclasses don't necessarily exhaust all possibilites. An example would be the sum of a process and the process boundary of another process.
per discussion with Barry Smith
Simons uses different terminology for relations of occurrents to regions: Denote the spatio-temporal location of a given occurrent e by 'spn[e]' and call this region its span. We may say an occurrent is at its span, in any larger region, and covers any smaller region. Now suppose we have fixed a frame of reference so that we can speak not merely of spatio-temporal but also of spatial regions (places) and temporal regions (times). The spread of an occurrent, (relative to a frame of reference) is the space it exactly occupies, and its spell is likewise the time it exactly occupies. We write 'spr[e]' and `spl[e]' respectively for the spread and spell of e, omitting mention of the frame.
An occurrent is an entity that unfolds itself in time or it is the instantaneous boundary of such an entity (for example a beginning or an ending) or it is a temporal or spatiotemporal region which such an entity occupies_temporal_region or occupies_spatiotemporal_region. (axiom label in BFO2 Reference: [077-002])
Every occurrent occupies_spatiotemporal_region some spatiotemporal region. (axiom label in BFO2 Reference: [108-001])
b is an occurrent entity iff b is an entity that has temporal parts. (axiom label in BFO2 Reference: [079-001])
(forall (x) (if (Occurrent x) (exists (r) (and (SpatioTemporalRegion r) (occupiesSpatioTemporalRegion x r))))) // axiom label in BFO2 CLIF: [108-001]
(forall (x) (iff (Occurrent x) (and (Entity x) (exists (y) (temporalPartOf y x))))) // axiom label in BFO2 CLIF: [079-001]
ic
IndependentContinuant
a chair
a heart
a leg
a molecule
a spatial region
an atom
an orchestra.
an organism
the bottom right portion of a human torso
the interior of your mouth
b is an independent continuant = Def. b is a continuant which is such that there is no c and no t such that b s-depends_on c at t. (axiom label in BFO2 Reference: [017-002])
For any independent continuant b and any time t there is some spatial region r such that b is located_in r at t. (axiom label in BFO2 Reference: [134-001])
For every independent continuant b and time t during the region of time spanned by its life, there are entities which s-depends_on b during t. (axiom label in BFO2 Reference: [018-002])
(forall (x t) (if (IndependentContinuant x) (exists (r) (and (SpatialRegion r) (locatedInAt x r t))))) // axiom label in BFO2 CLIF: [134-001]
(forall (x t) (if (and (IndependentContinuant x) (existsAt x t)) (exists (y) (and (Entity y) (specificallyDependsOnAt y x t))))) // axiom label in BFO2 CLIF: [018-002]
(iff (IndependentContinuant a) (and (Continuant a) (not (exists (b t) (specificallyDependsOnAt a b t))))) // axiom label in BFO2 CLIF: [017-002]
independent continuant
b is an independent continuant = Def. b is a continuant which is such that there is no c and no t such that b s-depends_on c at t. (axiom label in BFO2 Reference: [017-002])
For any independent continuant b and any time t there is some spatial region r such that b is located_in r at t. (axiom label in BFO2 Reference: [134-001])
For every independent continuant b and time t during the region of time spanned by its life, there are entities which s-depends_on b during t. (axiom label in BFO2 Reference: [018-002])
(forall (x t) (if (IndependentContinuant x) (exists (r) (and (SpatialRegion r) (locatedInAt x r t))))) // axiom label in BFO2 CLIF: [134-001]
(forall (x t) (if (and (IndependentContinuant x) (existsAt x t)) (exists (y) (and (Entity y) (specificallyDependsOnAt y x t))))) // axiom label in BFO2 CLIF: [018-002]
(iff (IndependentContinuant a) (and (Continuant a) (not (exists (b t) (specificallyDependsOnAt a b t))))) // axiom label in BFO2 CLIF: [017-002]
process
Process
a process of cell-division, \ a beating of the heart
a process of meiosis
a process of sleeping
the course of a disease
the flight of a bird
the life of an organism
your process of aging.
p is a process = Def. p is an occurrent that has temporal proper parts and for some time t, p s-depends_on some material entity at t. (axiom label in BFO2 Reference: [083-003])
BFO 2 Reference: The realm of occurrents is less pervasively marked by the presence of natural units than is the case in the realm of independent continuants. Thus there is here no counterpart of ‘object’. In BFO 1.0 ‘process’ served as such a counterpart. In BFO 2.0 ‘process’ is, rather, the occurrent counterpart of ‘material entity’. Those natural – as contrasted with engineered, which here means: deliberately executed – units which do exist in the realm of occurrents are typically either parasitic on the existence of natural units on the continuant side, or they are fiat in nature. Thus we can count lives; we can count football games; we can count chemical reactions performed in experiments or in chemical manufacturing. We cannot count the processes taking place, for instance, in an episode of insect mating behavior.Even where natural units are identifiable, for example cycles in a cyclical process such as the beating of a heart or an organism’s sleep/wake cycle, the processes in question form a sequence with no discontinuities (temporal gaps) of the sort that we find for instance where billiard balls or zebrafish or planets are separated by clear spatial gaps. Lives of organisms are process units, but they too unfold in a continuous series from other, prior processes such as fertilization, and they unfold in turn in continuous series of post-life processes such as post-mortem decay. Clear examples of boundaries of processes are almost always of the fiat sort (midnight, a time of death as declared in an operating theater or on a death certificate, the initiation of a state of war)
(iff (Process a) (and (Occurrent a) (exists (b) (properTemporalPartOf b a)) (exists (c t) (and (MaterialEntity c) (specificallyDependsOnAt a c t))))) // axiom label in BFO2 CLIF: [083-003]
process
p is a process = Def. p is an occurrent that has temporal proper parts and for some time t, p s-depends_on some material entity at t. (axiom label in BFO2 Reference: [083-003])
(iff (Process a) (and (Occurrent a) (exists (b) (properTemporalPartOf b a)) (exists (c t) (and (MaterialEntity c) (specificallyDependsOnAt a c t))))) // axiom label in BFO2 CLIF: [083-003]
disposition
Disposition
an atom of element X has the disposition to decay to an atom of element Y
certain people have a predisposition to colon cancer
children are innately disposed to categorize objects in certain ways.
the cell wall is disposed to filter chemicals in endocytosis and exocytosis
BFO 2 Reference: Dispositions exist along a strength continuum. Weaker forms of disposition are realized in only a fraction of triggering cases. These forms occur in a significant number of cases of a similar type.
b is a disposition means: b is a realizable entity & b’s bearer is some material entity & b is such that if it ceases to exist, then its bearer is physically changed, & b’s realization occurs when and because this bearer is in some special physical circumstances, & this realization occurs in virtue of the bearer’s physical make-up. (axiom label in BFO2 Reference: [062-002])
If b is a realizable entity then for all t at which b exists, b s-depends_on some material entity at t. (axiom label in BFO2 Reference: [063-002])
(forall (x t) (if (and (RealizableEntity x) (existsAt x t)) (exists (y) (and (MaterialEntity y) (specificallyDepends x y t))))) // axiom label in BFO2 CLIF: [063-002]
(forall (x) (if (Disposition x) (and (RealizableEntity x) (exists (y) (and (MaterialEntity y) (bearerOfAt x y t)))))) // axiom label in BFO2 CLIF: [062-002]
disposition
b is a disposition means: b is a realizable entity & b’s bearer is some material entity & b is such that if it ceases to exist, then its bearer is physically changed, & b’s realization occurs when and because this bearer is in some special physical circumstances, & this realization occurs in virtue of the bearer’s physical make-up. (axiom label in BFO2 Reference: [062-002])
If b is a realizable entity then for all t at which b exists, b s-depends_on some material entity at t. (axiom label in BFO2 Reference: [063-002])
(forall (x t) (if (and (RealizableEntity x) (existsAt x t)) (exists (y) (and (MaterialEntity y) (specificallyDepends x y t))))) // axiom label in BFO2 CLIF: [063-002]
(forall (x) (if (Disposition x) (and (RealizableEntity x) (exists (y) (and (MaterialEntity y) (bearerOfAt x y t)))))) // axiom label in BFO2 CLIF: [062-002]
realizable
RealizableEntity
the disposition of this piece of metal to conduct electricity.
the disposition of your blood to coagulate
the function of your reproductive organs
the role of being a doctor
the role of this boundary to delineate where Utah and Colorado meet
A specifically dependent continuant that inheres in continuant entities and are not exhibited in full at every time in which it inheres in an entity or group of entities. The exhibition or actualization of a realizable entity is a particular manifestation, functioning or process that occurs under certain circumstances.
To say that b is a realizable entity is to say that b is a specifically dependent continuant that inheres in some independent continuant which is not a spatial region and is of a type instances of which are realized in processes of a correlated type. (axiom label in BFO2 Reference: [058-002])
All realizable dependent continuants have independent continuants that are not spatial regions as their bearers. (axiom label in BFO2 Reference: [060-002])
(forall (x t) (if (RealizableEntity x) (exists (y) (and (IndependentContinuant y) (not (SpatialRegion y)) (bearerOfAt y x t))))) // axiom label in BFO2 CLIF: [060-002]
(forall (x) (if (RealizableEntity x) (and (SpecificallyDependentContinuant x) (exists (y) (and (IndependentContinuant y) (not (SpatialRegion y)) (inheresIn x y)))))) // axiom label in BFO2 CLIF: [058-002]
realizable entity
To say that b is a realizable entity is to say that b is a specifically dependent continuant that inheres in some independent continuant which is not a spatial region and is of a type instances of which are realized in processes of a correlated type. (axiom label in BFO2 Reference: [058-002])
All realizable dependent continuants have independent continuants that are not spatial regions as their bearers. (axiom label in BFO2 Reference: [060-002])
(forall (x t) (if (RealizableEntity x) (exists (y) (and (IndependentContinuant y) (not (SpatialRegion y)) (bearerOfAt y x t))))) // axiom label in BFO2 CLIF: [060-002]
(forall (x) (if (RealizableEntity x) (and (SpecificallyDependentContinuant x) (exists (y) (and (IndependentContinuant y) (not (SpatialRegion y)) (inheresIn x y)))))) // axiom label in BFO2 CLIF: [058-002]
quality
Quality
the ambient temperature of this portion of air
the color of a tomato
the length of the circumference of your waist
the mass of this piece of gold.
the shape of your nose
the shape of your nostril
a quality is a specifically dependent continuant that, in contrast to roles and dispositions, does not require any further process in order to be realized. (axiom label in BFO2 Reference: [055-001])
If an entity is a quality at any time that it exists, then it is a quality at every time that it exists. (axiom label in BFO2 Reference: [105-001])
(forall (x) (if (Quality x) (SpecificallyDependentContinuant x))) // axiom label in BFO2 CLIF: [055-001]
(forall (x) (if (exists (t) (and (existsAt x t) (Quality x))) (forall (t_1) (if (existsAt x t_1) (Quality x))))) // axiom label in BFO2 CLIF: [105-001]
quality
a quality is a specifically dependent continuant that, in contrast to roles and dispositions, does not require any further process in order to be realized. (axiom label in BFO2 Reference: [055-001])
If an entity is a quality at any time that it exists, then it is a quality at every time that it exists. (axiom label in BFO2 Reference: [105-001])
(forall (x) (if (Quality x) (SpecificallyDependentContinuant x))) // axiom label in BFO2 CLIF: [055-001]
(forall (x) (if (exists (t) (and (existsAt x t) (Quality x))) (forall (t_1) (if (existsAt x t_1) (Quality x))))) // axiom label in BFO2 CLIF: [105-001]
sdc
SpecificallyDependentContinuant
Reciprocal specifically dependent continuants: the function of this key to open this lock and the mutually dependent disposition of this lock: to be opened by this key
of one-sided specifically dependent continuants: the mass of this tomato
of relational dependent continuants (multiple bearers): John’s love for Mary, the ownership relation between John and this statue, the relation of authority between John and his subordinates.
the disposition of this fish to decay
the function of this heart: to pump blood
the mutual dependence of proton donors and acceptors in chemical reactions [79
the mutual dependence of the role predator and the role prey as played by two organisms in a given interaction
the pink color of a medium rare piece of grilled filet mignon at its center
the role of being a doctor
the shape of this hole.
the smell of this portion of mozzarella
b is a specifically dependent continuant = Def. b is a continuant & there is some independent continuant c which is not a spatial region and which is such that b s-depends_on c at every time t during the course of b’s existence. (axiom label in BFO2 Reference: [050-003])
Specifically dependent continuant doesn't have a closure axiom because the subclasses don't necessarily exhaust all possibilites. We're not sure what else will develop here, but for example there are questions such as what are promises, obligation, etc.
(iff (SpecificallyDependentContinuant a) (and (Continuant a) (forall (t) (if (existsAt a t) (exists (b) (and (IndependentContinuant b) (not (SpatialRegion b)) (specificallyDependsOnAt a b t))))))) // axiom label in BFO2 CLIF: [050-003]
specifically dependent continuant
b is a specifically dependent continuant = Def. b is a continuant & there is some independent continuant c which is not a spatial region and which is such that b s-depends_on c at every time t during the course of b’s existence. (axiom label in BFO2 Reference: [050-003])
Specifically dependent continuant doesn't have a closure axiom because the subclasses don't necessarily exhaust all possibilites. We're not sure what else will develop here, but for example there are questions such as what are promises, obligation, etc.
per discussion with Barry Smith
(iff (SpecificallyDependentContinuant a) (and (Continuant a) (forall (t) (if (existsAt a t) (exists (b) (and (IndependentContinuant b) (not (SpatialRegion b)) (specificallyDependsOnAt a b t))))))) // axiom label in BFO2 CLIF: [050-003]
role
Role
John’s role of husband to Mary is dependent on Mary’s role of wife to John, and both are dependent on the object aggregate comprising John and Mary as member parts joined together through the relational quality of being married.
the priest role
the role of a boundary to demarcate two neighboring administrative territories
the role of a building in serving as a military target
the role of a stone in marking a property boundary
the role of subject in a clinical trial
the student role
A realizable entity the manifestation of which brings about some result or end that is not essential to a continuant in virtue of the kind of thing that it is but that can be served or participated in by that kind of continuant in some kinds of natural, social or institutional contexts.
BFO 2 Reference: One major family of examples of non-rigid universals involves roles, and ontologies developed for corresponding administrative purposes may consist entirely of representatives of entities of this sort. Thus ‘professor’, defined as follows,b instance_of professor at t =Def. there is some c, c instance_of professor role & c inheres_in b at t.denotes a non-rigid universal and so also do ‘nurse’, ‘student’, ‘colonel’, ‘taxpayer’, and so forth. (These terms are all, in the jargon of philosophy, phase sortals.) By using role terms in definitions, we can create a BFO conformant treatment of such entities drawing on the fact that, while an instance of professor may be simultaneously an instance of trade union member, no instance of the type professor role is also (at any time) an instance of the type trade union member role (any more than any instance of the type color is at any time an instance of the type length).If an ontology of employment positions should be defined in terms of roles following the above pattern, this enables the ontology to do justice to the fact that individuals instantiate the corresponding universals – professor, sergeant, nurse – only during certain phases in their lives.
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
b is a role means: b is a realizable entity & b exists because there is some single bearer that is in some special physical, social, or institutional set of circumstances in which this bearer does not have to be& b is not such that, if it ceases to exist, then the physical make-up of the bearer is thereby changed. (axiom label in BFO2 Reference: [061-001])
(forall (x) (if (Role x) (RealizableEntity x))) // axiom label in BFO2 CLIF: [061-001]
role
b is a role means: b is a realizable entity & b exists because there is some single bearer that is in some special physical, social, or institutional set of circumstances in which this bearer does not have to be& b is not such that, if it ceases to exist, then the physical make-up of the bearer is thereby changed. (axiom label in BFO2 Reference: [061-001])
(forall (x) (if (Role x) (RealizableEntity x))) // axiom label in BFO2 CLIF: [061-001]
gdc
GenericallyDependentContinuant
The entries in your database are patterns instantiated as quality instances in your hard drive. The database itself is an aggregate of such patterns. When you create the database you create a particular instance of the generically dependent continuant type database. Each entry in the database is an instance of the generically dependent continuant type IAO: information content entity.
the pdf file on your laptop, the pdf file that is a copy thereof on my laptop
the sequence of this protein molecule; the sequence that is a copy thereof in that protein molecule.
b is a generically dependent continuant = Def. b is a continuant that g-depends_on one or more other entities. (axiom label in BFO2 Reference: [074-001])
(iff (GenericallyDependentContinuant a) (and (Continuant a) (exists (b t) (genericallyDependsOnAt a b t)))) // axiom label in BFO2 CLIF: [074-001]
generically dependent continuant
b is a generically dependent continuant = Def. b is a continuant that g-depends_on one or more other entities. (axiom label in BFO2 Reference: [074-001])
(iff (GenericallyDependentContinuant a) (and (Continuant a) (exists (b t) (genericallyDependsOnAt a b t)))) // axiom label in BFO2 CLIF: [074-001]
function
Function
the function of a hammer to drive in nails
the function of a heart pacemaker to regulate the beating of a heart through electricity
the function of amylase in saliva to break down starch into sugar
BFO 2 Reference: In the past, we have distinguished two varieties of function, artifactual function and biological function. These are not asserted subtypes of BFO:function however, since the same function – for example: to pump, to transport – can exist both in artifacts and in biological entities. The asserted subtypes of function that would be needed in order to yield a separate monoheirarchy are not artifactual function, biological function, etc., but rather transporting function, pumping function, etc.
A function is a disposition that exists in virtue of the bearer’s physical make-up and this physical make-up is something the bearer possesses because it came into being, either through evolution (in the case of natural biological entities) or through intentional design (in the case of artifacts), in order to realize processes of a certain sort. (axiom label in BFO2 Reference: [064-001])
(forall (x) (if (Function x) (Disposition x))) // axiom label in BFO2 CLIF: [064-001]
function
A function is a disposition that exists in virtue of the bearer’s physical make-up and this physical make-up is something the bearer possesses because it came into being, either through evolution (in the case of natural biological entities) or through intentional design (in the case of artifacts), in order to realize processes of a certain sort. (axiom label in BFO2 Reference: [064-001])
(forall (x) (if (Function x) (Disposition x))) // axiom label in BFO2 CLIF: [064-001]
data item
Data items include counts of things, analyte concentrations, and statistical summaries.
An information content entity that is intended to be a truthful statement about something (modulo, e.g., measurement precision or other systematic errors) and is constructed/acquired by a method which reliably tends to produce (approximately) truthful statements.
2/2/2009 Alan and Bjoern discussing FACS run output data. This is a data item because it is about the cell population. Each element records an event and is typically further composed a set of measurment data items that record the fluorescent intensity stimulated by one of the lasers.
2009-03-16: data item deliberatly ambiguous: we merged data set and datum to be one entity, not knowing how to define singular versus plural. So data item is more general than datum.
2009-03-16: removed datum as alternative term as datum specifically refers to singular form, and is thus not an exact synonym.
2014-03-31: See discussion at http://odontomachus.wordpress.com/2014/03/30/aboutness-objects-propositions/
JAR: datum -- well, this will be very tricky to define, but maybe some
information-like stuff that might be put into a computer and that is
meant, by someone, to denote and/or to be interpreted by some
process... I would include lists, tables, sentences... I think I might
defer to Barry, or to Brian Cantwell Smith
JAR: A data item is an approximately justified approximately true approximate belief
PERSON: Alan Ruttenberg
PERSON: Chris Stoeckert
PERSON: Jonathan Rees
data
data item
information content entity
Examples of information content entites include journal articles, data, graphical layouts, and graphs.
A generically dependent continuant that is about some thing.
2014-03-10: The use of "thing" is intended to be general enough to include universals and configurations (see https://groups.google.com/d/msg/information-ontology/GBxvYZCk1oc/-L6B5fSBBTQJ).
information_content_entity 'is_encoded_in' some digital_entity in obi before split (040907). information_content_entity 'is_encoded_in' some physical_document in obi before split (040907).
Previous. An information content entity is a non-realizable information entity that 'is encoded in' some digital or physical entity.
PERSON: Chris Stoeckert
OBI_0000142
information content entity
curation status specification
The curation status of the term. The allowed values come from an enumerated list of predefined terms. See the specification of these instances for more detailed definitions of each enumerated value.
Better to represent curation as a process with parts and then relate labels to that process (in IAO meeting)
PERSON:Bill Bug
GROUP:OBI:<http://purl.obolibrary.org/obo/obi>
OBI_0000266
curation status specification
data about an ontology part
Data about an ontology part is a data item about a part of an ontology, for example a term
Person:Alan Ruttenberg
data about an ontology part
obsolescence reason specification
The reason for which a term has been deprecated. The allowed values come from an enumerated list of predefined terms. See the specification of these instances for more detailed definitions of each enumerated value.
The creation of this class has been inspired in part by Werner Ceusters' paper, Applying evolutionary terminology auditing to the Gene Ontology.
PERSON: Alan Ruttenberg
PERSON: Melanie Courtot
obsolescence reason specification
denotator type
The Basic Formal Ontology ontology makes a distinction between Universals and defined classes, where the formal are "natural kinds" and the latter arbitrary collections of entities.
A denotator type indicates how a term should be interpreted from an ontological perspective.
Alan Ruttenberg
Barry Smith, Werner Ceusters
denotator type
Values in a table presented as a series of ASCII text lines organised so that each column value is separated by a pipe ( | ).
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
pipe separated values
Values in a table presented as a series of ASCII text lines organised so that each column value is separated by a pipe ( | ).
RDMT. https://terms.codata.org/rdmt/pipe-separated-values
Software preservation involves the collection and long-term storage of software for archiving as well as maintaining availability and accessibility.
Kristina Hettne
Victoria Dominguez Del Angel
Yann Le Franc
https://orcid.org/0000-0002-7702-4495
AL 8.2.22: Refactored "software review and preservation" to "software preservation", as the review process is not within the remit of terms4FAIRskills. Added definition and source.
software preservation
Software preservation involves the collection and long-term storage of software for archiving as well as maintaining availability and accessibility.
https://orcid.org/0000-0002-7702-4495 8.2.22
In accessing a repository one uses a client (application) to discover relevant digital objects within a repository, and then retrieve a copy of a desired digital object.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
repository access
In accessing a repository one uses a client (application) to discover relevant digital objects within a repository, and then retrieve a copy of a desired digital object.
Definition from now-deprecated CASRAI Glossary term: repository-access
Include FAIR and open research in the strategic framework for the organization and set objectives and timeframe.
Angus Whyte
Celia van Gelder
Mateusz Kuzak
Yan Wang
develop fair and open research vision
The activity of developing an open research strategy and vision.
Angus Whyte
Celia van Gelder
Mateusz Kuzak
Yan Wang
develop open research strategy and vision
The activity of developing an open research strategy and vision.
PMQ 3.2022
Statement of an organisation’s processes for the management of a specified set of data assets.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data management policy
Statement of an organisation’s processes for the management of a specified set of data assets.
RDMT. https://terms.codata.org/rdmt/data-management-policy
Understand how the governing principles of research integrity and FAIR overlap
Angus Whyte
Celia van Gelder
understand research code of conduct
To be able to choose the appropriate reporting guideline/checklist for your data, based on community-adopted standards.
https://orcid.org/0000-0003-2687-1982
choosing the appropriate reporting guideline for your data
To be able to choose the appropriate reporting guideline/checklist for your data, based on community-adopted standards.
[PMQ]
A specific deed, action, function or sphere of action in relation with the role of data stewardship
Yann Le Franc
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
Set of actions carried out during data stewardship processes
data stewardship activity
A series of potentially destructive or irrevocable changes to a piece of data or a file. Common munging operations include removing punctuation or html tags, data parsing, filtering, and transformation.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data munging
A series of potentially destructive or irrevocable changes to a piece of data or a file. Common munging operations include removing punctuation or html tags, data parsing, filtering, and transformation.
Definition from now-deprecated CASRAI Glossary term: data-munging
https://orcid.org/0000-0003-2687-1982
interoperability of digital assets
Bin for Skills related to Resource management
leightonlc
skills for resource management
An activity within archiving in which specific items of data are maintained over time so that they can still be accessed and understood through changes in technology.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
Conservation
preservation
An activity within archiving in which specific items of data are maintained over time so that they can still be accessed and understood through changes in technology.
RDMT. https://terms.codata.org/rdmt/preservation
Data Categorization
Data Classification
https://orcid.org/0000-0003-2687-1982
2021-02-17T22:30:31.531624Z
data categorisation
Curation process on a data object by which it receives a persistent identifier (PID) from a trusted registration authority. Registration must be accompanied by the step(s) to create and submit metadata describing the object to the registry.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
data registration
Curation process on a data object by which it receives a persistent identifier (PID) from a trusted registration authority. Registration must be accompanied by the step(s) to create and submit metadata describing the object to the registry.
RDMT. https://terms.codata.org/rdmt/data-registration
The capability to communicate, execute programs, or transfer data among various functional units in a useful and meaningful manner that requires the user to have little or no knowledge of the unique characteristics of those units. Foundational, syntactic, and semantic interoperability are the three necessary aspects of interoperability.
v
interoperability
The capability to communicate, execute programs, or transfer data among various functional units in a useful and meaningful manner that requires the user to have little or no knowledge of the unique characteristics of those units. Foundational, syntactic, and semantic interoperability are the three necessary aspects of interoperability.
Definition from now-deprecated CASRAI Glossary term: interoperability
Exposing data is the activity of exposing your data to collaborators, the public, or other interested parties. A data producer makes the data accessible to external users in a machine- and/or human-readable way.
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
AL 15.3.22: Was "expose your data", which was not in the style of other term labels. Also updated definition, but might need revisiting later for a more formal definition.
exposing data
Exposing data is the activity of exposing your data to collaborators, the public, or other interested parties. A data producer makes the data accessible to external users in a machine- and/or human-readable way.
[PMQ 3.2022, AL 15.3.22]
Bin for Aptitudes related to Workflow technologies management.
leightonlc
aptitudes for workflow technologies management
A standard that is widely accepted and used, but lacks formal approval by a recognized standards developing organization (e.g., the QWERTY keyboard).
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
de facto standard
A standard that is widely accepted and used, but lacks formal approval by a recognized standards developing organization (e.g., the QWERTY keyboard).
Definition from now-deprecated CASRAI Glossary term: de-facto-standard
Object, event or phenomenon about which data are stored in a database and which has intermediate representation in a Data Model.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data entity
Object, event or phenomenon about which data are stored in a database and which has intermediate representation in a Data Model.
RDMT. https://terms.codata.org/rdmt/data-entity
Data that are being received, processed and stored at the time of their occurrence with only small delays. Examples include: stock quotes, manufacturing statistics, Web server loads, data warehouse activity and sensor feeds to data collectors. Real-time data are often used for navigation or tracking. Real-time data are data streams that are typically generated by sensors and received via direct networking connections.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
real-time data
Data that are being received, processed and stored at the time of their occurrence with only small delays. Examples include: stock quotes, manufacturing statistics, Web server loads, data warehouse activity and sensor feeds to data collectors. Real-time data are often used for navigation or tracking. Real-time data are data streams that are typically generated by sensors and received via direct networking connections.
RDMT. https://terms.codata.org/rdmt/real-time-data
Detect and analysis security risk of storage periodically, and minimize the impact of the risks detected
Angus Whyte
Celia van Gelder
Mateusz Kuzak
Yan Wang
storage security risk assessment and mitigation
The ability to get access to a computer or a network from a remote distance. Access may be through an Internet service provider (ISP) or through a dedicated line between a computer or a remote local area network and the central or main corporate local area network. A dedicated line is more expensive and less flexible but offers faster data rates.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
remote access
The ability to get access to a computer or a network from a remote distance. Access may be through an Internet service provider (ISP) or through a dedicated line between a computer or a remote local area network and the central or main corporate local area network. A dedicated line is more expensive and less flexible but offers faster data rates.
Definition from now-deprecated CASRAI Glossary term: remote-access
The activity of supervision of other people to ensure FAIR data practices.
Philippe Rocca-Serra
Susanna Sansone
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
fair stewardship supervising
The activity of supervision of other people to ensure FAIR data practices.
PMQ
Bin for Skills needed for High performance computing management.
leightonlc
skills related to high performance computing management
Ability to select the appropriate FAIR metrics among the existing ones in relation to the type of digital object concerned.
Kristina Hettne
Victoria Dominguez Del Angel
Yann Le Franc
fair metrics selection skills
A collection of data items organized as a set of formally-described tables from which data can be accessed or reassembled in many different ways without having to reorganize the database tables. The standard user and application program interface to a relational database is the structured query language (SQL).
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
relational database
A collection of data items organized as a set of formally-described tables from which data can be accessed or reassembled in many different ways without having to reorganize the database tables. The standard user and application program interface to a relational database is the structured query language (SQL).
Definition from now-deprecated CASRAI Glossary term: relational-database
Data that could not lead to the identification of a specific object of interest. These may be data that have been de-identified, or that could not lead to identifiable information in the first place.
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
AL 6.5.22: Modifed CASRAI definition to distinguish this term from 'non personally identifiable information.' Further work may determine whether or not both terms are required.
non identifiable data
Data that could not lead to the identification of a specific object of interest. These may be data that have been de-identified, or that could not lead to identifiable information in the first place.
Modified by AL on 6.5.22 from CASRAI. https://casrai.org/term/non-identifiable-data/
database developer
Managing and promoting the use of assets from their point of creation to ensure that they are fit for contemporary purpose and available for discovery and reuse. For dynamic datasets this may mean continuous enrichment or updating to keep them fit for purpose. Higher levels of curation will also involve links with annotation and with other published materials.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
curation
Managing and promoting the use of assets from their point of creation to ensure that they are fit for contemporary purpose and available for discovery and reuse. For dynamic datasets this may mean continuous enrichment or updating to keep them fit for purpose. Higher levels of curation will also involve links with annotation and with other published materials.
RDMT. https://terms.codata.org/rdmt/curation
https://orcid.org/0000-0003-2687-1982
A1. (meta)data are retrievable by their identifier using a standardised communications protocol
Control over time of data, computer code, software, and documents that allows for the ability to revert to a previous revision, which is critical for data traceability, tracking edits, and correcting mistakes. Version control generates a (changed) copy of a data object that is uniquely labelled with a version number. The intent is to track changes to a data object, by making versioned copies. Note that a version is different from a backup copy, which is typically a copy made at a specific point in time, or a replica.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
Versioning, Revision control, Source control
version control
Control over time of data, computer code, software, and documents that allows for the ability to revert to a previous revision, which is critical for data traceability, tracking edits, and correcting mistakes. Version control generates a (changed) copy of a data object that is uniquely labelled with a version number. The intent is to track changes to a data object, by making versioned copies. Note that a version is different from a backup copy, which is typically a copy made at a specific point in time, or a replica.
RDMT. https://terms.codata.org/rdmt/version-control
Understand the FAIR and open research practices, and the research landscape / current data management practices in the organization.
Angus Whyte
Celia van Gelder
Mateusz Kuzak
Yan Wang
understanding fair and open research challenges in your organization
ethical application of patents, licenses
System that allows users to be granted access to a database under specified conditions.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data access protocol
System that allows users to be granted access to a database under specified conditions.
RDMT. https://terms.codata.org/rdmt/data-access-protocol
Generic concept referring to all kinds of procedures being executed on data at any point in the data lifecycle.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data processing
Generic concept referring to all kinds of procedures being executed on data at any point in the data lifecycle.
RDMT. https://terms.codata.org/rdmt/data-processing
Set of measures that are used by a repository to form aggregations of data objects (including collections and metadata) to describe the properties of data objects, to register PIDs, to build the PID records, to link between all components, and to set up the containers (in the form of the software stack) that are used to store all components.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data organization
Set of measures that are used by a repository to form aggregations of data objects (including collections and metadata) to describe the properties of data objects, to register PIDs, to build the PID records, to link between all components, and to set up the containers (in the form of the software stack) that are used to store all components.
RDMT. https://terms.codata.org/rdmt/data-organisation
Recovery and/or transformation and digitization of dark data and at-risk data so that they can be preserved, accessed, shared, and used. Data rescue also involves the addition of rich metadata to make the content understandable and more easily re-usable.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
data rescue
Recovery and/or transformation and digitization of dark data and at-risk data so that they can be preserved, accessed, shared, and used. Data rescue also involves the addition of rich metadata to make the content understandable and more easily re-usable.
RDMT. https://terms.codata.org/rdmt/data-rescue
Anticipate possible implications of the research and making its outputs FAIR, reflecting on rmotivations and areas of uncertainty
Angus Whyte
Celia van Gelder
Mateusz Kuzak
Yan Wang
understand societal impact of research
To be able to search using software with a GUI or terminal access. For example using BLAST or genomic search tools in the life sciences.
https://orcid.org/0000-0003-2687-1982
Algorithm searching
Programmatic search
batch search
searching via algorithms and software
To be able to search using software with a GUI or terminal access. For example using BLAST or genomic search tools in the life sciences.
[PMQ]
Evaluation of an organisation, system, group, project or product with respect to its data and processes around this, often in accordance with a standard, guide, or framework used to structure the work. This can involve assessing, describing, and classifying any data held. An audit can be carried out internally by those who have access to the data or participate in related processes regularly, or by an independent, external actor.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
audit
Evaluation of an organisation, system, group, project or product with respect to its data and processes around this, often in accordance with a standard, guide, or framework used to structure the work. This can involve assessing, describing, and classifying any data held. An audit can be carried out internally by those who have access to the data or participate in related processes regularly, or by an independent, external actor.
RDMT. https://terms.codata.org/rdmt/audit
https://orcid.org/0000-0003-2687-1982
2020-10-01T21:03:37.147424Z
wiki site
Bin for Skills needed for Authorization management.
leightonlc
skills related to authorisation management
The term storage management encompasses the technologies and processes organizations use to maximize or improve the performance of their data storage resources. It is a broad category that includes virtualization, replication, mirroring, security, compression, traffic analysis, process automation, storage provisioning and related techniques.
Kristina Hettne
Simon Hodson
Victoria Dominguez Del Angel
storage management
The term storage management encompasses the technologies and processes organizations use to maximize or improve the performance of their data storage resources. It is a broad category that includes virtualization, replication, mirroring, security, compression, traffic analysis, process automation, storage provisioning and related techniques.
(webopedia.com/TERM/S/storage_management.html)[VDA]
Central repository for all or significant parts of the data that an organisation’s various business systems collect, containing harmonised, highly-structured, quality data integrated from multiple sources. Data warehousing emphasises the capture of data from diverse sources for useful analysis and access, but does not generally start from the point-of-view of the end user who may need access to specialised data marts. There are two approaches to data warehousing: The top-down approach spins off data marts for specific groups of users after the complete data warehouse has been created. The bottom-up approach builds the data marts first and then combines them into a single, all-encompassing data warehouse.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data warehouse
Central repository for all or significant parts of the data that an organisation’s various business systems collect, containing harmonised, highly-structured, quality data integrated from multiple sources. Data warehousing emphasises the capture of data from diverse sources for useful analysis and access, but does not generally start from the point-of-view of the end user who may need access to specialised data marts. There are two approaches to data warehousing: The top-down approach spins off data marts for specific groups of users after the complete data warehouse has been created. The bottom-up approach builds the data marts first and then combines them into a single, all-encompassing data warehouse.
RDMT. https://terms.codata.org/rdmt/data-warehouse
Serious problem caused by one or more ineffective data analysis processes.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data driven disaster
Serious problem caused by one or more ineffective data analysis processes.
RDMT. https://terms.codata.org/rdmt/data-driven-disaster
An identifier that uniquely distinguishes one set of data from all others.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data identifier
An identifier that uniquely distinguishes one set of data from all others.
Definition from now-deprecated CASRAI Glossary term: data-identifier
Protecting sensitive data from unauthorised access by encrypting the data and storing different portions of a file on different servers. An unauthorised person would need to know the locations of the servers containing the parts, be able to get access to each server, know what data to combine, and how to decrypt it. Data splitting can be made even more effective by periodically retrieving and recombining the parts, and then splitting the data in a different way among different servers, and using a different encryption key.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
data splitting
Protecting sensitive data from unauthorised access by encrypting the data and storing different portions of a file on different servers. An unauthorised person would need to know the locations of the servers containing the parts, be able to get access to each server, know what data to combine, and how to decrypt it. Data splitting can be made even more effective by periodically retrieving and recombining the parts, and then splitting the data in a different way among different servers, and using a different encryption key.
RDMT. https://terms.codata.org/rdmt/data-splitting
Monitor the status of information access of different stakeholder groups. Evaluate new access request and authorize or decline it. Update the organizational information access overview.
Angus Whyte
Celia van Gelder
Mateusz Kuzak
Yan Wang
manage access
Digital entity properties that are generated by the data management system (e.g., creation time; owner; storage location; data retention period; the length of time a digital entity will be retained).
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
system metadata
Digital entity properties that are generated by the data management system (e.g., creation time; owner; storage location; data retention period; the length of time a digital entity will be retained).
RDMT. https://terms.codata.org/rdmt/system-metadata
Philippe Rocca-Serra
Susanna Sansone
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
tactical/short-term planning
Electronic version of the paper record that doctors have traditionally maintained for their patients and which is typically only accessible within the facility or office that controls it.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
electronic medical record
Electronic version of the paper record that doctors have traditionally maintained for their patients and which is typically only accessible within the facility or office that controls it.
RDMT. https://terms.codata.org/rdmt/electronic-medical-record
Bin for Aptitudes related to Provenance information management.
leightonlc
aptitudes for provenance information management
Formalising and documenting existing processes and events. A first step in analysing a system of objects with which users interact is to identify each object and its relationship to other objects. This process is called data modelling and results in a picture of object relationships. Data modellers often use multiple models to view the same data and ensure that all processes, entities, relationships and data flows have been identified. There are several different approaches to data modelling, including: Conceptual Data Modelling (identifies the highest-level relationships between different entities); Enterprise Data Modelling (similar to conceptual data modelling, but addresses the unique requirements of a specific organisation); Logical Data Modelling (illustrates the specific entities, attributes and relationships involved in a business function. Serves as the basis for the creation of the physical data model); Physical Data Modelling (represents an application and database-specific implementation of a logical data model).
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data modeling
Formalising and documenting existing processes and events. A first step in analysing a system of objects with which users interact is to identify each object and its relationship to other objects. This process is called data modelling and results in a picture of object relationships. Data modellers often use multiple models to view the same data and ensure that all processes, entities, relationships and data flows have been identified. There are several different approaches to data modelling, including: Conceptual Data Modelling (identifies the highest-level relationships between different entities); Enterprise Data Modelling (similar to conceptual data modelling, but addresses the unique requirements of a specific organisation); Logical Data Modelling (illustrates the specific entities, attributes and relationships involved in a business function. Serves as the basis for the creation of the physical data model); Physical Data Modelling (represents an application and database-specific implementation of a logical data model).
RDMT. https://terms.codata.org/rdmt/data-modelling
To be able to search and understand the results from dataset aggregrator sites. To understand the implications of the provenance of the data and how to integrate and analyse data with differing metadata.
https://orcid.org/0000-0003-2687-1982
Searching aggregator sites
use of aggregator sites
To be able to search and understand the results from dataset aggregrator sites. To understand the implications of the provenance of the data and how to integrate and analyse data with differing metadata.
[PMQ]
Implement the policies that govern the choice of metadata schema, reserved vocabularies, metadata organization in tables, and metadata properties (creation date, access control, ownership, etc.).
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
manage metadata catalog
Implement the policies that govern the choice of metadata schema, reserved vocabularies, metadata organization in tables, and metadata properties (creation date, access control, ownership, etc.).
Definition from now-deprecated CASRAI Glossary term: manage-metadata-catalog
Manipulation of raw data to produce a single output.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
data transformation
Manipulation of raw data to produce a single output.
RDMT. https://terms.codata.org/rdmt/data-transformation
An intellectual process of describing objects in accordance with accepted library principles, particularly those of subject and classification order.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
cataloguing
An intellectual process of describing objects in accordance with accepted library principles, particularly those of subject and classification order.
RDMT. https://terms.codata.org/rdmt/catalogue-verb
Iniitate and develop processes for crediting the contributions of researchers and professional groups towards making FAIR outputs
Angus Whyte
Celia van Gelder
Mateusz Kuzak
Yan Wang
design and apply processes for attribution
Collection of datasets sharing the same product specification. A dataset series is a type of aggregation or collection with some “logical grouping” such as by a topic (specification) with the (product) unit being a dataset series.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
dataset series
Collection of datasets sharing the same product specification. A dataset series is a type of aggregation or collection with some “logical grouping” such as by a topic (specification) with the (product) unit being a dataset series.
RDMT. https://terms.codata.org/rdmt/dataset-series
Collection of descriptions of the data objects or items in a data model. After each data object or item is given a descriptive name, its relationship is described (or it becomes part of some structure that implicitly describes relationship), the type of data (such as text or image or binary value) is described, possible predefined values are listed, and a brief textual description is provided. This collection can be organised for reference into a data dictionary.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data dictionary
Collection of descriptions of the data objects or items in a data model. After each data object or item is given a descriptive name, its relationship is described (or it becomes part of some structure that implicitly describes relationship), the type of data (such as text or image or binary value) is described, possible predefined values are listed, and a brief textual description is provided. This collection can be organised for reference into a data dictionary.
RDMT. https://terms.codata.org/rdmt/data-dictionary
Process that creates a new dataset from an original source.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
data selection
Process that creates a new dataset from an original source.
RDMT. https://terms.codata.org/rdmt/data-selection
Define data access policy based on project requirements. Provide access to autorised parties.
Celia van Gelder
Mateusz Kuzak
Yan Wang
apply data access policy
The ability to find and comprehend data produced by people other than yourself.
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
Data search
AL 15.3.22: tidied label from "discover other peoples data" to "Data discovery
data discovery
The ability to find and comprehend data produced by people other than yourself.
[PMQ]
Data that are delivered with all associated metadata, data dictionary, description of methods and instruments used to collect and process the data, and other supporting data (such as duplicate sample results, replicate analyses, percent recovery, etc.) with the purpose of providing the full context in which the data were created.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
documented data
Data that are delivered with all associated metadata, data dictionary, description of methods and instruments used to collect and process the data, and other supporting data (such as duplicate sample results, replicate analyses, percent recovery, etc.) with the purpose of providing the full context in which the data were created.
RDMT. https://terms.codata.org/rdmt/documented-data
Philippe Rocca-Serra
Susanna Sansone
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
conducting operations
Data that are changing frequently and at asynchronous moments.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
dynamic data
Data that are changing frequently and at asynchronous moments.
RDMT. https://terms.codata.org/rdmt/dynamic-data
Design covering a class of frameworks with the following characteristics: (1) it can be used to generate more specific models that still belong to the class and (2) it can be used to compare a concrete framework design to identify whether it belongs to the same class.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
reference model
Design covering a class of frameworks with the following characteristics: (1) it can be used to generate more specific models that still belong to the class and (2) it can be used to compare a concrete framework design to identify whether it belongs to the same class.
RDMT. https://terms.codata.org/rdmt/reference-model
https://orcid.org/0000-0003-2687-1982
2020-10-01T20:52:57.551905Z
online workbook
engaging in open innovation beyond academia
Data that have gone through a registration process and have been assigned an identifier metadata to aid in their search and retrieval.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
registered data
Data that have gone through a registration process and have been assigned an identifier metadata to aid in their search and retrieval.
RDMT. https://terms.codata.org/rdmt/registered-data
In the context of data and network security, assurance that information can only be accessed or modified by those authorised to do so. Measures taken to ensure integrity include controlling the physical environment of networked terminals and servers, restricting access to data, and maintaining rigorous authentication practices. Data integrity can also be threatened by environmental hazards, such as heat, dust, and electrical surges.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
integrity
In the context of data and network security, assurance that information can only be accessed or modified by those authorised to do so. Measures taken to ensure integrity include controlling the physical environment of networked terminals and servers, restricting access to data, and maintaining rigorous authentication practices. Data integrity can also be threatened by environmental hazards, such as heat, dust, and electrical surges.
RDMT. https://terms.codata.org/rdmt/integrity
Scholarship which is dependent upon digital methods, tools or resources. May include building a digital collection of information for further study and analysis; creating appropriate tools for collection-building; creating appropriate tools for the analysis and study of collections; using digital collections and analytical tools to generate new intellectual products; or creating authoring tools for these new intellectual products, either in traditional forms or in digital form.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
digital scholarship
Scholarship which is dependent upon digital methods, tools or resources. May include building a digital collection of information for further study and analysis; creating appropriate tools for collection-building; creating appropriate tools for the analysis and study of collections; using digital collections and analytical tools to generate new intellectual products; or creating authoring tools for these new intellectual products, either in traditional forms or in digital form.
RDMT. https://terms.codata.org/rdmt/digital-scholarship
Bin for types of Knowledge related to Resource management
leightonlc
a knowledge for resource management
Bin for Aptitudes related to Resource management.
leightonlc
aptitudes for resource management
Bin for types of Knowledge related to Storage management.
leightonlc
knowledge of storage management
Know about the open access publishing procedure, journals' policies on OA publishing, project funder and institutions policy as well as the financial support on OA publishing
Celia van Gelder
Mateusz Kuzak
Yan Wang
understand open access publishing
OBSOLETE. The use of persistent identifiers or PIDs to uniquely and persistently identify an entity.
Nancy Hoebelheinrich
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
https://github.com/terms4fairskills/FAIRterminology/issues/21
This term is redundant with http://purl.obolibrary.org/obo/T4FS_0000388 and has been deprecated.
use of persistent, resolvable identifiers (pids)
true
OBSOLETE. The use of persistent identifiers or PIDs to uniquely and persistently identify an entity.
[PMQ 3.2022]
Process in which files are first parsed (assigned to appropriate fields in a record) and then translated to a common format. Data often lack consistency simply because there are many ways of saying the same thing. Standardising the record ensures that when a query is run for a particular field, accurate results will be returned.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
https://github.com/terms4fairskills/FAIRterminology/issues/32
record standardization
Process in which files are first parsed (assigned to appropriate fields in a record) and then translated to a common format. Data often lack consistency simply because there are many ways of saying the same thing. Standardising the record ensures that when a query is run for a particular field, accurate results will be returned.
RDMT. https://terms.codata.org/rdmt/record-standardisation
https://orcid.org/0000-0003-2687-1982
https://orcid.org/0000-0003-2687-1982
2020-10-01T20:53:59.107153Z
online documentation
Catalogue containing metadata records that enables services to find data and services.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
https://orcid.org/0000-0003-2687-1982
metadata catalogue
Catalogue containing metadata records that enables services to find data and services.
RDMT. https://terms.codata.org/rdmt/metadata-catalogue
Learn about the types of patents and its legal implications. Understand the application procedure of patents.
Celia van Gelder
Mateusz Kuzak
Yan Wang
understand legal background of patents
Layout of a file in terms of how the data within the file are organised and encoded for storage.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
Research data format
data file format
Layout of a file in terms of how the data within the file are organised and encoded for storage.
RDMT. https://terms.codata.org/rdmt/data-file-format
Curated collection of metadata records describing datasets and their data elements. Curated and organised using a formal metadata schema appropriate to data and data sets (e.g. ReCollect etc).
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data catalogue
Curated collection of metadata records describing datasets and their data elements. Curated and organised using a formal metadata schema appropriate to data and data sets (e.g. ReCollect etc).
RDMT. https://terms.codata.org/rdmt/data-catalogue
A type of workflow that includes active steps to curate data as an aid to on-going management of data through its lifecycle.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
curation workflow
A type of workflow that includes active steps to curate data as an aid to on-going management of data through its lifecycle.
Definition from now–deprecated CASRAI Glossary term: curation-workflow
https://orcid.org/0000-0003-2687-1982
A1.1 the protocol is open, free, and universally implementable
Planning for and controlling the present and future costs of the conservation of data, including technical storage, fixed costs, and staff resources.
Kristina Hettne
Leighton Christiansen
Simon Hodson
Victoria Dominguez Del Angel
preservation costs management
Planning for and controlling the present and future costs of the conservation of data, including technical storage, fixed costs, and staff resources.
Based on https://www.cessda.eu/About/Projects/Past-projects/CESSDA-SaW/WP4/Cost-Benefit-Advocacy-Toolkit/Costs-Factsheet [LLC]
The person who is tasked with delivering a project within the boundaries and framework established by the program manager. The project manager is and should be delivery and execution focused and is judged on the elements of time, cost, and scope of the project. The person responsible for ensuring that the Project Team completes the project. The Project Manager develops the Project Plan with the team and manages the teamís performance of project tasks. It is also the responsibility of the Project Manager to secure acceptance and approval of deliverables from the Project Sponsor and Stakeholders. The Project Manager is responsible for communication, including status reporting, risk management, escalation of issues that cannot be resolved in the team, and, in general, making sure the project is delivered in budget, on schedule, and within scope.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
project manager
The person who is tasked with delivering a project within the boundaries and framework established by the program manager. The project manager is and should be delivery and execution focused and is judged on the elements of time, cost, and scope of the project. The person responsible for ensuring that the Project Team completes the project. The Project Manager develops the Project Plan with the team and manages the teamís performance of project tasks. It is also the responsibility of the Project Manager to secure acceptance and approval of deliverables from the Project Sponsor and Stakeholders. The Project Manager is responsible for communication, including status reporting, risk management, escalation of issues that cannot be resolved in the team, and, in general, making sure the project is delivered in budget, on schedule, and within scope.
Definition from now-deprecated CASRAI Glossary term: project-manager
Celia van Gelder
Mateusz Kuzak
Yan Wang
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
Release data
AL 9.3.22. Removed from 'FAIR4S defined activity' class to better integrate with the T4FS hierarchy. AL 5.5.22: removed "Publishing and archiving data" and "Publish and archive data" alternative terms, as the two actions are not the same.
publish data
https://orcid.org/0000-0002-7702-4495
Access management
AL 16.3.22: Changed 'information security and access management' such that access management went in as an alternative term. If required in future, we may wish to separate it out completely.
information security
All activities involved in the planning, collecting, processing, analysis and maintenance of data in the original research project. Among these activities are selecting a study design, constructing instruments for data collection, conducting data collection/creation, performing data editing/verification/validation, analysing data, backing up data versions and preparing and tagging metadata.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data production
All activities involved in the planning, collecting, processing, analysis and maintenance of data in the original research project. Among these activities are selecting a study design, constructing instruments for data collection, conducting data collection/creation, performing data editing/verification/validation, analysing data, backing up data versions and preparing and tagging metadata.
RDMT. https://terms.codata.org/rdmt/data-production
Data that have not been processed for meaningful use. Although raw data have the potential to become information, they require selective extraction, organisation, and sometimes analysis and formatting for presentation.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
Source data
raw data
Data that have not been processed for meaningful use. Although raw data have the potential to become information, they require selective extraction, organisation, and sometimes analysis and formatting for presentation.
RDMT. https://terms.codata.org/rdmt/raw-data
Testing conducted to evaluate whether systems or components pass data and control correctly to each other.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
interface testing
Testing conducted to evaluate whether systems or components pass data and control correctly to each other.
Definition from now-deprecated CASRAI Glossary term: interface-testing
Machine-independent data structure consisting of one or more elements in digital form that can be parsed by different information systems; the structure helps to enable interoperability among diverse information systems. A digital object is composed of a structured sequence of bits/bytes. The bit sequence realising the object can be identified and accessed by a unique and persistent identifier or by use of referencing attributes describing its properties.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
Digital entity
digital object
Machine-independent data structure consisting of one or more elements in digital form that can be parsed by different information systems; the structure helps to enable interoperability among diverse information systems. A digital object is composed of a structured sequence of bits/bytes. The bit sequence realising the object can be identified and accessed by a unique and persistent identifier or by use of referencing attributes describing its properties.
RDMT. https://terms.codata.org/rdmt/digital-object
A broad term encompassing: (a) digital surrogates created as a result of converting analogue materials to digital form (digitisation); (b) born digital for which there has never been and is never intended to be an analogue equivalent; and, (c) digital records.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
digital materials
A broad term encompassing: (a) digital surrogates created as a result of converting analogue materials to digital form (digitisation); (b) born digital for which there has never been and is never intended to be an analogue equivalent; and, (c) digital records.
RDMT. https://terms.codata.org/rdmt/digital-materials
Understand the needs and conditions of information access for different stakeholder groups. Identify the responsibility and rights for each stakeholder group on informaiton access. Build up a catalogue of risk profiles. For each risk profile, develope mitigation protocols.
Angus Whyte
Celia van Gelder
Mateusz Kuzak
Yan Wang
define access policy
Bin for Skills needed for Provenance information management.
leightonlc
2019-10-17T15:17:02.313138Z
skills related to provenance information management
Ability to access and download data from a repository.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
remote data access
Ability to access and download data from a repository.
RDMT. https://terms.codata.org/rdmt/remote-data-access
The act of interpreting an author's intended use of a word that has multiple meanings or spellings.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
disambiguation
The act of interpreting an author's intended use of a word that has multiple meanings or spellings.
Definition from now-deprecated CASRAI Glossary term: dissambuation
Ongoing processes to ensure adherence of data to both enterprise business rules (government department, university, industry, or agency), and to legal, regulatory and accreditation requirements. Includes five areas: controls, audit, legal compliance, regulatory compliance, and accreditation conformance.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data compliance
Ongoing processes to ensure adherence of data to both enterprise business rules (government department, university, industry, or agency), and to legal, regulatory and accreditation requirements. Includes five areas: controls, audit, legal compliance, regulatory compliance, and accreditation conformance.
RDMT. https://terms.codata.org/rdmt/data-compliance
The re-analysis of a dataset or combination of datasets outside of the original research purpose for which the dataset was created.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
https://github.com/terms4fairskills/FAIRterminology/issues/15
AL 2.3.23: Removed because of its high level of similarity with the 'R' FAIR principle.
re-use
true
The re-analysis of a dataset or combination of datasets outside of the original research purpose for which the dataset was created.
RDMT. https://terms.codata.org/rdmt/re-use
trainer/teacher
Bin for Aptitudes related to Identity management.
leightonlc
aptitudes for identity management
Know the security requirements on the organization regarding different types of information Know the current organizational policy, infrastructure and capacity on information security.
Angus Whyte
Celia van Gelder
Mateusz Kuzak
Yan Wang
understand information security challenges
Bin for Aptitudes related to Authorization management.
leightonlc
aptitudes for authorisation management
Angus Whyte
Celia van Gelder
Mateusz Kuzak
Yan Wang
https://orcid.org/0000-0002-7702-4495
AL 17.3.22: Updated to "Assessment" to match the style of its sibling classes. AL 3.22: Was "govern and assess", which is a compound concept. Split into two classes, Assess and Governance. Original source: FAIR4S defined activity.
assessment
The World Wide Web Consortium’s Platform for Personal Privacy Project (P3P) offers specific recommendations for practices that will let users define and share personal information with Web sites that they agree to share it with. The P3P incorporates a number of industry proposals, including the Open Profiling Standard (OPS). Using software that adheres to the P3P recommendations, users will be able to create a personal profile, all or parts of which can be made accessible to a Web site as the user directs. A tool that will help a user decide whether to trust a given website with personal information is a Statement of Privacy Policy that a website can post.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
personal information privacy
The World Wide Web Consortium’s Platform for Personal Privacy Project (P3P) offers specific recommendations for practices that will let users define and share personal information with Web sites that they agree to share it with. The P3P incorporates a number of industry proposals, including the Open Profiling Standard (OPS). Using software that adheres to the P3P recommendations, users will be able to create a personal profile, all or parts of which can be made accessible to a Web site as the user directs. A tool that will help a user decide whether to trust a given website with personal information is a Statement of Privacy Policy that a website can post.
RDMT. https://terms.codata.org/rdmt/personal-information-privacy
Demonstrate ability to involve others and share responsibility for applying FAIR principles.
Angus Whyte
ability to solve fair implementation problems collaboratively
Demonstrate ability to involve others and share responsibility for applying FAIR principles.
[PMQ, 3.2022]
Bin for Skills needed for Cloud computing environment management.
leightonlc
2019-10-17T15:14:37.180683Z
skills related to cloud computing environment management
Managed process throughout the data lifecycle, by which data/data collections are cleansed, documented, standardised, formatted and inter-related. This includes versioning data, or forming a new collection from several data sources, annotating with metadata, adding codes to raw data (e.g., classifying a galaxy image with a galaxy type such as “spiral”). Higher levels of curation involve maintaining links with annotation and with other published materials. Thus a dataset may include a citation link to publication whose analysis was based on the data. The goal of curation is to manage and promote the use of data from its point of creation to ensure it is fit for contemporary purpose and available for discovery and re-use. For dynamic datasets this may mean continuous enrichment or updating to keep it fit for purpose. Special forms of curation may be available in data repositories. The data curation process itself must be documented as part of curation. Thus curation and provenance are highly related.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data curation
Managed process throughout the data lifecycle, by which data/data collections are cleansed, documented, standardised, formatted and inter-related. This includes versioning data, or forming a new collection from several data sources, annotating with metadata, adding codes to raw data (e.g., classifying a galaxy image with a galaxy type such as “spiral”). Higher levels of curation involve maintaining links with annotation and with other published materials. Thus a dataset may include a citation link to publication whose analysis was based on the data. The goal of curation is to manage and promote the use of data from its point of creation to ensure it is fit for contemporary purpose and available for discovery and re-use. For dynamic datasets this may mean continuous enrichment or updating to keep it fit for purpose. Special forms of curation may be available in data repositories. The data curation process itself must be documented as part of curation. Thus curation and provenance are highly related.
RDMT. https://terms.codata.org/rdmt/data-curation
Cite contributions to data work in published literature which draws upon that data.
Laura Molloy
https://orcid.org/0000-0003-2687-1982
cite contributions
Cite contributions to data work in published literature which draws upon that data.
[LM]
Ongoing organizational activities associated with supporting functional elements, as opposed to project elements. Operational management also includes support of products that the organization has created through project activity.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
operational management
Ongoing organizational activities associated with supporting functional elements, as opposed to project elements. Operational management also includes support of products that the organization has created through project activity.
Definition from now-deprecated CASRAI Glossary term: operational-management
Data that are complete, timely, accurate, consistent, relevant, reliable, traceable, cleaned, validated, and well documented.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
high quality data
Data that are complete, timely, accurate, consistent, relevant, reliable, traceable, cleaned, validated, and well documented.
RDMT. https://terms.codata.org/rdmt/high-quality-data
Understand what reuse is permitted according to the licenses or terms and conditions applicable to services, tools and their content.
Angus Whyte
how to make your data reuseable
https://orcid.org/0000-0003-2687-1982
I1. (meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation
Organisation or structure for a database. The activity of data modelling leads to a schema. (The plural form is schemata.) The term is used in discussing both relational databases and object-oriented databases. The term sometimes refers to a visualisation of a structure and sometimes to a formal text-oriented description.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
schema
schema (database)
Organisation or structure for a database. The activity of data modelling leads to a schema. (The plural form is schemata.) The term is used in discussing both relational databases and object-oriented databases. The term sometimes refers to a visualisation of a structure and sometimes to a formal text-oriented description.
RDMT. https://terms.codata.org/rdmt/schema-database
Service that provides a connection between a PID and its target object.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
pid service
Service that provides a connection between a PID and its target object.
RDMT. https://terms.codata.org/rdmt/pid-service
Set of conditions that allow data exchange from one information technology system to be received by another and does not require the ability for the receiving information technology system to interpret the data.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
foundational interoperability
Set of conditions that allow data exchange from one information technology system to be received by another and does not require the ability for the receiving information technology system to interpret the data.
RDMT. https://terms.codata.org/rdmt/foundational-interoperability
The activities of data policies, data planning, data element standardization, information management control, data synchronization, data sharing, and database development, including practices and projects that acquire, control, protect, deliver and enhance the value of data and information.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data management
The activities of data policies, data planning, data element standardization, information management control, data synchronization, data sharing, and database development, including practices and projects that acquire, control, protect, deliver and enhance the value of data and information.
Definition from now-deprecated CASRAI Glossary term: data-management
Authorization management is concerned with people's access to different objects, most often to data or physical objects, such as land, buildings, rooms or infrastructure.
Kristina Hettne
Leighton Christiansen
Simon Hodson
Victoria Dominguez Del Angel
authorisation management
Authorization management is concerned with people's access to different objects, most often to data or physical objects, such as land, buildings, rooms or infrastructure.
https://managementmania.com/en/authorization-management
Bin for types of Knowledge related to Authorization management.
leightonlc
knowledge of authorisation management
The ability to search repositories and knowledge-bases.
https://orcid.org/0000-0003-2687-1982
Repository search
database search
searching databases
searching repositories
search repositories and knowledge-bases
The ability to search repositories and knowledge-bases.
[PMQ]
Ecosystem that includes software, infrastructure, consultants, integrators, partners, third parties and anything else in the specified environment that has a bearing on the other components.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
cloud ecosystem
Ecosystem that includes software, infrastructure, consultants, integrators, partners, third parties and anything else in the specified environment that has a bearing on the other components.
RDMT. https://terms.codata.org/rdmt/cloud-ecosystem
Learn about data licensing, the types of data licences and its legal implications. Understand the application procedure of data licences.
Celia van Gelder
Mateusz Kuzak
Yan Wang
understand legal background of licensing
A place or collection containing static records, documents, or other materials for long-term preservation.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
archive
A place or collection containing static records, documents, or other materials for long-term preservation.
RDMT. https://terms.codata.org/rdmt/archive
An infrastructure component that provides reliable, long-term access to managed digital resources. It stores, manages, and curates digital objects and returns their bit streams when a request is issued. Trusted repositories undergo regular assessments according to a set of rules such as defined by Data Seal of Approval (DSA) or TRAC (ISO 16363). It is well understood that such an assessment has the potential of increasing trust from its depositors and users, but it will not be the only criterion for users. Repositories can be at different stages of assessments. However, it is evident that certain quality criteria need to be met to distinguish trusted repositories from all types of other entities that store data such as notebooks or lab servers.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
trusted digital repository
An infrastructure component that provides reliable, long-term access to managed digital resources. It stores, manages, and curates digital objects and returns their bit streams when a request is issued. Trusted repositories undergo regular assessments according to a set of rules such as defined by Data Seal of Approval (DSA) or TRAC (ISO 16363). It is well understood that such an assessment has the potential of increasing trust from its depositors and users, but it will not be the only criterion for users. Repositories can be at different stages of assessments. However, it is evident that certain quality criteria need to be met to distinguish trusted repositories from all types of other entities that store data such as notebooks or lab servers.
Definition from now-deprecated CASRAI Glossary term: trusted-digital-repository
Managing the physical aspects of data resources, including database design and integrity, backup and recovery, performance and tuning.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
database administration
Managing the physical aspects of data resources, including database design and integrity, backup and recovery, performance and tuning.
RDMT. https://terms.codata.org/rdmt/database-administration
Bin for types of Knowledge related to Identity management
leightonlc
knowledge of identity management
Angus Whyte
Celia van Gelder
Mateusz Kuzak
Yan Wang
quality assessment
Specialised format for organising and storing data. General data structure types include the array, the file, the record, the table, the tree, and so on. Any data structure is designed to organise data to suit a specific purpose so that it can be accessed and worked with in appropriate ways. In computer programming, a data structure may be selected or designed to store data for the purpose of working on it with various algorithms.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
Data format
data structure
Specialised format for organising and storing data. General data structure types include the array, the file, the record, the table, the tree, and so on. Any data structure is designed to organise data to suit a specific purpose so that it can be accessed and worked with in appropriate ways. In computer programming, a data structure may be selected or designed to store data for the purpose of working on it with various algorithms.
RDMT. https://terms.codata.org/rdmt/data-structure
Information systems and technology infrastructure manager, expert, or technician.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
information technology specialist
Information systems and technology infrastructure manager, expert, or technician.
RDMT. https://terms.codata.org/rdmt/information-technology-specialist
A curation activity that ensures that data are properly selected, stored, and can be accessed, and for which logical and physical integrity are maintained over time, including security and authenticity.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
archiving
A curation activity that ensures that data are properly selected, stored, and can be accessed, and for which logical and physical integrity are maintained over time, including security and authenticity.
RDMT. https://terms.codata.org/rdmt/archive-verb
Bin for types of Knowledge related to Preservation costs management.
leightonlc
2019-10-17T15:17:32.816776Z
knowledge of preservation costs management
A phase of development where the product is tested in the real world by the intended audience. The experiences of the early users are forwarded back to the developers who make final changes before releasing the product.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
user acceptance testing
A phase of development where the product is tested in the real world by the intended audience. The experiences of the early users are forwarded back to the developers who make final changes before releasing the product.
Definition from now-deprecated CASRAI Glossary term: user-acceptance-testing/
Description of a digital object with a limited number of fields including at least a name and persistent identifier.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
minimal metadata
Description of a digital object with a limited number of fields including at least a name and persistent identifier.
RDMT. https://terms.codata.org/rdmt/minimal-metadata
Metadata exposure is the process of finding, identifying, selecting and acquiring/obtaining access to a metadata entity.
Nancy Hoebelheinrich
https://orcid.org/0000-0002-7702-4495
AL 22.3.22: Refactored from 'metadata creation and exposure'. Metadata creation has moved to the Curation hierarchy.
metadata exposure
Metadata exposure is the process of finding, identifying, selecting and acquiring/obtaining access to a metadata entity.
AL 22.3.22, and see also FRBR User Tasks at: https://sites.google.com/site/metadatastandards/chapter-6/6-3-frbr-user-tasks.
Process of destroying data stored on tapes, hard disks and other forms of electronic media so that it is completely unreadable and cannot be accessed or used.
https://orcid.org/0000-0002-5214-4466
data destruction
Process of destroying data stored on tapes, hard disks and other forms of electronic media so that it is completely unreadable and cannot be accessed or used.
RDMT. https://terms.codata.org/rdmt/data-destruction
The provision of training materials and events in and around good data stewardship.
Philippe Rocca-Serra
Susanna Sansone
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
fair stewardship training
The provision of training materials and events in and around good data stewardship.
PMQ
The ability to have an innovative approach to research by creating new or modified current concepts, theories, approaches and/or solutions.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
creativity
The ability to have an innovative approach to research by creating new or modified current concepts, theories, approaches and/or solutions.
Definition from now-deprecated CASRAI Glossary term: creativity
Field or column in a database table. It is an abbreviation for ‘physical data attribute’ which is a single data element related to a data object, such as a table in a database. The database schema associates one or more attributes with each database entity (i.e. table).
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data table attribute
data table attribute (field)
Field or column in a database table. It is an abbreviation for ‘physical data attribute’ which is a single data element related to a data object, such as a table in a database. The database schema associates one or more attributes with each database entity (i.e. table).
RDMT. https://terms.codata.org/rdmt/data-table-attribute-field
A standard developed through the cooperation of all parties who have an interest in participating in its development and/or use.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
Community standard
consensus standard
A standard developed through the cooperation of all parties who have an interest in participating in its development and/or use.
Definition from now-deprecated CASRAI Glossary term: consensus-standard
Identity Management (IdM), also called Identity and Access Management (IAM) refers to a framework of policies and technologies for ensuring that the proper user in an organisation has the appropriate access to technology resources systems fall under the overarching umbrella of IT security. IdM systems not only identify, but authenticate and authorize individuals who will be using IT resources, but also the hardware and applications users need to access.
Kristina Hettne
Leighton Christiansen
Simon Hodson
Victoria Dominguez Del Angel
IAM
IdM
Identity and Access Management
identity management
Identity Management (IdM), also called Identity and Access Management (IAM) refers to a framework of policies and technologies for ensuring that the proper user in an organisation has the appropriate access to technology resources systems fall under the overarching umbrella of IT security. IdM systems not only identify, but authenticate and authorize individuals who will be using IT resources, but also the hardware and applications users need to access.
https://en.wikipedia.org/wiki/Identity_management
https://orcid.org/0000-0003-2687-1982
reuse of digital assets
Explore the potential application domain/sociental implication of the project work and the communities of users/partners.
Celia van Gelder
Mateusz Kuzak
Yan Wang
investigate open innovation opportunities
The process of restoring data that have been lost, accidentally deleted, corrupted or made inaccessible for any reason. The data recovery process may vary, depending on the circumstances of the data loss, the data recovery software used to create backups, and backup target media. In some cases, end users may be able to restore lost...
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data recovery
The process of restoring data that have been lost, accidentally deleted, corrupted or made inaccessible for any reason. The data recovery process may vary, depending on the circumstances of the data loss, the data recovery software used to create backups, and backup target media. In some cases, end users may be able to restore lost...
Definition from now-deprecated CASRAI Glossary term: data-quality-review
https://orcid.org/0000-0003-2687-1982
R1. (meta)data are richly described with a plurality of accurate and relevant attributes
To be able to define and deploy appropriate criteria, based on the FAIR principles, to identify the appropriate repository for a dataset. This may, for example, involve discovering the mark-up of the repository, the exchange formats and data models used, and the licensing information for each repository. Tools such as the FAIR evaluator or FAIRshake may be used to assess the FAIRness of a repository based on human-entered questionnaires or FAIRsharing resource metadata.
https://orcid.org/0000-0003-2687-1982
FAIR data submission
Repository FAIRness
fair evaluation of repositories for data deposition
To be able to define and deploy appropriate criteria, based on the FAIR principles, to identify the appropriate repository for a dataset. This may, for example, involve discovering the mark-up of the repository, the exchange formats and data models used, and the licensing information for each repository. Tools such as the FAIR evaluator or FAIRshake may be used to assess the FAIRness of a repository based on human-entered questionnaires or FAIRsharing resource metadata.
[PMQ]
A Regional standard is one that applies across a multi-nation region. Most countries have their own national standards-making bodies, which in turn may also group together to make regional standards. For example, national standards bodies in Europe are also members of the European Committee for Standardization (CEN) as well as members of ISO. The use of such standards may be voluntary, or they may be referenced in regulation (therefore mandatory).
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
AL 9.3.22: Removed reference to CASRAI (https://casrai.org/term/regional-standard/), as the definition was unsuitable.
regional standard
A Regional standard is one that applies across a multi-nation region. Most countries have their own national standards-making bodies, which in turn may also group together to make regional standards. For example, national standards bodies in Europe are also members of the European Committee for Standardization (CEN) as well as members of ISO. The use of such standards may be voluntary, or they may be referenced in regulation (therefore mandatory).
Adapted by AL from ISO. https://www.iso.org/sites/ConsumersStandards/1_standards.html#section1_2
The process of setting up environments for workflow technologies.
Kristina Hettne
Simon Hodson
Victoria Dominguez Del Angel
workflow technologies management
The process of setting up environments for workflow technologies.
KH
Data steward is an umbrella term for numerous support roles that involve the creation, management and usage of research data. A data steward facilitates the quality, integrity and access to (meta)data in a manner that is consistent with the appropriate laws and institutional policies, ensuring professional treatment of data throughout all stages of the research project.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
https://orcid.org/0000-0003-2687-1982
data steward
Data steward is an umbrella term for numerous support roles that involve the creation, management and usage of research data. A data steward facilitates the quality, integrity and access to (meta)data in a manner that is consistent with the appropriate laws and institutional policies, ensuring professional treatment of data throughout all stages of the research project.
The Turing Way. https://book.the-turing-way.org/collaboration/research-infrastructure-roles/data-steward.html
Bin for Skills needed for Preservation costs management.
leightonlc
2019-10-17T15:17:32.816013Z
skills related to preservation costs management
https://orcid.org/0000-0003-2687-1982
2021-02-17T22:10:11.837519Z
repository certification schemes
Database containing information about trusted repositories that are provided by repository managers and are useful for human and machine users. These registries do not contain information about all metadata descriptions of digital objects, nor do they offer a list of PIDs of all stored digital objects. They do offer information based on standardised types on how to retrieve such information (e.g., the port under which OAI-PMH can be accessed to offer metadata). A registry requires the assignment of a permanent, unique and unambiguous identifier to each item.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
Data registry
registry
Database containing information about trusted repositories that are provided by repository managers and are useful for human and machine users. These registries do not contain information about all metadata descriptions of digital objects, nor do they offer a list of PIDs of all stored digital objects. They do offer information based on standardised types on how to retrieve such information (e.g., the port under which OAI-PMH can be accessed to offer metadata). A registry requires the assignment of a permanent, unique and unambiguous identifier to each item.
RDMT. https://terms.codata.org/rdmt/registry
To clearly communicate the existence of contributions from different individuals / projects to the compilation of a given dataset, in such a way that can be consistently cited.
Laura Molloy
attribution
credit
recognise and acknowledge contributions
To clearly communicate the existence of contributions from different individuals / projects to the compilation of a given dataset, in such a way that can be consistently cited.
[LM]
Data where relationships/connections between them are available to allow easy data access.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
linked open data
Data where relationships/connections between them are available to allow easy data access.
RDMT. https://terms.codata.org/rdmt/linked-open-data
Research data format is a generic term encompassing the concept of a standardised format for research data.
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
AL 9.3.2022: The original CASRAI definition is unsuitable (https://casrai.org/term/research-data-format), therefore I have changed the definition to something more generic.
research data format
Research data format is a generic term encompassing the concept of a standardised format for research data.
AL, 9.3.22
Activities and policies that ensure that the benefits to society of research outweigh any risks, from both an ethical and legal perspective.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
research governance
Activities and policies that ensure that the benefits to society of research outweigh any risks, from both an ethical and legal perspective.
RDMT. https://terms.codata.org/rdmt/research-governance
Ability of computer systems to transmit data with unambiguous, shared meaning. Semantic interoperability is a requirement to enable machine computable logic, inferencing, knowledge discovery, and data federation between information systems. Semantic interoperability is achieved when the information transferred has, in its communicated form, all of the meaning required for the receiving system to interpret it correctly, even when the algorithms used by the receiving system are unknown to the sending system. Syntactic interoperability is a prerequisite to semantic interoperability. Semantic interoperability ensures that the precise format and meaning of exchanged data and information is preserved and understood throughout exchanges between parties; in other words, what is sent is what is understood.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
semantic interoperability
Ability of computer systems to transmit data with unambiguous, shared meaning. Semantic interoperability is a requirement to enable machine computable logic, inferencing, knowledge discovery, and data federation between information systems. Semantic interoperability is achieved when the information transferred has, in its communicated form, all of the meaning required for the receiving system to interpret it correctly, even when the algorithms used by the receiving system are unknown to the sending system. Syntactic interoperability is a prerequisite to semantic interoperability. Semantic interoperability ensures that the precise format and meaning of exchanged data and information is preserved and understood throughout exchanges between parties; in other words, what is sent is what is understood.
RDMT. https://terms.codata.org/rdmt/semantic-interoperability
A non-proprietary identifier for chemical substances that can be used in printed and electronic data sources thus enabling easier linking of diverse data compilations.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
international chemical identifier
A non-proprietary identifier for chemical substances that can be used in printed and electronic data sources thus enabling easier linking of diverse data compilations.
RDMT. https://terms.codata.org/rdmt/international-chemical-identifier
Activities and processes in a digital environment that lead to the publication of research data, associated metadata and accompanying documentation and software code on the Web. In contrast to interim or final published products, workflows are the means to curate, document, and review, and thus ensure and enhance the value of the published product. Workflows can involve both humans and machines and often humans are supported by technology as they perform steps in the workflow. Similar workflows may vary in the details depending on the research discipline, data publishing product and/or the host institution of the workflow.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
research data publication workflow
Activities and processes in a digital environment that lead to the publication of research data, associated metadata and accompanying documentation and software code on the Web. In contrast to interim or final published products, workflows are the means to curate, document, and review, and thus ensure and enhance the value of the published product. Workflows can involve both humans and machines and often humans are supported by technology as they perform steps in the workflow. Similar workflows may vary in the details depending on the research discipline, data publishing product and/or the host institution of the workflow.
RDMT. https://terms.codata.org/rdmt/research-data-publication-workflow
Degree to which all required measurements are known. Values may be designated as “missing” in order not to have empty cells, or missing values may be replaced with default or interpolated values. In the case of default or interpolated values, these must be flagged as such to distinguish them from actual measurements or observations. Missing, default, or interpolated values do not imply that the dataset has been made complete.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data completeness
Degree to which all required measurements are known. Values may be designated as “missing” in order not to have empty cells, or missing values may be replaced with default or interpolated values. In the case of default or interpolated values, these must be flagged as such to distinguish them from actual measurements or observations. Missing, default, or interpolated values do not imply that the dataset has been made complete.
RDMT. https://terms.codata.org/rdmt/data-completeness
Data linkage where the resulting product has been de-identified.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
privacy-preserving data linkage
Data linkage where the resulting product has been de-identified.
RDMT. https://terms.codata.org/rdmt/privacy-preserving-data-linkage
Celia van Gelder
Mateusz Kuzak
Yan Wang
https://orcid.org/0000-0002-7702-4495
AL 22.3.22: refactored 'open access publishing and self-archiving' to be two separate concepts ('open access publishing', and 'self-archiving') according to T4FS best practices. Self-archiving did not have to be created because we already had self-archive, which was then moved to the Archiving hierarchy. 'publish open access' was also removed in favour of this term as they were too similar to have without the potential for confusion by users.
open access publishing
Prepare the data in preferred types and the data documentation. Choose the data repository and data license, deposit the data in the repository.
Celia van Gelder
Mateusz Kuzak
Yan Wang
https://orcid.org/0000-0002-7702-4495
self-archiving
Person with broad knowledge of information management disciplines and who provides guidance and support to program and staff functions on all aspects of managing the information resource.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
information management advisor
Person with broad knowledge of information management disciplines and who provides guidance and support to program and staff functions on all aspects of managing the information resource.
RDMT. https://terms.codata.org/rdmt/information-management-advisor
Statement describing how research data will be managed throughout a specified research project's life cycle - during and after the active phase of the research project - including terms regarding archiving and potential preservation of the data in a data repository. The DMP is considered to be a 'living' document, i.e. one which can be updated when necessary.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
DMP
data management plan
Statement describing how research data will be managed throughout a specified research project's life cycle - during and after the active phase of the research project - including terms regarding archiving and potential preservation of the data in a data repository. The DMP is considered to be a 'living' document, i.e. one which can be updated when necessary.
RDMT. https://terms.codata.org/rdmt/data-management-plan
https://orcid.org/0000-0003-2687-1982
F2. data are described with rich metadata
https://orcid.org/0000-0003-2687-1982
Data license
R1.1 (meta)data are released with a clear and accessible data usage license
Sequence of digitally encoded, coherent signals used to send or receive a representation of information content as transmitted.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data stream
Sequence of digitally encoded, coherent signals used to send or receive a representation of information content as transmitted.
RDMT. https://terms.codata.org/rdmt/data-stream
Librarian who manages the sharing and publishing of datasets as openly as possible and as closed as necessary, and the management and curation of repositories required to achieve this. Broad role requirements include support for sharing and publishing datasets, finding, accessing, interoperating and re-using these datasets, reviewing and supporting Data Management Plans and training delivery.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data librarian
Librarian who manages the sharing and publishing of datasets as openly as possible and as closed as necessary, and the management and curation of repositories required to achieve this. Broad role requirements include support for sharing and publishing datasets, finding, accessing, interoperating and re-using these datasets, reviewing and supporting Data Management Plans and training delivery.
RDMT. https://terms.codata.org/rdmt/data-librarian
The process of developing, communicating, implementing, monitoring, and assuring the policies, procedures, organizational structures, and practices associated with a given program.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
program governance
The process of developing, communicating, implementing, monitoring, and assuring the policies, procedures, organizational structures, and practices associated with a given program.
Definition from now-deprecated CASRAI Glossary term: program-governance
Bin for types of Knowledge related to Workflow technologies management.
leightonlc
2019-10-17T15:16:10.561925Z
knowledge of workflow technologies management
Obtain an overview of information access status for different stakeholder groups. Check this overview against the organizational information access policy and risk catalogue, identify the type of information security risks based on the mismatch between the current status and policy. Choose and implement the right protocal to mitigate the risk.
Angus Whyte
Celia van Gelder
Mateusz Kuzak
Yan Wang
access risk assessment and mitigation
Data that can be used: delivered in a form that meets the needs of different end-user audiences, is ready for the tasks that the end-user needs to accomplish, and that has been adapted to the end-user’s needs. Usable data have been cleaned, structured, are in machine readable format, fully documented, and ready for analysis and interpretation.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
usable data
Data that can be used: delivered in a form that meets the needs of different end-user audiences, is ready for the tasks that the end-user needs to accomplish, and that has been adapted to the end-user’s needs. Usable data have been cleaned, structured, are in machine readable format, fully documented, and ready for analysis and interpretation.
RDMT. https://terms.codata.org/rdmt/usable-data
https://orcid.org/0000-0003-2687-1982
https://orcid.org/0000-0003-2687-1982
2020-10-01T20:52:43.543314Z
presentation slides
Configure secure storage and monitor its usage
Angus Whyte
Celia van Gelder
Mateusz Kuzak
Yan Wang
implement secure storage
Entire period of time that research data exists. This lifecycle describes the flow of research data starting from planning, collecting, processing, analysing, preserving, sharing and finally reusing the research data. Research data often have a longer lifespan than the research project.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data lifecycle
research data lifecycle
Entire period of time that research data exists. This lifecycle describes the flow of research data starting from planning, collecting, processing, analysing, preserving, sharing and finally reusing the research data. Research data often have a longer lifespan than the research project.
RDMT. https://terms.codata.org/rdmt/research-data-lifecycle
Kristina Hettne
Victoria Dominguez Del Angel
Yann Le Franc
format and media migration
Repository for persistently storing collections of data, such as a database, a file system or a directory. The data stored can be of any type that can be rendered in digital format and placed in electronic media.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data store
Repository for persistently storing collections of data, such as a database, a file system or a directory. The data stored can be of any type that can be rendered in digital format and placed in electronic media.
RDMT. https://terms.codata.org/rdmt/data-store
Initiate and develop processes to ensure outputs are made FAIR consistently with research integrity principles, and with ethical oversight.
Angus Whyte
Celia van Gelder
Mateusz Kuzak
Yan Wang
design processes for research integrity
CoreTrustSeal offers to any interested data repository a core level certification based on the DSA–WDS Core Trustworthy Data Repositories Requirements catalogue and procedures. This universal catalogue of requirements reflects the core characteristics of trustworthy data repositories and is the culmination of a cooperative effort between DSA and WDS under the umbrella of the Research Data Alliance to merge their data repositories certifications.
https://orcid.org/0000-0003-2687-1982
CTS
https://orcid.org/0000-0003-2687-1982
2021-02-17T22:12:12.996581Z
core trust seal
CoreTrustSeal offers to any interested data repository a core level certification based on the DSA–WDS Core Trustworthy Data Repositories Requirements catalogue and procedures. This universal catalogue of requirements reflects the core characteristics of trustworthy data repositories and is the culmination of a cooperative effort between DSA and WDS under the umbrella of the Research Data Alliance to merge their data repositories certifications.
https://www.coretrustseal.org/about/
Defines how to manage a project. It will always be the same, regardless of the project lifecycle being employed.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
project management lifecycle
Defines how to manage a project. It will always be the same, regardless of the project lifecycle being employed.
Definition from now-deprecated CASRAI Glossary term: project-management-lifecycle
Type of metadata that conveys information needed to link a data object to its original source.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
authenticity metadata
Type of metadata that conveys information needed to link a data object to its original source.
RDMT. https://terms.codata.org/rdmt/authenticity-metadata
researcher
Bin for Skills needed for Service level management.
leightonlc
2019-10-17T15:18:53.731967Z
skills related to service level management
Data that relate to a living individual who can be identified from those data or those data plus other information which is in the possession of, or is likely to come into the possession of, the data controller, and includes any expression of opinion about the individual and any indication of the intentions of the data controller or any other person in respect of the individual. Any information that can be used to distinguish one person from another and can be used for de-anonymising anonymous data can be considered personally identifiable data.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
Personal data
personally identifiable information
Data that relate to a living individual who can be identified from those data or those data plus other information which is in the possession of, or is likely to come into the possession of, the data controller, and includes any expression of opinion about the individual and any indication of the intentions of the data controller or any other person in respect of the individual. Any information that can be used to distinguish one person from another and can be used for de-anonymising anonymous data can be considered personally identifiable data.
RDMT. https://terms.codata.org/rdmt/personally-identifiable-information
Understand the current storage security status and the preferred status, as well as the barriers between them.
Angus Whyte
Celia van Gelder
Mateusz Kuzak
Yan Wang
understand storage security challanges
Organising data into tables so that the results of using the database are always unambiguous and as intended. Normalisation is typically a refinement process after the initial exercise of identifying the data objects that should be in the database, identifying their relationships, and defining the tables required and the columns within each table. First normal form (1NF) is the “basic” level of normalisation: Data and information are contained in two-dimensional tables with rows and columns. Each column corresponds to a sub-object or an attribute of the object represented by the entire table. Each row represents a unique instance of that sub-object or attribute and must be different in some way from any other row (that is, no duplicate rows are possible). All entries in any column must be of the same kind. For example, in the column labelled “Date,” only dates are permitted. In Second normal form (2NF), the tables are in first normal form and, in addition, each column in a table that is not a determiner of the contents of another column must itself be a function of the other columns in the table. At the second normal form, modifications are still possible because a change to one row in a table may affect data that refers to this information from another table. In Third normal form (3NF), the tables are in second normal form and, in addition, there is no transitive functional dependency. For example, if A is functionally dependent on B, and B is functionally dependent on C, then C is transitively dependent on A via B. In Domain/key normal form (DKNF), a key uniquely identifies each row in a table. A domain is the set of permissible values for an attribute. By enforcing key and domain restrictions, the database is assured of being freed from modification anomalies. DKNF is the normalisation level that most designers aim to achieve.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
normalization
Organising data into tables so that the results of using the database are always unambiguous and as intended. Normalisation is typically a refinement process after the initial exercise of identifying the data objects that should be in the database, identifying their relationships, and defining the tables required and the columns within each table. First normal form (1NF) is the “basic” level of normalisation: Data and information are contained in two-dimensional tables with rows and columns. Each column corresponds to a sub-object or an attribute of the object represented by the entire table. Each row represents a unique instance of that sub-object or attribute and must be different in some way from any other row (that is, no duplicate rows are possible). All entries in any column must be of the same kind. For example, in the column labelled “Date,” only dates are permitted. In Second normal form (2NF), the tables are in first normal form and, in addition, each column in a table that is not a determiner of the contents of another column must itself be a function of the other columns in the table. At the second normal form, modifications are still possible because a change to one row in a table may affect data that refers to this information from another table. In Third normal form (3NF), the tables are in second normal form and, in addition, there is no transitive functional dependency. For example, if A is functionally dependent on B, and B is functionally dependent on C, then C is transitively dependent on A via B. In Domain/key normal form (DKNF), a key uniquely identifies each row in a table. A domain is the set of permissible values for an attribute. By enforcing key and domain restrictions, the database is assured of being freed from modification anomalies. DKNF is the normalisation level that most designers aim to achieve.
RDMT. https://terms.codata.org/rdmt/normalisation
Evaluation is a decision about significance, value, or quality of something, based on careful study of its good and bad features.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
AL 17.3.22: We may wish to make this an alternative term of Assessment.
evaluation
Evaluation is a decision about significance, value, or quality of something, based on careful study of its good and bad features.
Definition from now-deprecated CASRAI Glossary term: evaluation
Type of data element that expresses a proposition that binds one or more property values to some data entity.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data item
Type of data element that expresses a proposition that binds one or more property values to some data entity.
RDMT. https://terms.codata.org/rdmt/data-item
Duty and practice of ensuring that personal or sensitive information only flows from one entity to another according to legislated or otherwise broadly accepted norms and policies. This can be done by either restricting access to the data or certain variables in the data, and/or by protecting personal or sensitive information using an anonymisation method.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
confidentiality
Duty and practice of ensuring that personal or sensitive information only flows from one entity to another according to legislated or otherwise broadly accepted norms and policies. This can be done by either restricting access to the data or certain variables in the data, and/or by protecting personal or sensitive information using an anonymisation method.
RDMT. https://terms.codata.org/rdmt/confidentiality
Bin for types of Knowledge needed for Data management costs management.
leightonlc
knowledge of data management costs management
Philippe Rocca-Serra
Susanna Sansone
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
logistical support
Bin for Aptitudes related to Data management costs management.
leightonlc
aptitudes for data management costs management
The process of acquiring data from some source. For example, data may be acquired by download from a repository, transfer from a data logger, data capture, etc.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
data capture
data download
data reception
AL 15.3.22: Removed "data and metadata capture" as was too similar to this term. Added alternative term "data capture".
data acquisition
The process of acquiring data from some source. For example, data may be acquired by download from a repository, transfer from a data logger, data capture, etc.
RDMT. https://terms.codata.org/rdmt/data-acquisition
Planning for and controlling expenditures related to the data management lifecycle, including, but not limited to, labor and infrastructure expenses for data collection; data documentation; data storage; data access and security; data preservation; data sharing; and data disposition.
Kristina Hettne
Simon Hodson
Victoria Dominguez Del Angel
data management costs management
Planning for and controlling expenditures related to the data management lifecycle, including, but not limited to, labor and infrastructure expenses for data collection; data documentation; data storage; data access and security; data preservation; data sharing; and data disposition.
Written by LLC, based on list at https://www.uu.nl/en/research/research-data-management/guides/costs-of-data-management
A Text file is a kind of computer file that is structured as a sequence of lines of electronic text. A text file exists stored as data within a computer file system.
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
AL 9.3.22: The CASRAI term (https://casrai.org/term/text-file) had an unsuitable definition, and therefore was removed.
text file
A Text file is a kind of computer file that is structured as a sequence of lines of electronic text. A text file exists stored as data within a computer file system.
AL, Adapted from Wikipedia. https://en.wikipedia.org/wiki/Text_file
Data and code that are commented so that humans can understand what they represent, their design, and their purpose.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
human-readable format
Data and code that are commented so that humans can understand what they represent, their design, and their purpose.
RDMT. https://terms.codata.org/rdmt/human-readable-format
Checks that data are valid, sensible, reasonable, clean, usable, and secure before they are processed. Provides well-defined guarantees for fitness, accuracy, and consistency for any of various kinds of user input into an application or automated system. Failures or omissions in data validation can lead to data corruption, security vulnerability. Improperly validated data can cause computer code processing the data to crash, generate error messages, behave in an unanticipated manner, or generate incorrect results that may be difficult or impossible to detect.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data validation
Checks that data are valid, sensible, reasonable, clean, usable, and secure before they are processed. Provides well-defined guarantees for fitness, accuracy, and consistency for any of various kinds of user input into an application or automated system. Failures or omissions in data validation can lead to data corruption, security vulnerability. Improperly validated data can cause computer code processing the data to crash, generate error messages, behave in an unanticipated manner, or generate incorrect results that may be difficult or impossible to detect.
RDMT. https://terms.codata.org/rdmt/data-validation
Type of record (and organisation) that stores an instance of an executable/understandable PID. The content of a PID record distinguishes a registered digital or data object from other digital objects. A PID record is a type of record that includes property information that characterises the digital object it is identifying. Important parts of a PID record are location and checksum. However there is a large variation in usage. In some data models the PID is simply used as a unique label with an empty record. A PID record has a lifecycle including creation, publication, curation and destruction.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
pid record
Type of record (and organisation) that stores an instance of an executable/understandable PID. The content of a PID record distinguishes a registered digital or data object from other digital objects. A PID record is a type of record that includes property information that characterises the digital object it is identifying. Important parts of a PID record are location and checksum. However there is a large variation in usage. In some data models the PID is simply used as a unique label with an empty record. A PID record has a lifecycle including creation, publication, curation and destruction.
RDMT. https://terms.codata.org/rdmt/pid-record
Take proactive approach to ensure outputs are made FAIR consistently with research integrity principles.
Angus Whyte
Celia van Gelder
Mateusz Kuzak
Yan Wang
appreciate importance of research integrity
A type of metadata that indicates how compound objects are put together.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
structural metadata
structural metadata (information)
A type of metadata that indicates how compound objects are put together.
RDMT. https://terms.codata.org/rdmt/structural-metadata-information
Data that could not lead to the identification of a specific individual, to distinguishing one person from another, or to personally identifiable information. These may be data that have been de-identified, or that could not lead to personally identifiable information in the first place.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
non personally identifiable information
Data that could not lead to the identification of a specific individual, to distinguishing one person from another, or to personally identifiable information. These may be data that have been de-identified, or that could not lead to personally identifiable information in the first place.
Definition from now-deprecated CASRAI Glossary term: non-personally-identifiable-information
Process of creating digital files by scanning or otherwise converting analogue materials. The resulting digital copy, or digital surrogate, would then be classed as digital material and then subject to the same broad challenges involved in preserving access to it, as born-digital materials.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
digitisation
Process of creating digital files by scanning or otherwise converting analogue materials. The resulting digital copy, or digital surrogate, would then be classed as digital material and then subject to the same broad challenges involved in preserving access to it, as born-digital materials.
RDMT. https://terms.codata.org/rdmt/digitisation
Release of research data, associated metadata, accompanying documentation, and software code (in cases where the raw data have been processed or manipulated) for re-use and analysis in such a manner that they can be discovered on the Web and referred to in a unique and persistent way. Data publication occurs via dedicated data repositories and/or (data) journals which ensure that the published research objects are findable, accessible, interoperable and re-usable.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data publication
Release of research data, associated metadata, accompanying documentation, and software code (in cases where the raw data have been processed or manipulated) for re-use and analysis in such a manner that they can be discovered on the Web and referred to in a unique and persistent way. Data publication occurs via dedicated data repositories and/or (data) journals which ensure that the published research objects are findable, accessible, interoperable and re-usable.
RDMT. https://terms.codata.org/rdmt/data-publication
https://orcid.org/0000-0003-2687-1982
findability of digital assets
Philippe Rocca-Serra
Susanna Sansone
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
AL 9.3.22: renamed to prevent compound term. Was 'Influencing/community building'
community building
Standalone dataset that can be cited in a similar manner to other research outputs. The dataset appears in a data repository, data paper or project website, and has a Persistent Identifier. Most current referencing systems provide a format for citing datasets.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
citable data
Standalone dataset that can be cited in a similar manner to other research outputs. The dataset appears in a data repository, data paper or project website, and has a Persistent Identifier. Most current referencing systems provide a format for citing datasets.
RDMT. https://terms.codata.org/rdmt/citable-data
Voluminous amount of structured, semi-structured and/or unstructured data that have the potential to be mined for information, primarily characterised by big volume, extensive variety, high velocity (creation and use), and/or variability that together require a scalable architecture for efficient data storage, manipulation, and analysis. The definition is evolving and can vary by sector, depending on what kind of software tools are commonly available and what sizes of datasets are common in a particular discipline. With those caveats, big data in many sectors today will range from a few dozen terabytes to multiple petabytes.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
big data
Voluminous amount of structured, semi-structured and/or unstructured data that have the potential to be mined for information, primarily characterised by big volume, extensive variety, high velocity (creation and use), and/or variability that together require a scalable architecture for efficient data storage, manipulation, and analysis. The definition is evolving and can vary by sector, depending on what kind of software tools are commonly available and what sizes of datasets are common in a particular discipline. With those caveats, big data in many sectors today will range from a few dozen terabytes to multiple petabytes.
RDMT. https://terms.codata.org/rdmt/big-data
Ability to apply knowledge about FAIR metrics and assessing FAIRness using FAIR assessment tools.
Kristina Hettne
Victoria Dominguez Del Angel
Yann Le Franc
fairness assessment
Service providing long-term, permanent care and accessibility for digital objects with research value. The standard for such repositories is the Open Archival Information System reference model.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data archive
Service providing long-term, permanent care and accessibility for digital objects with research value. The standard for such repositories is the Open Archival Information System reference model.
RDMT. https://terms.codata.org/rdmt/data-archive
Information describing the technical processes used to produce, or required to use a digital object.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
technical metadata
Information describing the technical processes used to produce, or required to use a digital object.
RDMT. https://terms.codata.org/rdmt/technical-metadata
In a form that can be used and understood by a computer.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
AL 5.12.22: Deprecated because of its similarity with http://purl.obolibrary.org/obo/T4FS_0000551. We don't need to be going to the level of granularity that the concept of machine readability needs to be modelled as well as machine readable format.
machine readable
true
In a form that can be used and understood by a computer.
RDMT. https://terms.codata.org/rdmt/machine-readable
Combination of business processes, policies and technologies that allows organisations to provide secure access to confidential data. Integrated access management software is used by enterprises to control the flow of sensitive data in and out of a network.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
integrated access management
Combination of business processes, policies and technologies that allows organisations to provide secure access to confidential data. Integrated access management software is used by enterprises to control the flow of sensitive data in and out of a network.
RDMT. https://terms.codata.org/rdmt/integrated-access-management
Meaningless data, including any data that cannot be understood and interpreted correctly by machines such as unstructured text; any data that has been received, stored, or changed in such a manner that it cannot be read or used by the program that originally created it.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
noisy data
Meaningless data, including any data that cannot be understood and interpreted correctly by machines such as unstructured text; any data that has been received, stored, or changed in such a manner that it cannot be read or used by the program that originally created it.
RDMT. https://terms.codata.org/rdmt/noisy-data
Activities in and around the provision of training, mentorship and teaching opportunities for good data management and the implementation of FAIR practices.
Philippe Rocca-Serra
Susanna Sansone
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
fair training
Activities in and around the provision of training, mentorship and teaching opportunities for good data management and the implementation of FAIR practices.
PMQ
A set of instructions that direct a computer to do a specific task.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
software
A set of instructions that direct a computer to do a specific task.
Definition from now-deprecated CASRAI Glossary term: software
Conversion of multiple datasets to a single common format and structure.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
https://github.com/terms4fairskills/FAIRterminology/issues/32
data standardization
Conversion of multiple datasets to a single common format and structure.
RDMT. https://terms.codata.org/rdmt/data-standardisation
Data that are accessible, machine-readable, usable, intelligible, and freely shared. Open data can be freely used, re-used, built on, and redistributed by anyone – subject only, at most, to the requirement to attribute and sharealike.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
open data
Data that are accessible, machine-readable, usable, intelligible, and freely shared. Open data can be freely used, re-used, built on, and redistributed by anyone – subject only, at most, to the requirement to attribute and sharealike.
RDMT. https://terms.codata.org/rdmt/open-data
The output of a data curation activity. Such data has generally already been cleaned, standardised, documented. Additional metadata relevant to the data object has also been added, via either manual or automatic methods.
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
https://orcid.org/0000-0003-2687-1982
AL 9.5.22: The orginal label ("data curation") matched a term within the Data stewardship activity hierarchy. Changing the class label to "Curated data" better fit the model.
curated data
The output of a data curation activity. Such data has generally already been cleaned, standardised, documented. Additional metadata relevant to the data object has also been added, via either manual or automatic methods.
AL 9.5.22
Bin for Skills needed for Funding acquisition management.
leightonlc
skills related to funding acquisition management
128-bit number used to guarantee unique identity for objects on the internet over time.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
UUID, GUID
universally unique identifier
128-bit number used to guarantee unique identity for objects on the internet over time.
RDMT. https://terms.codata.org/rdmt/universally-unique-identifier
The process of confirming the identity of a principal entity.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
authentication
The process of confirming the identity of a principal entity.
Definition from now-deprecated CASRAI Glossary: authentication
Large-scale distributed computing paradigm that is driven by economies of scale, in which a pool of abstracted, virtualised, dynamically-scalable, managed computing power, storage, platforms and services are delivered on demand to external customers over the Internet. Key features are that: it is a specialised distributed computing paradigm; it is massively scalable; it can be encapsulated as an abstract entity that delivers different levels of services to customers outside the Cloud; it is driven by economies of scale; and the services can be dynamically configured (via virtualisation or other approaches) and delivered on demand.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
cloud computing
Large-scale distributed computing paradigm that is driven by economies of scale, in which a pool of abstracted, virtualised, dynamically-scalable, managed computing power, storage, platforms and services are delivered on demand to external customers over the Internet. Key features are that: it is a specialised distributed computing paradigm; it is massively scalable; it can be encapsulated as an abstract entity that delivers different levels of services to customers outside the Cloud; it is driven by economies of scale; and the services can be dynamically configured (via virtualisation or other approaches) and delivered on demand.
RDMT. https://terms.codata.org/rdmt/cloud-computing
Access to digital materials either for a defined period of time while use is predicted but which does not extend beyond the foreseeable future and/or until it becomes inaccessible because of changes in technology.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
short-term preservation
Access to digital materials either for a defined period of time while use is predicted but which does not extend beyond the foreseeable future and/or until it becomes inaccessible because of changes in technology.
RDMT. https://terms.codata.org/rdmt/short-term-preservation
Standard that is used in multiple nations and whose development process is open to representatives from all countries.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
international standard
Standard that is used in multiple nations and whose development process is open to representatives from all countries.
RDMT. https://terms.codata.org/rdmt/international-standard
Satisfaction of the requirements of a specific standard(s) and/or specification(s). Conformance is used with respect to voluntary standards and specifications, whereas compliance is used with respect to mandatory standards and regulations.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
conformance
Satisfaction of the requirements of a specific standard(s) and/or specification(s). Conformance is used with respect to voluntary standards and specifications, whereas compliance is used with respect to mandatory standards and regulations.
RDMT. https://terms.codata.org/rdmt/conformance
Individual or organisation responsible for the IT infrastructure providing and protecting data in conformance with the policies and practices prescribed by data governance.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
Data manager
Technical data steward
data custodian
Individual or organisation responsible for the IT infrastructure providing and protecting data in conformance with the policies and practices prescribed by data governance.
RDMT. https://terms.codata.org/rdmt/data-custodian
Bin for Aptitudes related to High performance computing management.
leightonlc
aptitudes for high performance computing management
Implement the policies that govern the arrangement, naming, descriptive metadata, provenance metadata, representation metadata, administrative metadata, access controls, retention, disposition, integrity, and replication of digital objects.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
manage datasets in a repository
Implement the policies that govern the arrangement, naming, descriptive metadata, provenance metadata, representation metadata, administrative metadata, access controls, retention, disposition, integrity, and replication of digital objects.
Definition from now-deprecated CASRAI Glossary term: manage-datasets-in-a-repository
Removing noise from data.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data de-noising
Removing noise from data.
RDMT. https://terms.codata.org/rdmt/data-de-noising
Provides the relationship and process context for working together to ensure outcomes are achieved.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
governance and accountability model
Provides the relationship and process context for working together to ensure outcomes are achieved.
Definition from now-deprecated CASRAI Glossary term: governance-and-accountability-model
Continued availability and ongoing usability of a digital resource, retaining all qualities of authenticity, accuracy and functionality deemed to be essential for the purposes the digital material was created and/or acquired for. Users who have access can retrieve, understand, manipulate, and store copies.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
access
Continued availability and ongoing usability of a digital resource, retaining all qualities of authenticity, accuracy and functionality deemed to be essential for the purposes the digital material was created and/or acquired for. Users who have access can retrieve, understand, manipulate, and store copies.
RDMT. https://terms.codata.org/rdmt/access
Find the people with a good understand of FAIR and open research challenges in your organiation, and create a work agenda.
Angus Whyte
Celia van Gelder
Mateusz Kuzak
Yan Wang
develop fair and open research strategy
Choose the proper type of patent for the project data and apply for the patent chosen.
Celia van Gelder
Mateusz Kuzak
Yan Wang
patent application
Type of data provenance indication that adds metadata to identify data collections. The organisation doing the collection management is stated in the metadata along with the provenance of collection management events such as source of data acquisition, conservation, and movement.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
collection management identification
Type of data provenance indication that adds metadata to identify data collections. The organisation doing the collection management is stated in the metadata along with the provenance of collection management events such as source of data acquisition, conservation, and movement.
RDMT. https://terms.codata.org/rdmt/collection-management-identification
Activity/Process whereby digital objects are evaluated with the relevant FAIR metrics and assessment tools.
Kristina Hettne
Victoria Dominguez Del Angel
Yann Le Franc
assess fairness
Activity/Process whereby digital objects are evaluated with the relevant FAIR metrics and assessment tools.
[YLF, VDA, KH]
Bin for types of Knowledge related to Change management.
leightonlc
knowledge of change management
Process of obtaining, importing, and processing data for later use or storage in a database. This process often involves altering individual files by editing their content and/or formatting them to fit into a larger document. An effective data ingestion methodology begins by validating the individual files, then prioritises the sources for optimum processing, and finally validates the results. When numerous data sources exist in diverse formats (the sources may number in the hundreds and the formats in the dozens), maintaining reasonable speed and efficiency can become a major challenge. To that end, several vendors offer programs tailored to the task of data ingestion in specific applications or environments.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data ingestion
Process of obtaining, importing, and processing data for later use or storage in a database. This process often involves altering individual files by editing their content and/or formatting them to fit into a larger document. An effective data ingestion methodology begins by validating the individual files, then prioritises the sources for optimum processing, and finally validates the results. When numerous data sources exist in diverse formats (the sources may number in the hundreds and the formats in the dozens), maintaining reasonable speed and efficiency can become a major challenge. To that end, several vendors offer programs tailored to the task of data ingestion in specific applications or environments.
RDMT. https://terms.codata.org/rdmt/data-ingestion
Manually or semi-automatically converting or mapping data from one form into another format that allows for more convenient consumption of the data with the help of semi-automated tools. Gathering and organising disparate data from different sources, often collected by many different investigators. Activities include developing and supporting search tools that utilise standardised metadata, harmonising the coding of data for specific variables, engineering new methods of combining data. with the help of semi-automated tools. The result of data wrangling is repurposed data.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data wrangling
Manually or semi-automatically converting or mapping data from one form into another format that allows for more convenient consumption of the data with the help of semi-automated tools. Gathering and organising disparate data from different sources, often collected by many different investigators. Activities include developing and supporting search tools that utilise standardised metadata, harmonising the coding of data for specific variables, engineering new methods of combining data. with the help of semi-automated tools. The result of data wrangling is repurposed data.
RDMT. https://terms.codata.org/rdmt/data-wrangling
Configuration of staff, services and tools assembled to support data management across the research lifecycle and to provide comprehensive coverage of the stages making up the data lifecycle. It can be organised locally and/or globally to support research data activities across the research lifecycle.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
research data management infrastructure
Configuration of staff, services and tools assembled to support data management across the research lifecycle and to provide comprehensive coverage of the stages making up the data lifecycle. It can be organised locally and/or globally to support research data activities across the research lifecycle.
RDMT. https://terms.codata.org/rdmt/research-data%20management-infrastructure
Appreciate the need for flexibility in applying FAIR criteria to make data as open as possible, as closed as necessary.
Angus Whyte
flexibility in relating fair criteria to openness
To be able to research and identify the appropriate taxonomy for your project. This may involve using resources such as FAIRsharing, which captures metadata on taxonomies and provides an assessment of their FAIRness.
https://orcid.org/0000-0003-2687-1982
Controlled Vocabulary identification
Ontology identification
Taxonomy identification
Thesaurus identification
choosing the appropriate terminology for your data
To be able to research and identify the appropriate taxonomy for your project. This may involve using resources such as FAIRsharing, which captures metadata on taxonomies and provides an assessment of their FAIRness.
[PMQ]
Data elements that have been organised into a consistent format and data structure within a defined data model such that the elements can be easily addressed, organised and accessed in various combinations to make better use of the information, such as in a relational database.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
Structured information
structured data
Data elements that have been organised into a consistent format and data structure within a defined data model such that the elements can be easily addressed, organised and accessed in various combinations to make better use of the information, such as in a relational database.
RDMT. https://terms.codata.org/rdmt/structured-data
Know how data handling practices in the domain make FAIR criteria more or less easy to implement.
Angus Whyte
knowledge to contextualise fair principles to domain
Storage, access and preservation of data created or collected in the course of research. Research data management practices cover the entire lifecycle of the data, from planning the investigation to conducting it, and from backing up data as it is created and used to long term preservation of data deliverables after the research investigation has concluded. Specific activities and issues that fall within the category of data management include: File naming (the proper way to name computer files); data quality control and quality assurance; data access; data documentation (including levels of uncertainty); metadata creation and controlled vocabularies; data storage; data archiving and preservation; data sharing and reuse; data integrity; data security; data privacy; data rights; notebook protocols (lab or field) as required.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
research data management
Storage, access and preservation of data created or collected in the course of research. Research data management practices cover the entire lifecycle of the data, from planning the investigation to conducting it, and from backing up data as it is created and used to long term preservation of data deliverables after the research investigation has concluded. Specific activities and issues that fall within the category of data management include: File naming (the proper way to name computer files); data quality control and quality assurance; data access; data documentation (including levels of uncertainty); metadata creation and controlled vocabularies; data storage; data archiving and preservation; data sharing and reuse; data integrity; data security; data privacy; data rights; notebook protocols (lab or field) as required.
RDMT. https://terms.codata.org/rdmt/research-data-management
Deposit research output (software, data and publication together with documentation) in previousy selected repositories.
Celia van Gelder
Mateusz Kuzak
Yan Wang
publish output in a repository
To be able to identify and process metadata over the internet, sourced from a variety of metadata providers and schemata.
https://orcid.org/0000-0003-2687-1982
Metadata processing
Metadata search
metadata search via metadata providers
To be able to identify and process metadata over the internet, sourced from a variety of metadata providers and schemata.
[PMQ]
Information for a data object that includes: the person who deposited the data object in the repository, the source of the data object, the date when the object was deposited, and authenticity information needed to link the data object to its original source.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
record provenance information
Information for a data object that includes: the person who deposited the data object in the repository, the source of the data object, the date when the object was deposited, and authenticity information needed to link the data object to its original source.
RDMT. https://terms.codata.org/rdmt/record-provenance-information
Process of citing a dataset in a similar manner to other research outputs. The dataset must be a standalone output that appears in a data repository, data paper or project website, and has a Persistent Identifier. Most current referencing systems provide a format for citing datasets.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data citation
Process of citing a dataset in a similar manner to other research outputs. The dataset must be a standalone output that appears in a data repository, data paper or project website, and has a Persistent Identifier. Most current referencing systems provide a format for citing datasets.
RDMT. https://terms.codata.org/rdmt/data-citation
https://orcid.org/0000-0003-2687-1982
https://orcid.org/0000-0003-2687-1982
2020-10-01T20:54:08.535183Z
book
The capacity to influence stakeholders and the direction of research activities; the ability to shape others' understanding in ways that capture interest, inform and gain support; and, the capacity to influence the actions and opinions of others.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
intellectual leadership
The capacity to influence stakeholders and the direction of research activities; the ability to shape others' understanding in ways that capture interest, inform and gain support; and, the capacity to influence the actions and opinions of others.
Definition from now-deprecated CASRAI Glossary term: intellectual-leadership/
In the context of a researcher's activities, innovation is the development of modified or novel approaches, theories, concepts, ideas or solutions. Innovation is one of four valued outcomes.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
innovation
In the context of a researcher's activities, innovation is the development of modified or novel approaches, theories, concepts, ideas or solutions. Innovation is one of four valued outcomes.
Definition from now-deprecated CASRAI Glossary term: innovation
High-performance computing (HPC) is a process for using of super computers and parallel processing techniques for solving complex computational problems. HPC technology focuses on developing parallel processing algorithms and systems by incorporating both administration and parallel computational techniques.
Kristina Hettne
Simon Hodson
Victoria Dominguez Del Angel
HPC management
high performance computing management
High-performance computing (HPC) is a process for using of super computers and parallel processing techniques for solving complex computational problems. HPC technology focuses on developing parallel processing algorithms and systems by incorporating both administration and parallel computational techniques.
(modified from wikipedia)[VDA]
A type of historical information or metadata about the origin, location or the source of something, or the history of the ownership or location of an object or resource including digital objects. For example, information about the Principal Investigator who recorded the data, and the information concerning its storage, handling, and migration.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
provenance
A type of historical information or metadata about the origin, location or the source of something, or the history of the ownership or location of an object or resource including digital objects. For example, information about the Principal Investigator who recorded the data, and the information concerning its storage, handling, and migration.
RDMT. https://terms.codata.org/rdmt/provenance
A record created digitally in the day-to-day business of the organisation and assigned formal status by the organisation. Examples include: word processing documents, emails, databases, or intranet web pages.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
Electronic record
AL 8.2.22: Updated label from the CASRAI "Digital" to "Digital record" to better match the definition and intent of the term, and to provide clarity.
digital record
A record created digitally in the day-to-day business of the organisation and assigned formal status by the organisation. Examples include: word processing documents, emails, databases, or intranet web pages.
Definition from now-deprecated CASRAI Glossary term: digital
Processes and procedures designed to ensure that the results of laboratory analysis are consistent, comparable, accurate and within specified limits of precision.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
analytical quality control
Processes and procedures designed to ensure that the results of laboratory analysis are consistent, comparable, accurate and within specified limits of precision.
Definition from now-deprecated CASRAI Glossary term: analytical-quality-control
Research data in digital form. It may have been originally created in digital form, or it may have been converted from paper, or other non-digital form to a digital representation.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
digital research data
Research data in digital form. It may have been originally created in digital form, or it may have been converted from paper, or other non-digital form to a digital representation.
RDMT. https://terms.codata.org/rdmt/digital-research-data
In the context of a researcher’s activities, Managing research is the processes related to the planning, organizing, setting objectives, controlling and evaluating of RDA activities and their associated human and financial resources. It includes the provision of leadership to, and assessment of, other scientists, engineers, technologists, and/or other staff. Managing research is one of the three research contexts in which a researcher is expected to conduct his/her activities. Managing research is distinct from the position-based role of a research manager.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
managing research
In the context of a researcher’s activities, Managing research is the processes related to the planning, organizing, setting objectives, controlling and evaluating of RDA activities and their associated human and financial resources. It includes the provision of leadership to, and assessment of, other scientists, engineers, technologists, and/or other staff. Managing research is one of the three research contexts in which a researcher is expected to conduct his/her activities. Managing research is distinct from the position-based role of a research manager.
Definition from now-deprecated CASRAI Glossary term: managing-research
To be able to define and deploy appropriate criteria, based on the FAIR principles, to identify the appropriate publishing platform for a dataset. This may, for example, involve discovering the mark-up of the platform, the exchange formats and data models used, and the licensing information for each repository. Tools such as the FAIR evaluator or FAIRshake may be used to assess the FAIRnessbased on human-entered questionnaires or FAIRsharing resource metadata.
https://orcid.org/0000-0003-2687-1982
evaluation of how to publish your data
To be able to define and deploy appropriate criteria, based on the FAIR principles, to identify the appropriate publishing platform for a dataset. This may, for example, involve discovering the mark-up of the platform, the exchange formats and data models used, and the licensing information for each repository. Tools such as the FAIR evaluator or FAIRshake may be used to assess the FAIRnessbased on human-entered questionnaires or FAIRsharing resource metadata.
[PMQ]
Numbers used by the National Center for Biotechnology Information (NCBI) that are unique and citable.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
accession number
Numbers used by the National Center for Biotechnology Information (NCBI) that are unique and citable.
Definition from now-deprecated CASRAI Glossary term: accession-number
Series of managed activities necessary to ensure continued access to digital materials for as long as necessary. All of the actions required to maintain access to digital materials beyond the limits of media failure or technological change. Those materials may be records created during the day-to-day business of an organisation; born-digital materials created for a specific purpose (such as teaching resources); or the products of digitisation projects. This definition specifically excludes the potential use of digital technology to preserve the original artefacts through digitisation.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
digital preservation
Series of managed activities necessary to ensure continued access to digital materials for as long as necessary. All of the actions required to maintain access to digital materials beyond the limits of media failure or technological change. Those materials may be records created during the day-to-day business of an organisation; born-digital materials created for a specific purpose (such as teaching resources); or the products of digitisation projects. This definition specifically excludes the potential use of digital technology to preserve the original artefacts through digitisation.
RDMT. https://terms.codata.org/rdmt/digital-preservation Also drawn from https://www.dpconline.org/digipres/what-is-digipres
The practice of initiating, planning, executing, controlling, and closing the work of a team in relation to FAIR data stewardship.
Philippe Rocca-Serra
Susanna Sansone
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
project management activities
The practice of initiating, planning, executing, controlling, and closing the work of a team in relation to FAIR data stewardship.
PMQ
Bin for types of Knowledge related to Funding acquisition management.
leightonlc
knowledge of funding acquisition management
Monitor the authorised parties' access to data and keep it up to date.
Celia van Gelder
Mateusz Kuzak
Yan Wang
manage access control
The practice of initiating, planning, executing, controlling, and closing the work of a team to achieve specific goals and meet specific success criteria at the specified time.
Kristina Hettne
Simon Hodson
Victoria Dominguez Del Angel
PM
project management
The practice of initiating, planning, executing, controlling, and closing the work of a team to achieve specific goals and meet specific success criteria at the specified time.
KH
Established protocol of an organisation for retaining information for operational or regulatory compliance needs. The objectives of a data retention policy are to keep important information for future use or reference, to organise information so it can be searched and accessed at a later date, and to dispose of information that is no longer needed. A data retention policy must consider both the value of data over time, and regulations to which the data may be subject.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data retention policy
Established protocol of an organisation for retaining information for operational or regulatory compliance needs. The objectives of a data retention policy are to keep important information for future use or reference, to organise information so it can be searched and accessed at a later date, and to dispose of information that is no longer needed. A data retention policy must consider both the value of data over time, and regulations to which the data may be subject.
RDMT. https://terms.codata.org/rdmt/data-retention-policy
Object describing the context of the data, including provenance, description, structural, and administrative information.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
data representation
Object describing the context of the data, including provenance, description, structural, and administrative information.
RDMT. https://terms.codata.org/rdmt/data-representation
The set of metadata describing a specific dataset.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
metadata dataset
The set of metadata describing a specific dataset.
Definition from now-deprecated CASRAI Glossary term: metadata-dataset
Continued access to digital materials, or at least to the information contained in them, indefinitely.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
long-term preservation
Continued access to digital materials, or at least to the information contained in them, indefinitely.
RDMT. https://terms.codata.org/rdmt/long-term-preservation
Research metadata format is a generic term encompassing the concept of a standardised format for research metadata.
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
AL 9.3.2022: The original CASRAI definition is unsuitable (https://casrai.org/term/research-metadata-format), therefore I have changed the definition to something more generic
research metadata format
Research metadata format is a generic term encompassing the concept of a standardised format for research metadata.
AL, 9.3.2022
Physical or geographic location of an organisation’s data or information. Data residency also refers to the legal or regulatory requirements imposed on data based on the country or region in which it resides. Cloud computing, which allows organisations to deliver hosted services over the Internet, can create data residency concerns.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data residency
Physical or geographic location of an organisation’s data or information. Data residency also refers to the legal or regulatory requirements imposed on data based on the country or region in which it resides. Cloud computing, which allows organisations to deliver hosted services over the Internet, can create data residency concerns.
RDMT. https://terms.codata.org/rdmt/data-residency
Information governance, is the overall strategy for information at an organization. Information governance balances the risk that information presents with the value that information provides. Information governance helps with legal compliance, operational transparency, and reducing expenditures associated with legal discovery.
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
CASRAI is the source for the term, with the definition provided separately.
information governance
Information governance, is the overall strategy for information at an organization. Information governance balances the risk that information presents with the value that information provides. Information governance helps with legal compliance, operational transparency, and reducing expenditures associated with legal discovery.
https://en.wikipedia.org/wiki/Information_governance, accessed 8.2.22
Unit of data for which the definition, identification, representation (term used to represent it), and permissible values are specified by means of a set of attributes.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data element
Unit of data for which the definition, identification, representation (term used to represent it), and permissible values are specified by means of a set of attributes.
RDMT. https://terms.codata.org/rdmt/data-element
Set of high-level principles that establish a guiding framework for data-related issues. A data policy may contain subsidiary sections that define approaches to strategic aspects such as data access; data protection; data management, custodianship or stewardship; data preservation; relevant legal matters; data acquisition; data ethics and other data-related issues. Distinct from a data management policy which is the subset of data policy addressing the management of a specified set of data assets.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data policy
Set of high-level principles that establish a guiding framework for data-related issues. A data policy may contain subsidiary sections that define approaches to strategic aspects such as data access; data protection; data management, custodianship or stewardship; data preservation; relevant legal matters; data acquisition; data ethics and other data-related issues. Distinct from a data management policy which is the subset of data policy addressing the management of a specified set of data assets.
RDMT. https://terms.codata.org/rdmt/data-policy
Older data that can no longer be accessed or processed easily because they are stored in obsolete formats or systems.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
legacy data
Older data that can no longer be accessed or processed easily because they are stored in obsolete formats or systems.
RDMT. https://terms.codata.org/rdmt/legacy-data
List of standardised terminology, words, or phrases, used for indexing or content analysis and information retrieval, usually in a defined information domain.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
controlled vocabulary
List of standardised terminology, words, or phrases, used for indexing or content analysis and information retrieval, usually in a defined information domain.
RDMT. https://terms.codata.org/rdmt/controlled-vocabulary
Assurance that information can only be accessed or modified by those authorised to do so.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data integrity
data integrity (access)
Assurance that information can only be accessed or modified by those authorised to do so.
RDMT. https://terms.codata.org/rdmt/data-integrity-access
Single data element related to a PID and part of its record content.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
pid attribute
Single data element related to a PID and part of its record content.
RDMT. https://terms.codata.org/rdmt/pid-attribute
Bin for Aptitudes related to Change management.
leightonlc
aptitudes for change management
Continued access to digital materials beyond changes in technology for a defined period of time but not indefinitely.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
medium-term preservation
Continued access to digital materials beyond changes in technology for a defined period of time but not indefinitely.
RDMT. https://terms.codata.org/rdmt/medium-term-preservation
Describes the processes and tasks that must be completed to produce a product or service. Different project lifecycles exist for specific products and services. (For example, the lifecycle followed to build a house is very different from the lifecycle followed to develop a software package.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
project lifecycle
Describes the processes and tasks that must be completed to produce a product or service. Different project lifecycles exist for specific products and services. (For example, the lifecycle followed to build a house is very different from the lifecycle followed to develop a software package.
Definition from now-deprecated CASRAI Glossary term: project-lifecycle
Know how the acceptability of research data FAIRness depends on the research community norms e.g. concepts of data and methods for deriving valid knowledge.
Angus Whyte
knowledge to relate fair data assessment to research community norms
Data traceability follows the lifecycle of data to track all access and changes to the data. It helps demonstrate transparency, compliance and adherence to regulations. Data traceability, along with data compliance, can be considered part of a data audit process. Data traceability is fundamental to reproducible research.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data traceability
Data traceability follows the lifecycle of data to track all access and changes to the data. It helps demonstrate transparency, compliance and adherence to regulations. Data traceability, along with data compliance, can be considered part of a data audit process. Data traceability is fundamental to reproducible research.
RDMT. https://terms.codata.org/rdmt/data-traceability
https://orcid.org/0000-0003-2687-1982
2020-10-02T14:23:22.115275Z
maintaining persistent identifiers
Choose the proper license considering types of project data and clearly indicate (apply) the license chosen in the project work.
Celia van Gelder
Mateusz Kuzak
Yan Wang
select and apply license
Indicates how the different components within a system are linked to fulfill the tasks. Relations are thus defined by the services they are making use of and by the interface specifications.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
relations
Indicates how the different components within a system are linked to fulfill the tasks. Relations are thus defined by the services they are making use of and by the interface specifications.
Definition from now-deprecated CASRAI Glossary term: relations
Organised collection of data or objects in a computational format, that are generated or collected by researchers in the course of their investigations, regardless of their form or method, that form the object on which researchers test a hypothesis. This includes the full range of data: raw, unprocessed datasets, proprietary generated and processed data and secondary data obtained from third parties. The presentation of the data in the application is enabled through metadata.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
dataset
Organised collection of data or objects in a computational format, that are generated or collected by researchers in the course of their investigations, regardless of their form or method, that form the object on which researchers test a hypothesis. This includes the full range of data: raw, unprocessed datasets, proprietary generated and processed data and secondary data obtained from third parties. The presentation of the data in the application is enabled through metadata.
RDMT. https://terms.codata.org/rdmt/dataset
Bin for Aptitudes related to Service level management.
leightonlc
aptitudes for service level management
Text format derived from Standard Generalized Markup Language or ‘SGML’ (ISO 8879). Originally designed to meet the challenges of large-scale electronic publishing, XML is also playing an increasingly important role in the exchange of a wide variety of data on the Web and elsewhere.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
XML
extensible markup language
Text format derived from Standard Generalized Markup Language or ‘SGML’ (ISO 8879). Originally designed to meet the challenges of large-scale electronic publishing, XML is also playing an increasingly important role in the exchange of a wide variety of data on the Web and elsewhere.
RDMT. https://terms.codata.org/rdmt/extensible-markup-language
Bin for Skills needed for Change management.
leightonlc
skills related to change management
Techniques that produce synthesised knowledge from organised information. Process of inspecting, cleaning, transforming, and modelling data with the goal of highlighting useful information suggesting conclusions, and supporting decision making. Data analysis has multiple facets and approaches, encompassing diverse techniques under a variety of names, in different business, science, and social science domains.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data analysis
Techniques that produce synthesised knowledge from organised information. Process of inspecting, cleaning, transforming, and modelling data with the goal of highlighting useful information suggesting conclusions, and supporting decision making. Data analysis has multiple facets and approaches, encompassing diverse techniques under a variety of names, in different business, science, and social science domains.
RDMT. https://terms.codata.org/rdmt/data-analysis
Resource management is the efficient and effective development of an organization's resources when they are needed. Such resources may include the financial resources, inventory, human skills, production resources, or information technology (IT) and natural resources.
Kristina Hettne
Leighton Christiansen
Simon Hodson
Victoria Dominguez Del Angel
resource management
Resource management is the efficient and effective development of an organization's resources when they are needed. Such resources may include the financial resources, inventory, human skills, production resources, or information technology (IT) and natural resources.
[LLC, https://en.wikipedia.org/wiki/Resource_management]
Summarising the main characteristics of a dataset using visualisation.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data exploration
Summarising the main characteristics of a dataset using visualisation.
RDMT. https://terms.codata.org/rdmt/data-exploration
Bin for Aptitudes related to Storage management.
leightonlc
aptitudes for storage management
Management and provision of good data stewardship practice.
Philippe Rocca-Serra
Susanna Sansone
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
Advise and enable
Data Stewardship and FAIR mentorship activity
data management and open research
AL 9.3.22. Removed from 'FAIR4S defined activity' class to better integrate with the T4FS hierarchy.
advise and mentor
Management and provision of good data stewardship practice.
PMQ
Bin for Skills needed for Workflow technologies management.
leightonlc
2019-10-17T15:15:55.86138Z
skills related to workflow technologies management
Those layers that sit between base technology (a computer science concern) and discipline-specific science. Value-added systems and services that can be widely shared across scientific domains, both supporting and enabling large increases in multi- and interdisciplinary science while reducing duplication of effort and resources (including hardware, software, personnel, services and organisations).
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
digital infrastructure
Those layers that sit between base technology (a computer science concern) and discipline-specific science. Value-added systems and services that can be widely shared across scientific domains, both supporting and enabling large increases in multi- and interdisciplinary science while reducing duplication of effort and resources (including hardware, software, personnel, services and organisations).
RDMT. https://terms.codata.org/rdmt/digital-infrastructure
Bin for Skills needed for Data management cost management.
leightonlc
skills related to data management cost management
Consists of at least one PID resolver, a name schema and a defined mechanism for issuing PIDs that conform to the name schema.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
pid system
Consists of at least one PID resolver, a name schema and a defined mechanism for issuing PIDs that conform to the name schema.
RDMT. https://terms.codata.org/rdmt/pid-system
https://orcid.org/0000-0003-2687-1982
I2. (meta)data use vocabularies that follow fair principles
Research approach that explains aspects of one discipline in terms of another (e.g., the physics of music; the politics of literature).
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
cross-disciplinary
Research approach that explains aspects of one discipline in terms of another (e.g., the physics of music; the politics of literature).
RDMT. https://terms.codata.org/rdmt/cross-disciplinary
Data that are tagged with particular metadata that can be used to derive relationships between data.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
semantic data
Data that are tagged with particular metadata that can be used to derive relationships between data.
RDMT. https://terms.codata.org/rdmt/semantic-data
For a single identifier, the class of entity it refers to. For a PID system, the typical class of entities it is intended to be used for, such as digital objects, physical objects, bodies, actors.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
pid domain
For a single identifier, the class of entity it refers to. For a PID system, the typical class of entities it is intended to be used for, such as digital objects, physical objects, bodies, actors.
RDMT. https://terms.codata.org/rdmt/pid-domain
Select data handling approaches likely to make data as FAIR as possible, considering what the data is about and its purpose in the research.
Angus Whyte
knowledge to choose fair data handling approaches appropriate to the research phenomena
The activity of recording provenance for data and software. Knowledge about provenance vocabularies, for example PROV-O and models provenance such as nanopublications.
Kristina Hettne
Simon Hodson
Victoria Dominguez Del Angel
provenance information management
The activity of recording provenance for data and software. Knowledge about provenance vocabularies, for example PROV-O and models provenance such as nanopublications.
KH
Results that can be replicated using the documented data, code, and methods employed by the author or provider without the need for any additional information or needing to communicate with the author or provider.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
reproducible research
Results that can be replicated using the documented data, code, and methods employed by the author or provider without the need for any additional information or needing to communicate with the author or provider.
RDMT. https://terms.codata.org/rdmt/reproducible-research
https://orcid.org/0000-0003-2687-1982
A1.2 the protocol allows for an authentication and authorisation procedure, where necessary
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
data stewardship technical concept
Data collection that has been normalised by some established criteria to allow for effective data management. Examples include: data files that belong to a certain experiment, all files that are created by one specific simulation, all files that belong to a specific observation (same day, same place, etc.).
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
canonical data collection
Data collection that has been normalised by some established criteria to allow for effective data management. Examples include: data files that belong to a certain experiment, all files that are created by one specific simulation, all files that belong to a specific observation (same day, same place, etc.).
RDMT. https://terms.codata.org/rdmt/canonical-data-collection
Ability to clean up existing source code and version control systems, get Unique ID for the code, choose the appropriate code repository/registry [YLF, VDA, KH]
Kristina Hettne
Victoria Dominguez Del Angel
Yann Le Franc
https://orcid.org/0000-0002-7702-4495
archival documentation of software
Data harmonization is the process of making data from different sources comparable.
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
https://github.com/terms4fairskills/FAIRterminology/issues/17
data harmonization
Data harmonization is the process of making data from different sources comparable.
Modified from CASRAI. https://casrai.org/term/data-harmonization/
Repository of data designed to serve a particular community of knowledge workers. A data mart contains harmonised, highly-structured, quality data integrated from multiple sources (which is also a characteristic of the data warehouse). It's also optimised to support well-known, predefined and repeatable analytic queries, rather than ad-hoc analysis.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data mart
Repository of data designed to serve a particular community of knowledge workers. A data mart contains harmonised, highly-structured, quality data integrated from multiple sources (which is also a characteristic of the data warehouse). It's also optimised to support well-known, predefined and repeatable analytic queries, rather than ad-hoc analysis.
RDMT. https://terms.codata.org/rdmt/data-mart
Type of access entity that contains the services and functions which make the data object holdings and their information content and related services visible to data consumers.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
access workflow
Type of access entity that contains the services and functions which make the data object holdings and their information content and related services visible to data consumers.
RDMT. https://terms.codata.org/rdmt/access-workflow
Bin for types of Knowledge related to Service level management.
leightonlc
2019-10-17T15:18:53.73274Z
knowledge of service level management
Model that specifies the structure or schema of a dataset. The model provides a documented description of the data and thus is an instance of metadata. It is a logical, relational data model showing an organised dataset as a collection of tables with entity, attributes and relations.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data model
Model that specifies the structure or schema of a dataset. The model provides a documented description of the data and thus is an instance of metadata. It is a logical, relational data model showing an organised dataset as a collection of tables with entity, attributes and relations.
RDMT. https://terms.codata.org/rdmt/data-model
Be aware of the potential conflicts between security and usability. Understand the importance of having information security policy and keeping a balance between the information security and usability. Willing to establish both organization and technical information security measurements.
Angus Whyte
Celia van Gelder
Mateusz Kuzak
Yan Wang
appreciate security and usability trade-offs
Set of agreed-upon and documented guidelines, specifications, accepted practices, technical requirements, or terminologies that have been prepared by a standards developing organisation or group, and published in accordance with established procedures. These can be mandatory or voluntary and are distinct from Acts, regulations, and codes, although standards can be referenced in those legal instruments.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
standard
Set of agreed-upon and documented guidelines, specifications, accepted practices, technical requirements, or terminologies that have been prepared by a standards developing organisation or group, and published in accordance with established procedures. These can be mandatory or voluntary and are distinct from Acts, regulations, and codes, although standards can be referenced in those legal instruments.
RDMT. https://terms.codata.org/rdmt/standard
The application of a comprehensive knowledge of a discipline or disciplines to the development of expertise and the generation of new knowledge through research, and the planning and presentation of courses of study for undergraduates and graduates in universities.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
university teaching
The application of a comprehensive knowledge of a discipline or disciplines to the development of expertise and the generation of new knowledge through research, and the planning and presentation of courses of study for undergraduates and graduates in universities.
Definition from now-deprecated CASRAI Glossary term: university-teaching
Transfer of data between storage types, formats, information technologies, or computer systems. A data migration project is usually undertaken to replace or upgrade servers or storage equipment, for a website consolidation, to conduct server maintenance or to relocate a data centre.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data migration
Transfer of data between storage types, formats, information technologies, or computer systems. A data migration project is usually undertaken to replace or upgrade servers or storage equipment, for a website consolidation, to conduct server maintenance or to relocate a data centre.
RDMT. https://terms.codata.org/rdmt/data-migration
Process of detecting and correcting corrupt or inaccurate records from a dataset. Data cleaning is a continuous process that requires corrective actions throughout the data lifecycle. Data cleaning involves identifying, replacing, modifying or deleting incomplete, incorrect, inaccurate, inconsistent, irrelevant, and improperly formatted data. Typically, the process involves updating, correcting, standardising, and de-duplicating records to create a single view of the data, even if they are stored in multiple disparate systems.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data cleaning
Process of detecting and correcting corrupt or inaccurate records from a dataset. Data cleaning is a continuous process that requires corrective actions throughout the data lifecycle. Data cleaning involves identifying, replacing, modifying or deleting incomplete, incorrect, inaccurate, inconsistent, irrelevant, and improperly formatted data. Typically, the process involves updating, correcting, standardising, and de-duplicating records to create a single view of the data, even if they are stored in multiple disparate systems.
RDMT. https://terms.codata.org/rdmt/data-cleaning
https://orcid.org/0000-0003-2687-1982
R1.2 (meta)data are associated with detailed provenance
Bin for Skills needed for Identity management.
leightonlc
skills related to identity management
The use of persistent identifiers or PIDs to uniquely and persistently identify an entity.
Nancy Hoebelheinrich
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
https://orcid.org/0000-0003-2687-1982
2020-10-02T14:22:51.569762Z
using persistent identifiers
The use of persistent identifiers or PIDs to uniquely and persistently identify an entity.
[PMQ 3.2022]
https://orcid.org/0000-0002-7702-4495
database curator
AL 15.3.22: Updated label to 'data curator' to encompass curation beyond just in databases.
data curator
Learn about innovation partners' interests and concerns, analyze them and position the project work/outcome properly. Reach out to innovation partners with proactive and pragmatic innovation plan which is in compliance with FAIR principles.
Celia van Gelder
Mateusz Kuzak
Yan Wang
engage innovation partners
https://orcid.org/0000-0003-2687-1982
R1.3 (meta)data meet domain-relevant community standards
Process of reducing the amount or size of stored data. This may be achieved by eliminating redundant copies of data files, deduplicating data files by removing redundant records, or by compressing the data files.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data reduction
Process of reducing the amount or size of stored data. This may be achieved by eliminating redundant copies of data files, deduplicating data files by removing redundant records, or by compressing the data files.
RDMT. https://terms.codata.org/rdmt/data-reduction
https://orcid.org/0000-0003-2687-1982
F4. (meta)data are registered or indexed in a searchable resource
Registry for data types supporting their standardisation, uniqueness and discoverability. Data types range from complex digital objects to simple categories that occur in digital objects. An additional functionality may be to link data types to executable data processing functions.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data type registry
Registry for data types supporting their standardisation, uniqueness and discoverability. Data types range from complex digital objects to simple categories that occur in digital objects. An additional functionality may be to link data types to executable data processing functions.
RDMT. https://terms.codata.org/rdmt/data-type-registry
Collection of data that is organised according to a conceptual structure/model describing the characteristics of these data and the relationships among their corresponding entities, supporting one or more application areas. A database allows its contents to be easily accessed, managed and updated. The type of database used depends on the requirements of the study. A common type is the relational database, where data are related to each other in a systematic manner so that they can be reorganised and accessed in a number of different ways. A database may house one or many datasets.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
database
Collection of data that is organised according to a conceptual structure/model describing the characteristics of these data and the relationships among their corresponding entities, supporting one or more application areas. A database allows its contents to be easily accessed, managed and updated. The type of database used depends on the requirements of the study. A common type is the relational database, where data are related to each other in a systematic manner so that they can be reorganised and accessed in a number of different ways. A database may house one or many datasets.
RDMT. https://terms.codata.org/rdmt/database
Recognize the added value of FAIR and open research, encourage researchers to practise FAIR and open research.
Angus Whyte
Celia van Gelder
Mateusz Kuzak
Yan Wang
appreciation of fair and open research
Demonstrate willingness to engage with new ways of applying FAIR principles.
Angus Whyte
ability to learn new techniques for fair implementation
Bin for Aptitudes related to Funding acquisition management.
leightonlc
aptitudes for funding acquisition management
To be able to choose the correct data model and/or exchange format for your data, based on the repository where you plan to deposit your data.
https://orcid.org/0000-0003-2687-1982
Which data model to use
Which format to use
choosing the appropriate model or format for your data
To be able to choose the correct data model and/or exchange format for your data, based on the repository where you plan to deposit your data.
[PMQ]
Information concerning the creation, attribution, or version history of managed data. Provenance metadata indicates the relationship between two versions of data objects and is generated whenever a new version of a dataset is created. Provenance information is gathered along the data lifecycle as part of curation processes. A finer level of provenance metadata would be concerned only with data flowing between various stores such as curated databases and managed repositories. Provenance metadata is designed to allow queries over the relationship between versions, and includes either or both fine-grained and coarse-grained provenance data. Different applications may store different provenance data.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
provenance metadata
Information concerning the creation, attribution, or version history of managed data. Provenance metadata indicates the relationship between two versions of data objects and is generated whenever a new version of a dataset is created. Provenance information is gathered along the data lifecycle as part of curation processes. A finer level of provenance metadata would be concerned only with data flowing between various stores such as curated databases and managed repositories. Provenance metadata is designed to allow queries over the relationship between versions, and includes either or both fine-grained and coarse-grained provenance data. Different applications may store different provenance data.
RDMT. https://terms.codata.org/rdmt/provenance-metadata
State when data are available in a timely manner in the place and form as needed by the user.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data availability
State when data are available in a timely manner in the place and form as needed by the user.
RDMT. https://terms.codata.org/rdmt/data-availability
Activity through which the correctness conditions of the data are verified. It also includes the specification of the type of the error or condition not met, and the qualification of the data and its division into “error-free” or “erroneous” data. Data review consists of both error detection and data analysis, and can be carried out in manual or automated mode.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
data review
Activity through which the correctness conditions of the data are verified. It also includes the specification of the type of the error or condition not met, and the qualification of the data and its division into “error-free” or “erroneous” data. Data review consists of both error detection and data analysis, and can be carried out in manual or automated mode.
RDMT. https://terms.codata.org/rdmt/data-review
HughShanahan
2019-10-18T08:56:32.673701Z
aptitudes associated with using fair tools and services
Documents actions that have been undertaken to preserve a digital resource such as migrations and checks sum calculations.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
preservation metadata
Documents actions that have been undertaken to preserve a digital resource such as migrations and checks sum calculations.
RDMT. https://terms.codata.org/rdmt/preservation-metadata
Change management is a broad subject and can be applied to all different types of organizational change. The most common change drivers include: technological evolution, process reviews, crisis, and consumer habit changes; pressure from new business entrants, acquisitions, mergers, and organizational restructuring.
Kristina Hettne
Leighton Christiansen
Simon Hodson
Victoria Dominguez Del Angel
CM
change management
Change management is a broad subject and can be applied to all different types of organizational change. The most common change drivers include: technological evolution, process reviews, crisis, and consumer habit changes; pressure from new business entrants, acquisitions, mergers, and organizational restructuring.
https://www.hucmi.com/en/hcmbok/
Type of repository where the original copy of data was stored and probably a data identifier registered.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
original repository
Type of repository where the original copy of data was stored and probably a data identifier registered.
RDMT. https://terms.codata.org/rdmt/original-repository
Techniques designed to make the risk of identifying a particular individual in a dataset negligible, whilst retaining the re-usability of the dataset. The purpose is to protect the privacy of the individual and comply with legislation, whilst enabling data sharing. Methods include removing direct and indirect identifiers such as names, addresses, social insurance numbers, or dates of birth, or using obfuscation methods such as encryption, hashing, generalisation, pseudonymisation, and perturbation.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
de-identification
Techniques designed to make the risk of identifying a particular individual in a dataset negligible, whilst retaining the re-usability of the dataset. The purpose is to protect the privacy of the individual and comply with legislation, whilst enabling data sharing. Methods include removing direct and indirect identifiers such as names, addresses, social insurance numbers, or dates of birth, or using obfuscation methods such as encryption, hashing, generalisation, pseudonymisation, and perturbation.
RDMT. https://terms.codata.org/rdmt/de-identification
Program delivery managers and support function managers, at all levels in an institution who are accountable for the direct delivery and support of programs and services within their domain of business responsibility.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
manager
Program delivery managers and support function managers, at all levels in an institution who are accountable for the direct delivery and support of programs and services within their domain of business responsibility.
Definition from now-deprecated CASRAI Glossary term: manager
Organizational leadership is: (a) The ability to attract, assess, mobilize and focus energies and talent to work towards a shared purpose aligned with the mandate of the organization; (b) The ability to change culture, processes and priorities within the organization; and, (c) The ability to mentor.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
organizational leadership
Organizational leadership is: (a) The ability to attract, assess, mobilize and focus energies and talent to work towards a shared purpose aligned with the mandate of the organization; (b) The ability to change culture, processes and priorities within the organization; and, (c) The ability to mentor.
Definition from now-deprecated CASRAI Glossary term: organizational-leadership
Data mining practice in which large volumes of data are analysed seeking any possible relationships between data. The traditional scientific method, in contrast, begins with a hypothesis and follows with an examination of the data. Data dredging often circumvents traditional data mining techniques and may lead to premature conclusions. Uncovered patterns may be presented as statistically significant without any specific hypothesis as to the underlying causality.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
Data fishing
data dredging
Data mining practice in which large volumes of data are analysed seeking any possible relationships between data. The traditional scientific method, in contrast, begins with a hypothesis and follows with an examination of the data. Data dredging often circumvents traditional data mining techniques and may lead to premature conclusions. Uncovered patterns may be presented as statistically significant without any specific hypothesis as to the underlying causality.
RDMT. https://terms.codata.org/rdmt/data-dredging
Data that have not been organised into a format and identifiable data structure that makes them easy to access and process. These data can often be searched as long as they are digital, but they are difficult to use for computer analyses.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
Unstructured information
unstructured data
Data that have not been organised into a format and identifiable data structure that makes them easy to access and process. These data can often be searched as long as they are digital, but they are difficult to use for computer analyses.
RDMT. https://terms.codata.org/rdmt/unstructured-data
https://orcid.org/0000-0003-2687-1982
2020-11-16T22:18:14.764875Z
presentation slides with interactive exercises
https://orcid.org/0000-0003-2687-1982
2020-10-02T14:23:05.600474Z
creating persistent identifiers
The process of bringing together from two or more different sources, data that relate to the same individual, family, place or event).
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
Linkage
data linkage
The process of bringing together from two or more different sources, data that relate to the same individual, family, place or event).
RDMT. https://terms.codata.org/rdmt/data-linkage
A type of collection that describes, and points to features of another collection.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
catalogue
A type of collection that describes, and points to features of another collection.
RDMT. https://terms.codata.org/rdmt/catalogue
The consideration of available monetary resources for a specific time period to serve a specific purpose.
Kristina Hettne
Simon Hodson
Victoria Dominguez Del Angel
funding acquisition management
The consideration of available monetary resources for a specific time period to serve a specific purpose.
KH
To be able to use programmatic methods to access a resource's API to query and extract an appropriate subset of data.
https://orcid.org/0000-0003-2687-1982
API access
Machine actionable search
machine access via api
To be able to use programmatic methods to access a resource's API to query and extract an appropriate subset of data.
PMQ
Continuum of data structure that includes unstructured data, semi-structured data, and structured data.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data structure continuum
Continuum of data structure that includes unstructured data, semi-structured data, and structured data.
RDMT. https://terms.codata.org/rdmt/data-structure-continuum
Facts, measurements, recordings, records, or observations about the world, collected by researchers, that are yet to be processed/interpreted/analysed. Data may be in any format or medium taking the form of writings, notes, numbers, symbols, text, images, films, video, sound recordings, pictorial reproductions, drawings, designs or other graphical representations, procedural manuals, forms, diagrams, work flow charts, equipment descriptions, data files, data processing algorithms, or statistical records.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data
Facts, measurements, recordings, records, or observations about the world, collected by researchers, that are yet to be processed/interpreted/analysed. Data may be in any format or medium taking the form of writings, notes, numbers, symbols, text, images, films, video, sound recordings, pictorial reproductions, drawings, designs or other graphical representations, procedural manuals, forms, diagrams, work flow charts, equipment descriptions, data files, data processing algorithms, or statistical records.
RDMT. https://terms.codata.org/rdmt/data
Data that have not been organised into a specialised repository, such as a database, but that nevertheless have associated information, such as metadata, that makes them more amenable to processing than raw data. Semi-structured data lie somewhere between structured and unstructured data. They are not organised in a complex manner that makes sophisticated access and analysis possible. However, they may have information associated with them, such as metadata tagging that allows elements contained to be addressed.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
semi-structured data
Data that have not been organised into a specialised repository, such as a database, but that nevertheless have associated information, such as metadata, that makes them more amenable to processing than raw data. Semi-structured data lie somewhere between structured and unstructured data. They are not organised in a complex manner that makes sophisticated access and analysis possible. However, they may have information associated with them, such as metadata tagging that allows elements contained to be addressed.
RDMT. https://terms.codata.org/rdmt/semi-structured-data
https://orcid.org/0000-0003-2687-1982
2020-10-01T20:53:37.24517Z
book chapter
A scientist who conducts activities in: (1) Research, development and analysis (RDA); (2) Managing research; and, (3) Representation and client services.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
research scientist
A scientist who conducts activities in: (1) Research, development and analysis (RDA); (2) Managing research; and, (3) Representation and client services.
Definition from now-deprecated CASRAI Glossary term: research-scientist
Type of digital Persistent Identifier (PID) issued by the International DOI Foundation. This permanent digital identifier is associated with an object that permits the object to be referenced reliably even if its location and metadata undergo change over time.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
DOI
digital object identifier
Type of digital Persistent Identifier (PID) issued by the International DOI Foundation. This permanent digital identifier is associated with an object that permits the object to be referenced reliably even if its location and metadata undergo change over time.
RDMT. https://terms.codata.org/rdmt/digital-object-identifier
https://orcid.org/0000-0003-2687-1982
F3. metadata clearly and explicitly include the identifier of the data they describe
Expert in one or more of the information management disciplines that support the effective and efficient management of information.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
information management specialist
Expert in one or more of the information management disciplines that support the effective and efficient management of information.
RDMT. https://terms.codata.org/rdmt/information-management-specialist
https://orcid.org/0000-0003-2687-1982
A2. metadata are accessible, even when the data are no longer available
Identify and engage in dialogue with stakeholders affected by the research, or by making its outputs FAIR.
Angus Whyte
Celia van Gelder
Mateusz Kuzak
Yan Wang
stakeholder engagement on societal impact
The Principal Investigator (P.I.) is a researcher who has a research leadership role and is the point of contact for a project or partnership that applies the scientific method, historical method, or other research methodology for the advancement of knowledge resulting in independent, objective, high quality, traceable, and reproducible results. The P.I. has primary responsibility for the intellectual direction and integrity of the research or research-related activity, including data production, findings and results, and ensures ethical conduct in all aspects of the research process including but not limited to the treatment of human and animal subjects, conflicts of interest, data acquisition, sharing and ownership, publication practices, responsible authorship, and collaborative research and reporting. While various tasks may be delegated to team members, some of whom may have greater expertise in specific areas, the P.I. is familiar with the various technical and scientific aspects of a project and how they fit together, is able to identify and remediate gaps, and ensure communication within the team and with users of the research data and results.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
principal investigator
The Principal Investigator (P.I.) is a researcher who has a research leadership role and is the point of contact for a project or partnership that applies the scientific method, historical method, or other research methodology for the advancement of knowledge resulting in independent, objective, high quality, traceable, and reproducible results. The P.I. has primary responsibility for the intellectual direction and integrity of the research or research-related activity, including data production, findings and results, and ensures ethical conduct in all aspects of the research process including but not limited to the treatment of human and animal subjects, conflicts of interest, data acquisition, sharing and ownership, publication practices, responsible authorship, and collaborative research and reporting. While various tasks may be delegated to team members, some of whom may have greater expertise in specific areas, the P.I. is familiar with the various technical and scientific aspects of a project and how they fit together, is able to identify and remediate gaps, and ensure communication within the team and with users of the research data and results.
Definition from now-deprecated CASRAI Glossary term: principal-investigator
The person responsible for creating the organizational environment culture by providing clear direction and circumstances that allow people to be successful. The program manager is judged on the elements time, cost, and scope, cumulatively for all the projects and operations within the program. Program management decisions are both tactical and strategic in nature. The strategy aspects of these decisions must consider multidimensional impacts beyond the near-term delivery dates of the project. In addition to delivery and execution, the program manager has to also be concerned with the overall health and effectiveness of the program over the long term.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
program manager
The person responsible for creating the organizational environment culture by providing clear direction and circumstances that allow people to be successful. The program manager is judged on the elements time, cost, and scope, cumulatively for all the projects and operations within the program. Program management decisions are both tactical and strategic in nature. The strategy aspects of these decisions must consider multidimensional impacts beyond the near-term delivery dates of the project. In addition to delivery and execution, the program manager has to also be concerned with the overall health and effectiveness of the program over the long term.
Definition from now-deprecated CASRAI Glossary term: program-manager
URL that points to an intermediate resolution service. The PURL resolution service associates the PURL with the actual URL and returns that URL to the client.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
PURL
persistent uniform resource locator
URL that points to an intermediate resolution service. The PURL resolution service associates the PURL with the actual URL and returns that URL to the client.
RDMT. https://terms.codata.org/rdmt/persistent-uniform-resource-locator
Widely used metadata element set, formally titled ISO 15836-1:2017, Information and documentation — The Dublin Core metadata element set — Part 1: Core elements.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
dublin core
Widely used metadata element set, formally titled ISO 15836-1:2017, Information and documentation — The Dublin Core metadata element set — Part 1: Core elements.
RDMT. https://terms.codata.org/rdmt/dublin-core
Yann Le Franc
ylefranc
2021-03-17T16:34:14.943152Z
data stewardship guideline
The FAIR principles are guidelines to improve the Findability, Accessibility, Interoperability, and Reuse of digital assets. The principles emphasise machine-actionability (i.e., the capacity of computational systems to find, access, interoperate, and reuse data with none or minimal human intervention) because humans increasingly rely on computational support to deal with data as a result of the increase in volume, complexity, and creation speed of data. The principles refer to three types of entities: data (or any digital object), metadata (information about that digital object), and infrastructure.
https://orcid.org/0000-0003-2687-1982
https://orcid.org/0000-0003-2687-1982
2020-10-02T14:56:15.031154Z
fair principle
The FAIR principles are guidelines to improve the Findability, Accessibility, Interoperability, and Reuse of digital assets. The principles emphasise machine-actionability (i.e., the capacity of computational systems to find, access, interoperate, and reuse data with none or minimal human intervention) because humans increasingly rely on computational support to deal with data as a result of the increase in volume, complexity, and creation speed of data. The principles refer to three types of entities: data (or any digital object), metadata (information about that digital object), and infrastructure.
https://www.go-fair.org/fair-principles/
Bin for Aptitudes related to Cloud computing environment management.
leightonlc
aptitudes for cloud computing environment management
New datasets obtained by combining data appropriately from a variety of existing files, generating new data products that did not previously exist. Repurposed data result from data wrangling.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
repurposed data
New datasets obtained by combining data appropriately from a variety of existing files, generating new data products that did not previously exist. Repurposed data result from data wrangling.
RDMT. https://terms.codata.org/rdmt/repurposed-data
Definition of the structure and the legal elements and attributes of an XML document.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
document type definition
Definition of the structure and the legal elements and attributes of an XML document.
RDMT. https://terms.codata.org/rdmt/document-type-definition
https://orcid.org/0000-0003-2687-1982
I3. (meta)data include qualified references to other (meta)data
Long-lasting digital reference to an object that gives information about that object regardless of what happens to that object. Developed to address link rot, a persistent identifier can be resolved to provide an appropriate representation of an object whether that object changes its online location or goes offline.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
PID
persistent identifier
Long-lasting digital reference to an object that gives information about that object regardless of what happens to that object. Developed to address link rot, a persistent identifier can be resolved to provide an appropriate representation of an object whether that object changes its online location or goes offline.
RDMT. https://terms.codata.org/rdmt/persistent-identifier
A person who is studying or has expert knowledge of one or more of the natural or physical sciences.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
scientist
A person who is studying or has expert knowledge of one or more of the natural or physical sciences.
Definition from now-deprecated CASRAI Glossary term: scientist
1. The act of bringing together smaller components into a single system that functions as one. 2. In the context of information technology: The end result of a process that aims to stitch together different, often disparate, subsystems so that the data contained in each becomes part of a larger, more comprehensive system that, ideally, quickly and easily shares data when needed. This often requires that organizations build a customized architecture or structure of applications to combine new or existing hardware, software and other communications.
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
integration
1. The act of bringing together smaller components into a single system that functions as one. 2. In the context of information technology: The end result of a process that aims to stitch together different, often disparate, subsystems so that the data contained in each becomes part of a larger, more comprehensive system that, ideally, quickly and easily shares data when needed. This often requires that organizations build a customized architecture or structure of applications to combine new or existing hardware, software and other communications.
Definition from now-deprecated CASRAI Glossary term: integration
Bin for types of Knowledge related to High performance computing management.
leightonlc
knowledge of high performance computing management
The act of mentoring around FAIR data stewardship.
Philippe Rocca-Serra
Susanna Sansone
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
fair mentoring
The act of mentoring around FAIR data stewardship.
PMQ
Exercise of authority, control and shared decision making (planning, monitoring and enforcement) over the management of data assets. Overall management of the availability, usability, integrity, and security of the data employed in an organisation. A sound data governance program includes a governing body or council, a defined set of procedures, and a plan to execute those procedures.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data governance
Exercise of authority, control and shared decision making (planning, monitoring and enforcement) over the management of data assets. Overall management of the availability, usability, integrity, and security of the data employed in an organisation. A sound data governance program includes a governing body or council, a defined set of procedures, and a plan to execute those procedures.
RDMT. https://terms.codata.org/rdmt/data-governance
Scheme used for identification of resources (including people and organisations) and the sharing of data across domains, enterprises, and applications. XRI TC will define a Uniform Resource Identifier (URI) scheme and a corresponding Uniform Resource Namespace (URN).
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
extensible resource identifier
Scheme used for identification of resources (including people and organisations) and the sharing of data across domains, enterprises, and applications. XRI TC will define a Uniform Resource Identifier (URI) scheme and a corresponding Uniform Resource Namespace (URN).
RDMT. https://terms.codata.org/rdmt/extensible-resource-identifier
In the context of a researcher's activities, impact is the consequence of the research and new knowledge on the advancement of the specialty. Science-based policies, regulations, services and technology transfers are some examples of ways target results can be achieved and impact demonstrated.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
impact
In the context of a researcher's activities, impact is the consequence of the research and new knowledge on the advancement of the specialty. Science-based policies, regulations, services and technology transfers are some examples of ways target results can be achieved and impact demonstrated.
Definition from now-deprecated CASRAI Glossary term: impact
A list used to grant permission matched against credentials.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
access control list
A list used to grant permission matched against credentials.
Definition from now-deprecated CASRAI Glossary term: access-control-list.
Learn about data copyright laws, licennsing and other legal aspects of data access. Understand the application of those laws at the project level.
Celia van Gelder
Mateusz Kuzak
Yan Wang
understand data ownership and access policies
Meeting/conference organisation is a project management activity that encompasses all of the steps required to run a meeting or conference.
Philippe Rocca-Serra
Susanna Sansone
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
meeting/conference organisation
It can be considered as a negotiated agreement between the customer and vendor which are acceptable to both parties with respect to costs and expectations in order to support the business process.
Kristina Hettne
Leighton Christiansen
Simon Hodson
Victoria Dominguez Del Angel
SLM
service level management
It can be considered as a negotiated agreement between the customer and vendor which are acceptable to both parties with respect to costs and expectations in order to support the business process.
https://www.techopedia.com/definition/13848/service-level-management-slm [LLC]
From an official perspective, a national standard is adopted by a national standards body (e.g., Standards Council of Canada, American National Standards Institute, British Standards Institution) and made available to the public. Practically speaking, however, a national standard is any standard that is widely used and recognized within a country. In this context, even government standards, such as those issued by the Occupational Safety and Health Administration (OSHA), can be considered national standards.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
national standard
From an official perspective, a national standard is adopted by a national standards body (e.g., Standards Council of Canada, American National Standards Institute, British Standards Institution) and made available to the public. Practically speaking, however, a national standard is any standard that is widely used and recognized within a country. In this context, even government standards, such as those issued by the Occupational Safety and Health Administration (OSHA), can be considered national standards.
Definition from now-deprecated CASRAI Glossary term: national-standard
A governing culture that holds that the public has the right to access the documents and proceedings of government to allow for greater openness, accountability, and engagement.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
open government
A governing culture that holds that the public has the right to access the documents and proceedings of government to allow for greater openness, accountability, and engagement.
Definition from now-deprecated CASRAI Glossary term: open-government
access control and management
Manage the assessment, implementation and monitoring of secure storage protocols.
Angus Whyte
Celia van Gelder
Mateusz Kuzak
Yan Wang
storage security management
1. Exercising authority to provide direction and to undertake, coordinate, and regulate activities in support of achieving this direction and desired outcomes. Governance can be thought of as the role of an organization's board of directors or its equivalent that is focused on defining that organization's purpose and the development of the strategies, objectives, values, and policies that frame how that purpose will be pursued. It includes the development of such things as mission statements, statements of organizational objectives and values, logic models, organizational performance metrics, risk management frameworks, policies and guidelines for financial and operational matters, stakeholder relations, etc.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
governance
1. Exercising authority to provide direction and to undertake, coordinate, and regulate activities in support of achieving this direction and desired outcomes. Governance can be thought of as the role of an organization's board of directors or its equivalent that is focused on defining that organization's purpose and the development of the strategies, objectives, values, and policies that frame how that purpose will be pursued. It includes the development of such things as mission statements, statements of organizational objectives and values, logic models, organizational performance metrics, risk management frameworks, policies and guidelines for financial and operational matters, stakeholder relations, etc.
Definition from now-deprecated CASRAI Glossary term: governance
Low-barrier mechanism for repository interoperability. Data Providers are repositories that expose structured metadata via OAI-PMH. Service Providers then make OAI-PMH service requests to harvest that metadata. OAI-PMH is a set of six verbs or services that are invoked within HTTP.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
open archives initiative protocol for metadata harvesting
Low-barrier mechanism for repository interoperability. Data Providers are repositories that expose structured metadata via OAI-PMH. Service Providers then make OAI-PMH service requests to harvest that metadata. OAI-PMH is a set of six verbs or services that are invoked within HTTP.
RDMT. https://terms.codata.org/rdmt/open-archives-initiative-protocol-for-metadata-harvesting
A series of computer instructions written in some human readable computer language, usually stored in a text file.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
computer code
A series of computer instructions written in some human readable computer language, usually stored in a text file.
Definition from now-deprecated CASRAI Glossary term: computer-code
Defines the structure or format of data exchange and is achieved through tools such as XML or SQL standards.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
syntactic interoperability
Defines the structure or format of data exchange and is achieved through tools such as XML or SQL standards.
RDMT. https://terms.codata.org/rdmt/syntactic-interoperability
Bin for Aptitudes related to Preservation costs management.
leightonlc
aptitudes for preservation costs management
https://orcid.org/0000-0003-2687-1982
accessibility of digital assets
The activity of bringing computer system resources, especially data storage and computing power on demand via Internet.
Kristina Hettne
Simon Hodson
Victoria Dominguez Del Angel
cloud computing environment management
The activity of bringing computer system resources, especially data storage and computing power on demand via Internet.
(modified from wikipedia) [VDA]
Set of documents that has a scientific meaning. A corpus can be produced by an individual researcher's activity (including its archival materials) or from a laboratory's research, a field campaign, a survey, or any other discrete research activity.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
corpus
Set of documents that has a scientific meaning. A corpus can be produced by an individual researcher's activity (including its archival materials) or from a laboratory's research, a field campaign, a survey, or any other discrete research activity.
RDMT. https://terms.codata.org/rdmt/corpus
Understand how to use FAIR and open research tools and services.
Hugh Shanahan
the use of fair and open research tools or services
Practice of making data available for checking, reproducing or reusing. The mechanisms available for achieving this are: making data available on request, as supplementary information to journal articles, or as published datasets in data repositories.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
Data dissemination
Data posting
data sharing
Practice of making data available for checking, reproducing or reusing. The mechanisms available for achieving this are: making data available on request, as supplementary information to journal articles, or as published datasets in data repositories.
RDMT. https://terms.codata.org/rdmt/data-sharing
Understand how the rewards for effort depend on crediting the contributions of researchers and professional groups towards making FAIR outputs.
Angus Whyte
Celia van Gelder
Mateusz Kuzak
Yan Wang
appreciate importance of crediting research contributions
The process of resolving a reference to useful information by using a globally available system.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
reference resolution
The process of resolving a reference to useful information by using a globally available system.
Definition from now-deprecated CASRAI Glossary term: reference-resolution
The person who manages or coordinates resources, personnel, facilities, and operating funds-allocations in an organization conducting research, development and analysis (RDA) in the natural and physical sciences. A research manager determines the nature, priority objectives and the resources committed to their achievement within and across the organizations, and evaluates program outputs in relation to organizational objectives and policies. A research manager provides scientific advice on the direction, conduct and management of these programs. A research manager does not personally conduct research development and analysis (RDA), control and coordinate projects, or control and coordinate contracted RDA.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
research manager
The person who manages or coordinates resources, personnel, facilities, and operating funds-allocations in an organization conducting research, development and analysis (RDA) in the natural and physical sciences. A research manager determines the nature, priority objectives and the resources committed to their achievement within and across the organizations, and evaluates program outputs in relation to organizational objectives and policies. A research manager provides scientific advice on the direction, conduct and management of these programs. A research manager does not personally conduct research development and analysis (RDA), control and coordinate projects, or control and coordinate contracted RDA.
Definition from now-deprecated CASRAI Glossary term: research-manager
Combining diverse datasets from disparate sources into one unified dataset or database. Data are accessed and extracted, moved, validated, cleaned, transformed and loaded.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
data integration
Combining diverse datasets from disparate sources into one unified dataset or database. Data are accessed and extracted, moved, validated, cleaned, transformed and loaded.
RDMT. https://terms.codata.org/rdmt/data-integration
Written methods, instructions, and tools that, when applied in different data collection contexts produce data that are ready to be harmonised or integrated without further manipulation.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
AL, LM 22.1.25: Refined the definition of this term to apply specifically to data stewardship tasks, and to align with RDMT.
Standard operating procedure for the collection of harmonised or integrated data
standard operating procedure
Written methods, instructions, and tools that, when applied in different data collection contexts produce data that are ready to be harmonised or integrated without further manipulation.
RDMT. https://terms.codata.org/rdmt/standard-operating-procedure-harmonised-integrated-data
Physical or digital storage location that can house, preserve, manage, and provide access to many types of digital and physical materials in a variety of formats. Materials in online repositories are curated to enable search, discovery, and reuse. There must be sufficient control for the physical and digital material to be authentic, reliable, accessible and usable on a continuing basis.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
repository
Physical or digital storage location that can house, preserve, manage, and provide access to many types of digital and physical materials in a variety of formats. Materials in online repositories are curated to enable search, discovery, and reuse. There must be sufficient control for the physical and digital material to be authentic, reliable, accessible and usable on a continuing basis.
RDMT. https://terms.codata.org/rdmt/repository
Understand the concepts underpinning FAIR criteria.
Angus Whyte
knowledge of theories underlying fair implementation
Know the purpose of the public use, choose the proper subjects for the audience and describe the project work for the purpose/audience chosen.
Celia van Gelder
Mateusz Kuzak
Yan Wang
document in lay terms
https://orcid.org/0000-0003-2687-1982
2020-10-02T14:54:27.168274Z
understanding persistent identifiers
Collection of interrelated data often with controlled redundancy, organised according to a scheme to serve one or more applications; the data are stored so that they can be used by several programs without concern for data structures or organisation.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data upload database
Collection of interrelated data often with controlled redundancy, organised according to a scheme to serve one or more applications; the data are stored so that they can be used by several programs without concern for data structures or organisation.
RDMT. https://terms.codata.org/rdmt/data-upload-database
A string of characters used to identify or name a resource on the Internet. Such identification enables interaction with representations of the resource over a network, typically the World Wide Web, using specific protocols. MIT data management and publishing
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
uniform resource identifier
A string of characters used to identify or name a resource on the Internet. Such identification enables interaction with representations of the resource over a network, typically the World Wide Web, using specific protocols. MIT data management and publishing
Definition from now-deprecated CASRAI Glossary term: uniform-resource-identifier
Know how to search for and identify FAIR services or tools that fit project needs.
Angus Whyte
how to find fair research data tools/services (catalogues)
A framework whose primary purpose is to enable information sharing and reuse via the standard description and discovery of common data and the promotion of uniform data management practices.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data reference model
A framework whose primary purpose is to enable information sharing and reuse via the standard description and discovery of common data and the promotion of uniform data management practices.
Definition from now-deprecated CASRAI Glossary: data-reference-model
A process by which a scholarly work (such as a paper or a research proposal) is checked by a group of experts in the same field to make sure it meets the necessary standards before it is published or accepted.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
peer review
A process by which a scholarly work (such as a paper or a research proposal) is checked by a group of experts in the same field to make sure it meets the necessary standards before it is published or accepted.
Definition from now-deprecated CASRAI Glossary term: peer-review
Bin for types of Knowledge related to Provenance information management.
leightonlc
2019-10-17T15:17:02.313897Z
knowledge of provenance information management
(noun) Sometimes called a row, a group of fields (sometimes called columns) within a table that are relevant to a specific entity. Multiple records are contained in a file or dataset.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
Data record
record
(noun) Sometimes called a row, a group of fields (sometimes called columns) within a table that are relevant to a specific entity. Multiple records are contained in a file or dataset.
RDMT. https://terms.codata.org/rdmt/record
Reverse engineering process in which de-identified data are cross-referenced with other data sources to re-identify the personally identifiable information. This could occur if a de-identification process had not been not successfully performed, or had not been undertaken in the first place.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
de-anonymization
Reverse engineering process in which de-identified data are cross-referenced with other data sources to re-identify the personally identifiable information. This could occur if a de-identification process had not been not successfully performed, or had not been undertaken in the first place.
RDMT. https://terms.codata.org/rdmt/de-anonymisation
HughShanahan
2019-10-18T08:45:02.980527Z
application of fair tools and services
Requires compliance because of a government statute or regulation, an organization internal policy, or contractual requirement. Failure to comply with a mandatory standard usually carries a sanction, such as civil or criminal penalties, or loss of employment.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
mandatory standard
Requires compliance because of a government statute or regulation, an organization internal policy, or contractual requirement. Failure to comply with a mandatory standard usually carries a sanction, such as civil or criminal penalties, or loss of employment.
Definition from now-deprecated CASRAI Glossary term: mandatory-standard
Approach to governance that values decisions that can be backed up with data that can be verified. The success of the data-driven approach is reliant upon the quality of the data gathered and the effectiveness of its analysis and interpretation. Errors can creep into data analytics processes at any stage of the endeavour and serious issues can result when they do.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
DDDM
data driven decision management
Approach to governance that values decisions that can be backed up with data that can be verified. The success of the data-driven approach is reliant upon the quality of the data gathered and the effectiveness of its analysis and interpretation. Errors can creep into data analytics processes at any stage of the endeavour and serious issues can result when they do.
RDMT. https://terms.codata.org/rdmt/data-driven-decision-management
Single, well-defined version of all the data entities in an organisational ecosystem. Encompasses all the data in every system of record within a particular organisation. A well-maintained, current golden record should be a fundamental element of the master data management policy for every enterprise. The word “golden” is sometimes used in information technology to express the importance of some type of source. In the context of virtualization, for example, a golden image is a template for a virtual machine, virtual desktop, servers, or hard disk drive.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
golden record
Single, well-defined version of all the data entities in an organisational ecosystem. Encompasses all the data in every system of record within a particular organisation. A well-maintained, current golden record should be a fundamental element of the master data management policy for every enterprise. The word “golden” is sometimes used in information technology to express the importance of some type of source. In the context of virtualization, for example, a golden image is a template for a virtual machine, virtual desktop, servers, or hard disk drive.
RDMT. https://terms.codata.org/rdmt/golden-record
Reliability and application efficiency of data. Perception or assessment of a dataset's fitness to serve its purpose in a given context. Aspects of data quality include: Accuracy, Completeness, Update status, Relevance, Consistency across data sources, Reliability, Appropriate presentation, Accessibility. Data quality is affected by the way data are entered, stored and managed. Maintaining data quality requires going through the data periodically and scrubbing it. Typically this involves updating, standardising, and de-duplicating records to create a single view of the data, even if it is stored in multiple disparate systems.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data quality
Reliability and application efficiency of data. Perception or assessment of a dataset's fitness to serve its purpose in a given context. Aspects of data quality include: Accuracy, Completeness, Update status, Relevance, Consistency across data sources, Reliability, Appropriate presentation, Accessibility. Data quality is affected by the way data are entered, stored and managed. Maintaining data quality requires going through the data periodically and scrubbing it. Typically this involves updating, standardising, and de-duplicating records to create a single view of the data, even if it is stored in multiple disparate systems.
RDMT. https://terms.codata.org/rdmt/data-quality
HughShanahan
2019-10-18T08:38:59.038306Z
using fair and open research tools or services
https://orcid.org/0000-0003-2687-1982
https://orcid.org/0000-0003-2687-1982
2020-10-01T20:53:15.691824Z
webinar
Data in the form of digital materials.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
digital data
Data in the form of digital materials.
RDMT. https://terms.codata.org/rdmt/digital-data
https://orcid.org/0000-0003-2687-1982
F1. (meta)data are assigned a globally unique and persistent identifier
An organization's process of defining its strategy or direction in the context of FAIR project management activities, both in the context of current knowledge and unknown factors in the future.
Philippe Rocca-Serra
Susanna Sansone
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
strategic/long-term planning
An organization's process of defining its strategy or direction in the context of FAIR project management activities, both in the context of current knowledge and unknown factors in the future.
AL, and https://en.wikipedia.org/wiki/Strategic_planning [17.10.19]
Collective processes conducted to ensure the cleanliness of data. Data are considered clean when they are relatively error-free.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data hygiene
Collective processes conducted to ensure the cleanliness of data. Data are considered clean when they are relatively error-free.
RDMT. https://terms.codata.org/rdmt/data-hygiene
Statistical analysis and assessment of the quality of data values within a dataset for consistency, uniqueness and logic. The data profiling process cannot identify inaccurate data; it can only identify rule violations and anomalies. The insight gained by data profiling can be used to determine how difficult it will be to use existing data for other purposes. It can also be used to provide metrics to assess data quality and help determine whether or not metadata accurately describes the source data. Profiling tools evaluate the actual content, structure and quality of the data by exploring relationships that exist between value collections both within and across datasets.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data profiling
Statistical analysis and assessment of the quality of data values within a dataset for consistency, uniqueness and logic. The data profiling process cannot identify inaccurate data; it can only identify rule violations and anomalies. The insight gained by data profiling can be used to determine how difficult it will be to use existing data for other purposes. It can also be used to provide metrics to assess data quality and help determine whether or not metadata accurately describes the source data. Profiling tools evaluate the actual content, structure and quality of the data by exploring relationships that exist between value collections both within and across datasets.
RDMT. https://terms.codata.org/rdmt/data-profiling
Set of metadata elements and their values that describe an object. Metadata elements in the record may derive from a metadata profile or standard, and may include different types of metadata (descriptive, administrative, etc.). A metadata record is typically stored within a metadata catalogue or repository.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
metadata record
Set of metadata elements and their values that describe an object. Metadata elements in the record may derive from a metadata profile or standard, and may include different types of metadata (descriptive, administrative, etc.). A metadata record is typically stored within a metadata catalogue or repository.
RDMT. https://terms.codata.org/rdmt/metadata-record
Data that are used as primary sources to support technical or scientific enquiry, research, scholarship, or artistic activity, and that are used as evidence in the research process and/or are commonly accepted in the research community as necessary to validate research findings and results. All other digital and non-digital content have the potential of becoming research data. Research data may be experimental data, observational data, operational data, third party data, public sector data, monitoring data, processed data, or repurposed data.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
research data
Data that are used as primary sources to support technical or scientific enquiry, research, scholarship, or artistic activity, and that are used as evidence in the research process and/or are commonly accepted in the research community as necessary to validate research findings and results. All other digital and non-digital content have the potential of becoming research data. Research data may be experimental data, observational data, operational data, third party data, public sector data, monitoring data, processed data, or repurposed data.
RDMT. https://terms.codata.org/rdmt/research-data
The medium through which learning has been provided. For example, through a course, slides, video presentation, online documentation, wiki pages or others.
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
https://orcid.org/0000-0003-2687-1982
2020-09-22T13:19:57.012225Z
learning medium
data manager
Compilation of core electronic health data submitted by various healthcare providers and organisations, accessible by numerous authorised parties from a number of points of care, possibly even from different jurisdictions. Electronic health records typically include: contact information, information about visits to health care professionals, allergies, insurance information, family history, immunisation status, information about any conditions or diseases, a list of medications, records of hospitalisation, information about any surgeries or procedures performed.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
Digital medical record
electronic health record
Compilation of core electronic health data submitted by various healthcare providers and organisations, accessible by numerous authorised parties from a number of points of care, possibly even from different jurisdictions. Electronic health records typically include: contact information, information about visits to health care professionals, allergies, insurance information, family history, immunisation status, information about any conditions or diseases, a list of medications, records of hospitalisation, information about any surgeries or procedures performed.
RDMT. https://terms.codata.org/rdmt/electronic-health-record
Techniques used to deal with parameters having different units and scales.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
Data rescaling
data scaling
Techniques used to deal with parameters having different units and scales.
RDMT. https://terms.codata.org/rdmt/data-scaling
Definitions of the access relationships between the following metadata: data object name, a user name (or user group, or user role), and access permission(s). The information can be stored as metadata information associated with each data object. The information can be generated dynamically by applying the access controls of the collection that organises the data objects (if a collection sticky bit is turned on).
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
access controls
Definitions of the access relationships between the following metadata: data object name, a user name (or user group, or user role), and access permission(s). The information can be stored as metadata information associated with each data object. The information can be generated dynamically by applying the access controls of the collection that organises the data objects (if a collection sticky bit is turned on).
RDMT. https://terms.codata.org/rdmt/access-controls
Monitoring the risk to privacy posed by data requests from researchers, and the practices of data custodians in providing data (information governance) to ensure that confidentiality is protected. Such governance requires specialised knowledge of technology, law, and statistical methods.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
privacy governance
Monitoring the risk to privacy posed by data requests from researchers, and the practices of data custodians in providing data (information governance) to ensure that confidentiality is protected. Such governance requires specialised knowledge of technology, law, and statistical methods.
RDMT. https://terms.codata.org/rdmt/privacy-governance
Angus Whyte
Celia van Gelder
Mateusz Kuzak
Yan Wang
research integrity, attribution, impact awareness
Bin for types of Knowledge related to Cloud computing environment management.
leightonlc
knowledge of cloud computing environment management
Data about data. It is data (or information) that defines and describes the characteristics of other data. It is used to improve the understanding and use of the data.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
Data documentation
metadata
Data about data. It is data (or information) that defines and describes the characteristics of other data. It is used to improve the understanding and use of the data.
RDMT. https://terms.codata.org/rdmt/metadata
Values in a table presented as a series of ASCII text lines organised so that each column value is separated by a comma from the next column’s value and each row starts a new line. Along with pipe- and tab-separated values, comma-separated values are a specific example of a record-oriented data structure (usually of fixed dimension) with fields separated by an agreed delimiter.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
CSV
Maybe remove as shouldn't have formats here.
comma separated values
Values in a table presented as a series of ASCII text lines organised so that each column value is separated by a comma from the next column’s value and each row starts a new line. Along with pipe- and tab-separated values, comma-separated values are a specific example of a record-oriented data structure (usually of fixed dimension) with fields separated by an agreed delimiter.
RDMT. https://terms.codata.org/rdmt/comma-separated-values
https://orcid.org/0000-0003-2687-1982
identifier
Responsible for executing tasks and producing deliverables as outlined in the Project Plan and directed by the Project Manager, at whatever level of effort or participation has been defined for them.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
project team member
Responsible for executing tasks and producing deliverables as outlined in the Project Plan and directed by the Project Manager, at whatever level of effort or participation has been defined for them.
Definition from now-deprecated CASRAI Glossary term: project-team-member
Metadata that describe a dataset or resource in such a way that people can discover and identify it. Contains information that aids with findability such as information (metadata elements) on the creator(s), affiliation(s), title, abstract, keywords, persistent identifier, related publications, etc.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
https://github.com/terms4fairskills/FAIRterminology/issues/13
descriptive metadata
Metadata that describe a dataset or resource in such a way that people can discover and identify it. Contains information that aids with findability such as information (metadata elements) on the creator(s), affiliation(s), title, abstract, keywords, persistent identifier, related publications, etc.
RDMT. https://terms.codata.org/rdmt/descriptive-metadata
https://orcid.org/0000-0003-2687-1982
https://orcid.org/0000-0003-2687-1982
2020-10-01T21:03:55.425579Z
github repository
Analysing multivariate datasets using pattern recognition or other knowledge discovery techniques to identify potentially unknown and potentially meaningful data content, relationships, classification or trends. Data mining parameters include: Association (looking for patterns where one event is connected to another event); Sequence or path analysis (looking for patterns where one event leads to another later event); Classification (looking for new patterns); Clustering (finding and visually documenting groups of facts not previously known); Forecasting, or predictive analytics (discovering patterns in data that can lead to reasonable predictions about the future.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
data mining
Analysing multivariate datasets using pattern recognition or other knowledge discovery techniques to identify potentially unknown and potentially meaningful data content, relationships, classification or trends. Data mining parameters include: Association (looking for patterns where one event is connected to another event); Sequence or path analysis (looking for patterns where one event leads to another later event); Classification (looking for new patterns); Clustering (finding and visually documenting groups of facts not previously known); Forecasting, or predictive analytics (discovering patterns in data that can lead to reasonable predictions about the future.
RDMT. https://terms.codata.org/rdmt/data-mining
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
Data preservation
https://orcid.org/0000-0003-2687-1982
2021-02-17T22:18:22.641817Z
data archiving
Machine processable specifications which define the structure and syntax of metadata specifications in a formal schema language.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
AL 5.12.22 Moved from 'findability of digital assets' as it is definitely not a particular data stewardship guideline.
encoding schema
Machine processable specifications which define the structure and syntax of metadata specifications in a formal schema language.
RDMT. https://terms.codata.org/rdmt/encoding-schema
In the context of computing, the process of backup and ongoing maintenance as opposed to strategies for long-term digital preservation.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
digital archiving
digital archiving (computing)
In the context of computing, the process of backup and ongoing maintenance as opposed to strategies for long-term digital preservation.
RDMT. https://terms.codata.org/rdmt/digital-archiving-computing
The process of resolving a PID to a useful state of information about a digital object by using a globally available system.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
pid resolution
The process of resolving a PID to a useful state of information about a digital object by using a globally available system.
RDMT. https://terms.codata.org/rdmt/pid-resolution
Bin for Skills needed for Storage management.
leightonlc
skills related to storage management
Management of a national, discipline or institutional repository of published datasets. Provision of infrastructure, curation, policy and training that govern the organisation, control, and properties of the repository such as: required file formats, access control restrictions, integrity, replication, retention, disposal, etc.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
data repository management
Management of a national, discipline or institutional repository of published datasets. Provision of infrastructure, curation, policy and training that govern the organisation, control, and properties of the repository such as: required file formats, access control restrictions, integrity, replication, retention, disposal, etc.
RDMT. https://terms.codata.org/rdmt/data-repository-management
A document creation and management specification that builds content reuse into the authoring process.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
darwin information typing architecture
A document creation and management specification that builds content reuse into the authoring process.
Definition from now-deprecated CASRAI Glossary term: darwin-information-typing-architecture/
The discovery of meaningful multidimensional patterns in data.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
analytics
The discovery of meaningful multidimensional patterns in data.
Definition from now-deprecated CASRAI Glossary term: analytics
The personal attributes necessary to perform a task.
https://orcid.org/0000-0003-2687-1982
Aptitude is the innate trait or talent that a person brings to a task or situation. It is the quality of being able to do something. An aptitude is not something that can be learned or developed unless it is there to begin with.
Laura Molloy, Celia van Gelder:https://docs.google.com/presentation/d/12oNBFix39ZtsLAR4tkqESBXxMU5rVg3cCKoEDG2oK-M/edit#slide=id.p5
data stewardship soft skill
Ability to apply the FAIR principles i.e. describe the data with community metadata standard for machine and human, align with existing semantic models (ontologies, controlled vocabularies, ...), get a persistent ID, attribute licence and credit for data creators, legal aspect related to data (ownership, confidentiality, ethics, ...) add data provenance.
Kristina Hettne
Victoria Dominguez Del Angel
Yann Le Franc
https://orcid.org/0000-0002-7702-4495
archival documentation of data
Ability to apply the FAIR principles i.e. describe the data with community metadata standard for machine and human, align with existing semantic models (ontologies, controlled vocabularies, ...), get a persistent ID, attribute licence and credit for data creators, legal aspect related to data (ownership, confidentiality, ethics, ...) add data provenance.
[YLF, VDA, KH]
A machine-readable format is a structured format that can be processed by a computer. Such formats can either be intended solely for machine processing (e.g. XML or RDF), or may be both human and machine accessible via appropriate markup (e.g. HTML).
https://orcid.org/0000-0002-7702-4495
https://orcid.org/0000-0003-2687-1982
AL 5.12.22: Merged with now-obsolete 'machine readable' (http://purl.obolibrary.org/obo/T4FS_0000256), as we do not need that level of granularity. AL 6.5.22: The original CASRAI term (https://casrai.org/term/machine-readable-format/) has an identical source definition with 'Digital materials', and therefore has been removed.
machine-readable format
A machine-readable format is a structured format that can be processed by a computer. Such formats can either be intended solely for machine processing (e.g. XML or RDF), or may be both human and machine accessible via appropriate markup (e.g. HTML).
Modified by AL from Wikipedia, https://en.wikipedia.org/wiki/Machine-readable_data, accessed 6.5.22
Any information obtained by a person on the understanding that they will not disclose it to others, or obtained in circumstances where it is expected that they will not disclose it.
https://orcid.org/0000-0002-5214-4466
https://orcid.org/0000-0003-2687-1982
confidential information
Any information obtained by a person on the understanding that they will not disclose it to others, or obtained in circumstances where it is expected that they will not disclose it.
Definition from now-deprecated CASRAI Glossary term: confidential-information
The FAIR Cookbook’s recipes are a combination of guidance, technical, hands-on, background and review types to cover the operation steps of FAIR data management, and are classified according to the audience types, to serve all those involved in the data management life cycle. The FAIR Cookbook is for anyone working in the Life Sciences who needs guidance on applying the FAIR Principles in practice.
https://orcid.org/0000-0002-7702-4495
fair cookbook recipe
The FAIR Cookbook’s recipes are a combination of guidance, technical, hands-on, background and review types to cover the operation steps of FAIR data management, and are classified according to the audience types, to serve all those involved in the data management life cycle. The FAIR Cookbook is for anyone working in the Life Sciences who needs guidance on applying the FAIR Principles in practice.
https://faircookbook.elixir-europe.org/content/recipes/introduction/FAIR-cookbook-audience.html
Evaluation indicators are designed to enable the measurement of the degree of compliance against a guideline using a set of criteria. The goals of such indicators are varied, but a good example would be to improve the re-usability of the digital asset being measured for increased discoverability, interoperability and overall machine actionability.
https://github.com/terms4fairskills/FAIRterminology/issues/22
evaluation indicator
Evaluation indicators are designed to enable the measurement of the degree of compliance against a guideline using a set of criteria. The goals of such indicators are varied, but a good example would be to improve the re-usability of the digital asset being measured for increased discoverability, interoperability and overall machine actionability.
Definition by https://orcid.org/0000-0002-7702-4495 and drawn in part from https://fairplus.github.io/Data-Maturity/docs/Indicators
The fairplus dataset maturity model indicators are used within the context of the model, which is intended as a comprehensive reference model for state-of-FAIRness improvement in research datasets. Based on the FAIR guiding principles, the DSM model defines and classifies requirements that constitute an incremental path towards improving FAIRness level for a given research dataset.
https://orcid.org/0000-0002-7702-4495
fairplus DSM model indicator
fairplus dataset maturity model indicator
The fairplus dataset maturity model indicators are used within the context of the model, which is intended as a comprehensive reference model for state-of-FAIRness improvement in research datasets. Based on the FAIR guiding principles, the DSM model defines and classifies requirements that constitute an incremental path towards improving FAIRness level for a given research dataset.
https://fairplus.github.io/Data-Maturity/
The fairplus DSM content-related indicators relate to what is reported in the Dataset (data) & the Dataset Descriptor (metadata).
https://orcid.org/0000-0002-7702-4495
fairplus DSM content-related indicator
The fairplus DSM content-related indicators relate to what is reported in the Dataset (data) & the Dataset Descriptor (metadata).
https://fairplus.github.io/Data-Maturity/
The fairplus DSM representation and format indicators relate to how the data object & metadata object are represented and formatted.
https://orcid.org/0000-0002-7702-4495
AL 5.12.22: Please note that the 'and' in this label is a direct representation of the fairplus DSM model, and not a ontological design choice.
fairplus DSM representation and format indicator
The fairplus DSM representation and format indicators relate to how the data object & metadata object are represented and formatted.
https://fairplus.github.io/Data-Maturity/
The fairplus DSM hosting-environment capabilities indicator relate to the capabilities of the hosting environment that enables and supports the use of FAIR data
https://orcid.org/0000-0002-7702-4495
fairplus DSM hosting-environment capabilities indicator
The fairplus DSM hosting-environment capabilities indicator relate to the capabilities of the hosting environment that enables and supports the use of FAIR data
https://fairplus.github.io/Data-Maturity/
No representation of Data purposed for FAIR sharing is available
https://orcid.org/0000-0002-7702-4495
DSM-0-R2
No representation of Data purposed for FAIR sharing is available
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Structured and/or Unstructured Data are organised into Dataset(s) created for the purpose of FAIR sharing
https://orcid.org/0000-0002-7702-4495
DSM-1-R2
Structured and/or Unstructured Data are organised into Dataset(s) created for the purpose of FAIR sharing
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Project collected Data are organized into structured Dataset(s) and conform to a locally defined Dataset Model
https://orcid.org/0000-0002-7702-4495
DSM-2-R2
Project collected Data are organized into structured Dataset(s) and conform to a locally defined Dataset Model
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Structured Data are represented as Datasets and conform to relevant Standard Dataset Model(s) for FAIR sharing
https://orcid.org/0000-0002-7702-4495
DSM-3-R2
Structured Data are represented as Datasets and conform to relevant Standard Dataset Model(s) for FAIR sharing
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Dataset(s) content is semantically represented using Linked Data Represetations conforming to a Semantic Data Model
https://orcid.org/0000-0002-7702-4495
DSM-4-R2
Dataset(s) content is semantically represented using Linked Data Represetations conforming to a Semantic Data Model
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Dataset(s) are granularly represented and managed at the Data Element Level (e.g. ISO 11179 MDR standard)
https://orcid.org/0000-0002-7702-4495
DSM-5-R2
Dataset(s) are granularly represented and managed at the Data Element Level (e.g. ISO 11179 MDR standard)
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Dataset Metadata is NOT formally represented in a structured Dataset Descriptor
https://orcid.org/0000-0002-7702-4495
DSM-0-R3
Dataset Metadata is NOT formally represented in a structured Dataset Descriptor
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Dataset Metadata is formally represented in the form of an Identifiable Dataset Descriptor
https://orcid.org/0000-0002-7702-4495
DSM-1-R0
Dataset Metadata is formally represented in the form of an Identifiable Dataset Descriptor
https://fairplus.github.io/Data-Maturity/docs/Indicators/
A representation of the Dataset Descriptor conforming to a relevant General Purpose Metadata Schema is available
https://orcid.org/0000-0002-7702-4495
DSM-1-R3
A representation of the Dataset Descriptor conforming to a relevant General Purpose Metadata Schema is available
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Dataset Descriptor(s) conforms to or extends a Standard Generic Dataset Descriptor Model to describe and represent structural metadata of Dataset(s)
https://orcid.org/0000-0002-7702-4495
DSM-2-R3
Dataset Descriptor(s) conforms to or extends a Standard Generic Dataset Descriptor Model to describe and represent structural metadata of Dataset(s)
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Dataset Descriptor(s) use community-defined or domain-specific metadata standard
https://orcid.org/0000-0002-7702-4495
DSM-3-R3
Dataset Descriptor(s) use community-defined or domain-specific metadata standard
https://fairplus.github.io/Data-Maturity/docs/Indicators/
A Semantic Data Model (Metadata) used for data harmonisation across Datasets is formally defined and represented using Linked Data Representations
https://orcid.org/0000-0002-7702-4495
DSM-4-R3
A Semantic Data Model (Metadata) used for data harmonisation across Datasets is formally defined and represented using Linked Data Representations
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Common Data Elements and their value sets are defined and registered in a managed Metadata Registry
https://orcid.org/0000-0002-7702-4495
DSM-5-R3
Common Data Elements and their value sets are defined and registered in a managed Metadata Registry
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Contextual Metadata is NOT formally represented in any form
https://orcid.org/0000-0002-7702-4495
DSM-0-R1
Contextual Metadata is NOT formally represented in any form
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Contextual Metadata is reported at summary level and represented in the Dataset Descriptor
https://orcid.org/0000-0002-7702-4495
DSM-1-R1
Contextual Metadata is reported at summary level and represented in the Dataset Descriptor
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Contextual Metadata is formally represented in the form of a locally defined Domain Model
https://orcid.org/0000-0002-7702-4495
DSM-2-R1
Contextual Metadata is formally represented in the form of a locally defined Domain Model
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Contextual Metadata is formally represented and conforms to a standard defined Domain Model if available
https://orcid.org/0000-0002-7702-4495
DSM-3-R1
Contextual Metadata is formally represented and conforms to a standard defined Domain Model if available
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Contextual Metadata is formally represented by a defined set Common Data Elements
https://orcid.org/0000-0002-7702-4495
DSM-4-R1
Contextual Metadata is formally represented by a defined set Common Data Elements
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Domain entities are represented by Managed Master Data Objects conforming to a Master Data Model used for data consolidation
https://orcid.org/0000-0002-7702-4495
DSM-5-R1
Domain entities are represented by Managed Master Data Objects conforming to a Master Data Model used for data consolidation
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Dataset Descriptor (metadata) is NOT available in a Machine Readable Format
https://orcid.org/0000-0002-7702-4495
DSM-0-R4
Dataset Descriptor (metadata) is NOT available in a Machine Readable Format
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Dataset Descriptor is available in Machine Readable Format
https://orcid.org/0000-0002-7702-4495
DSM-1-R4
Dataset Descriptor is available in Machine Readable Format
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Contextual Metadata represented in the form of a Domain Model is available in a Human Readable Format
https://orcid.org/0000-0002-7702-4495
DSM-2-R4
Contextual Metadata represented in the form of a Domain Model is available in a Human Readable Format
https://fairplus.github.io/Data-Maturity/docs/Indicators/
A formal documentation of the adopted Standard Dataset Model is available in a Machine Readable Format
https://orcid.org/0000-0002-7702-4495
DSM-3-R4
A formal documentation of the adopted Standard Dataset Model is available in a Machine Readable Format
https://fairplus.github.io/Data-Maturity/docs/Indicators/
A Semantic Data Model (Metadata) describing the data is represented in a Machine Readable and Machine Interptretable format
https://orcid.org/0000-0002-7702-4495
DSM-4-R4
A Semantic Data Model (Metadata) describing the data is represented in a Machine Readable and Machine Interptretable format
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Dataset(s) are NOT available in a Machine Readable Format
https://orcid.org/0000-0002-7702-4495
DSM-0-R5
Dataset(s) are NOT available in a Machine Readable Format
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Dataset(s) available in Machine Readable Format
https://orcid.org/0000-0002-7702-4495
DSM-1-R5
Dataset(s) available in Machine Readable Format
https://fairplus.github.io/Data-Maturity/docs/Indicators/
If applicable, Dataset(s) available in non-proprietary Machine Readable Format relevant to the adopted standard Dataset Model
https://orcid.org/0000-0002-7702-4495
DSM-3-R5
If applicable, Dataset(s) available in non-proprietary Machine Readable Format relevant to the adopted standard Dataset Model
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Datasets are available in a Machine Readable and Machine Interpretable format
https://orcid.org/0000-0002-7702-4495
DSM-4-R5
Datasets are available in a Machine Readable and Machine Interpretable format
https://fairplus.github.io/Data-Maturity/docs/Indicators/
If applicable, license information and/or permitted use and accessability to parts of the dataset is formally represented and encoded in a Machine Readable Format
https://orcid.org/0000-0002-7702-4495
DSM-4-R6
If applicable, license information and/or permitted use and accessability to parts of the dataset is formally represented and encoded in a Machine Readable Format
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Dataset(s) are NOT Identifiable via Unique Identifiers
https://orcid.org/0000-0002-7702-4495
DSM-0-C0
Dataset(s) are NOT Identifiable via Unique Identifiers
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Each Dataset purposed for FAIR sharing is assigned a unique identifier
https://orcid.org/0000-0002-7702-4495
DSM-1-C0
Each Dataset purposed for FAIR sharing is assigned a unique identifier
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Where applicable, data is structured in the Dataset according to the Tidy Data Principles
https://orcid.org/0000-0002-7702-4495
DSM-2-C2
Where applicable, data is structured in the Dataset according to the Tidy Data Principles
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Where applicable, Dataset(s) scope and content are reported in compliance with relevant community-defined Data Reporting Guidelines
https://orcid.org/0000-0002-7702-4495
DSM-3-C2
Where applicable, Dataset(s) scope and content are reported in compliance with relevant community-defined Data Reporting Guidelines
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Dataset(s) content is harmonised against a designed-for-purpose Semantic Data Model
https://orcid.org/0000-0002-7702-4495
DSM-4-C2
Dataset(s) content is harmonised against a designed-for-purpose Semantic Data Model
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Dataset(s) include Reference Fields that enable joining related datasets
https://orcid.org/0000-0002-7702-4495
DSM-2-C3
Dataset(s) include Reference Fields that enable joining related datasets
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Where applicable, Dataset Field Names use standard controlled terms as recommended by the adopted Standard
https://orcid.org/0000-0002-7702-4495
DSM-3-C3
Where applicable, Dataset Field Names use standard controlled terms as recommended by the adopted Standard
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Key Dataset Fields are mapped to Common Data Elements as defined by the Semantic Data Model
https://orcid.org/0000-0002-7702-4495
DSM-4-C3
Key Dataset Fields are mapped to Common Data Elements as defined by the Semantic Data Model
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Dataset Fields are linked and harmonized against enterprise managed Metadata Elements (e.g. MDR registered Data Elements)
https://orcid.org/0000-0002-7702-4495
DSM-5-C3
Dataset Fields are linked and harmonized against enterprise managed Metadata Elements (e.g. MDR registered Data Elements)
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Where applicable, Dataset Field Values are standardized against a locally defined Data Dictionary within and across related Datasets
https://orcid.org/0000-0002-7702-4495
DSM-2-C4
Where applicable, Dataset Field Values are standardized against a locally defined Data Dictionary within and across related Datasets
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Where applicable, Dataset Field Values are standardised against domain-specific Controlled Terminologies and/or Ontology Terms
https://orcid.org/0000-0002-7702-4495
DSM-3-C4
Where applicable, Dataset Field Values are standardised against domain-specific Controlled Terminologies and/or Ontology Terms
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Values for key Domain Entities reported in the Dataset(s) are standardised and assigned unique Standard Identifiers
https://orcid.org/0000-0002-7702-4495
DSM-4-C4
Values for key Domain Entities reported in the Dataset(s) are standardised and assigned unique Standard Identifiers
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Dataset Field values are controlled and managed via enterprise managed Reference and Master Data
https://orcid.org/0000-0002-7702-4495
DSM-5-C4
Dataset Field values are controlled and managed via enterprise managed Reference and Master Data
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Study/Project-Level metadata is NOT reported
https://orcid.org/0000-0002-7702-4495
DSM-0-C1
Study/Project-Level metadata is NOT reported
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Dataset Descriptor includes Descriptive Study/Project-Level summary information
https://orcid.org/0000-0002-7702-4495
DSM-1-C1
Dataset Descriptor includes Descriptive Study/Project-Level summary information
https://fairplus.github.io/Data-Maturity/docs/Indicators/
A locally defined Domain Model contains concepts that describes the overall project/study design, the relationships between the Datasets, the key entities reported within the Datasets and the relationships between them.
https://orcid.org/0000-0002-7702-4495
DSM-2-C1
A locally defined Domain Model contains concepts that describes the overall project/study design, the relationships between the Datasets, the key entities reported within the Datasets and the relationships between them.
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Where applicable, study-level / experimental metadata is reported in compliance with relevant Minimum Information Reporting Guidelines
https://orcid.org/0000-0002-7702-4495
DSM-3-C1
Where applicable, study-level / experimental metadata is reported in compliance with relevant Minimum Information Reporting Guidelines
https://fairplus.github.io/Data-Maturity/docs/Indicators/
A Semantic Data Model includes study design Data Elements and the relationships between them
https://orcid.org/0000-0002-7702-4495
DSM-4-C1
A Semantic Data Model includes study design Data Elements and the relationships between them
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Dataset Descriptor does NOT include a reference to the Dataset it describes
https://orcid.org/0000-0002-7702-4495
DSM-0-C2
Dataset Descriptor does NOT include a reference to the Dataset it describes
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Dataset Descriptor includes Identifying & Descriptive Dataset-Level metadata
https://orcid.org/0000-0002-7702-4495
DSM-1-C2
Dataset Descriptor includes Identifying & Descriptive Dataset-Level metadata
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Dataset Descriptor contains access information for the Dataset
https://orcid.org/0000-0002-7702-4495
DSM-1-C3
Dataset Descriptor contains access information for the Dataset
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Dataset Descriptor includes reference to related Datasets and if applicable the relevant joining Dataset Fields
https://orcid.org/0000-0002-7702-4495
DSM-2-C5
Dataset Descriptor includes reference to related Datasets and if applicable the relevant joining Dataset Fields
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Dataset Descriptor references a standard license under which the dataset can be re-used.
https://orcid.org/0000-0002-7702-4495
DSM-3-C7
Dataset Descriptor references a standard license under which the dataset can be re-used.
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Dataset Descriptor includes Field-level Metadata as prescribed by a locally defined Dataset Model
https://orcid.org/0000-0002-7702-4495
DSM-2-C6
Dataset Descriptor includes Field-level Metadata as prescribed by a locally defined Dataset Model
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Dataset Descriptor includes standard-compliant Field-level Metadata as prescribed by the adopted standard Dataset Model.
https://orcid.org/0000-0002-7702-4495
DSM-3-C6
Dataset Descriptor includes standard-compliant Field-level Metadata as prescribed by the adopted standard Dataset Model.
https://fairplus.github.io/Data-Maturity/docs/Indicators/
The Semantic Data Model includes a pre-defined set of Common Data Elements reported within the Datasets and the relationships between them
https://orcid.org/0000-0002-7702-4495
DSM-4-C5
The Semantic Data Model includes a pre-defined set of Common Data Elements reported within the Datasets and the relationships between them
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Dataset Descriptor includes Value-level Metadata or if applicable includes a reference to a locally defined Data Dictionary
https://orcid.org/0000-0002-7702-4495
DSM-2-C7
Dataset Descriptor includes Value-level Metadata or if applicable includes a reference to a locally defined Data Dictionary
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Value Level Metadata includes Resolvable Identifiers for Controlled and/or Standard Terms reported in the Dataset
https://orcid.org/0000-0002-7702-4495
DSM-3-C5
Value Level Metadata includes Resolvable Identifiers for Controlled and/or Standard Terms reported in the Dataset
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Data or metadata is hosted in non-accessible storage (e.g., personal desktop, local file system or archive)
https://orcid.org/0000-0002-7702-4495
DSM-0-H1
Data or metadata is hosted in non-accessible storage (e.g., personal desktop, local file system or archive)
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Metadata hosting environment stores and maintains an identifiable Dataset Descriptor for each identifiable Dataset
https://orcid.org/0000-0002-7702-4495
DSM-1-H1
Metadata hosting environment stores and maintains an identifiable Dataset Descriptor for each identifiable Dataset
https://fairplus.github.io/Data-Maturity/docs/Indicators/
The Data hosting environment's Persistence Model is aligned with a locally defined Domain Model to enable interpretation of Datasets
https://orcid.org/0000-0002-7702-4495
DSM-2-H1
The Data hosting environment's Persistence Model is aligned with a locally defined Domain Model to enable interpretation of Datasets
https://fairplus.github.io/Data-Maturity/docs/Indicators/
The Data hosting environment's Persistence Model is aligned with a standard Dataset model or compliant with relevant Minimum Information Reporting Guidelines
https://orcid.org/0000-0002-7702-4495
DSM-3-H1
The Data hosting environment's Persistence Model is aligned with a standard Dataset model or compliant with relevant Minimum Information Reporting Guidelines
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Data Hosting environment stores data in a relevant linked data store (e.g., Triple Store or Graph Databaase)
https://orcid.org/0000-0002-7702-4495
DSM-4-H1
Data Hosting environment stores data in a relevant linked data store (e.g., Triple Store or Graph Databaase)
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Data or metadata hosted in an accessible resource but with no retrieval capability
https://orcid.org/0000-0002-7702-4495
DSM-0-H2
Data or metadata hosted in an accessible resource but with no retrieval capability
https://fairplus.github.io/Data-Maturity/docs/Indicators/
The Dataset and its Descriptor are indexed and retrievable (in the same or separate hosting environments) via unique and persistent identifiers
https://orcid.org/0000-0002-7702-4495
DSM-1-H2
The Dataset and its Descriptor are indexed and retrievable (in the same or separate hosting environments) via unique and persistent identifiers
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Retrieval of the Dataset and the Dataset Descriptor utilises a standardized communication protocol that is open, free and universally implementable
https://orcid.org/0000-0002-7702-4495
DSM-1-H3
Retrieval of the Dataset and the Dataset Descriptor utilises a standardized communication protocol that is open, free and universally implementable
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Metadata hosting environment provides programmatic access and retrieval (API) for the Dataset Descriptor
https://orcid.org/0000-0002-7702-4495
DSM-2-H2
Metadata hosting environment provides programmatic access and retrieval (API) for the Dataset Descriptor
https://fairplus.github.io/Data-Maturity/docs/Indicators/
For each dataset, the hosting environment maintains a globally unique, persistent and resolvable identifier for access and retrieval
https://orcid.org/0000-0002-7702-4495
DSM-3-H2
For each dataset, the hosting environment maintains a globally unique, persistent and resolvable identifier for access and retrieval
https://fairplus.github.io/Data-Maturity/docs/Indicators/
If applicable, Dataset hosting environment offers dataset-level authentication and authorisation capabilities
https://orcid.org/0000-0002-7702-4495
DSM-3-H4
If applicable, Dataset hosting environment offers dataset-level authentication and authorisation capabilities
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Data Hosting Environment provides semantic querying capability
https://orcid.org/0000-0002-7702-4495
DSM-4-H2
Data Hosting Environment provides semantic querying capability
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Dataset's Metadata is NOT searchable via keywords or elements within the Descriptor
https://orcid.org/0000-0002-7702-4495
DSM-0-H3
Dataset's Metadata is NOT searchable via keywords or elements within the Descriptor
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Metadata hosting environment offers the capability to browse and search contents of the Dataset Descriptor
https://orcid.org/0000-0002-7702-4495
DSM-1-H4
Metadata hosting environment offers the capability to browse and search contents of the Dataset Descriptor
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Data hosting environment offers the capability to browse and search related Datasets
https://orcid.org/0000-0002-7702-4495
DSM-2-H3
Data hosting environment offers the capability to browse and search related Datasets
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Data Hosting environment utilises controlled terms and/or ontology terms to search within Dataset content.
https://orcid.org/0000-0002-7702-4495
DSM-3-H3
Data Hosting environment utilises controlled terms and/or ontology terms to search within Dataset content.
https://fairplus.github.io/Data-Maturity/docs/Indicators/
Data Hosting Environment provides semantic querying capability
https://orcid.org/0000-0002-7702-4495
DSM-4-H2
Data Hosting Environment provides semantic querying capability
https://fairplus.github.io/Data-Maturity/docs/Indicators/
https://orcid.org/0000-0002-7702-4495
AL 15.3.22: Added for FAIR Cookbook integration.
data scientist
https://orcid.org/0000-0002-7702-4495
AL 15.3.22: Added for FAIR Cookbook integration.
ontologist
https://orcid.org/0000-0002-7702-4495
AL 15.3.22: Added for FAIR Cookbook integration.
terminology manager
https://orcid.org/0000-0002-7702-4495
AL 16.3.22 Added as part of FAIR cookbook integration.
software engineer
https://orcid.org/0000-0002-7702-4495
AL 16.3.22: Added as part of FAIR Cookbook integration.
system administrator
https://orcid.org/0000-0002-7702-4495
AL 16.3.22: Added as part of FAIR Cookbook integration. Useful for such roles to be able to understand the cost of e.g. making things FAIR (and how to deal with it), but also the benefits.
procurement officer
Metadata creation concerns the creation of data that provides information about characteristics, aspects or context of other data entities such as a dataset or other digital or analog object.
https://orcid.org/0000-0002-7702-4495
metadata creation
Metadata creation concerns the creation of data that provides information about characteristics, aspects or context of other data entities such as a dataset or other digital or analog object.
AL 22.3.22, and see also FRBR User Tasks at: https://sites.google.com/site/metadatastandards/chapter-6/6-3-frbr-user-tasks.
Obsolete Class
example to be eventually removed
example to be eventually removed
failed exploratory term
The term was used in an attempt to structure part of the ontology but in retrospect failed to do a good job
Person:Alan Ruttenberg
failed exploratory term
metadata complete
Class has all its metadata, but is either not guaranteed to be in its final location in the asserted IS_A hierarchy or refers to another class that is not complete.
metadata complete
organizational term
Term created to ease viewing/sort terms for development purpose, and will not be included in a release
organizational term
ready for release
Class has undergone final review, is ready for use, and will be included in the next release. Any class lacking "ready_for_release" should be considered likely to change place in hierarchy, have its definition refined, or be obsoleted in the next release. Those classes deemed "ready_for_release" will also derived from a chain of ancestor classes that are also "ready_for_release."
ready for release
metadata incomplete
Class is being worked on; however, the metadata (including definition) are not complete or sufficiently clear to the branch editors.
metadata incomplete
uncurated
Nothing done yet beyond assigning a unique class ID and proposing a preferred term.
uncurated
pending final vetting
All definitions, placement in the asserted IS_A hierarchy and required minimal metadata are complete. The class is awaiting a final review by someone other than the term editor.
pending final vetting
placeholder removed
placeholder removed
terms merged
An editor note should explain what were the merged terms and the reason for the merge.
terms merged
term imported
This is to be used when the original term has been replaced by a term imported from an other ontology. An editor note should indicate what is the URI of the new term to use.
term imported
term split
This is to be used when a term has been split in two or more new terms. An editor note should indicate the reason for the split and indicate the URIs of the new terms created.
term split
universal
Hard to give a definition for. Intuitively a "natural kind" rather than a collection of any old things, which a class is able to be, formally. At the meta level, universals are defined as positives, are disjoint with their siblings, have single asserted parents.
Alan Ruttenberg
A Formal Theory of Substances, Qualities, and Universals, http://ontology.buffalo.edu/bfo/SQU.pdf
universal
defined class
A defined class is a class that is defined by a set of logically necessary and sufficient conditions but is not a universal
"definitions", in some readings, always are given by necessary and sufficient conditions. So one must be careful (and this is difficult sometimes) to distinguish between defined classes and universal.
Alan Ruttenberg
defined class
named class expression
A named class expression is a logical expression that is given a name. The name can be used in place of the expression.
named class expressions are used in order to have more concise logical definition but their extensions may not be interesting classes on their own. In languages such as OWL, with no provisions for macros, these show up as actuall classes. Tools may with to not show them as such, and to replace uses of the macros with their expansions
Alan Ruttenberg
named class expression
to be replaced with external ontology term
Terms with this status should eventually replaced with a term from another ontology.
Alan Ruttenberg
group:OBI
to be replaced with external ontology term
requires discussion
A term that is metadata complete, has been reviewed, and problems have been identified that require discussion before release. Such a term requires editor note(s) to identify the outstanding issues.
Alan Ruttenberg
group:OBI
requires discussion
The term was added to the ontology on the assumption it was in scope, but it turned out later that it was not.
This obsolesence reason should be used conservatively. Typical valid examples are: un-necessary grouping classes in disease ontologies, a phenotype term added on the assumption it was a disease.
https://github.com/information-artifact-ontology/ontology-metadata/issues/77
https://orcid.org/0000-0001-5208-3432
out of scope