terms4FAIRskills describes the competencies, skills and knowledge associated with making and keeping data FAIR.\nThis terminology applies to a variety of use cases, including: assisting with the creation and assessment of stewardship curricula; facilitating the annotation, discovery and evaluation of FAIR-enabling materials \(e.g. training\) and resources; enabling the formalisation of job descriptions and CVs with recognised, structured competencies.\nIt is intended to be of use to trainers who teach FAIR data skills, researchers who wish to identify skill gaps in their teams and managers who need to recruit individuals to relevant roles. terms4FAIRskills (T4FS) terms4FAIRskills by the terms4FAIRskills developers is licensed under CC BY 4.0. You are free to share (copy and redistribute the material in any medium or format) and adapt (remix, transform, and build upon the material) for any purpose, even commercially. for any purpose, even commercially. The licensor cannot revoke these freedoms as long as you follow the license terms. You must give appropriate credit (by using the original ontology IRI for the whole ontology and original term IRIs for individual terms), provide a link to the license, and indicate if any changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use. 2025-02-09 definition The official definition, explaining the meaning of a class or property. Shall be Aristotelian, formalized and normalized. Can be augmented with colloquial definitions. 2012-04-05: Barry Smith The official OBI definition, explaining the meaning of a class or property: 'Shall be Aristotelian, formalized and normalized. Can be augmented with colloquial definitions' is terrible. Can you fix to something like: A statement of necessary and sufficient conditions explaining the meaning of an expression referring to a class or property. Alan Ruttenberg Your proposed definition is a reasonable candidate, except that it is very common that necessary and sufficient conditions are not given. Mostly they are necessary, occasionally they are necessary and sufficient or just sufficient. Often they use terms that are not themselves defined and so they effectively can't be evaluated by those criteria. On the specifics of the proposed definition: We don't have definitions of 'meaning' or 'expression' or 'property'. For 'reference' in the intended sense I think we use the term 'denotation'. For 'expression', I think we you mean symbol, or identifier. For 'meaning' it differs for class and property. For class we want documentation that let's the intended reader determine whether an entity is instance of the class, or not. For property we want documentation that let's the intended reader determine, given a pair of potential relata, whether the assertion that the relation holds is true. The 'intended reader' part suggests that we also specify who, we expect, would be able to understand the definition, and also generalizes over human and computer reader to include textual and logical definition. Personally, I am more comfortable weakening definition to documentation, with instructions as to what is desirable. We also have the outstanding issue of how to aim different definitions to different audiences. A clinical audience reading chebi wants a different sort of definition documentation/definition from a chemistry trained audience, and similarly there is a need for a definition that is adequate for an ontologist to work with. PERSON:Daniel Schober GROUP:OBI:<http://purl.obolibrary.org/obo/obi> https://orcid.org/0000-0002-7702-4495 Allyson Lister 8.10.2019: Although the IAO definition has been included here, we have made a curation decision to allow non-Aristotelian definitions. definition The official definition, explaining the meaning of a class or property. Shall be Aristotelian, formalized and normalized. Can be augmented with colloquial definitions. IAO term editor Name of editor entering the term in the file. The term editor is a point of contact for information regarding the term. The term editor may be, but is not always, the author of the definition, which may have been worked upon by several people 20110707, MC: label update to term editor and definition modified accordingly. See https://github.com/information-artifact-ontology/IAO/issues/115. PERSON:Daniel Schober GROUP:OBI:<http://purl.obolibrary.org/obo/obi> Peter McQuilton https://orcid.org/0000-0002-7702-4495 term editor Name of editor entering the term in the file. The term editor is a point of contact for information regarding the term. The term editor may be, but is not always, the author of the definition, which may have been worked upon by several people IAO alternative label A label for a class or property that can be used to refer to the class or property instead of the preferred rdfs:label. Alternative labels should be used to indicate community- or context-specific labels, abbreviations, shorthand forms and the like. An alternative name for a class or property which means the same thing as the preferred name (semantically equivalent) OBO Operations committee PERSON:Daniel Schober GROUP:OBI:<http://purl.obolibrary.org/obo/obi> Consider re-defing to: An alternative name for a class or property which can mean the same thing as the preferred name (semantically equivalent, narrow, broad or related). alternative label alternative term An alternative name for a class or property which means the same thing as the preferred name (semantically equivalent) IAO definition source Formal citation, e.g. identifier in external database to indicate / attribute source(s) for the definition. Free text indicate / attribute source(s) for the definition. EXAMPLE: Author Name, URI, MeSH Term C04, PUBMED ID, Wiki uri on 31.01.2007 PERSON:Daniel Schober Discussion on obo-discuss mailing-list, see http://bit.ly/hgm99w GROUP:OBI:<http://purl.obolibrary.org/obo/obi> Peter McQuilton https://orcid.org/0000-0002-7702-4495 definition source Formal citation, e.g. identifier in external database to indicate / attribute source(s) for the definition. Free text indicate / attribute source(s) for the definition. EXAMPLE: Author Name, URI, MeSH Term C04, PUBMED ID, Wiki uri on 31.01.2007 IAO term tracker item the URI for an OBI Terms ticket at sourceforge, such as https://sourceforge.net/p/obi/obi-terms/772/ An IRI or similar locator for a request or discussion of an ontology term. Person: Jie Zheng, Chris Stoeckert, Alan Ruttenberg Person: Jie Zheng, Chris Stoeckert, Alan Ruttenberg The 'tracker item' can associate a tracker with a specific ontology term. term tracker item Peter McQuilton 2019-10-17T13:38:16.742124Z alternative definition Peter McQuilton 2019-10-17T13:39:13.568975Z alternative definition source created by creation date This document is about information artifacts and their representations A (currently) primitive relation that relates an information artifact to an entity. 7/6/2009 Alan Ruttenberg. Following discussion with Jonathan Rees, and introduction of "mentions" relation. Weaken the is_about relationship to be primitive. We will try to build it back up by elaborating the various subproperties that are more precisely defined. Some currently missing phenomena that should be considered "about" are predications - "The only person who knows the answer is sitting beside me" , Allegory, Satire, and other literary forms that can be topical without explicitly mentioning the topic. person:Alan Ruttenberg Smith, Ceusters, Ruttenberg, 2000 years of philosophy is about inheres in this fragility is a characteristic of this vase this red color is a characteristic of this apple a relation between a specifically dependent continuant (the characteristic) and any other entity (the bearer), in which the characteristic depends on the bearer for its existence. inheres_in Note that this relation was previously called "inheres in", but was changed to be called "characteristic of" because BFO2 uses "inheres in" in a more restricted fashion. This relation differs from BFO2:inheres_in in two respects: (1) it does not impose a range constraint, and thus it allows qualities of processes, as well as of information entities, whereas BFO2 restricts inheres_in to only apply to independent continuants (2) it is declared functional, i.e. something can only be a characteristic of one thing. characteristic of bearer of this apple is bearer of this red color this vase is bearer of this fragility Inverse of characteristic_of A bearer can have many dependents, and its dependents can exist for different periods of time, but none of its dependents can exist when the bearer does not exist. bearer_of is bearer of has characteristic this catalysis function is a function of this enzyme a relation between a function and an independent continuant (the bearer), in which the function specifically depends on the bearer for its existence A function inheres in its bearer at all times for which the function exists, however the function need not be realized at all the times that the function exists. function_of is function of This relation is modeled after the BFO relation of the same name which was in BFO2, but is used in a more restricted sense - specifically, we model this relation as functional (inherited from characteristic-of). Note that this relation is now removed from BFO2020. function of this red color is a quality of this apple a relation between a quality and an independent continuant (the bearer), in which the quality specifically depends on the bearer for its existence A quality inheres in its bearer at all times for which the quality exists. is quality of quality_of This relation is modeled after the BFO relation of the same name which was in BFO2, but is used in a more restricted sense - specifically, we model this relation as functional (inherited from characteristic-of). Note that this relation is now removed from BFO2020. quality of this investigator role is a role of this person a relation between a role and an independent continuant (the bearer), in which the role specifically depends on the bearer for its existence A role inheres in its bearer at all times for which the role exists, however the role need not be realized at all the times that the role exists. is role of role_of This relation is modeled after the BFO relation of the same name which was in BFO2, but is used in a more restricted sense - specifically, we model this relation as functional (inherited from characteristic-of). Note that this relation is now removed from BFO2020. role of this enzyme has function this catalysis function (more colloquially: this enzyme has this catalysis function) a relation between an independent continuant (the bearer) and a function, in which the function specifically depends on the bearer for its existence A bearer can have many functions, and its functions can exist for different periods of time, but none of its functions can exist when the bearer does not exist. A function need not be realized at all the times that the function exists. has_function has function this apple has quality this red color a relation between an independent continuant (the bearer) and a quality, in which the quality specifically depends on the bearer for its existence A bearer can have many qualities, and its qualities can exist for different periods of time, but none of its qualities can exist when the bearer does not exist. has_quality has quality this person has role this investigator role (more colloquially: this person has this role of investigator) a relation between an independent continuant (the bearer) and a role, in which the role specifically depends on the bearer for its existence A bearer can have many roles, and its roles can exist for different periods of time, but none of its roles can exist when the bearer does not exist. A role need not be realized at all the times that the role exists. has_role has role a relation between an independent continuant (the bearer) and a disposition, in which the disposition specifically depends on the bearer for its existence has disposition This relation is modeled after the BFO relation of the same name which was in BFO2, but is used in a more restricted sense - specifically, we model this relation as functional (inherited from characteristic-of). Note that this relation is now removed from BFO2020. disposition of Describes how a learning medium is intended to confer a competence or capability regarding a particular data stewardship activity, e.g. a presentation confering competency in metadata creation. Peter McQuilton https://orcid.org/0000-0002-7702-4495 Peter McQuilton 2020-10-01T21:36:11.080721Z confers competency about Describes how a learning medium is intended to confer a competence or capability regarding a particular data stewardship activity, e.g. a presentation confering competency in metadata creation. https://orcid.org/0000-0002-7702-4495 Describes how a learning medium is intended to confer knowledge of a particular data stewardship technical concept in order for that learning medium to perform its function, e.g. a presentation conferring competency in metadata. Peter McQuilton https://orcid.org/0000-0002-7702-4495 Peter McQuilton 2020-12-06 00:00:00 confers knowledge about Describes how a learning medium is intended to confer knowledge of a particular data stewardship technical concept in order for that learning medium to perform its function, e.g. a presentation conferring competency in metadata. https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0002-7702-4495 Peter McQuilton requires/improves personal attribute Describes how a learning medium confers practical skills regarding a particular data stewardship technical concept in order for that learning medium to perform its function e.g. a workshop conferring a practical skill in repository access. Peter McQuilton https://orcid.org/0000-0002-7702-4495 Peter McQuilton 2020-12-06 00:00:00 confers practical skill about Describes how a learning medium confers practical skills regarding a particular data stewardship technical concept in order for that learning medium to perform its function e.g. a workshop conferring a practical skill in repository access. https://orcid.org/0000-0002-7702-4495 Describes how an expertise level or role (e.g. "data steward") is associated with a data stewardship activity (e.g. ‘workflow set-up and management’) to indicate whether someone has an awareness of the area, or an ability to do it, or expert knowledge of it. Peter McQuilton https://orcid.org/0000-0002-7702-4495 Definition added 8.2.22 has/wants competency in Describes how an expertise level or role (e.g. "data steward") is associated with a data stewardship activity (e.g. ‘workflow set-up and management’) to indicate whether someone has an awareness of the area, or an ability to do it, or expert knowledge of it. https://orcid.org/0000-0002-7702-4495 Peter McQuilton https://orcid.org/0000-0002-7702-4495 has/wants knowledge about https://orcid.org/0000-0002-7702-4495 Peter McQuilton supports implementation of Desires/has a competence or capability acquired or applied in a specific context, e.g. producing a research output or deploying a service. A skill may be specified in a ‘skills user story’. A badge or certificate may provide evidence that a skill has been acquired, and a publication, personal profile, portfolio or CV may provide evidence that a skill has been applied. Peter McQuilton https://orcid.org/0000-0002-7702-4495 Peter McQuilton has/wants practical skill about Desires/has a competence or capability acquired or applied in a specific context, e.g. producing a research output or deploying a service. A skill may be specified in a ‘skills user story’. A badge or certificate may provide evidence that a skill has been acquired, and a publication, personal profile, portfolio or CV may provide evidence that a skill has been applied. FAIR4S Peter McQuilton https://orcid.org/0000-0002-7702-4495 Peter McQuilton 2021-03-24T15:33:43.476917Z has aptitude for Peter McQuilton https://orcid.org/0000-0002-7702-4495 Peter McQuilton 2021-03-22T15:25:13.443153Z contributes to the implementation of The learning media that creates a competence or capability. https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0002-7702-4495 Allyson Lister 13.9.22: This class is created as the inverse of confers/requires competency about to allow consistent reasoning and access to the Data stewardship guidelines for Data stewardship activities. It needs to be revisited by the entire team to check the model. competency gained through The learning media that creates a competence or capability. https://orcid.org/0000-0002-7702-4495 Describes how a learning medium requires competency in a particular data stewardship activity in order for that learning medium to perform its function, e.g. a presentation requiring competency in metadata creation prior to engaging with that learning medium. requires competency about Describes how a learning medium requires competency in a particular data stewardship activity in order for that learning medium to perform its function, e.g. a presentation requiring competency in metadata creation prior to engaging with that learning medium. https://orcid.org/0000-0002-7702-4495 Describes how a learning medium requires knowledge of a particular data stewardship technical concept in order for that learning medium to perform its function, e.g. a presentation requiring knowledge of metadata. requires knowledge about Describes how a learning medium requires knowledge of a particular data stewardship technical concept in order for that learning medium to perform its function, e.g. a presentation requiring knowledge of metadata. https://orcid.org/0000-0002-7702-4495 Describes how a learning medium requires practical skills of a particular data stewardship technical concept in order for that learning medium to perform its function e.g. a workshop requiring a practical skill in repository access. requires practical skill about Describes how a learning medium requires practical skills of a particular data stewardship technical concept in order for that learning medium to perform its function e.g. a workshop requiring a practical skill in repository access. https://orcid.org/0000-0002-7702-4495 Describes how an evaluation indicator may confer compliance with a particular guideline or set of guidelines https://orcid.org/0000-0002-7702-4495 https://github.com/terms4fairskills/FAIRterminology/issues/22 confers compliance with Describes how an evaluation indicator may confer compliance with a particular guideline or set of guidelines https://orcid.org/0000-0002-7702-4495 A learning medium may demonstrate how to meet the requirements for a particular evaluation indicator(s). https://orcid.org/0000-0002-7702-4495 https://github.com/terms4fairskills/FAIRterminology/issues/22 demonstrates how to fulfil the requirements for A learning medium may demonstrate how to meet the requirements for a particular evaluation indicator(s). https://orcid.org/0000-0002-7702-4495 reference URL entity Entity Julius Caesar Verdi’s Requiem the Second World War your body mass index BFO 2 Reference: In all areas of empirical inquiry we encounter general terms of two sorts. First are general terms which refer to universals or types:animaltuberculosissurgical procedurediseaseSecond, are general terms used to refer to groups of entities which instantiate a given universal but do not correspond to the extension of any subuniversal of that universal because there is nothing intrinsic to the entities in question by virtue of which they – and only they – are counted as belonging to the given group. Examples are: animal purchased by the Emperortuberculosis diagnosed on a Wednesdaysurgical procedure performed on a patient from Stockholmperson identified as candidate for clinical trial #2056-555person who is signatory of Form 656-PPVpainting by Leonardo da VinciSuch terms, which represent what are called ‘specializations’ in [81 Entity doesn't have a closure axiom because the subclasses don't necessarily exhaust all possibilites. For example Werner Ceusters 'portions of reality' include 4 sorts, entities (as BFO construes them), universals, configurations, and relations. It is an open question as to whether entities as construed in BFO will at some point also include these other portions of reality. See, for example, 'How to track absolutely everything' at http://www.referent-tracking.com/_RTU/papers/CeustersICbookRevised.pdf An entity is anything that exists or has existed or will exist. (axiom label in BFO2 Reference: [001-001]) entity Entity doesn't have a closure axiom because the subclasses don't necessarily exhaust all possibilites. For example Werner Ceusters 'portions of reality' include 4 sorts, entities (as BFO construes them), universals, configurations, and relations. It is an open question as to whether entities as construed in BFO will at some point also include these other portions of reality. See, for example, 'How to track absolutely everything' at http://www.referent-tracking.com/_RTU/papers/CeustersICbookRevised.pdf per discussion with Barry Smith An entity is anything that exists or has existed or will exist. (axiom label in BFO2 Reference: [001-001]) continuant Continuant An entity that exists in full at any time in which it exists at all, persists through time while maintaining its identity and has no temporal parts. BFO 2 Reference: Continuant entities are entities which can be sliced to yield parts only along the spatial dimension, yielding for example the parts of your table which we call its legs, its top, its nails. ‘My desk stretches from the window to the door. It has spatial parts, and can be sliced (in space) in two. With respect to time, however, a thing is a continuant.’ [60, p. 240 Continuant doesn't have a closure axiom because the subclasses don't necessarily exhaust all possibilites. For example, in an expansion involving bringing in some of Ceuster's other portions of reality, questions are raised as to whether universals are continuants A continuant is an entity that persists, endures, or continues to exist through time while maintaining its identity. (axiom label in BFO2 Reference: [008-002]) if b is a continuant and if, for some t, c has_continuant_part b at t, then c is a continuant. (axiom label in BFO2 Reference: [126-001]) if b is a continuant and if, for some t, cis continuant_part of b at t, then c is a continuant. (axiom label in BFO2 Reference: [009-002]) if b is a material entity, then there is some temporal interval (referred to below as a one-dimensional temporal region) during which b exists. (axiom label in BFO2 Reference: [011-002]) (forall (x y) (if (and (Continuant x) (exists (t) (continuantPartOfAt y x t))) (Continuant y))) // axiom label in BFO2 CLIF: [009-002] (forall (x y) (if (and (Continuant x) (exists (t) (hasContinuantPartOfAt y x t))) (Continuant y))) // axiom label in BFO2 CLIF: [126-001] (forall (x) (if (Continuant x) (Entity x))) // axiom label in BFO2 CLIF: [008-002] (forall (x) (if (Material Entity x) (exists (t) (and (TemporalRegion t) (existsAt x t))))) // axiom label in BFO2 CLIF: [011-002] continuant Continuant doesn't have a closure axiom because the subclasses don't necessarily exhaust all possibilites. For example, in an expansion involving bringing in some of Ceuster's other portions of reality, questions are raised as to whether universals are continuants A continuant is an entity that persists, endures, or continues to exist through time while maintaining its identity. (axiom label in BFO2 Reference: [008-002]) if b is a continuant and if, for some t, c has_continuant_part b at t, then c is a continuant. (axiom label in BFO2 Reference: [126-001]) if b is a continuant and if, for some t, cis continuant_part of b at t, then c is a continuant. (axiom label in BFO2 Reference: [009-002]) if b is a material entity, then there is some temporal interval (referred to below as a one-dimensional temporal region) during which b exists. (axiom label in BFO2 Reference: [011-002]) (forall (x y) (if (and (Continuant x) (exists (t) (continuantPartOfAt y x t))) (Continuant y))) // axiom label in BFO2 CLIF: [009-002] (forall (x y) (if (and (Continuant x) (exists (t) (hasContinuantPartOfAt y x t))) (Continuant y))) // axiom label in BFO2 CLIF: [126-001] (forall (x) (if (Continuant x) (Entity x))) // axiom label in BFO2 CLIF: [008-002] (forall (x) (if (Material Entity x) (exists (t) (and (TemporalRegion t) (existsAt x t))))) // axiom label in BFO2 CLIF: [011-002] occurrent Occurrent An entity that has temporal parts and that happens, unfolds or develops through time. BFO 2 Reference: every occurrent that is not a temporal or spatiotemporal region is s-dependent on some independent continuant that is not a spatial region BFO 2 Reference: s-dependence obtains between every process and its participants in the sense that, as a matter of necessity, this process could not have existed unless these or those participants existed also. A process may have a succession of participants at different phases of its unfolding. Thus there may be different players on the field at different times during the course of a football game; but the process which is the entire game s-depends_on all of these players nonetheless. Some temporal parts of this process will s-depend_on on only some of the players. Occurrent doesn't have a closure axiom because the subclasses don't necessarily exhaust all possibilites. An example would be the sum of a process and the process boundary of another process. Simons uses different terminology for relations of occurrents to regions: Denote the spatio-temporal location of a given occurrent e by 'spn[e]' and call this region its span. We may say an occurrent is at its span, in any larger region, and covers any smaller region. Now suppose we have fixed a frame of reference so that we can speak not merely of spatio-temporal but also of spatial regions (places) and temporal regions (times). The spread of an occurrent, (relative to a frame of reference) is the space it exactly occupies, and its spell is likewise the time it exactly occupies. We write 'spr[e]' and `spl[e]' respectively for the spread and spell of e, omitting mention of the frame. An occurrent is an entity that unfolds itself in time or it is the instantaneous boundary of such an entity (for example a beginning or an ending) or it is a temporal or spatiotemporal region which such an entity occupies_temporal_region or occupies_spatiotemporal_region. (axiom label in BFO2 Reference: [077-002]) Every occurrent occupies_spatiotemporal_region some spatiotemporal region. (axiom label in BFO2 Reference: [108-001]) b is an occurrent entity iff b is an entity that has temporal parts. (axiom label in BFO2 Reference: [079-001]) (forall (x) (if (Occurrent x) (exists (r) (and (SpatioTemporalRegion r) (occupiesSpatioTemporalRegion x r))))) // axiom label in BFO2 CLIF: [108-001] (forall (x) (iff (Occurrent x) (and (Entity x) (exists (y) (temporalPartOf y x))))) // axiom label in BFO2 CLIF: [079-001] occurent occurrent Occurrent doesn't have a closure axiom because the subclasses don't necessarily exhaust all possibilites. An example would be the sum of a process and the process boundary of another process. per discussion with Barry Smith Simons uses different terminology for relations of occurrents to regions: Denote the spatio-temporal location of a given occurrent e by 'spn[e]' and call this region its span. We may say an occurrent is at its span, in any larger region, and covers any smaller region. Now suppose we have fixed a frame of reference so that we can speak not merely of spatio-temporal but also of spatial regions (places) and temporal regions (times). The spread of an occurrent, (relative to a frame of reference) is the space it exactly occupies, and its spell is likewise the time it exactly occupies. We write 'spr[e]' and `spl[e]' respectively for the spread and spell of e, omitting mention of the frame. An occurrent is an entity that unfolds itself in time or it is the instantaneous boundary of such an entity (for example a beginning or an ending) or it is a temporal or spatiotemporal region which such an entity occupies_temporal_region or occupies_spatiotemporal_region. (axiom label in BFO2 Reference: [077-002]) Every occurrent occupies_spatiotemporal_region some spatiotemporal region. (axiom label in BFO2 Reference: [108-001]) b is an occurrent entity iff b is an entity that has temporal parts. (axiom label in BFO2 Reference: [079-001]) (forall (x) (if (Occurrent x) (exists (r) (and (SpatioTemporalRegion r) (occupiesSpatioTemporalRegion x r))))) // axiom label in BFO2 CLIF: [108-001] (forall (x) (iff (Occurrent x) (and (Entity x) (exists (y) (temporalPartOf y x))))) // axiom label in BFO2 CLIF: [079-001] ic IndependentContinuant a chair a heart a leg a molecule a spatial region an atom an orchestra. an organism the bottom right portion of a human torso the interior of your mouth b is an independent continuant = Def. b is a continuant which is such that there is no c and no t such that b s-depends_on c at t. (axiom label in BFO2 Reference: [017-002]) For any independent continuant b and any time t there is some spatial region r such that b is located_in r at t. (axiom label in BFO2 Reference: [134-001]) For every independent continuant b and time t during the region of time spanned by its life, there are entities which s-depends_on b during t. (axiom label in BFO2 Reference: [018-002]) (forall (x t) (if (IndependentContinuant x) (exists (r) (and (SpatialRegion r) (locatedInAt x r t))))) // axiom label in BFO2 CLIF: [134-001] (forall (x t) (if (and (IndependentContinuant x) (existsAt x t)) (exists (y) (and (Entity y) (specificallyDependsOnAt y x t))))) // axiom label in BFO2 CLIF: [018-002] (iff (IndependentContinuant a) (and (Continuant a) (not (exists (b t) (specificallyDependsOnAt a b t))))) // axiom label in BFO2 CLIF: [017-002] independent continuant b is an independent continuant = Def. b is a continuant which is such that there is no c and no t such that b s-depends_on c at t. (axiom label in BFO2 Reference: [017-002]) For any independent continuant b and any time t there is some spatial region r such that b is located_in r at t. (axiom label in BFO2 Reference: [134-001]) For every independent continuant b and time t during the region of time spanned by its life, there are entities which s-depends_on b during t. (axiom label in BFO2 Reference: [018-002]) (forall (x t) (if (IndependentContinuant x) (exists (r) (and (SpatialRegion r) (locatedInAt x r t))))) // axiom label in BFO2 CLIF: [134-001] (forall (x t) (if (and (IndependentContinuant x) (existsAt x t)) (exists (y) (and (Entity y) (specificallyDependsOnAt y x t))))) // axiom label in BFO2 CLIF: [018-002] (iff (IndependentContinuant a) (and (Continuant a) (not (exists (b t) (specificallyDependsOnAt a b t))))) // axiom label in BFO2 CLIF: [017-002] process Process a process of cell-division, \ a beating of the heart a process of meiosis a process of sleeping the course of a disease the flight of a bird the life of an organism your process of aging. p is a process = Def. p is an occurrent that has temporal proper parts and for some time t, p s-depends_on some material entity at t. (axiom label in BFO2 Reference: [083-003]) BFO 2 Reference: The realm of occurrents is less pervasively marked by the presence of natural units than is the case in the realm of independent continuants. Thus there is here no counterpart of ‘object’. In BFO 1.0 ‘process’ served as such a counterpart. In BFO 2.0 ‘process’ is, rather, the occurrent counterpart of ‘material entity’. Those natural – as contrasted with engineered, which here means: deliberately executed – units which do exist in the realm of occurrents are typically either parasitic on the existence of natural units on the continuant side, or they are fiat in nature. Thus we can count lives; we can count football games; we can count chemical reactions performed in experiments or in chemical manufacturing. We cannot count the processes taking place, for instance, in an episode of insect mating behavior.Even where natural units are identifiable, for example cycles in a cyclical process such as the beating of a heart or an organism’s sleep/wake cycle, the processes in question form a sequence with no discontinuities (temporal gaps) of the sort that we find for instance where billiard balls or zebrafish or planets are separated by clear spatial gaps. Lives of organisms are process units, but they too unfold in a continuous series from other, prior processes such as fertilization, and they unfold in turn in continuous series of post-life processes such as post-mortem decay. Clear examples of boundaries of processes are almost always of the fiat sort (midnight, a time of death as declared in an operating theater or on a death certificate, the initiation of a state of war) (iff (Process a) (and (Occurrent a) (exists (b) (properTemporalPartOf b a)) (exists (c t) (and (MaterialEntity c) (specificallyDependsOnAt a c t))))) // axiom label in BFO2 CLIF: [083-003] process p is a process = Def. p is an occurrent that has temporal proper parts and for some time t, p s-depends_on some material entity at t. (axiom label in BFO2 Reference: [083-003]) (iff (Process a) (and (Occurrent a) (exists (b) (properTemporalPartOf b a)) (exists (c t) (and (MaterialEntity c) (specificallyDependsOnAt a c t))))) // axiom label in BFO2 CLIF: [083-003] disposition Disposition an atom of element X has the disposition to decay to an atom of element Y certain people have a predisposition to colon cancer children are innately disposed to categorize objects in certain ways. the cell wall is disposed to filter chemicals in endocytosis and exocytosis BFO 2 Reference: Dispositions exist along a strength continuum. Weaker forms of disposition are realized in only a fraction of triggering cases. These forms occur in a significant number of cases of a similar type. b is a disposition means: b is a realizable entity & b’s bearer is some material entity & b is such that if it ceases to exist, then its bearer is physically changed, & b’s realization occurs when and because this bearer is in some special physical circumstances, & this realization occurs in virtue of the bearer’s physical make-up. (axiom label in BFO2 Reference: [062-002]) If b is a realizable entity then for all t at which b exists, b s-depends_on some material entity at t. (axiom label in BFO2 Reference: [063-002]) (forall (x t) (if (and (RealizableEntity x) (existsAt x t)) (exists (y) (and (MaterialEntity y) (specificallyDepends x y t))))) // axiom label in BFO2 CLIF: [063-002] (forall (x) (if (Disposition x) (and (RealizableEntity x) (exists (y) (and (MaterialEntity y) (bearerOfAt x y t)))))) // axiom label in BFO2 CLIF: [062-002] disposition b is a disposition means: b is a realizable entity & b’s bearer is some material entity & b is such that if it ceases to exist, then its bearer is physically changed, & b’s realization occurs when and because this bearer is in some special physical circumstances, & this realization occurs in virtue of the bearer’s physical make-up. (axiom label in BFO2 Reference: [062-002]) If b is a realizable entity then for all t at which b exists, b s-depends_on some material entity at t. (axiom label in BFO2 Reference: [063-002]) (forall (x t) (if (and (RealizableEntity x) (existsAt x t)) (exists (y) (and (MaterialEntity y) (specificallyDepends x y t))))) // axiom label in BFO2 CLIF: [063-002] (forall (x) (if (Disposition x) (and (RealizableEntity x) (exists (y) (and (MaterialEntity y) (bearerOfAt x y t)))))) // axiom label in BFO2 CLIF: [062-002] realizable RealizableEntity the disposition of this piece of metal to conduct electricity. the disposition of your blood to coagulate the function of your reproductive organs the role of being a doctor the role of this boundary to delineate where Utah and Colorado meet A specifically dependent continuant that inheres in continuant entities and are not exhibited in full at every time in which it inheres in an entity or group of entities. The exhibition or actualization of a realizable entity is a particular manifestation, functioning or process that occurs under certain circumstances. To say that b is a realizable entity is to say that b is a specifically dependent continuant that inheres in some independent continuant which is not a spatial region and is of a type instances of which are realized in processes of a correlated type. (axiom label in BFO2 Reference: [058-002]) All realizable dependent continuants have independent continuants that are not spatial regions as their bearers. (axiom label in BFO2 Reference: [060-002]) (forall (x t) (if (RealizableEntity x) (exists (y) (and (IndependentContinuant y) (not (SpatialRegion y)) (bearerOfAt y x t))))) // axiom label in BFO2 CLIF: [060-002] (forall (x) (if (RealizableEntity x) (and (SpecificallyDependentContinuant x) (exists (y) (and (IndependentContinuant y) (not (SpatialRegion y)) (inheresIn x y)))))) // axiom label in BFO2 CLIF: [058-002] realizable entity To say that b is a realizable entity is to say that b is a specifically dependent continuant that inheres in some independent continuant which is not a spatial region and is of a type instances of which are realized in processes of a correlated type. (axiom label in BFO2 Reference: [058-002]) All realizable dependent continuants have independent continuants that are not spatial regions as their bearers. (axiom label in BFO2 Reference: [060-002]) (forall (x t) (if (RealizableEntity x) (exists (y) (and (IndependentContinuant y) (not (SpatialRegion y)) (bearerOfAt y x t))))) // axiom label in BFO2 CLIF: [060-002] (forall (x) (if (RealizableEntity x) (and (SpecificallyDependentContinuant x) (exists (y) (and (IndependentContinuant y) (not (SpatialRegion y)) (inheresIn x y)))))) // axiom label in BFO2 CLIF: [058-002] quality Quality the ambient temperature of this portion of air the color of a tomato the length of the circumference of your waist the mass of this piece of gold. the shape of your nose the shape of your nostril a quality is a specifically dependent continuant that, in contrast to roles and dispositions, does not require any further process in order to be realized. (axiom label in BFO2 Reference: [055-001]) If an entity is a quality at any time that it exists, then it is a quality at every time that it exists. (axiom label in BFO2 Reference: [105-001]) (forall (x) (if (Quality x) (SpecificallyDependentContinuant x))) // axiom label in BFO2 CLIF: [055-001] (forall (x) (if (exists (t) (and (existsAt x t) (Quality x))) (forall (t_1) (if (existsAt x t_1) (Quality x))))) // axiom label in BFO2 CLIF: [105-001] quality a quality is a specifically dependent continuant that, in contrast to roles and dispositions, does not require any further process in order to be realized. (axiom label in BFO2 Reference: [055-001]) If an entity is a quality at any time that it exists, then it is a quality at every time that it exists. (axiom label in BFO2 Reference: [105-001]) (forall (x) (if (Quality x) (SpecificallyDependentContinuant x))) // axiom label in BFO2 CLIF: [055-001] (forall (x) (if (exists (t) (and (existsAt x t) (Quality x))) (forall (t_1) (if (existsAt x t_1) (Quality x))))) // axiom label in BFO2 CLIF: [105-001] sdc SpecificallyDependentContinuant Reciprocal specifically dependent continuants: the function of this key to open this lock and the mutually dependent disposition of this lock: to be opened by this key of one-sided specifically dependent continuants: the mass of this tomato of relational dependent continuants (multiple bearers): John’s love for Mary, the ownership relation between John and this statue, the relation of authority between John and his subordinates. the disposition of this fish to decay the function of this heart: to pump blood the mutual dependence of proton donors and acceptors in chemical reactions [79 the mutual dependence of the role predator and the role prey as played by two organisms in a given interaction the pink color of a medium rare piece of grilled filet mignon at its center the role of being a doctor the shape of this hole. the smell of this portion of mozzarella b is a specifically dependent continuant = Def. b is a continuant & there is some independent continuant c which is not a spatial region and which is such that b s-depends_on c at every time t during the course of b’s existence. (axiom label in BFO2 Reference: [050-003]) Specifically dependent continuant doesn't have a closure axiom because the subclasses don't necessarily exhaust all possibilites. We're not sure what else will develop here, but for example there are questions such as what are promises, obligation, etc. (iff (SpecificallyDependentContinuant a) (and (Continuant a) (forall (t) (if (existsAt a t) (exists (b) (and (IndependentContinuant b) (not (SpatialRegion b)) (specificallyDependsOnAt a b t))))))) // axiom label in BFO2 CLIF: [050-003] specifically dependent continuant b is a specifically dependent continuant = Def. b is a continuant & there is some independent continuant c which is not a spatial region and which is such that b s-depends_on c at every time t during the course of b’s existence. (axiom label in BFO2 Reference: [050-003]) Specifically dependent continuant doesn't have a closure axiom because the subclasses don't necessarily exhaust all possibilites. We're not sure what else will develop here, but for example there are questions such as what are promises, obligation, etc. per discussion with Barry Smith (iff (SpecificallyDependentContinuant a) (and (Continuant a) (forall (t) (if (existsAt a t) (exists (b) (and (IndependentContinuant b) (not (SpatialRegion b)) (specificallyDependsOnAt a b t))))))) // axiom label in BFO2 CLIF: [050-003] role Role John’s role of husband to Mary is dependent on Mary’s role of wife to John, and both are dependent on the object aggregate comprising John and Mary as member parts joined together through the relational quality of being married. the priest role the role of a boundary to demarcate two neighboring administrative territories the role of a building in serving as a military target the role of a stone in marking a property boundary the role of subject in a clinical trial the student role A realizable entity the manifestation of which brings about some result or end that is not essential to a continuant in virtue of the kind of thing that it is but that can be served or participated in by that kind of continuant in some kinds of natural, social or institutional contexts. BFO 2 Reference: One major family of examples of non-rigid universals involves roles, and ontologies developed for corresponding administrative purposes may consist entirely of representatives of entities of this sort. Thus ‘professor’, defined as follows,b instance_of professor at t =Def. there is some c, c instance_of professor role & c inheres_in b at t.denotes a non-rigid universal and so also do ‘nurse’, ‘student’, ‘colonel’, ‘taxpayer’, and so forth. (These terms are all, in the jargon of philosophy, phase sortals.) By using role terms in definitions, we can create a BFO conformant treatment of such entities drawing on the fact that, while an instance of professor may be simultaneously an instance of trade union member, no instance of the type professor role is also (at any time) an instance of the type trade union member role (any more than any instance of the type color is at any time an instance of the type length).If an ontology of employment positions should be defined in terms of roles following the above pattern, this enables the ontology to do justice to the fact that individuals instantiate the corresponding universals – professor, sergeant, nurse – only during certain phases in their lives. https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 b is a role means: b is a realizable entity & b exists because there is some single bearer that is in some special physical, social, or institutional set of circumstances in which this bearer does not have to be& b is not such that, if it ceases to exist, then the physical make-up of the bearer is thereby changed. (axiom label in BFO2 Reference: [061-001]) (forall (x) (if (Role x) (RealizableEntity x))) // axiom label in BFO2 CLIF: [061-001] role b is a role means: b is a realizable entity & b exists because there is some single bearer that is in some special physical, social, or institutional set of circumstances in which this bearer does not have to be& b is not such that, if it ceases to exist, then the physical make-up of the bearer is thereby changed. (axiom label in BFO2 Reference: [061-001]) (forall (x) (if (Role x) (RealizableEntity x))) // axiom label in BFO2 CLIF: [061-001] gdc GenericallyDependentContinuant The entries in your database are patterns instantiated as quality instances in your hard drive. The database itself is an aggregate of such patterns. When you create the database you create a particular instance of the generically dependent continuant type database. Each entry in the database is an instance of the generically dependent continuant type IAO: information content entity. the pdf file on your laptop, the pdf file that is a copy thereof on my laptop the sequence of this protein molecule; the sequence that is a copy thereof in that protein molecule. b is a generically dependent continuant = Def. b is a continuant that g-depends_on one or more other entities. (axiom label in BFO2 Reference: [074-001]) (iff (GenericallyDependentContinuant a) (and (Continuant a) (exists (b t) (genericallyDependsOnAt a b t)))) // axiom label in BFO2 CLIF: [074-001] generically dependent continuant b is a generically dependent continuant = Def. b is a continuant that g-depends_on one or more other entities. (axiom label in BFO2 Reference: [074-001]) (iff (GenericallyDependentContinuant a) (and (Continuant a) (exists (b t) (genericallyDependsOnAt a b t)))) // axiom label in BFO2 CLIF: [074-001] function Function the function of a hammer to drive in nails the function of a heart pacemaker to regulate the beating of a heart through electricity the function of amylase in saliva to break down starch into sugar BFO 2 Reference: In the past, we have distinguished two varieties of function, artifactual function and biological function. These are not asserted subtypes of BFO:function however, since the same function – for example: to pump, to transport – can exist both in artifacts and in biological entities. The asserted subtypes of function that would be needed in order to yield a separate monoheirarchy are not artifactual function, biological function, etc., but rather transporting function, pumping function, etc. A function is a disposition that exists in virtue of the bearer’s physical make-up and this physical make-up is something the bearer possesses because it came into being, either through evolution (in the case of natural biological entities) or through intentional design (in the case of artifacts), in order to realize processes of a certain sort. (axiom label in BFO2 Reference: [064-001]) (forall (x) (if (Function x) (Disposition x))) // axiom label in BFO2 CLIF: [064-001] function A function is a disposition that exists in virtue of the bearer’s physical make-up and this physical make-up is something the bearer possesses because it came into being, either through evolution (in the case of natural biological entities) or through intentional design (in the case of artifacts), in order to realize processes of a certain sort. (axiom label in BFO2 Reference: [064-001]) (forall (x) (if (Function x) (Disposition x))) // axiom label in BFO2 CLIF: [064-001] data item Data items include counts of things, analyte concentrations, and statistical summaries. An information content entity that is intended to be a truthful statement about something (modulo, e.g., measurement precision or other systematic errors) and is constructed/acquired by a method which reliably tends to produce (approximately) truthful statements. 2/2/2009 Alan and Bjoern discussing FACS run output data. This is a data item because it is about the cell population. Each element records an event and is typically further composed a set of measurment data items that record the fluorescent intensity stimulated by one of the lasers. 2009-03-16: data item deliberatly ambiguous: we merged data set and datum to be one entity, not knowing how to define singular versus plural. So data item is more general than datum. 2009-03-16: removed datum as alternative term as datum specifically refers to singular form, and is thus not an exact synonym. 2014-03-31: See discussion at http://odontomachus.wordpress.com/2014/03/30/aboutness-objects-propositions/ JAR: datum -- well, this will be very tricky to define, but maybe some information-like stuff that might be put into a computer and that is meant, by someone, to denote and/or to be interpreted by some process... I would include lists, tables, sentences... I think I might defer to Barry, or to Brian Cantwell Smith JAR: A data item is an approximately justified approximately true approximate belief PERSON: Alan Ruttenberg PERSON: Chris Stoeckert PERSON: Jonathan Rees data data item information content entity Examples of information content entites include journal articles, data, graphical layouts, and graphs. A generically dependent continuant that is about some thing. 2014-03-10: The use of "thing" is intended to be general enough to include universals and configurations (see https://groups.google.com/d/msg/information-ontology/GBxvYZCk1oc/-L6B5fSBBTQJ). information_content_entity 'is_encoded_in' some digital_entity in obi before split (040907). information_content_entity 'is_encoded_in' some physical_document in obi before split (040907). Previous. An information content entity is a non-realizable information entity that 'is encoded in' some digital or physical entity. PERSON: Chris Stoeckert OBI_0000142 information content entity curation status specification The curation status of the term. The allowed values come from an enumerated list of predefined terms. See the specification of these instances for more detailed definitions of each enumerated value. Better to represent curation as a process with parts and then relate labels to that process (in IAO meeting) PERSON:Bill Bug GROUP:OBI:<http://purl.obolibrary.org/obo/obi> OBI_0000266 curation status specification data about an ontology part Data about an ontology part is a data item about a part of an ontology, for example a term Person:Alan Ruttenberg data about an ontology part obsolescence reason specification The reason for which a term has been deprecated. The allowed values come from an enumerated list of predefined terms. See the specification of these instances for more detailed definitions of each enumerated value. The creation of this class has been inspired in part by Werner Ceusters' paper, Applying evolutionary terminology auditing to the Gene Ontology. PERSON: Alan Ruttenberg PERSON: Melanie Courtot obsolescence reason specification denotator type The Basic Formal Ontology ontology makes a distinction between Universals and defined classes, where the formal are "natural kinds" and the latter arbitrary collections of entities. A denotator type indicates how a term should be interpreted from an ontological perspective. Alan Ruttenberg Barry Smith, Werner Ceusters denotator type Values in a table presented as a series of ASCII text lines organised so that each column value is separated by a pipe ( | ). https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 pipe separated values Values in a table presented as a series of ASCII text lines organised so that each column value is separated by a pipe ( | ). RDMT. https://terms.codata.org/rdmt/pipe-separated-values Software preservation involves the collection and long-term storage of software for archiving as well as maintaining availability and accessibility. Kristina Hettne Victoria Dominguez Del Angel Yann Le Franc https://orcid.org/0000-0002-7702-4495 AL 8.2.22: Refactored "software review and preservation" to "software preservation", as the review process is not within the remit of terms4FAIRskills. Added definition and source. software preservation Software preservation involves the collection and long-term storage of software for archiving as well as maintaining availability and accessibility. https://orcid.org/0000-0002-7702-4495 8.2.22 In accessing a repository one uses a client (application) to discover relevant digital objects within a repository, and then retrieve a copy of a desired digital object. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 repository access In accessing a repository one uses a client (application) to discover relevant digital objects within a repository, and then retrieve a copy of a desired digital object. Definition from now-deprecated CASRAI Glossary term: repository-access Include FAIR and open research in the strategic framework for the organization and set objectives and timeframe. Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang develop fair and open research vision The activity of developing an open research strategy and vision. Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang develop open research strategy and vision The activity of developing an open research strategy and vision. PMQ 3.2022 Statement of an organisation’s processes for the management of a specified set of data assets. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data management policy Statement of an organisation’s processes for the management of a specified set of data assets. RDMT. https://terms.codata.org/rdmt/data-management-policy Understand how the governing principles of research integrity and FAIR overlap Angus Whyte Celia van Gelder understand research code of conduct To be able to choose the appropriate reporting guideline/checklist for your data, based on community-adopted standards. https://orcid.org/0000-0003-2687-1982 choosing the appropriate reporting guideline for your data To be able to choose the appropriate reporting guideline/checklist for your data, based on community-adopted standards. [PMQ] A specific deed, action, function or sphere of action in relation with the role of data stewardship Yann Le Franc https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 Set of actions carried out during data stewardship processes data stewardship activity A series of potentially destructive or irrevocable changes to a piece of data or a file. Common munging operations include removing punctuation or html tags, data parsing, filtering, and transformation. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data munging A series of potentially destructive or irrevocable changes to a piece of data or a file. Common munging operations include removing punctuation or html tags, data parsing, filtering, and transformation. Definition from now-deprecated CASRAI Glossary term: data-munging https://orcid.org/0000-0003-2687-1982 interoperability of digital assets Bin for Skills related to Resource management leightonlc skills for resource management An activity within archiving in which specific items of data are maintained over time so that they can still be accessed and understood through changes in technology. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 Conservation preservation An activity within archiving in which specific items of data are maintained over time so that they can still be accessed and understood through changes in technology. RDMT. https://terms.codata.org/rdmt/preservation Data Categorization Data Classification https://orcid.org/0000-0003-2687-1982 2021-02-17T22:30:31.531624Z data categorisation Curation process on a data object by which it receives a persistent identifier (PID) from a trusted registration authority. Registration must be accompanied by the step(s) to create and submit metadata describing the object to the registry. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 data registration Curation process on a data object by which it receives a persistent identifier (PID) from a trusted registration authority. Registration must be accompanied by the step(s) to create and submit metadata describing the object to the registry. RDMT. https://terms.codata.org/rdmt/data-registration The capability to communicate, execute programs, or transfer data among various functional units in a useful and meaningful manner that requires the user to have little or no knowledge of the unique characteristics of those units. Foundational, syntactic, and semantic interoperability are the three necessary aspects of interoperability. v interoperability The capability to communicate, execute programs, or transfer data among various functional units in a useful and meaningful manner that requires the user to have little or no knowledge of the unique characteristics of those units. Foundational, syntactic, and semantic interoperability are the three necessary aspects of interoperability. Definition from now-deprecated CASRAI Glossary term: interoperability Exposing data is the activity of exposing your data to collaborators, the public, or other interested parties. A data producer makes the data accessible to external users in a machine- and/or human-readable way. https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 AL 15.3.22: Was "expose your data", which was not in the style of other term labels. Also updated definition, but might need revisiting later for a more formal definition. exposing data Exposing data is the activity of exposing your data to collaborators, the public, or other interested parties. A data producer makes the data accessible to external users in a machine- and/or human-readable way. [PMQ 3.2022, AL 15.3.22] Bin for Aptitudes related to Workflow technologies management. leightonlc aptitudes for workflow technologies management A standard that is widely accepted and used, but lacks formal approval by a recognized standards developing organization (e.g., the QWERTY keyboard). https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 de facto standard A standard that is widely accepted and used, but lacks formal approval by a recognized standards developing organization (e.g., the QWERTY keyboard). Definition from now-deprecated CASRAI Glossary term: de-facto-standard Object, event or phenomenon about which data are stored in a database and which has intermediate representation in a Data Model. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data entity Object, event or phenomenon about which data are stored in a database and which has intermediate representation in a Data Model. RDMT. https://terms.codata.org/rdmt/data-entity Data that are being received, processed and stored at the time of their occurrence with only small delays. Examples include: stock quotes, manufacturing statistics, Web server loads, data warehouse activity and sensor feeds to data collectors. Real-time data are often used for navigation or tracking. Real-time data are data streams that are typically generated by sensors and received via direct networking connections. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 real-time data Data that are being received, processed and stored at the time of their occurrence with only small delays. Examples include: stock quotes, manufacturing statistics, Web server loads, data warehouse activity and sensor feeds to data collectors. Real-time data are often used for navigation or tracking. Real-time data are data streams that are typically generated by sensors and received via direct networking connections. RDMT. https://terms.codata.org/rdmt/real-time-data Detect and analysis security risk of storage periodically, and minimize the impact of the risks detected Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang storage security risk assessment and mitigation The ability to get access to a computer or a network from a remote distance. Access may be through an Internet service provider (ISP) or through a dedicated line between a computer or a remote local area network and the central or main corporate local area network. A dedicated line is more expensive and less flexible but offers faster data rates. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 remote access The ability to get access to a computer or a network from a remote distance. Access may be through an Internet service provider (ISP) or through a dedicated line between a computer or a remote local area network and the central or main corporate local area network. A dedicated line is more expensive and less flexible but offers faster data rates. Definition from now-deprecated CASRAI Glossary term: remote-access The activity of supervision of other people to ensure FAIR data practices. Philippe Rocca-Serra Susanna Sansone https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 fair stewardship supervising The activity of supervision of other people to ensure FAIR data practices. PMQ Bin for Skills needed for High performance computing management. leightonlc skills related to high performance computing management Ability to select the appropriate FAIR metrics among the existing ones in relation to the type of digital object concerned. Kristina Hettne Victoria Dominguez Del Angel Yann Le Franc fair metrics selection skills A collection of data items organized as a set of formally-described tables from which data can be accessed or reassembled in many different ways without having to reorganize the database tables. The standard user and application program interface to a relational database is the structured query language (SQL). https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 relational database A collection of data items organized as a set of formally-described tables from which data can be accessed or reassembled in many different ways without having to reorganize the database tables. The standard user and application program interface to a relational database is the structured query language (SQL). Definition from now-deprecated CASRAI Glossary term: relational-database Data that could not lead to the identification of a specific object of interest. These may be data that have been de-identified, or that could not lead to identifiable information in the first place. https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 AL 6.5.22: Modifed CASRAI definition to distinguish this term from 'non personally identifiable information.' Further work may determine whether or not both terms are required. non identifiable data Data that could not lead to the identification of a specific object of interest. These may be data that have been de-identified, or that could not lead to identifiable information in the first place. Modified by AL on 6.5.22 from CASRAI. https://casrai.org/term/non-identifiable-data/ database developer Managing and promoting the use of assets from their point of creation to ensure that they are fit for contemporary purpose and available for discovery and reuse. For dynamic datasets this may mean continuous enrichment or updating to keep them fit for purpose. Higher levels of curation will also involve links with annotation and with other published materials. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 curation Managing and promoting the use of assets from their point of creation to ensure that they are fit for contemporary purpose and available for discovery and reuse. For dynamic datasets this may mean continuous enrichment or updating to keep them fit for purpose. Higher levels of curation will also involve links with annotation and with other published materials. RDMT. https://terms.codata.org/rdmt/curation https://orcid.org/0000-0003-2687-1982 A1. (meta)data are retrievable by their identifier using a standardised communications protocol Control over time of data, computer code, software, and documents that allows for the ability to revert to a previous revision, which is critical for data traceability, tracking edits, and correcting mistakes. Version control generates a (changed) copy of a data object that is uniquely labelled with a version number. The intent is to track changes to a data object, by making versioned copies. Note that a version is different from a backup copy, which is typically a copy made at a specific point in time, or a replica. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 Versioning, Revision control, Source control version control Control over time of data, computer code, software, and documents that allows for the ability to revert to a previous revision, which is critical for data traceability, tracking edits, and correcting mistakes. Version control generates a (changed) copy of a data object that is uniquely labelled with a version number. The intent is to track changes to a data object, by making versioned copies. Note that a version is different from a backup copy, which is typically a copy made at a specific point in time, or a replica. RDMT. https://terms.codata.org/rdmt/version-control Understand the FAIR and open research practices, and the research landscape / current data management practices in the organization. Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang understanding fair and open research challenges in your organization ethical application of patents, licenses System that allows users to be granted access to a database under specified conditions. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data access protocol System that allows users to be granted access to a database under specified conditions. RDMT. https://terms.codata.org/rdmt/data-access-protocol Generic concept referring to all kinds of procedures being executed on data at any point in the data lifecycle. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data processing Generic concept referring to all kinds of procedures being executed on data at any point in the data lifecycle. RDMT. https://terms.codata.org/rdmt/data-processing Set of measures that are used by a repository to form aggregations of data objects (including collections and metadata) to describe the properties of data objects, to register PIDs, to build the PID records, to link between all components, and to set up the containers (in the form of the software stack) that are used to store all components. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data organization Set of measures that are used by a repository to form aggregations of data objects (including collections and metadata) to describe the properties of data objects, to register PIDs, to build the PID records, to link between all components, and to set up the containers (in the form of the software stack) that are used to store all components. RDMT. https://terms.codata.org/rdmt/data-organisation Recovery and/or transformation and digitization of dark data and at-risk data so that they can be preserved, accessed, shared, and used. Data rescue also involves the addition of rich metadata to make the content understandable and more easily re-usable. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 data rescue Recovery and/or transformation and digitization of dark data and at-risk data so that they can be preserved, accessed, shared, and used. Data rescue also involves the addition of rich metadata to make the content understandable and more easily re-usable. RDMT. https://terms.codata.org/rdmt/data-rescue Anticipate possible implications of the research and making its outputs FAIR, reflecting on rmotivations and areas of uncertainty Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang understand societal impact of research To be able to search using software with a GUI or terminal access. For example using BLAST or genomic search tools in the life sciences. https://orcid.org/0000-0003-2687-1982 Algorithm searching Programmatic search batch search searching via algorithms and software To be able to search using software with a GUI or terminal access. For example using BLAST or genomic search tools in the life sciences. [PMQ] Evaluation of an organisation, system, group, project or product with respect to its data and processes around this, often in accordance with a standard, guide, or framework used to structure the work. This can involve assessing, describing, and classifying any data held. An audit can be carried out internally by those who have access to the data or participate in related processes regularly, or by an independent, external actor. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 audit Evaluation of an organisation, system, group, project or product with respect to its data and processes around this, often in accordance with a standard, guide, or framework used to structure the work. This can involve assessing, describing, and classifying any data held. An audit can be carried out internally by those who have access to the data or participate in related processes regularly, or by an independent, external actor. RDMT. https://terms.codata.org/rdmt/audit https://orcid.org/0000-0003-2687-1982 2020-10-01T21:03:37.147424Z wiki site Bin for Skills needed for Authorization management. leightonlc skills related to authorisation management The term storage management encompasses the technologies and processes organizations use to maximize or improve the performance of their data storage resources. It is a broad category that includes virtualization, replication, mirroring, security, compression, traffic analysis, process automation, storage provisioning and related techniques. Kristina Hettne Simon Hodson Victoria Dominguez Del Angel storage management The term storage management encompasses the technologies and processes organizations use to maximize or improve the performance of their data storage resources. It is a broad category that includes virtualization, replication, mirroring, security, compression, traffic analysis, process automation, storage provisioning and related techniques. (webopedia.com/TERM/S/storage_management.html)[VDA] Central repository for all or significant parts of the data that an organisation’s various business systems collect, containing harmonised, highly-structured, quality data integrated from multiple sources. Data warehousing emphasises the capture of data from diverse sources for useful analysis and access, but does not generally start from the point-of-view of the end user who may need access to specialised data marts. There are two approaches to data warehousing: The top-down approach spins off data marts for specific groups of users after the complete data warehouse has been created. The bottom-up approach builds the data marts first and then combines them into a single, all-encompassing data warehouse. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data warehouse Central repository for all or significant parts of the data that an organisation’s various business systems collect, containing harmonised, highly-structured, quality data integrated from multiple sources. Data warehousing emphasises the capture of data from diverse sources for useful analysis and access, but does not generally start from the point-of-view of the end user who may need access to specialised data marts. There are two approaches to data warehousing: The top-down approach spins off data marts for specific groups of users after the complete data warehouse has been created. The bottom-up approach builds the data marts first and then combines them into a single, all-encompassing data warehouse. RDMT. https://terms.codata.org/rdmt/data-warehouse Serious problem caused by one or more ineffective data analysis processes. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data driven disaster Serious problem caused by one or more ineffective data analysis processes. RDMT. https://terms.codata.org/rdmt/data-driven-disaster An identifier that uniquely distinguishes one set of data from all others. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data identifier An identifier that uniquely distinguishes one set of data from all others. Definition from now-deprecated CASRAI Glossary term: data-identifier Protecting sensitive data from unauthorised access by encrypting the data and storing different portions of a file on different servers. An unauthorised person would need to know the locations of the servers containing the parts, be able to get access to each server, know what data to combine, and how to decrypt it. Data splitting can be made even more effective by periodically retrieving and recombining the parts, and then splitting the data in a different way among different servers, and using a different encryption key. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 data splitting Protecting sensitive data from unauthorised access by encrypting the data and storing different portions of a file on different servers. An unauthorised person would need to know the locations of the servers containing the parts, be able to get access to each server, know what data to combine, and how to decrypt it. Data splitting can be made even more effective by periodically retrieving and recombining the parts, and then splitting the data in a different way among different servers, and using a different encryption key. RDMT. https://terms.codata.org/rdmt/data-splitting Monitor the status of information access of different stakeholder groups. Evaluate new access request and authorize or decline it. Update the organizational information access overview. Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang manage access Digital entity properties that are generated by the data management system (e.g., creation time; owner; storage location; data retention period; the length of time a digital entity will be retained). https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 system metadata Digital entity properties that are generated by the data management system (e.g., creation time; owner; storage location; data retention period; the length of time a digital entity will be retained). RDMT. https://terms.codata.org/rdmt/system-metadata Philippe Rocca-Serra Susanna Sansone https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 tactical/short-term planning Electronic version of the paper record that doctors have traditionally maintained for their patients and which is typically only accessible within the facility or office that controls it. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 electronic medical record Electronic version of the paper record that doctors have traditionally maintained for their patients and which is typically only accessible within the facility or office that controls it. RDMT. https://terms.codata.org/rdmt/electronic-medical-record Bin for Aptitudes related to Provenance information management. leightonlc aptitudes for provenance information management Formalising and documenting existing processes and events. A first step in analysing a system of objects with which users interact is to identify each object and its relationship to other objects. This process is called data modelling and results in a picture of object relationships. Data modellers often use multiple models to view the same data and ensure that all processes, entities, relationships and data flows have been identified. There are several different approaches to data modelling, including: Conceptual Data Modelling (identifies the highest-level relationships between different entities); Enterprise Data Modelling (similar to conceptual data modelling, but addresses the unique requirements of a specific organisation); Logical Data Modelling (illustrates the specific entities, attributes and relationships involved in a business function. Serves as the basis for the creation of the physical data model); Physical Data Modelling (represents an application and database-specific implementation of a logical data model). https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data modeling Formalising and documenting existing processes and events. A first step in analysing a system of objects with which users interact is to identify each object and its relationship to other objects. This process is called data modelling and results in a picture of object relationships. Data modellers often use multiple models to view the same data and ensure that all processes, entities, relationships and data flows have been identified. There are several different approaches to data modelling, including: Conceptual Data Modelling (identifies the highest-level relationships between different entities); Enterprise Data Modelling (similar to conceptual data modelling, but addresses the unique requirements of a specific organisation); Logical Data Modelling (illustrates the specific entities, attributes and relationships involved in a business function. Serves as the basis for the creation of the physical data model); Physical Data Modelling (represents an application and database-specific implementation of a logical data model). RDMT. https://terms.codata.org/rdmt/data-modelling To be able to search and understand the results from dataset aggregrator sites. To understand the implications of the provenance of the data and how to integrate and analyse data with differing metadata. https://orcid.org/0000-0003-2687-1982 Searching aggregator sites use of aggregator sites To be able to search and understand the results from dataset aggregrator sites. To understand the implications of the provenance of the data and how to integrate and analyse data with differing metadata. [PMQ] Implement the policies that govern the choice of metadata schema, reserved vocabularies, metadata organization in tables, and metadata properties (creation date, access control, ownership, etc.). https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 manage metadata catalog Implement the policies that govern the choice of metadata schema, reserved vocabularies, metadata organization in tables, and metadata properties (creation date, access control, ownership, etc.). Definition from now-deprecated CASRAI Glossary term: manage-metadata-catalog Manipulation of raw data to produce a single output. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 data transformation Manipulation of raw data to produce a single output. RDMT. https://terms.codata.org/rdmt/data-transformation An intellectual process of describing objects in accordance with accepted library principles, particularly those of subject and classification order. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 cataloguing An intellectual process of describing objects in accordance with accepted library principles, particularly those of subject and classification order. RDMT. https://terms.codata.org/rdmt/catalogue-verb Iniitate and develop processes for crediting the contributions of researchers and professional groups towards making FAIR outputs Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang design and apply processes for attribution Collection of datasets sharing the same product specification. A dataset series is a type of aggregation or collection with some “logical grouping” such as by a topic (specification) with the (product) unit being a dataset series. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 dataset series Collection of datasets sharing the same product specification. A dataset series is a type of aggregation or collection with some “logical grouping” such as by a topic (specification) with the (product) unit being a dataset series. RDMT. https://terms.codata.org/rdmt/dataset-series Collection of descriptions of the data objects or items in a data model. After each data object or item is given a descriptive name, its relationship is described (or it becomes part of some structure that implicitly describes relationship), the type of data (such as text or image or binary value) is described, possible predefined values are listed, and a brief textual description is provided. This collection can be organised for reference into a data dictionary. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data dictionary Collection of descriptions of the data objects or items in a data model. After each data object or item is given a descriptive name, its relationship is described (or it becomes part of some structure that implicitly describes relationship), the type of data (such as text or image or binary value) is described, possible predefined values are listed, and a brief textual description is provided. This collection can be organised for reference into a data dictionary. RDMT. https://terms.codata.org/rdmt/data-dictionary Process that creates a new dataset from an original source. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 data selection Process that creates a new dataset from an original source. RDMT. https://terms.codata.org/rdmt/data-selection Define data access policy based on project requirements. Provide access to autorised parties. Celia van Gelder Mateusz Kuzak Yan Wang apply data access policy The ability to find and comprehend data produced by people other than yourself. https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 Data search AL 15.3.22: tidied label from "discover other peoples data" to "Data discovery data discovery The ability to find and comprehend data produced by people other than yourself. [PMQ] Data that are delivered with all associated metadata, data dictionary, description of methods and instruments used to collect and process the data, and other supporting data (such as duplicate sample results, replicate analyses, percent recovery, etc.) with the purpose of providing the full context in which the data were created. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 documented data Data that are delivered with all associated metadata, data dictionary, description of methods and instruments used to collect and process the data, and other supporting data (such as duplicate sample results, replicate analyses, percent recovery, etc.) with the purpose of providing the full context in which the data were created. RDMT. https://terms.codata.org/rdmt/documented-data Philippe Rocca-Serra Susanna Sansone https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 conducting operations Data that are changing frequently and at asynchronous moments. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 dynamic data Data that are changing frequently and at asynchronous moments. RDMT. https://terms.codata.org/rdmt/dynamic-data Design covering a class of frameworks with the following characteristics: (1) it can be used to generate more specific models that still belong to the class and (2) it can be used to compare a concrete framework design to identify whether it belongs to the same class. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 reference model Design covering a class of frameworks with the following characteristics: (1) it can be used to generate more specific models that still belong to the class and (2) it can be used to compare a concrete framework design to identify whether it belongs to the same class. RDMT. https://terms.codata.org/rdmt/reference-model https://orcid.org/0000-0003-2687-1982 2020-10-01T20:52:57.551905Z online workbook engaging in open innovation beyond academia Data that have gone through a registration process and have been assigned an identifier metadata to aid in their search and retrieval. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 registered data Data that have gone through a registration process and have been assigned an identifier metadata to aid in their search and retrieval. RDMT. https://terms.codata.org/rdmt/registered-data In the context of data and network security, assurance that information can only be accessed or modified by those authorised to do so. Measures taken to ensure integrity include controlling the physical environment of networked terminals and servers, restricting access to data, and maintaining rigorous authentication practices. Data integrity can also be threatened by environmental hazards, such as heat, dust, and electrical surges. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 integrity In the context of data and network security, assurance that information can only be accessed or modified by those authorised to do so. Measures taken to ensure integrity include controlling the physical environment of networked terminals and servers, restricting access to data, and maintaining rigorous authentication practices. Data integrity can also be threatened by environmental hazards, such as heat, dust, and electrical surges. RDMT. https://terms.codata.org/rdmt/integrity Scholarship which is dependent upon digital methods, tools or resources. May include building a digital collection of information for further study and analysis; creating appropriate tools for collection-building; creating appropriate tools for the analysis and study of collections; using digital collections and analytical tools to generate new intellectual products; or creating authoring tools for these new intellectual products, either in traditional forms or in digital form. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 digital scholarship Scholarship which is dependent upon digital methods, tools or resources. May include building a digital collection of information for further study and analysis; creating appropriate tools for collection-building; creating appropriate tools for the analysis and study of collections; using digital collections and analytical tools to generate new intellectual products; or creating authoring tools for these new intellectual products, either in traditional forms or in digital form. RDMT. https://terms.codata.org/rdmt/digital-scholarship Bin for types of Knowledge related to Resource management leightonlc a knowledge for resource management Bin for Aptitudes related to Resource management. leightonlc aptitudes for resource management Bin for types of Knowledge related to Storage management. leightonlc knowledge of storage management Know about the open access publishing procedure, journals' policies on OA publishing, project funder and institutions policy as well as the financial support on OA publishing Celia van Gelder Mateusz Kuzak Yan Wang understand open access publishing OBSOLETE. The use of persistent identifiers or PIDs to uniquely and persistently identify an entity. Nancy Hoebelheinrich https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 https://github.com/terms4fairskills/FAIRterminology/issues/21 This term is redundant with http://purl.obolibrary.org/obo/T4FS_0000388 and has been deprecated. use of persistent, resolvable identifiers (pids) true OBSOLETE. The use of persistent identifiers or PIDs to uniquely and persistently identify an entity. [PMQ 3.2022] Process in which files are first parsed (assigned to appropriate fields in a record) and then translated to a common format. Data often lack consistency simply because there are many ways of saying the same thing. Standardising the record ensures that when a query is run for a particular field, accurate results will be returned. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 https://github.com/terms4fairskills/FAIRterminology/issues/32 record standardization Process in which files are first parsed (assigned to appropriate fields in a record) and then translated to a common format. Data often lack consistency simply because there are many ways of saying the same thing. Standardising the record ensures that when a query is run for a particular field, accurate results will be returned. RDMT. https://terms.codata.org/rdmt/record-standardisation https://orcid.org/0000-0003-2687-1982 https://orcid.org/0000-0003-2687-1982 2020-10-01T20:53:59.107153Z online documentation Catalogue containing metadata records that enables services to find data and services. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 https://orcid.org/0000-0003-2687-1982 metadata catalogue Catalogue containing metadata records that enables services to find data and services. RDMT. https://terms.codata.org/rdmt/metadata-catalogue Learn about the types of patents and its legal implications. Understand the application procedure of patents. Celia van Gelder Mateusz Kuzak Yan Wang understand legal background of patents Layout of a file in terms of how the data within the file are organised and encoded for storage. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 Research data format data file format Layout of a file in terms of how the data within the file are organised and encoded for storage. RDMT. https://terms.codata.org/rdmt/data-file-format Curated collection of metadata records describing datasets and their data elements. Curated and organised using a formal metadata schema appropriate to data and data sets (e.g. ReCollect etc). https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data catalogue Curated collection of metadata records describing datasets and their data elements. Curated and organised using a formal metadata schema appropriate to data and data sets (e.g. ReCollect etc). RDMT. https://terms.codata.org/rdmt/data-catalogue A type of workflow that includes active steps to curate data as an aid to on-going management of data through its lifecycle. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 curation workflow A type of workflow that includes active steps to curate data as an aid to on-going management of data through its lifecycle. Definition from now–deprecated CASRAI Glossary term: curation-workflow https://orcid.org/0000-0003-2687-1982 A1.1 the protocol is open, free, and universally implementable Planning for and controlling the present and future costs of the conservation of data, including technical storage, fixed costs, and staff resources. Kristina Hettne Leighton Christiansen Simon Hodson Victoria Dominguez Del Angel preservation costs management Planning for and controlling the present and future costs of the conservation of data, including technical storage, fixed costs, and staff resources. Based on https://www.cessda.eu/About/Projects/Past-projects/CESSDA-SaW/WP4/Cost-Benefit-Advocacy-Toolkit/Costs-Factsheet [LLC] The person who is tasked with delivering a project within the boundaries and framework established by the program manager. The project manager is and should be delivery and execution focused and is judged on the elements of time, cost, and scope of the project. The person responsible for ensuring that the Project Team completes the project. The Project Manager develops the Project Plan with the team and manages the teamís performance of project tasks. It is also the responsibility of the Project Manager to secure acceptance and approval of deliverables from the Project Sponsor and Stakeholders. The Project Manager is responsible for communication, including status reporting, risk management, escalation of issues that cannot be resolved in the team, and, in general, making sure the project is delivered in budget, on schedule, and within scope. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 project manager The person who is tasked with delivering a project within the boundaries and framework established by the program manager. The project manager is and should be delivery and execution focused and is judged on the elements of time, cost, and scope of the project. The person responsible for ensuring that the Project Team completes the project. The Project Manager develops the Project Plan with the team and manages the teamís performance of project tasks. It is also the responsibility of the Project Manager to secure acceptance and approval of deliverables from the Project Sponsor and Stakeholders. The Project Manager is responsible for communication, including status reporting, risk management, escalation of issues that cannot be resolved in the team, and, in general, making sure the project is delivered in budget, on schedule, and within scope. Definition from now-deprecated CASRAI Glossary term: project-manager Celia van Gelder Mateusz Kuzak Yan Wang https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 Release data AL 9.3.22. Removed from 'FAIR4S defined activity' class to better integrate with the T4FS hierarchy. AL 5.5.22: removed "Publishing and archiving data" and "Publish and archive data" alternative terms, as the two actions are not the same. publish data https://orcid.org/0000-0002-7702-4495 Access management AL 16.3.22: Changed 'information security and access management' such that access management went in as an alternative term. If required in future, we may wish to separate it out completely. information security All activities involved in the planning, collecting, processing, analysis and maintenance of data in the original research project. Among these activities are selecting a study design, constructing instruments for data collection, conducting data collection/creation, performing data editing/verification/validation, analysing data, backing up data versions and preparing and tagging metadata. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data production All activities involved in the planning, collecting, processing, analysis and maintenance of data in the original research project. Among these activities are selecting a study design, constructing instruments for data collection, conducting data collection/creation, performing data editing/verification/validation, analysing data, backing up data versions and preparing and tagging metadata. RDMT. https://terms.codata.org/rdmt/data-production Data that have not been processed for meaningful use. Although raw data have the potential to become information, they require selective extraction, organisation, and sometimes analysis and formatting for presentation. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 Source data raw data Data that have not been processed for meaningful use. Although raw data have the potential to become information, they require selective extraction, organisation, and sometimes analysis and formatting for presentation. RDMT. https://terms.codata.org/rdmt/raw-data Testing conducted to evaluate whether systems or components pass data and control correctly to each other. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 interface testing Testing conducted to evaluate whether systems or components pass data and control correctly to each other. Definition from now-deprecated CASRAI Glossary term: interface-testing Machine-independent data structure consisting of one or more elements in digital form that can be parsed by different information systems; the structure helps to enable interoperability among diverse information systems. A digital object is composed of a structured sequence of bits/bytes. The bit sequence realising the object can be identified and accessed by a unique and persistent identifier or by use of referencing attributes describing its properties. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 Digital entity digital object Machine-independent data structure consisting of one or more elements in digital form that can be parsed by different information systems; the structure helps to enable interoperability among diverse information systems. A digital object is composed of a structured sequence of bits/bytes. The bit sequence realising the object can be identified and accessed by a unique and persistent identifier or by use of referencing attributes describing its properties. RDMT. https://terms.codata.org/rdmt/digital-object A broad term encompassing: (a) digital surrogates created as a result of converting analogue materials to digital form (digitisation); (b) born digital for which there has never been and is never intended to be an analogue equivalent; and, (c) digital records. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 digital materials A broad term encompassing: (a) digital surrogates created as a result of converting analogue materials to digital form (digitisation); (b) born digital for which there has never been and is never intended to be an analogue equivalent; and, (c) digital records. RDMT. https://terms.codata.org/rdmt/digital-materials Understand the needs and conditions of information access for different stakeholder groups. Identify the responsibility and rights for each stakeholder group on informaiton access. Build up a catalogue of risk profiles. For each risk profile, develope mitigation protocols. Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang define access policy Bin for Skills needed for Provenance information management. leightonlc 2019-10-17T15:17:02.313138Z skills related to provenance information management Ability to access and download data from a repository. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 remote data access Ability to access and download data from a repository. RDMT. https://terms.codata.org/rdmt/remote-data-access The act of interpreting an author's intended use of a word that has multiple meanings or spellings. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 disambiguation The act of interpreting an author's intended use of a word that has multiple meanings or spellings. Definition from now-deprecated CASRAI Glossary term: dissambuation Ongoing processes to ensure adherence of data to both enterprise business rules (government department, university, industry, or agency), and to legal, regulatory and accreditation requirements. Includes five areas: controls, audit, legal compliance, regulatory compliance, and accreditation conformance. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data compliance Ongoing processes to ensure adherence of data to both enterprise business rules (government department, university, industry, or agency), and to legal, regulatory and accreditation requirements. Includes five areas: controls, audit, legal compliance, regulatory compliance, and accreditation conformance. RDMT. https://terms.codata.org/rdmt/data-compliance The re-analysis of a dataset or combination of datasets outside of the original research purpose for which the dataset was created. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 https://github.com/terms4fairskills/FAIRterminology/issues/15 AL 2.3.23: Removed because of its high level of similarity with the 'R' FAIR principle. re-use true The re-analysis of a dataset or combination of datasets outside of the original research purpose for which the dataset was created. RDMT. https://terms.codata.org/rdmt/re-use trainer/teacher Bin for Aptitudes related to Identity management. leightonlc aptitudes for identity management Know the security requirements on the organization regarding different types of information Know the current organizational policy, infrastructure and capacity on information security. Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang understand information security challenges Bin for Aptitudes related to Authorization management. leightonlc aptitudes for authorisation management Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang https://orcid.org/0000-0002-7702-4495 AL 17.3.22: Updated to "Assessment" to match the style of its sibling classes. AL 3.22: Was "govern and assess", which is a compound concept. Split into two classes, Assess and Governance. Original source: FAIR4S defined activity. assessment The World Wide Web Consortium’s Platform for Personal Privacy Project (P3P) offers specific recommendations for practices that will let users define and share personal information with Web sites that they agree to share it with. The P3P incorporates a number of industry proposals, including the Open Profiling Standard (OPS). Using software that adheres to the P3P recommendations, users will be able to create a personal profile, all or parts of which can be made accessible to a Web site as the user directs. A tool that will help a user decide whether to trust a given website with personal information is a Statement of Privacy Policy that a website can post. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 personal information privacy The World Wide Web Consortium’s Platform for Personal Privacy Project (P3P) offers specific recommendations for practices that will let users define and share personal information with Web sites that they agree to share it with. The P3P incorporates a number of industry proposals, including the Open Profiling Standard (OPS). Using software that adheres to the P3P recommendations, users will be able to create a personal profile, all or parts of which can be made accessible to a Web site as the user directs. A tool that will help a user decide whether to trust a given website with personal information is a Statement of Privacy Policy that a website can post. RDMT. https://terms.codata.org/rdmt/personal-information-privacy Demonstrate ability to involve others and share responsibility for applying FAIR principles. Angus Whyte ability to solve fair implementation problems collaboratively Demonstrate ability to involve others and share responsibility for applying FAIR principles. [PMQ, 3.2022] Bin for Skills needed for Cloud computing environment management. leightonlc 2019-10-17T15:14:37.180683Z skills related to cloud computing environment management Managed process throughout the data lifecycle, by which data/data collections are cleansed, documented, standardised, formatted and inter-related. This includes versioning data, or forming a new collection from several data sources, annotating with metadata, adding codes to raw data (e.g., classifying a galaxy image with a galaxy type such as “spiral”). Higher levels of curation involve maintaining links with annotation and with other published materials. Thus a dataset may include a citation link to publication whose analysis was based on the data. The goal of curation is to manage and promote the use of data from its point of creation to ensure it is fit for contemporary purpose and available for discovery and re-use. For dynamic datasets this may mean continuous enrichment or updating to keep it fit for purpose. Special forms of curation may be available in data repositories. The data curation process itself must be documented as part of curation. Thus curation and provenance are highly related. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data curation Managed process throughout the data lifecycle, by which data/data collections are cleansed, documented, standardised, formatted and inter-related. This includes versioning data, or forming a new collection from several data sources, annotating with metadata, adding codes to raw data (e.g., classifying a galaxy image with a galaxy type such as “spiral”). Higher levels of curation involve maintaining links with annotation and with other published materials. Thus a dataset may include a citation link to publication whose analysis was based on the data. The goal of curation is to manage and promote the use of data from its point of creation to ensure it is fit for contemporary purpose and available for discovery and re-use. For dynamic datasets this may mean continuous enrichment or updating to keep it fit for purpose. Special forms of curation may be available in data repositories. The data curation process itself must be documented as part of curation. Thus curation and provenance are highly related. RDMT. https://terms.codata.org/rdmt/data-curation Cite contributions to data work in published literature which draws upon that data. Laura Molloy https://orcid.org/0000-0003-2687-1982 cite contributions Cite contributions to data work in published literature which draws upon that data. [LM] Ongoing organizational activities associated with supporting functional elements, as opposed to project elements. Operational management also includes support of products that the organization has created through project activity. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 operational management Ongoing organizational activities associated with supporting functional elements, as opposed to project elements. Operational management also includes support of products that the organization has created through project activity. Definition from now-deprecated CASRAI Glossary term: operational-management Data that are complete, timely, accurate, consistent, relevant, reliable, traceable, cleaned, validated, and well documented. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 high quality data Data that are complete, timely, accurate, consistent, relevant, reliable, traceable, cleaned, validated, and well documented. RDMT. https://terms.codata.org/rdmt/high-quality-data Understand what reuse is permitted according to the licenses or terms and conditions applicable to services, tools and their content. Angus Whyte how to make your data reuseable https://orcid.org/0000-0003-2687-1982 I1. (meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation Organisation or structure for a database. The activity of data modelling leads to a schema. (The plural form is schemata.) The term is used in discussing both relational databases and object-oriented databases. The term sometimes refers to a visualisation of a structure and sometimes to a formal text-oriented description. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 schema schema (database) Organisation or structure for a database. The activity of data modelling leads to a schema. (The plural form is schemata.) The term is used in discussing both relational databases and object-oriented databases. The term sometimes refers to a visualisation of a structure and sometimes to a formal text-oriented description. RDMT. https://terms.codata.org/rdmt/schema-database Service that provides a connection between a PID and its target object. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 pid service Service that provides a connection between a PID and its target object. RDMT. https://terms.codata.org/rdmt/pid-service Set of conditions that allow data exchange from one information technology system to be received by another and does not require the ability for the receiving information technology system to interpret the data. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 foundational interoperability Set of conditions that allow data exchange from one information technology system to be received by another and does not require the ability for the receiving information technology system to interpret the data. RDMT. https://terms.codata.org/rdmt/foundational-interoperability The activities of data policies, data planning, data element standardization, information management control, data synchronization, data sharing, and database development, including practices and projects that acquire, control, protect, deliver and enhance the value of data and information. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data management The activities of data policies, data planning, data element standardization, information management control, data synchronization, data sharing, and database development, including practices and projects that acquire, control, protect, deliver and enhance the value of data and information. Definition from now-deprecated CASRAI Glossary term: data-management Authorization management is concerned with people's access to different objects, most often to data or physical objects, such as land, buildings, rooms or infrastructure. Kristina Hettne Leighton Christiansen Simon Hodson Victoria Dominguez Del Angel authorisation management Authorization management is concerned with people's access to different objects, most often to data or physical objects, such as land, buildings, rooms or infrastructure. https://managementmania.com/en/authorization-management Bin for types of Knowledge related to Authorization management. leightonlc knowledge of authorisation management The ability to search repositories and knowledge-bases. https://orcid.org/0000-0003-2687-1982 Repository search database search searching databases searching repositories search repositories and knowledge-bases The ability to search repositories and knowledge-bases. [PMQ] Ecosystem that includes software, infrastructure, consultants, integrators, partners, third parties and anything else in the specified environment that has a bearing on the other components. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 cloud ecosystem Ecosystem that includes software, infrastructure, consultants, integrators, partners, third parties and anything else in the specified environment that has a bearing on the other components. RDMT. https://terms.codata.org/rdmt/cloud-ecosystem Learn about data licensing, the types of data licences and its legal implications. Understand the application procedure of data licences. Celia van Gelder Mateusz Kuzak Yan Wang understand legal background of licensing A place or collection containing static records, documents, or other materials for long-term preservation. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 archive A place or collection containing static records, documents, or other materials for long-term preservation. RDMT. https://terms.codata.org/rdmt/archive An infrastructure component that provides reliable, long-term access to managed digital resources. It stores, manages, and curates digital objects and returns their bit streams when a request is issued. Trusted repositories undergo regular assessments according to a set of rules such as defined by Data Seal of Approval (DSA) or TRAC (ISO 16363). It is well understood that such an assessment has the potential of increasing trust from its depositors and users, but it will not be the only criterion for users. Repositories can be at different stages of assessments. However, it is evident that certain quality criteria need to be met to distinguish trusted repositories from all types of other entities that store data such as notebooks or lab servers. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 trusted digital repository An infrastructure component that provides reliable, long-term access to managed digital resources. It stores, manages, and curates digital objects and returns their bit streams when a request is issued. Trusted repositories undergo regular assessments according to a set of rules such as defined by Data Seal of Approval (DSA) or TRAC (ISO 16363). It is well understood that such an assessment has the potential of increasing trust from its depositors and users, but it will not be the only criterion for users. Repositories can be at different stages of assessments. However, it is evident that certain quality criteria need to be met to distinguish trusted repositories from all types of other entities that store data such as notebooks or lab servers. Definition from now-deprecated CASRAI Glossary term: trusted-digital-repository Managing the physical aspects of data resources, including database design and integrity, backup and recovery, performance and tuning. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 database administration Managing the physical aspects of data resources, including database design and integrity, backup and recovery, performance and tuning. RDMT. https://terms.codata.org/rdmt/database-administration Bin for types of Knowledge related to Identity management leightonlc knowledge of identity management Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang quality assessment Specialised format for organising and storing data. General data structure types include the array, the file, the record, the table, the tree, and so on. Any data structure is designed to organise data to suit a specific purpose so that it can be accessed and worked with in appropriate ways. In computer programming, a data structure may be selected or designed to store data for the purpose of working on it with various algorithms. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 Data format data structure Specialised format for organising and storing data. General data structure types include the array, the file, the record, the table, the tree, and so on. Any data structure is designed to organise data to suit a specific purpose so that it can be accessed and worked with in appropriate ways. In computer programming, a data structure may be selected or designed to store data for the purpose of working on it with various algorithms. RDMT. https://terms.codata.org/rdmt/data-structure Information systems and technology infrastructure manager, expert, or technician. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 information technology specialist Information systems and technology infrastructure manager, expert, or technician. RDMT. https://terms.codata.org/rdmt/information-technology-specialist A curation activity that ensures that data are properly selected, stored, and can be accessed, and for which logical and physical integrity are maintained over time, including security and authenticity. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 archiving A curation activity that ensures that data are properly selected, stored, and can be accessed, and for which logical and physical integrity are maintained over time, including security and authenticity. RDMT. https://terms.codata.org/rdmt/archive-verb Bin for types of Knowledge related to Preservation costs management. leightonlc 2019-10-17T15:17:32.816776Z knowledge of preservation costs management A phase of development where the product is tested in the real world by the intended audience. The experiences of the early users are forwarded back to the developers who make final changes before releasing the product. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 user acceptance testing A phase of development where the product is tested in the real world by the intended audience. The experiences of the early users are forwarded back to the developers who make final changes before releasing the product. Definition from now-deprecated CASRAI Glossary term: user-acceptance-testing/ Description of a digital object with a limited number of fields including at least a name and persistent identifier. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 minimal metadata Description of a digital object with a limited number of fields including at least a name and persistent identifier. RDMT. https://terms.codata.org/rdmt/minimal-metadata Metadata exposure is the process of finding, identifying, selecting and acquiring/obtaining access to a metadata entity. Nancy Hoebelheinrich https://orcid.org/0000-0002-7702-4495 AL 22.3.22: Refactored from 'metadata creation and exposure'. Metadata creation has moved to the Curation hierarchy. metadata exposure Metadata exposure is the process of finding, identifying, selecting and acquiring/obtaining access to a metadata entity. AL 22.3.22, and see also FRBR User Tasks at: https://sites.google.com/site/metadatastandards/chapter-6/6-3-frbr-user-tasks. Process of destroying data stored on tapes, hard disks and other forms of electronic media so that it is completely unreadable and cannot be accessed or used. https://orcid.org/0000-0002-5214-4466 data destruction Process of destroying data stored on tapes, hard disks and other forms of electronic media so that it is completely unreadable and cannot be accessed or used. RDMT. https://terms.codata.org/rdmt/data-destruction The provision of training materials and events in and around good data stewardship. Philippe Rocca-Serra Susanna Sansone https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 fair stewardship training The provision of training materials and events in and around good data stewardship. PMQ The ability to have an innovative approach to research by creating new or modified current concepts, theories, approaches and/or solutions. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 creativity The ability to have an innovative approach to research by creating new or modified current concepts, theories, approaches and/or solutions. Definition from now-deprecated CASRAI Glossary term: creativity Field or column in a database table. It is an abbreviation for ‘physical data attribute’ which is a single data element related to a data object, such as a table in a database. The database schema associates one or more attributes with each database entity (i.e. table). https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data table attribute data table attribute (field) Field or column in a database table. It is an abbreviation for ‘physical data attribute’ which is a single data element related to a data object, such as a table in a database. The database schema associates one or more attributes with each database entity (i.e. table). RDMT. https://terms.codata.org/rdmt/data-table-attribute-field A standard developed through the cooperation of all parties who have an interest in participating in its development and/or use. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 Community standard consensus standard A standard developed through the cooperation of all parties who have an interest in participating in its development and/or use. Definition from now-deprecated CASRAI Glossary term: consensus-standard Identity Management (IdM), also called Identity and Access Management (IAM) refers to a framework of policies and technologies for ensuring that the proper user in an organisation has the appropriate access to technology resources systems fall under the overarching umbrella of IT security. IdM systems not only identify, but authenticate and authorize individuals who will be using IT resources, but also the hardware and applications users need to access. Kristina Hettne Leighton Christiansen Simon Hodson Victoria Dominguez Del Angel IAM IdM Identity and Access Management identity management Identity Management (IdM), also called Identity and Access Management (IAM) refers to a framework of policies and technologies for ensuring that the proper user in an organisation has the appropriate access to technology resources systems fall under the overarching umbrella of IT security. IdM systems not only identify, but authenticate and authorize individuals who will be using IT resources, but also the hardware and applications users need to access. https://en.wikipedia.org/wiki/Identity_management https://orcid.org/0000-0003-2687-1982 reuse of digital assets Explore the potential application domain/sociental implication of the project work and the communities of users/partners. Celia van Gelder Mateusz Kuzak Yan Wang investigate open innovation opportunities The process of restoring data that have been lost, accidentally deleted, corrupted or made inaccessible for any reason. The data recovery process may vary, depending on the circumstances of the data loss, the data recovery software used to create backups, and backup target media. In some cases, end users may be able to restore lost... https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data recovery The process of restoring data that have been lost, accidentally deleted, corrupted or made inaccessible for any reason. The data recovery process may vary, depending on the circumstances of the data loss, the data recovery software used to create backups, and backup target media. In some cases, end users may be able to restore lost... Definition from now-deprecated CASRAI Glossary term: data-quality-review https://orcid.org/0000-0003-2687-1982 R1. (meta)data are richly described with a plurality of accurate and relevant attributes To be able to define and deploy appropriate criteria, based on the FAIR principles, to identify the appropriate repository for a dataset. This may, for example, involve discovering the mark-up of the repository, the exchange formats and data models used, and the licensing information for each repository. Tools such as the FAIR evaluator or FAIRshake may be used to assess the FAIRness of a repository based on human-entered questionnaires or FAIRsharing resource metadata. https://orcid.org/0000-0003-2687-1982 FAIR data submission Repository FAIRness fair evaluation of repositories for data deposition To be able to define and deploy appropriate criteria, based on the FAIR principles, to identify the appropriate repository for a dataset. This may, for example, involve discovering the mark-up of the repository, the exchange formats and data models used, and the licensing information for each repository. Tools such as the FAIR evaluator or FAIRshake may be used to assess the FAIRness of a repository based on human-entered questionnaires or FAIRsharing resource metadata. [PMQ] A Regional standard is one that applies across a multi-nation region. Most countries have their own national standards-making bodies, which in turn may also group together to make regional standards. For example, national standards bodies in Europe are also members of the European Committee for Standardization (CEN) as well as members of ISO. The use of such standards may be voluntary, or they may be referenced in regulation (therefore mandatory). https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 AL 9.3.22: Removed reference to CASRAI (https://casrai.org/term/regional-standard/), as the definition was unsuitable. regional standard A Regional standard is one that applies across a multi-nation region. Most countries have their own national standards-making bodies, which in turn may also group together to make regional standards. For example, national standards bodies in Europe are also members of the European Committee for Standardization (CEN) as well as members of ISO. The use of such standards may be voluntary, or they may be referenced in regulation (therefore mandatory). Adapted by AL from ISO. https://www.iso.org/sites/ConsumersStandards/1_standards.html#section1_2 The process of setting up environments for workflow technologies. Kristina Hettne Simon Hodson Victoria Dominguez Del Angel workflow technologies management The process of setting up environments for workflow technologies. KH Data steward is an umbrella term for numerous support roles that involve the creation, management and usage of research data. A data steward facilitates the quality, integrity and access to (meta)data in a manner that is consistent with the appropriate laws and institutional policies, ensuring professional treatment of data throughout all stages of the research project. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 https://orcid.org/0000-0003-2687-1982 data steward Data steward is an umbrella term for numerous support roles that involve the creation, management and usage of research data. A data steward facilitates the quality, integrity and access to (meta)data in a manner that is consistent with the appropriate laws and institutional policies, ensuring professional treatment of data throughout all stages of the research project. The Turing Way. https://book.the-turing-way.org/collaboration/research-infrastructure-roles/data-steward.html Bin for Skills needed for Preservation costs management. leightonlc 2019-10-17T15:17:32.816013Z skills related to preservation costs management https://orcid.org/0000-0003-2687-1982 2021-02-17T22:10:11.837519Z repository certification schemes Database containing information about trusted repositories that are provided by repository managers and are useful for human and machine users. These registries do not contain information about all metadata descriptions of digital objects, nor do they offer a list of PIDs of all stored digital objects. They do offer information based on standardised types on how to retrieve such information (e.g., the port under which OAI-PMH can be accessed to offer metadata). A registry requires the assignment of a permanent, unique and unambiguous identifier to each item. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 Data registry registry Database containing information about trusted repositories that are provided by repository managers and are useful for human and machine users. These registries do not contain information about all metadata descriptions of digital objects, nor do they offer a list of PIDs of all stored digital objects. They do offer information based on standardised types on how to retrieve such information (e.g., the port under which OAI-PMH can be accessed to offer metadata). A registry requires the assignment of a permanent, unique and unambiguous identifier to each item. RDMT. https://terms.codata.org/rdmt/registry To clearly communicate the existence of contributions from different individuals / projects to the compilation of a given dataset, in such a way that can be consistently cited. Laura Molloy attribution credit recognise and acknowledge contributions To clearly communicate the existence of contributions from different individuals / projects to the compilation of a given dataset, in such a way that can be consistently cited. [LM] Data where relationships/connections between them are available to allow easy data access. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 linked open data Data where relationships/connections between them are available to allow easy data access. RDMT. https://terms.codata.org/rdmt/linked-open-data Research data format is a generic term encompassing the concept of a standardised format for research data. https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 AL 9.3.2022: The original CASRAI definition is unsuitable (https://casrai.org/term/research-data-format), therefore I have changed the definition to something more generic. research data format Research data format is a generic term encompassing the concept of a standardised format for research data. AL, 9.3.22 Activities and policies that ensure that the benefits to society of research outweigh any risks, from both an ethical and legal perspective. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 research governance Activities and policies that ensure that the benefits to society of research outweigh any risks, from both an ethical and legal perspective. RDMT. https://terms.codata.org/rdmt/research-governance Ability of computer systems to transmit data with unambiguous, shared meaning. Semantic interoperability is a requirement to enable machine computable logic, inferencing, knowledge discovery, and data federation between information systems. Semantic interoperability is achieved when the information transferred has, in its communicated form, all of the meaning required for the receiving system to interpret it correctly, even when the algorithms used by the receiving system are unknown to the sending system. Syntactic interoperability is a prerequisite to semantic interoperability. Semantic interoperability ensures that the precise format and meaning of exchanged data and information is preserved and understood throughout exchanges between parties; in other words, what is sent is what is understood. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 semantic interoperability Ability of computer systems to transmit data with unambiguous, shared meaning. Semantic interoperability is a requirement to enable machine computable logic, inferencing, knowledge discovery, and data federation between information systems. Semantic interoperability is achieved when the information transferred has, in its communicated form, all of the meaning required for the receiving system to interpret it correctly, even when the algorithms used by the receiving system are unknown to the sending system. Syntactic interoperability is a prerequisite to semantic interoperability. Semantic interoperability ensures that the precise format and meaning of exchanged data and information is preserved and understood throughout exchanges between parties; in other words, what is sent is what is understood. RDMT. https://terms.codata.org/rdmt/semantic-interoperability A non-proprietary identifier for chemical substances that can be used in printed and electronic data sources thus enabling easier linking of diverse data compilations. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 international chemical identifier A non-proprietary identifier for chemical substances that can be used in printed and electronic data sources thus enabling easier linking of diverse data compilations. RDMT. https://terms.codata.org/rdmt/international-chemical-identifier Activities and processes in a digital environment that lead to the publication of research data, associated metadata and accompanying documentation and software code on the Web. In contrast to interim or final published products, workflows are the means to curate, document, and review, and thus ensure and enhance the value of the published product. Workflows can involve both humans and machines and often humans are supported by technology as they perform steps in the workflow. Similar workflows may vary in the details depending on the research discipline, data publishing product and/or the host institution of the workflow. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 research data publication workflow Activities and processes in a digital environment that lead to the publication of research data, associated metadata and accompanying documentation and software code on the Web. In contrast to interim or final published products, workflows are the means to curate, document, and review, and thus ensure and enhance the value of the published product. Workflows can involve both humans and machines and often humans are supported by technology as they perform steps in the workflow. Similar workflows may vary in the details depending on the research discipline, data publishing product and/or the host institution of the workflow. RDMT. https://terms.codata.org/rdmt/research-data-publication-workflow Degree to which all required measurements are known. Values may be designated as “missing” in order not to have empty cells, or missing values may be replaced with default or interpolated values. In the case of default or interpolated values, these must be flagged as such to distinguish them from actual measurements or observations. Missing, default, or interpolated values do not imply that the dataset has been made complete. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data completeness Degree to which all required measurements are known. Values may be designated as “missing” in order not to have empty cells, or missing values may be replaced with default or interpolated values. In the case of default or interpolated values, these must be flagged as such to distinguish them from actual measurements or observations. Missing, default, or interpolated values do not imply that the dataset has been made complete. RDMT. https://terms.codata.org/rdmt/data-completeness Data linkage where the resulting product has been de-identified. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 privacy-preserving data linkage Data linkage where the resulting product has been de-identified. RDMT. https://terms.codata.org/rdmt/privacy-preserving-data-linkage Celia van Gelder Mateusz Kuzak Yan Wang https://orcid.org/0000-0002-7702-4495 AL 22.3.22: refactored 'open access publishing and self-archiving' to be two separate concepts ('open access publishing', and 'self-archiving') according to T4FS best practices. Self-archiving did not have to be created because we already had self-archive, which was then moved to the Archiving hierarchy. 'publish open access' was also removed in favour of this term as they were too similar to have without the potential for confusion by users. open access publishing Prepare the data in preferred types and the data documentation. Choose the data repository and data license, deposit the data in the repository. Celia van Gelder Mateusz Kuzak Yan Wang https://orcid.org/0000-0002-7702-4495 self-archiving Person with broad knowledge of information management disciplines and who provides guidance and support to program and staff functions on all aspects of managing the information resource. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 information management advisor Person with broad knowledge of information management disciplines and who provides guidance and support to program and staff functions on all aspects of managing the information resource. RDMT. https://terms.codata.org/rdmt/information-management-advisor Statement describing how research data will be managed throughout a specified research project's life cycle - during and after the active phase of the research project - including terms regarding archiving and potential preservation of the data in a data repository. The DMP is considered to be a 'living' document, i.e. one which can be updated when necessary. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 DMP data management plan Statement describing how research data will be managed throughout a specified research project's life cycle - during and after the active phase of the research project - including terms regarding archiving and potential preservation of the data in a data repository. The DMP is considered to be a 'living' document, i.e. one which can be updated when necessary. RDMT. https://terms.codata.org/rdmt/data-management-plan https://orcid.org/0000-0003-2687-1982 F2. data are described with rich metadata https://orcid.org/0000-0003-2687-1982 Data license R1.1 (meta)data are released with a clear and accessible data usage license Sequence of digitally encoded, coherent signals used to send or receive a representation of information content as transmitted. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data stream Sequence of digitally encoded, coherent signals used to send or receive a representation of information content as transmitted. RDMT. https://terms.codata.org/rdmt/data-stream Librarian who manages the sharing and publishing of datasets as openly as possible and as closed as necessary, and the management and curation of repositories required to achieve this. Broad role requirements include support for sharing and publishing datasets, finding, accessing, interoperating and re-using these datasets, reviewing and supporting Data Management Plans and training delivery. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data librarian Librarian who manages the sharing and publishing of datasets as openly as possible and as closed as necessary, and the management and curation of repositories required to achieve this. Broad role requirements include support for sharing and publishing datasets, finding, accessing, interoperating and re-using these datasets, reviewing and supporting Data Management Plans and training delivery. RDMT. https://terms.codata.org/rdmt/data-librarian The process of developing, communicating, implementing, monitoring, and assuring the policies, procedures, organizational structures, and practices associated with a given program. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 program governance The process of developing, communicating, implementing, monitoring, and assuring the policies, procedures, organizational structures, and practices associated with a given program. Definition from now-deprecated CASRAI Glossary term: program-governance Bin for types of Knowledge related to Workflow technologies management. leightonlc 2019-10-17T15:16:10.561925Z knowledge of workflow technologies management Obtain an overview of information access status for different stakeholder groups. Check this overview against the organizational information access policy and risk catalogue, identify the type of information security risks based on the mismatch between the current status and policy. Choose and implement the right protocal to mitigate the risk. Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang access risk assessment and mitigation Data that can be used: delivered in a form that meets the needs of different end-user audiences, is ready for the tasks that the end-user needs to accomplish, and that has been adapted to the end-user’s needs. Usable data have been cleaned, structured, are in machine readable format, fully documented, and ready for analysis and interpretation. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 usable data Data that can be used: delivered in a form that meets the needs of different end-user audiences, is ready for the tasks that the end-user needs to accomplish, and that has been adapted to the end-user’s needs. Usable data have been cleaned, structured, are in machine readable format, fully documented, and ready for analysis and interpretation. RDMT. https://terms.codata.org/rdmt/usable-data https://orcid.org/0000-0003-2687-1982 https://orcid.org/0000-0003-2687-1982 2020-10-01T20:52:43.543314Z presentation slides Configure secure storage and monitor its usage Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang implement secure storage Entire period of time that research data exists. This lifecycle describes the flow of research data starting from planning, collecting, processing, analysing, preserving, sharing and finally reusing the research data. Research data often have a longer lifespan than the research project. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data lifecycle research data lifecycle Entire period of time that research data exists. This lifecycle describes the flow of research data starting from planning, collecting, processing, analysing, preserving, sharing and finally reusing the research data. Research data often have a longer lifespan than the research project. RDMT. https://terms.codata.org/rdmt/research-data-lifecycle Kristina Hettne Victoria Dominguez Del Angel Yann Le Franc format and media migration Repository for persistently storing collections of data, such as a database, a file system or a directory. The data stored can be of any type that can be rendered in digital format and placed in electronic media. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data store Repository for persistently storing collections of data, such as a database, a file system or a directory. The data stored can be of any type that can be rendered in digital format and placed in electronic media. RDMT. https://terms.codata.org/rdmt/data-store Initiate and develop processes to ensure outputs are made FAIR consistently with research integrity principles, and with ethical oversight. Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang design processes for research integrity CoreTrustSeal offers to any interested data repository a core level certification based on the DSA–WDS Core Trustworthy Data Repositories Requirements catalogue and procedures. This universal catalogue of requirements reflects the core characteristics of trustworthy data repositories and is the culmination of a cooperative effort between DSA and WDS under the umbrella of the Research Data Alliance to merge their data repositories certifications. https://orcid.org/0000-0003-2687-1982 CTS https://orcid.org/0000-0003-2687-1982 2021-02-17T22:12:12.996581Z core trust seal CoreTrustSeal offers to any interested data repository a core level certification based on the DSA–WDS Core Trustworthy Data Repositories Requirements catalogue and procedures. This universal catalogue of requirements reflects the core characteristics of trustworthy data repositories and is the culmination of a cooperative effort between DSA and WDS under the umbrella of the Research Data Alliance to merge their data repositories certifications. https://www.coretrustseal.org/about/ Defines how to manage a project. It will always be the same, regardless of the project lifecycle being employed. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 project management lifecycle Defines how to manage a project. It will always be the same, regardless of the project lifecycle being employed. Definition from now-deprecated CASRAI Glossary term: project-management-lifecycle Type of metadata that conveys information needed to link a data object to its original source. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 authenticity metadata Type of metadata that conveys information needed to link a data object to its original source. RDMT. https://terms.codata.org/rdmt/authenticity-metadata researcher Bin for Skills needed for Service level management. leightonlc 2019-10-17T15:18:53.731967Z skills related to service level management Data that relate to a living individual who can be identified from those data or those data plus other information which is in the possession of, or is likely to come into the possession of, the data controller, and includes any expression of opinion about the individual and any indication of the intentions of the data controller or any other person in respect of the individual. Any information that can be used to distinguish one person from another and can be used for de-anonymising anonymous data can be considered personally identifiable data. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 Personal data personally identifiable information Data that relate to a living individual who can be identified from those data or those data plus other information which is in the possession of, or is likely to come into the possession of, the data controller, and includes any expression of opinion about the individual and any indication of the intentions of the data controller or any other person in respect of the individual. Any information that can be used to distinguish one person from another and can be used for de-anonymising anonymous data can be considered personally identifiable data. RDMT. https://terms.codata.org/rdmt/personally-identifiable-information Understand the current storage security status and the preferred status, as well as the barriers between them. Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang understand storage security challanges Organising data into tables so that the results of using the database are always unambiguous and as intended. Normalisation is typically a refinement process after the initial exercise of identifying the data objects that should be in the database, identifying their relationships, and defining the tables required and the columns within each table. First normal form (1NF) is the “basic” level of normalisation: Data and information are contained in two-dimensional tables with rows and columns. Each column corresponds to a sub-object or an attribute of the object represented by the entire table. Each row represents a unique instance of that sub-object or attribute and must be different in some way from any other row (that is, no duplicate rows are possible). All entries in any column must be of the same kind. For example, in the column labelled “Date,” only dates are permitted. In Second normal form (2NF), the tables are in first normal form and, in addition, each column in a table that is not a determiner of the contents of another column must itself be a function of the other columns in the table. At the second normal form, modifications are still possible because a change to one row in a table may affect data that refers to this information from another table. In Third normal form (3NF), the tables are in second normal form and, in addition, there is no transitive functional dependency. For example, if A is functionally dependent on B, and B is functionally dependent on C, then C is transitively dependent on A via B. In Domain/key normal form (DKNF), a key uniquely identifies each row in a table. A domain is the set of permissible values for an attribute. By enforcing key and domain restrictions, the database is assured of being freed from modification anomalies. DKNF is the normalisation level that most designers aim to achieve. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 normalization Organising data into tables so that the results of using the database are always unambiguous and as intended. Normalisation is typically a refinement process after the initial exercise of identifying the data objects that should be in the database, identifying their relationships, and defining the tables required and the columns within each table. First normal form (1NF) is the “basic” level of normalisation: Data and information are contained in two-dimensional tables with rows and columns. Each column corresponds to a sub-object or an attribute of the object represented by the entire table. Each row represents a unique instance of that sub-object or attribute and must be different in some way from any other row (that is, no duplicate rows are possible). All entries in any column must be of the same kind. For example, in the column labelled “Date,” only dates are permitted. In Second normal form (2NF), the tables are in first normal form and, in addition, each column in a table that is not a determiner of the contents of another column must itself be a function of the other columns in the table. At the second normal form, modifications are still possible because a change to one row in a table may affect data that refers to this information from another table. In Third normal form (3NF), the tables are in second normal form and, in addition, there is no transitive functional dependency. For example, if A is functionally dependent on B, and B is functionally dependent on C, then C is transitively dependent on A via B. In Domain/key normal form (DKNF), a key uniquely identifies each row in a table. A domain is the set of permissible values for an attribute. By enforcing key and domain restrictions, the database is assured of being freed from modification anomalies. DKNF is the normalisation level that most designers aim to achieve. RDMT. https://terms.codata.org/rdmt/normalisation Evaluation is a decision about significance, value, or quality of something, based on careful study of its good and bad features. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 AL 17.3.22: We may wish to make this an alternative term of Assessment. evaluation Evaluation is a decision about significance, value, or quality of something, based on careful study of its good and bad features. Definition from now-deprecated CASRAI Glossary term: evaluation Type of data element that expresses a proposition that binds one or more property values to some data entity. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data item Type of data element that expresses a proposition that binds one or more property values to some data entity. RDMT. https://terms.codata.org/rdmt/data-item Duty and practice of ensuring that personal or sensitive information only flows from one entity to another according to legislated or otherwise broadly accepted norms and policies. This can be done by either restricting access to the data or certain variables in the data, and/or by protecting personal or sensitive information using an anonymisation method. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 confidentiality Duty and practice of ensuring that personal or sensitive information only flows from one entity to another according to legislated or otherwise broadly accepted norms and policies. This can be done by either restricting access to the data or certain variables in the data, and/or by protecting personal or sensitive information using an anonymisation method. RDMT. https://terms.codata.org/rdmt/confidentiality Bin for types of Knowledge needed for Data management costs management. leightonlc knowledge of data management costs management Philippe Rocca-Serra Susanna Sansone https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 logistical support Bin for Aptitudes related to Data management costs management. leightonlc aptitudes for data management costs management The process of acquiring data from some source. For example, data may be acquired by download from a repository, transfer from a data logger, data capture, etc. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 data capture data download data reception AL 15.3.22: Removed "data and metadata capture" as was too similar to this term. Added alternative term "data capture". data acquisition The process of acquiring data from some source. For example, data may be acquired by download from a repository, transfer from a data logger, data capture, etc. RDMT. https://terms.codata.org/rdmt/data-acquisition Planning for and controlling expenditures related to the data management lifecycle, including, but not limited to, labor and infrastructure expenses for data collection; data documentation; data storage; data access and security; data preservation; data sharing; and data disposition. Kristina Hettne Simon Hodson Victoria Dominguez Del Angel data management costs management Planning for and controlling expenditures related to the data management lifecycle, including, but not limited to, labor and infrastructure expenses for data collection; data documentation; data storage; data access and security; data preservation; data sharing; and data disposition. Written by LLC, based on list at https://www.uu.nl/en/research/research-data-management/guides/costs-of-data-management A Text file is a kind of computer file that is structured as a sequence of lines of electronic text. A text file exists stored as data within a computer file system. https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 AL 9.3.22: The CASRAI term (https://casrai.org/term/text-file) had an unsuitable definition, and therefore was removed. text file A Text file is a kind of computer file that is structured as a sequence of lines of electronic text. A text file exists stored as data within a computer file system. AL, Adapted from Wikipedia. https://en.wikipedia.org/wiki/Text_file Data and code that are commented so that humans can understand what they represent, their design, and their purpose. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 human-readable format Data and code that are commented so that humans can understand what they represent, their design, and their purpose. RDMT. https://terms.codata.org/rdmt/human-readable-format Checks that data are valid, sensible, reasonable, clean, usable, and secure before they are processed. Provides well-defined guarantees for fitness, accuracy, and consistency for any of various kinds of user input into an application or automated system. Failures or omissions in data validation can lead to data corruption, security vulnerability. Improperly validated data can cause computer code processing the data to crash, generate error messages, behave in an unanticipated manner, or generate incorrect results that may be difficult or impossible to detect. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data validation Checks that data are valid, sensible, reasonable, clean, usable, and secure before they are processed. Provides well-defined guarantees for fitness, accuracy, and consistency for any of various kinds of user input into an application or automated system. Failures or omissions in data validation can lead to data corruption, security vulnerability. Improperly validated data can cause computer code processing the data to crash, generate error messages, behave in an unanticipated manner, or generate incorrect results that may be difficult or impossible to detect. RDMT. https://terms.codata.org/rdmt/data-validation Type of record (and organisation) that stores an instance of an executable/understandable PID. The content of a PID record distinguishes a registered digital or data object from other digital objects. A PID record is a type of record that includes property information that characterises the digital object it is identifying. Important parts of a PID record are location and checksum. However there is a large variation in usage. In some data models the PID is simply used as a unique label with an empty record. A PID record has a lifecycle including creation, publication, curation and destruction. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 pid record Type of record (and organisation) that stores an instance of an executable/understandable PID. The content of a PID record distinguishes a registered digital or data object from other digital objects. A PID record is a type of record that includes property information that characterises the digital object it is identifying. Important parts of a PID record are location and checksum. However there is a large variation in usage. In some data models the PID is simply used as a unique label with an empty record. A PID record has a lifecycle including creation, publication, curation and destruction. RDMT. https://terms.codata.org/rdmt/pid-record Take proactive approach to ensure outputs are made FAIR consistently with research integrity principles. Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang appreciate importance of research integrity A type of metadata that indicates how compound objects are put together. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 structural metadata structural metadata (information) A type of metadata that indicates how compound objects are put together. RDMT. https://terms.codata.org/rdmt/structural-metadata-information Data that could not lead to the identification of a specific individual, to distinguishing one person from another, or to personally identifiable information. These may be data that have been de-identified, or that could not lead to personally identifiable information in the first place. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 non personally identifiable information Data that could not lead to the identification of a specific individual, to distinguishing one person from another, or to personally identifiable information. These may be data that have been de-identified, or that could not lead to personally identifiable information in the first place. Definition from now-deprecated CASRAI Glossary term: non-personally-identifiable-information Process of creating digital files by scanning or otherwise converting analogue materials. The resulting digital copy, or digital surrogate, would then be classed as digital material and then subject to the same broad challenges involved in preserving access to it, as born-digital materials. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 digitisation Process of creating digital files by scanning or otherwise converting analogue materials. The resulting digital copy, or digital surrogate, would then be classed as digital material and then subject to the same broad challenges involved in preserving access to it, as born-digital materials. RDMT. https://terms.codata.org/rdmt/digitisation Release of research data, associated metadata, accompanying documentation, and software code (in cases where the raw data have been processed or manipulated) for re-use and analysis in such a manner that they can be discovered on the Web and referred to in a unique and persistent way. Data publication occurs via dedicated data repositories and/or (data) journals which ensure that the published research objects are findable, accessible, interoperable and re-usable. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data publication Release of research data, associated metadata, accompanying documentation, and software code (in cases where the raw data have been processed or manipulated) for re-use and analysis in such a manner that they can be discovered on the Web and referred to in a unique and persistent way. Data publication occurs via dedicated data repositories and/or (data) journals which ensure that the published research objects are findable, accessible, interoperable and re-usable. RDMT. https://terms.codata.org/rdmt/data-publication https://orcid.org/0000-0003-2687-1982 findability of digital assets Philippe Rocca-Serra Susanna Sansone https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 AL 9.3.22: renamed to prevent compound term. Was 'Influencing/community building' community building Standalone dataset that can be cited in a similar manner to other research outputs. The dataset appears in a data repository, data paper or project website, and has a Persistent Identifier. Most current referencing systems provide a format for citing datasets. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 citable data Standalone dataset that can be cited in a similar manner to other research outputs. The dataset appears in a data repository, data paper or project website, and has a Persistent Identifier. Most current referencing systems provide a format for citing datasets. RDMT. https://terms.codata.org/rdmt/citable-data Voluminous amount of structured, semi-structured and/or unstructured data that have the potential to be mined for information, primarily characterised by big volume, extensive variety, high velocity (creation and use), and/or variability that together require a scalable architecture for efficient data storage, manipulation, and analysis. The definition is evolving and can vary by sector, depending on what kind of software tools are commonly available and what sizes of datasets are common in a particular discipline. With those caveats, big data in many sectors today will range from a few dozen terabytes to multiple petabytes. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 big data Voluminous amount of structured, semi-structured and/or unstructured data that have the potential to be mined for information, primarily characterised by big volume, extensive variety, high velocity (creation and use), and/or variability that together require a scalable architecture for efficient data storage, manipulation, and analysis. The definition is evolving and can vary by sector, depending on what kind of software tools are commonly available and what sizes of datasets are common in a particular discipline. With those caveats, big data in many sectors today will range from a few dozen terabytes to multiple petabytes. RDMT. https://terms.codata.org/rdmt/big-data Ability to apply knowledge about FAIR metrics and assessing FAIRness using FAIR assessment tools. Kristina Hettne Victoria Dominguez Del Angel Yann Le Franc fairness assessment Service providing long-term, permanent care and accessibility for digital objects with research value. The standard for such repositories is the Open Archival Information System reference model. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data archive Service providing long-term, permanent care and accessibility for digital objects with research value. The standard for such repositories is the Open Archival Information System reference model. RDMT. https://terms.codata.org/rdmt/data-archive Information describing the technical processes used to produce, or required to use a digital object. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 technical metadata Information describing the technical processes used to produce, or required to use a digital object. RDMT. https://terms.codata.org/rdmt/technical-metadata In a form that can be used and understood by a computer. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 AL 5.12.22: Deprecated because of its similarity with http://purl.obolibrary.org/obo/T4FS_0000551. We don't need to be going to the level of granularity that the concept of machine readability needs to be modelled as well as machine readable format. machine readable true In a form that can be used and understood by a computer. RDMT. https://terms.codata.org/rdmt/machine-readable Combination of business processes, policies and technologies that allows organisations to provide secure access to confidential data. Integrated access management software is used by enterprises to control the flow of sensitive data in and out of a network. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 integrated access management Combination of business processes, policies and technologies that allows organisations to provide secure access to confidential data. Integrated access management software is used by enterprises to control the flow of sensitive data in and out of a network. RDMT. https://terms.codata.org/rdmt/integrated-access-management Meaningless data, including any data that cannot be understood and interpreted correctly by machines such as unstructured text; any data that has been received, stored, or changed in such a manner that it cannot be read or used by the program that originally created it. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 noisy data Meaningless data, including any data that cannot be understood and interpreted correctly by machines such as unstructured text; any data that has been received, stored, or changed in such a manner that it cannot be read or used by the program that originally created it. RDMT. https://terms.codata.org/rdmt/noisy-data Activities in and around the provision of training, mentorship and teaching opportunities for good data management and the implementation of FAIR practices. Philippe Rocca-Serra Susanna Sansone https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 fair training Activities in and around the provision of training, mentorship and teaching opportunities for good data management and the implementation of FAIR practices. PMQ A set of instructions that direct a computer to do a specific task. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 software A set of instructions that direct a computer to do a specific task. Definition from now-deprecated CASRAI Glossary term: software Conversion of multiple datasets to a single common format and structure. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 https://github.com/terms4fairskills/FAIRterminology/issues/32 data standardization Conversion of multiple datasets to a single common format and structure. RDMT. https://terms.codata.org/rdmt/data-standardisation Data that are accessible, machine-readable, usable, intelligible, and freely shared. Open data can be freely used, re-used, built on, and redistributed by anyone – subject only, at most, to the requirement to attribute and sharealike. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 open data Data that are accessible, machine-readable, usable, intelligible, and freely shared. Open data can be freely used, re-used, built on, and redistributed by anyone – subject only, at most, to the requirement to attribute and sharealike. RDMT. https://terms.codata.org/rdmt/open-data The output of a data curation activity. Such data has generally already been cleaned, standardised, documented. Additional metadata relevant to the data object has also been added, via either manual or automatic methods. https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 https://orcid.org/0000-0003-2687-1982 AL 9.5.22: The orginal label ("data curation") matched a term within the Data stewardship activity hierarchy. Changing the class label to "Curated data" better fit the model. curated data The output of a data curation activity. Such data has generally already been cleaned, standardised, documented. Additional metadata relevant to the data object has also been added, via either manual or automatic methods. AL 9.5.22 Bin for Skills needed for Funding acquisition management. leightonlc skills related to funding acquisition management 128-bit number used to guarantee unique identity for objects on the internet over time. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 UUID, GUID universally unique identifier 128-bit number used to guarantee unique identity for objects on the internet over time. RDMT. https://terms.codata.org/rdmt/universally-unique-identifier The process of confirming the identity of a principal entity. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 authentication The process of confirming the identity of a principal entity. Definition from now-deprecated CASRAI Glossary: authentication Large-scale distributed computing paradigm that is driven by economies of scale, in which a pool of abstracted, virtualised, dynamically-scalable, managed computing power, storage, platforms and services are delivered on demand to external customers over the Internet. Key features are that: it is a specialised distributed computing paradigm; it is massively scalable; it can be encapsulated as an abstract entity that delivers different levels of services to customers outside the Cloud; it is driven by economies of scale; and the services can be dynamically configured (via virtualisation or other approaches) and delivered on demand. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 cloud computing Large-scale distributed computing paradigm that is driven by economies of scale, in which a pool of abstracted, virtualised, dynamically-scalable, managed computing power, storage, platforms and services are delivered on demand to external customers over the Internet. Key features are that: it is a specialised distributed computing paradigm; it is massively scalable; it can be encapsulated as an abstract entity that delivers different levels of services to customers outside the Cloud; it is driven by economies of scale; and the services can be dynamically configured (via virtualisation or other approaches) and delivered on demand. RDMT. https://terms.codata.org/rdmt/cloud-computing Access to digital materials either for a defined period of time while use is predicted but which does not extend beyond the foreseeable future and/or until it becomes inaccessible because of changes in technology. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 short-term preservation Access to digital materials either for a defined period of time while use is predicted but which does not extend beyond the foreseeable future and/or until it becomes inaccessible because of changes in technology. RDMT. https://terms.codata.org/rdmt/short-term-preservation Standard that is used in multiple nations and whose development process is open to representatives from all countries. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 international standard Standard that is used in multiple nations and whose development process is open to representatives from all countries. RDMT. https://terms.codata.org/rdmt/international-standard Satisfaction of the requirements of a specific standard(s) and/or specification(s). Conformance is used with respect to voluntary standards and specifications, whereas compliance is used with respect to mandatory standards and regulations. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 conformance Satisfaction of the requirements of a specific standard(s) and/or specification(s). Conformance is used with respect to voluntary standards and specifications, whereas compliance is used with respect to mandatory standards and regulations. RDMT. https://terms.codata.org/rdmt/conformance Individual or organisation responsible for the IT infrastructure providing and protecting data in conformance with the policies and practices prescribed by data governance. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 Data manager Technical data steward data custodian Individual or organisation responsible for the IT infrastructure providing and protecting data in conformance with the policies and practices prescribed by data governance. RDMT. https://terms.codata.org/rdmt/data-custodian Bin for Aptitudes related to High performance computing management. leightonlc aptitudes for high performance computing management Implement the policies that govern the arrangement, naming, descriptive metadata, provenance metadata, representation metadata, administrative metadata, access controls, retention, disposition, integrity, and replication of digital objects. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 manage datasets in a repository Implement the policies that govern the arrangement, naming, descriptive metadata, provenance metadata, representation metadata, administrative metadata, access controls, retention, disposition, integrity, and replication of digital objects. Definition from now-deprecated CASRAI Glossary term: manage-datasets-in-a-repository Removing noise from data. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data de-noising Removing noise from data. RDMT. https://terms.codata.org/rdmt/data-de-noising Provides the relationship and process context for working together to ensure outcomes are achieved. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 governance and accountability model Provides the relationship and process context for working together to ensure outcomes are achieved. Definition from now-deprecated CASRAI Glossary term: governance-and-accountability-model Continued availability and ongoing usability of a digital resource, retaining all qualities of authenticity, accuracy and functionality deemed to be essential for the purposes the digital material was created and/or acquired for. Users who have access can retrieve, understand, manipulate, and store copies. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 access Continued availability and ongoing usability of a digital resource, retaining all qualities of authenticity, accuracy and functionality deemed to be essential for the purposes the digital material was created and/or acquired for. Users who have access can retrieve, understand, manipulate, and store copies. RDMT. https://terms.codata.org/rdmt/access Find the people with a good understand of FAIR and open research challenges in your organiation, and create a work agenda. Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang develop fair and open research strategy Choose the proper type of patent for the project data and apply for the patent chosen. Celia van Gelder Mateusz Kuzak Yan Wang patent application Type of data provenance indication that adds metadata to identify data collections. The organisation doing the collection management is stated in the metadata along with the provenance of collection management events such as source of data acquisition, conservation, and movement. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 collection management identification Type of data provenance indication that adds metadata to identify data collections. The organisation doing the collection management is stated in the metadata along with the provenance of collection management events such as source of data acquisition, conservation, and movement. RDMT. https://terms.codata.org/rdmt/collection-management-identification Activity/Process whereby digital objects are evaluated with the relevant FAIR metrics and assessment tools. Kristina Hettne Victoria Dominguez Del Angel Yann Le Franc assess fairness Activity/Process whereby digital objects are evaluated with the relevant FAIR metrics and assessment tools. [YLF, VDA, KH] Bin for types of Knowledge related to Change management. leightonlc knowledge of change management Process of obtaining, importing, and processing data for later use or storage in a database. This process often involves altering individual files by editing their content and/or formatting them to fit into a larger document. An effective data ingestion methodology begins by validating the individual files, then prioritises the sources for optimum processing, and finally validates the results. When numerous data sources exist in diverse formats (the sources may number in the hundreds and the formats in the dozens), maintaining reasonable speed and efficiency can become a major challenge. To that end, several vendors offer programs tailored to the task of data ingestion in specific applications or environments. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data ingestion Process of obtaining, importing, and processing data for later use or storage in a database. This process often involves altering individual files by editing their content and/or formatting them to fit into a larger document. An effective data ingestion methodology begins by validating the individual files, then prioritises the sources for optimum processing, and finally validates the results. When numerous data sources exist in diverse formats (the sources may number in the hundreds and the formats in the dozens), maintaining reasonable speed and efficiency can become a major challenge. To that end, several vendors offer programs tailored to the task of data ingestion in specific applications or environments. RDMT. https://terms.codata.org/rdmt/data-ingestion Manually or semi-automatically converting or mapping data from one form into another format that allows for more convenient consumption of the data with the help of semi-automated tools. Gathering and organising disparate data from different sources, often collected by many different investigators. Activities include developing and supporting search tools that utilise standardised metadata, harmonising the coding of data for specific variables, engineering new methods of combining data. with the help of semi-automated tools. The result of data wrangling is repurposed data. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data wrangling Manually or semi-automatically converting or mapping data from one form into another format that allows for more convenient consumption of the data with the help of semi-automated tools. Gathering and organising disparate data from different sources, often collected by many different investigators. Activities include developing and supporting search tools that utilise standardised metadata, harmonising the coding of data for specific variables, engineering new methods of combining data. with the help of semi-automated tools. The result of data wrangling is repurposed data. RDMT. https://terms.codata.org/rdmt/data-wrangling Configuration of staff, services and tools assembled to support data management across the research lifecycle and to provide comprehensive coverage of the stages making up the data lifecycle. It can be organised locally and/or globally to support research data activities across the research lifecycle. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 research data management infrastructure Configuration of staff, services and tools assembled to support data management across the research lifecycle and to provide comprehensive coverage of the stages making up the data lifecycle. It can be organised locally and/or globally to support research data activities across the research lifecycle. RDMT. https://terms.codata.org/rdmt/research-data%20management-infrastructure Appreciate the need for flexibility in applying FAIR criteria to make data as open as possible, as closed as necessary. Angus Whyte flexibility in relating fair criteria to openness To be able to research and identify the appropriate taxonomy for your project. This may involve using resources such as FAIRsharing, which captures metadata on taxonomies and provides an assessment of their FAIRness. https://orcid.org/0000-0003-2687-1982 Controlled Vocabulary identification Ontology identification Taxonomy identification Thesaurus identification choosing the appropriate terminology for your data To be able to research and identify the appropriate taxonomy for your project. This may involve using resources such as FAIRsharing, which captures metadata on taxonomies and provides an assessment of their FAIRness. [PMQ] Data elements that have been organised into a consistent format and data structure within a defined data model such that the elements can be easily addressed, organised and accessed in various combinations to make better use of the information, such as in a relational database. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 Structured information structured data Data elements that have been organised into a consistent format and data structure within a defined data model such that the elements can be easily addressed, organised and accessed in various combinations to make better use of the information, such as in a relational database. RDMT. https://terms.codata.org/rdmt/structured-data Know how data handling practices in the domain make FAIR criteria more or less easy to implement. Angus Whyte knowledge to contextualise fair principles to domain Storage, access and preservation of data created or collected in the course of research. Research data management practices cover the entire lifecycle of the data, from planning the investigation to conducting it, and from backing up data as it is created and used to long term preservation of data deliverables after the research investigation has concluded. Specific activities and issues that fall within the category of data management include: File naming (the proper way to name computer files); data quality control and quality assurance; data access; data documentation (including levels of uncertainty); metadata creation and controlled vocabularies; data storage; data archiving and preservation; data sharing and reuse; data integrity; data security; data privacy; data rights; notebook protocols (lab or field) as required. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 research data management Storage, access and preservation of data created or collected in the course of research. Research data management practices cover the entire lifecycle of the data, from planning the investigation to conducting it, and from backing up data as it is created and used to long term preservation of data deliverables after the research investigation has concluded. Specific activities and issues that fall within the category of data management include: File naming (the proper way to name computer files); data quality control and quality assurance; data access; data documentation (including levels of uncertainty); metadata creation and controlled vocabularies; data storage; data archiving and preservation; data sharing and reuse; data integrity; data security; data privacy; data rights; notebook protocols (lab or field) as required. RDMT. https://terms.codata.org/rdmt/research-data-management Deposit research output (software, data and publication together with documentation) in previousy selected repositories. Celia van Gelder Mateusz Kuzak Yan Wang publish output in a repository To be able to identify and process metadata over the internet, sourced from a variety of metadata providers and schemata. https://orcid.org/0000-0003-2687-1982 Metadata processing Metadata search metadata search via metadata providers To be able to identify and process metadata over the internet, sourced from a variety of metadata providers and schemata. [PMQ] Information for a data object that includes: the person who deposited the data object in the repository, the source of the data object, the date when the object was deposited, and authenticity information needed to link the data object to its original source. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 record provenance information Information for a data object that includes: the person who deposited the data object in the repository, the source of the data object, the date when the object was deposited, and authenticity information needed to link the data object to its original source. RDMT. https://terms.codata.org/rdmt/record-provenance-information Process of citing a dataset in a similar manner to other research outputs. The dataset must be a standalone output that appears in a data repository, data paper or project website, and has a Persistent Identifier. Most current referencing systems provide a format for citing datasets. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data citation Process of citing a dataset in a similar manner to other research outputs. The dataset must be a standalone output that appears in a data repository, data paper or project website, and has a Persistent Identifier. Most current referencing systems provide a format for citing datasets. RDMT. https://terms.codata.org/rdmt/data-citation https://orcid.org/0000-0003-2687-1982 https://orcid.org/0000-0003-2687-1982 2020-10-01T20:54:08.535183Z book The capacity to influence stakeholders and the direction of research activities; the ability to shape others' understanding in ways that capture interest, inform and gain support; and, the capacity to influence the actions and opinions of others. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 intellectual leadership The capacity to influence stakeholders and the direction of research activities; the ability to shape others' understanding in ways that capture interest, inform and gain support; and, the capacity to influence the actions and opinions of others. Definition from now-deprecated CASRAI Glossary term: intellectual-leadership/ In the context of a researcher's activities, innovation is the development of modified or novel approaches, theories, concepts, ideas or solutions. Innovation is one of four valued outcomes. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 innovation In the context of a researcher's activities, innovation is the development of modified or novel approaches, theories, concepts, ideas or solutions. Innovation is one of four valued outcomes. Definition from now-deprecated CASRAI Glossary term: innovation High-performance computing (HPC) is a process for using of super computers and parallel processing techniques for solving complex computational problems. HPC technology focuses on developing parallel processing algorithms and systems by incorporating both administration and parallel computational techniques. Kristina Hettne Simon Hodson Victoria Dominguez Del Angel HPC management high performance computing management High-performance computing (HPC) is a process for using of super computers and parallel processing techniques for solving complex computational problems. HPC technology focuses on developing parallel processing algorithms and systems by incorporating both administration and parallel computational techniques. (modified from wikipedia)[VDA] A type of historical information or metadata about the origin, location or the source of something, or the history of the ownership or location of an object or resource including digital objects. For example, information about the Principal Investigator who recorded the data, and the information concerning its storage, handling, and migration. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 provenance A type of historical information or metadata about the origin, location or the source of something, or the history of the ownership or location of an object or resource including digital objects. For example, information about the Principal Investigator who recorded the data, and the information concerning its storage, handling, and migration. RDMT. https://terms.codata.org/rdmt/provenance A record created digitally in the day-to-day business of the organisation and assigned formal status by the organisation. Examples include: word processing documents, emails, databases, or intranet web pages. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 Electronic record AL 8.2.22: Updated label from the CASRAI "Digital" to "Digital record" to better match the definition and intent of the term, and to provide clarity. digital record A record created digitally in the day-to-day business of the organisation and assigned formal status by the organisation. Examples include: word processing documents, emails, databases, or intranet web pages. Definition from now-deprecated CASRAI Glossary term: digital Processes and procedures designed to ensure that the results of laboratory analysis are consistent, comparable, accurate and within specified limits of precision. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 analytical quality control Processes and procedures designed to ensure that the results of laboratory analysis are consistent, comparable, accurate and within specified limits of precision. Definition from now-deprecated CASRAI Glossary term: analytical-quality-control Research data in digital form. It may have been originally created in digital form, or it may have been converted from paper, or other non-digital form to a digital representation. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 digital research data Research data in digital form. It may have been originally created in digital form, or it may have been converted from paper, or other non-digital form to a digital representation. RDMT. https://terms.codata.org/rdmt/digital-research-data In the context of a researcher’s activities, Managing research is the processes related to the planning, organizing, setting objectives, controlling and evaluating of RDA activities and their associated human and financial resources. It includes the provision of leadership to, and assessment of, other scientists, engineers, technologists, and/or other staff. Managing research is one of the three research contexts in which a researcher is expected to conduct his/her activities. Managing research is distinct from the position-based role of a research manager. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 managing research In the context of a researcher’s activities, Managing research is the processes related to the planning, organizing, setting objectives, controlling and evaluating of RDA activities and their associated human and financial resources. It includes the provision of leadership to, and assessment of, other scientists, engineers, technologists, and/or other staff. Managing research is one of the three research contexts in which a researcher is expected to conduct his/her activities. Managing research is distinct from the position-based role of a research manager. Definition from now-deprecated CASRAI Glossary term: managing-research To be able to define and deploy appropriate criteria, based on the FAIR principles, to identify the appropriate publishing platform for a dataset. This may, for example, involve discovering the mark-up of the platform, the exchange formats and data models used, and the licensing information for each repository. Tools such as the FAIR evaluator or FAIRshake may be used to assess the FAIRnessbased on human-entered questionnaires or FAIRsharing resource metadata. https://orcid.org/0000-0003-2687-1982 evaluation of how to publish your data To be able to define and deploy appropriate criteria, based on the FAIR principles, to identify the appropriate publishing platform for a dataset. This may, for example, involve discovering the mark-up of the platform, the exchange formats and data models used, and the licensing information for each repository. Tools such as the FAIR evaluator or FAIRshake may be used to assess the FAIRnessbased on human-entered questionnaires or FAIRsharing resource metadata. [PMQ] Numbers used by the National Center for Biotechnology Information (NCBI) that are unique and citable. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 accession number Numbers used by the National Center for Biotechnology Information (NCBI) that are unique and citable. Definition from now-deprecated CASRAI Glossary term: accession-number Series of managed activities necessary to ensure continued access to digital materials for as long as necessary. All of the actions required to maintain access to digital materials beyond the limits of media failure or technological change. Those materials may be records created during the day-to-day business of an organisation; born-digital materials created for a specific purpose (such as teaching resources); or the products of digitisation projects. This definition specifically excludes the potential use of digital technology to preserve the original artefacts through digitisation. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 digital preservation Series of managed activities necessary to ensure continued access to digital materials for as long as necessary. All of the actions required to maintain access to digital materials beyond the limits of media failure or technological change. Those materials may be records created during the day-to-day business of an organisation; born-digital materials created for a specific purpose (such as teaching resources); or the products of digitisation projects. This definition specifically excludes the potential use of digital technology to preserve the original artefacts through digitisation. RDMT. https://terms.codata.org/rdmt/digital-preservation Also drawn from https://www.dpconline.org/digipres/what-is-digipres The practice of initiating, planning, executing, controlling, and closing the work of a team in relation to FAIR data stewardship. Philippe Rocca-Serra Susanna Sansone https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 project management activities The practice of initiating, planning, executing, controlling, and closing the work of a team in relation to FAIR data stewardship. PMQ Bin for types of Knowledge related to Funding acquisition management. leightonlc knowledge of funding acquisition management Monitor the authorised parties' access to data and keep it up to date. Celia van Gelder Mateusz Kuzak Yan Wang manage access control The practice of initiating, planning, executing, controlling, and closing the work of a team to achieve specific goals and meet specific success criteria at the specified time. Kristina Hettne Simon Hodson Victoria Dominguez Del Angel PM project management The practice of initiating, planning, executing, controlling, and closing the work of a team to achieve specific goals and meet specific success criteria at the specified time. KH Established protocol of an organisation for retaining information for operational or regulatory compliance needs. The objectives of a data retention policy are to keep important information for future use or reference, to organise information so it can be searched and accessed at a later date, and to dispose of information that is no longer needed. A data retention policy must consider both the value of data over time, and regulations to which the data may be subject. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data retention policy Established protocol of an organisation for retaining information for operational or regulatory compliance needs. The objectives of a data retention policy are to keep important information for future use or reference, to organise information so it can be searched and accessed at a later date, and to dispose of information that is no longer needed. A data retention policy must consider both the value of data over time, and regulations to which the data may be subject. RDMT. https://terms.codata.org/rdmt/data-retention-policy Object describing the context of the data, including provenance, description, structural, and administrative information. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 data representation Object describing the context of the data, including provenance, description, structural, and administrative information. RDMT. https://terms.codata.org/rdmt/data-representation The set of metadata describing a specific dataset. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 metadata dataset The set of metadata describing a specific dataset. Definition from now-deprecated CASRAI Glossary term: metadata-dataset Continued access to digital materials, or at least to the information contained in them, indefinitely. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 long-term preservation Continued access to digital materials, or at least to the information contained in them, indefinitely. RDMT. https://terms.codata.org/rdmt/long-term-preservation Research metadata format is a generic term encompassing the concept of a standardised format for research metadata. https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 AL 9.3.2022: The original CASRAI definition is unsuitable (https://casrai.org/term/research-metadata-format), therefore I have changed the definition to something more generic research metadata format Research metadata format is a generic term encompassing the concept of a standardised format for research metadata. AL, 9.3.2022 Physical or geographic location of an organisation’s data or information. Data residency also refers to the legal or regulatory requirements imposed on data based on the country or region in which it resides. Cloud computing, which allows organisations to deliver hosted services over the Internet, can create data residency concerns. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data residency Physical or geographic location of an organisation’s data or information. Data residency also refers to the legal or regulatory requirements imposed on data based on the country or region in which it resides. Cloud computing, which allows organisations to deliver hosted services over the Internet, can create data residency concerns. RDMT. https://terms.codata.org/rdmt/data-residency Information governance, is the overall strategy for information at an organization. Information governance balances the risk that information presents with the value that information provides. Information governance helps with legal compliance, operational transparency, and reducing expenditures associated with legal discovery. https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 CASRAI is the source for the term, with the definition provided separately. information governance Information governance, is the overall strategy for information at an organization. Information governance balances the risk that information presents with the value that information provides. Information governance helps with legal compliance, operational transparency, and reducing expenditures associated with legal discovery. https://en.wikipedia.org/wiki/Information_governance, accessed 8.2.22 Unit of data for which the definition, identification, representation (term used to represent it), and permissible values are specified by means of a set of attributes. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data element Unit of data for which the definition, identification, representation (term used to represent it), and permissible values are specified by means of a set of attributes. RDMT. https://terms.codata.org/rdmt/data-element Set of high-level principles that establish a guiding framework for data-related issues. A data policy may contain subsidiary sections that define approaches to strategic aspects such as data access; data protection; data management, custodianship or stewardship; data preservation; relevant legal matters; data acquisition; data ethics and other data-related issues. Distinct from a data management policy which is the subset of data policy addressing the management of a specified set of data assets. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data policy Set of high-level principles that establish a guiding framework for data-related issues. A data policy may contain subsidiary sections that define approaches to strategic aspects such as data access; data protection; data management, custodianship or stewardship; data preservation; relevant legal matters; data acquisition; data ethics and other data-related issues. Distinct from a data management policy which is the subset of data policy addressing the management of a specified set of data assets. RDMT. https://terms.codata.org/rdmt/data-policy Older data that can no longer be accessed or processed easily because they are stored in obsolete formats or systems. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 legacy data Older data that can no longer be accessed or processed easily because they are stored in obsolete formats or systems. RDMT. https://terms.codata.org/rdmt/legacy-data List of standardised terminology, words, or phrases, used for indexing or content analysis and information retrieval, usually in a defined information domain. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 controlled vocabulary List of standardised terminology, words, or phrases, used for indexing or content analysis and information retrieval, usually in a defined information domain. RDMT. https://terms.codata.org/rdmt/controlled-vocabulary Assurance that information can only be accessed or modified by those authorised to do so. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data integrity data integrity (access) Assurance that information can only be accessed or modified by those authorised to do so. RDMT. https://terms.codata.org/rdmt/data-integrity-access Single data element related to a PID and part of its record content. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 pid attribute Single data element related to a PID and part of its record content. RDMT. https://terms.codata.org/rdmt/pid-attribute Bin for Aptitudes related to Change management. leightonlc aptitudes for change management Continued access to digital materials beyond changes in technology for a defined period of time but not indefinitely. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 medium-term preservation Continued access to digital materials beyond changes in technology for a defined period of time but not indefinitely. RDMT. https://terms.codata.org/rdmt/medium-term-preservation Describes the processes and tasks that must be completed to produce a product or service. Different project lifecycles exist for specific products and services. (For example, the lifecycle followed to build a house is very different from the lifecycle followed to develop a software package. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 project lifecycle Describes the processes and tasks that must be completed to produce a product or service. Different project lifecycles exist for specific products and services. (For example, the lifecycle followed to build a house is very different from the lifecycle followed to develop a software package. Definition from now-deprecated CASRAI Glossary term: project-lifecycle Know how the acceptability of research data FAIRness depends on the research community norms e.g. concepts of data and methods for deriving valid knowledge. Angus Whyte knowledge to relate fair data assessment to research community norms Data traceability follows the lifecycle of data to track all access and changes to the data. It helps demonstrate transparency, compliance and adherence to regulations. Data traceability, along with data compliance, can be considered part of a data audit process. Data traceability is fundamental to reproducible research. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data traceability Data traceability follows the lifecycle of data to track all access and changes to the data. It helps demonstrate transparency, compliance and adherence to regulations. Data traceability, along with data compliance, can be considered part of a data audit process. Data traceability is fundamental to reproducible research. RDMT. https://terms.codata.org/rdmt/data-traceability https://orcid.org/0000-0003-2687-1982 2020-10-02T14:23:22.115275Z maintaining persistent identifiers Choose the proper license considering types of project data and clearly indicate (apply) the license chosen in the project work. Celia van Gelder Mateusz Kuzak Yan Wang select and apply license Indicates how the different components within a system are linked to fulfill the tasks. Relations are thus defined by the services they are making use of and by the interface specifications. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 relations Indicates how the different components within a system are linked to fulfill the tasks. Relations are thus defined by the services they are making use of and by the interface specifications. Definition from now-deprecated CASRAI Glossary term: relations Organised collection of data or objects in a computational format, that are generated or collected by researchers in the course of their investigations, regardless of their form or method, that form the object on which researchers test a hypothesis. This includes the full range of data: raw, unprocessed datasets, proprietary generated and processed data and secondary data obtained from third parties. The presentation of the data in the application is enabled through metadata. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 dataset Organised collection of data or objects in a computational format, that are generated or collected by researchers in the course of their investigations, regardless of their form or method, that form the object on which researchers test a hypothesis. This includes the full range of data: raw, unprocessed datasets, proprietary generated and processed data and secondary data obtained from third parties. The presentation of the data in the application is enabled through metadata. RDMT. https://terms.codata.org/rdmt/dataset Bin for Aptitudes related to Service level management. leightonlc aptitudes for service level management Text format derived from Standard Generalized Markup Language or ‘SGML’ (ISO 8879). Originally designed to meet the challenges of large-scale electronic publishing, XML is also playing an increasingly important role in the exchange of a wide variety of data on the Web and elsewhere. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 XML extensible markup language Text format derived from Standard Generalized Markup Language or ‘SGML’ (ISO 8879). Originally designed to meet the challenges of large-scale electronic publishing, XML is also playing an increasingly important role in the exchange of a wide variety of data on the Web and elsewhere. RDMT. https://terms.codata.org/rdmt/extensible-markup-language Bin for Skills needed for Change management. leightonlc skills related to change management Techniques that produce synthesised knowledge from organised information. Process of inspecting, cleaning, transforming, and modelling data with the goal of highlighting useful information suggesting conclusions, and supporting decision making. Data analysis has multiple facets and approaches, encompassing diverse techniques under a variety of names, in different business, science, and social science domains. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data analysis Techniques that produce synthesised knowledge from organised information. Process of inspecting, cleaning, transforming, and modelling data with the goal of highlighting useful information suggesting conclusions, and supporting decision making. Data analysis has multiple facets and approaches, encompassing diverse techniques under a variety of names, in different business, science, and social science domains. RDMT. https://terms.codata.org/rdmt/data-analysis Resource management is the efficient and effective development of an organization's resources when they are needed. Such resources may include the financial resources, inventory, human skills, production resources, or information technology (IT) and natural resources. Kristina Hettne Leighton Christiansen Simon Hodson Victoria Dominguez Del Angel resource management Resource management is the efficient and effective development of an organization's resources when they are needed. Such resources may include the financial resources, inventory, human skills, production resources, or information technology (IT) and natural resources. [LLC, https://en.wikipedia.org/wiki/Resource_management] Summarising the main characteristics of a dataset using visualisation. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data exploration Summarising the main characteristics of a dataset using visualisation. RDMT. https://terms.codata.org/rdmt/data-exploration Bin for Aptitudes related to Storage management. leightonlc aptitudes for storage management Management and provision of good data stewardship practice. Philippe Rocca-Serra Susanna Sansone https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 Advise and enable Data Stewardship and FAIR mentorship activity data management and open research AL 9.3.22. Removed from 'FAIR4S defined activity' class to better integrate with the T4FS hierarchy. advise and mentor Management and provision of good data stewardship practice. PMQ Bin for Skills needed for Workflow technologies management. leightonlc 2019-10-17T15:15:55.86138Z skills related to workflow technologies management Those layers that sit between base technology (a computer science concern) and discipline-specific science. Value-added systems and services that can be widely shared across scientific domains, both supporting and enabling large increases in multi- and interdisciplinary science while reducing duplication of effort and resources (including hardware, software, personnel, services and organisations). https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 digital infrastructure Those layers that sit between base technology (a computer science concern) and discipline-specific science. Value-added systems and services that can be widely shared across scientific domains, both supporting and enabling large increases in multi- and interdisciplinary science while reducing duplication of effort and resources (including hardware, software, personnel, services and organisations). RDMT. https://terms.codata.org/rdmt/digital-infrastructure Bin for Skills needed for Data management cost management. leightonlc skills related to data management cost management Consists of at least one PID resolver, a name schema and a defined mechanism for issuing PIDs that conform to the name schema. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 pid system Consists of at least one PID resolver, a name schema and a defined mechanism for issuing PIDs that conform to the name schema. RDMT. https://terms.codata.org/rdmt/pid-system https://orcid.org/0000-0003-2687-1982 I2. (meta)data use vocabularies that follow fair principles Research approach that explains aspects of one discipline in terms of another (e.g., the physics of music; the politics of literature). https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 cross-disciplinary Research approach that explains aspects of one discipline in terms of another (e.g., the physics of music; the politics of literature). RDMT. https://terms.codata.org/rdmt/cross-disciplinary Data that are tagged with particular metadata that can be used to derive relationships between data. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 semantic data Data that are tagged with particular metadata that can be used to derive relationships between data. RDMT. https://terms.codata.org/rdmt/semantic-data For a single identifier, the class of entity it refers to. For a PID system, the typical class of entities it is intended to be used for, such as digital objects, physical objects, bodies, actors. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 pid domain For a single identifier, the class of entity it refers to. For a PID system, the typical class of entities it is intended to be used for, such as digital objects, physical objects, bodies, actors. RDMT. https://terms.codata.org/rdmt/pid-domain Select data handling approaches likely to make data as FAIR as possible, considering what the data is about and its purpose in the research. Angus Whyte knowledge to choose fair data handling approaches appropriate to the research phenomena The activity of recording provenance for data and software. Knowledge about provenance vocabularies, for example PROV-O and models provenance such as nanopublications. Kristina Hettne Simon Hodson Victoria Dominguez Del Angel provenance information management The activity of recording provenance for data and software. Knowledge about provenance vocabularies, for example PROV-O and models provenance such as nanopublications. KH Results that can be replicated using the documented data, code, and methods employed by the author or provider without the need for any additional information or needing to communicate with the author or provider. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 reproducible research Results that can be replicated using the documented data, code, and methods employed by the author or provider without the need for any additional information or needing to communicate with the author or provider. RDMT. https://terms.codata.org/rdmt/reproducible-research https://orcid.org/0000-0003-2687-1982 A1.2 the protocol allows for an authentication and authorisation procedure, where necessary https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 data stewardship technical concept Data collection that has been normalised by some established criteria to allow for effective data management. Examples include: data files that belong to a certain experiment, all files that are created by one specific simulation, all files that belong to a specific observation (same day, same place, etc.). https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 canonical data collection Data collection that has been normalised by some established criteria to allow for effective data management. Examples include: data files that belong to a certain experiment, all files that are created by one specific simulation, all files that belong to a specific observation (same day, same place, etc.). RDMT. https://terms.codata.org/rdmt/canonical-data-collection Ability to clean up existing source code and version control systems, get Unique ID for the code, choose the appropriate code repository/registry [YLF, VDA, KH] Kristina Hettne Victoria Dominguez Del Angel Yann Le Franc https://orcid.org/0000-0002-7702-4495 archival documentation of software Data harmonization is the process of making data from different sources comparable. https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 https://github.com/terms4fairskills/FAIRterminology/issues/17 data harmonization Data harmonization is the process of making data from different sources comparable. Modified from CASRAI. https://casrai.org/term/data-harmonization/ Repository of data designed to serve a particular community of knowledge workers. A data mart contains harmonised, highly-structured, quality data integrated from multiple sources (which is also a characteristic of the data warehouse). It's also optimised to support well-known, predefined and repeatable analytic queries, rather than ad-hoc analysis. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data mart Repository of data designed to serve a particular community of knowledge workers. A data mart contains harmonised, highly-structured, quality data integrated from multiple sources (which is also a characteristic of the data warehouse). It's also optimised to support well-known, predefined and repeatable analytic queries, rather than ad-hoc analysis. RDMT. https://terms.codata.org/rdmt/data-mart Type of access entity that contains the services and functions which make the data object holdings and their information content and related services visible to data consumers. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 access workflow Type of access entity that contains the services and functions which make the data object holdings and their information content and related services visible to data consumers. RDMT. https://terms.codata.org/rdmt/access-workflow Bin for types of Knowledge related to Service level management. leightonlc 2019-10-17T15:18:53.73274Z knowledge of service level management Model that specifies the structure or schema of a dataset. The model provides a documented description of the data and thus is an instance of metadata. It is a logical, relational data model showing an organised dataset as a collection of tables with entity, attributes and relations. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data model Model that specifies the structure or schema of a dataset. The model provides a documented description of the data and thus is an instance of metadata. It is a logical, relational data model showing an organised dataset as a collection of tables with entity, attributes and relations. RDMT. https://terms.codata.org/rdmt/data-model Be aware of the potential conflicts between security and usability. Understand the importance of having information security policy and keeping a balance between the information security and usability. Willing to establish both organization and technical information security measurements. Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang appreciate security and usability trade-offs Set of agreed-upon and documented guidelines, specifications, accepted practices, technical requirements, or terminologies that have been prepared by a standards developing organisation or group, and published in accordance with established procedures. These can be mandatory or voluntary and are distinct from Acts, regulations, and codes, although standards can be referenced in those legal instruments. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 standard Set of agreed-upon and documented guidelines, specifications, accepted practices, technical requirements, or terminologies that have been prepared by a standards developing organisation or group, and published in accordance with established procedures. These can be mandatory or voluntary and are distinct from Acts, regulations, and codes, although standards can be referenced in those legal instruments. RDMT. https://terms.codata.org/rdmt/standard The application of a comprehensive knowledge of a discipline or disciplines to the development of expertise and the generation of new knowledge through research, and the planning and presentation of courses of study for undergraduates and graduates in universities. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 university teaching The application of a comprehensive knowledge of a discipline or disciplines to the development of expertise and the generation of new knowledge through research, and the planning and presentation of courses of study for undergraduates and graduates in universities. Definition from now-deprecated CASRAI Glossary term: university-teaching Transfer of data between storage types, formats, information technologies, or computer systems. A data migration project is usually undertaken to replace or upgrade servers or storage equipment, for a website consolidation, to conduct server maintenance or to relocate a data centre. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data migration Transfer of data between storage types, formats, information technologies, or computer systems. A data migration project is usually undertaken to replace or upgrade servers or storage equipment, for a website consolidation, to conduct server maintenance or to relocate a data centre. RDMT. https://terms.codata.org/rdmt/data-migration Process of detecting and correcting corrupt or inaccurate records from a dataset. Data cleaning is a continuous process that requires corrective actions throughout the data lifecycle. Data cleaning involves identifying, replacing, modifying or deleting incomplete, incorrect, inaccurate, inconsistent, irrelevant, and improperly formatted data. Typically, the process involves updating, correcting, standardising, and de-duplicating records to create a single view of the data, even if they are stored in multiple disparate systems. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data cleaning Process of detecting and correcting corrupt or inaccurate records from a dataset. Data cleaning is a continuous process that requires corrective actions throughout the data lifecycle. Data cleaning involves identifying, replacing, modifying or deleting incomplete, incorrect, inaccurate, inconsistent, irrelevant, and improperly formatted data. Typically, the process involves updating, correcting, standardising, and de-duplicating records to create a single view of the data, even if they are stored in multiple disparate systems. RDMT. https://terms.codata.org/rdmt/data-cleaning https://orcid.org/0000-0003-2687-1982 R1.2 (meta)data are associated with detailed provenance Bin for Skills needed for Identity management. leightonlc skills related to identity management The use of persistent identifiers or PIDs to uniquely and persistently identify an entity. Nancy Hoebelheinrich https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 https://orcid.org/0000-0003-2687-1982 2020-10-02T14:22:51.569762Z using persistent identifiers The use of persistent identifiers or PIDs to uniquely and persistently identify an entity. [PMQ 3.2022] https://orcid.org/0000-0002-7702-4495 database curator AL 15.3.22: Updated label to 'data curator' to encompass curation beyond just in databases. data curator Learn about innovation partners' interests and concerns, analyze them and position the project work/outcome properly. Reach out to innovation partners with proactive and pragmatic innovation plan which is in compliance with FAIR principles. Celia van Gelder Mateusz Kuzak Yan Wang engage innovation partners https://orcid.org/0000-0003-2687-1982 R1.3 (meta)data meet domain-relevant community standards Process of reducing the amount or size of stored data. This may be achieved by eliminating redundant copies of data files, deduplicating data files by removing redundant records, or by compressing the data files. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data reduction Process of reducing the amount or size of stored data. This may be achieved by eliminating redundant copies of data files, deduplicating data files by removing redundant records, or by compressing the data files. RDMT. https://terms.codata.org/rdmt/data-reduction https://orcid.org/0000-0003-2687-1982 F4. (meta)data are registered or indexed in a searchable resource Registry for data types supporting their standardisation, uniqueness and discoverability. Data types range from complex digital objects to simple categories that occur in digital objects. An additional functionality may be to link data types to executable data processing functions. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data type registry Registry for data types supporting their standardisation, uniqueness and discoverability. Data types range from complex digital objects to simple categories that occur in digital objects. An additional functionality may be to link data types to executable data processing functions. RDMT. https://terms.codata.org/rdmt/data-type-registry Collection of data that is organised according to a conceptual structure/model describing the characteristics of these data and the relationships among their corresponding entities, supporting one or more application areas. A database allows its contents to be easily accessed, managed and updated. The type of database used depends on the requirements of the study. A common type is the relational database, where data are related to each other in a systematic manner so that they can be reorganised and accessed in a number of different ways. A database may house one or many datasets. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 database Collection of data that is organised according to a conceptual structure/model describing the characteristics of these data and the relationships among their corresponding entities, supporting one or more application areas. A database allows its contents to be easily accessed, managed and updated. The type of database used depends on the requirements of the study. A common type is the relational database, where data are related to each other in a systematic manner so that they can be reorganised and accessed in a number of different ways. A database may house one or many datasets. RDMT. https://terms.codata.org/rdmt/database Recognize the added value of FAIR and open research, encourage researchers to practise FAIR and open research. Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang appreciation of fair and open research Demonstrate willingness to engage with new ways of applying FAIR principles. Angus Whyte ability to learn new techniques for fair implementation Bin for Aptitudes related to Funding acquisition management. leightonlc aptitudes for funding acquisition management To be able to choose the correct data model and/or exchange format for your data, based on the repository where you plan to deposit your data. https://orcid.org/0000-0003-2687-1982 Which data model to use Which format to use choosing the appropriate model or format for your data To be able to choose the correct data model and/or exchange format for your data, based on the repository where you plan to deposit your data. [PMQ] Information concerning the creation, attribution, or version history of managed data. Provenance metadata indicates the relationship between two versions of data objects and is generated whenever a new version of a dataset is created. Provenance information is gathered along the data lifecycle as part of curation processes. A finer level of provenance metadata would be concerned only with data flowing between various stores such as curated databases and managed repositories. Provenance metadata is designed to allow queries over the relationship between versions, and includes either or both fine-grained and coarse-grained provenance data. Different applications may store different provenance data. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 provenance metadata Information concerning the creation, attribution, or version history of managed data. Provenance metadata indicates the relationship between two versions of data objects and is generated whenever a new version of a dataset is created. Provenance information is gathered along the data lifecycle as part of curation processes. A finer level of provenance metadata would be concerned only with data flowing between various stores such as curated databases and managed repositories. Provenance metadata is designed to allow queries over the relationship between versions, and includes either or both fine-grained and coarse-grained provenance data. Different applications may store different provenance data. RDMT. https://terms.codata.org/rdmt/provenance-metadata State when data are available in a timely manner in the place and form as needed by the user. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data availability State when data are available in a timely manner in the place and form as needed by the user. RDMT. https://terms.codata.org/rdmt/data-availability Activity through which the correctness conditions of the data are verified. It also includes the specification of the type of the error or condition not met, and the qualification of the data and its division into “error-free” or “erroneous” data. Data review consists of both error detection and data analysis, and can be carried out in manual or automated mode. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 data review Activity through which the correctness conditions of the data are verified. It also includes the specification of the type of the error or condition not met, and the qualification of the data and its division into “error-free” or “erroneous” data. Data review consists of both error detection and data analysis, and can be carried out in manual or automated mode. RDMT. https://terms.codata.org/rdmt/data-review HughShanahan 2019-10-18T08:56:32.673701Z aptitudes associated with using fair tools and services Documents actions that have been undertaken to preserve a digital resource such as migrations and checks sum calculations. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 preservation metadata Documents actions that have been undertaken to preserve a digital resource such as migrations and checks sum calculations. RDMT. https://terms.codata.org/rdmt/preservation-metadata Change management is a broad subject and can be applied to all different types of organizational change. The most common change drivers include: technological evolution, process reviews, crisis, and consumer habit changes; pressure from new business entrants, acquisitions, mergers, and organizational restructuring. Kristina Hettne Leighton Christiansen Simon Hodson Victoria Dominguez Del Angel CM change management Change management is a broad subject and can be applied to all different types of organizational change. The most common change drivers include: technological evolution, process reviews, crisis, and consumer habit changes; pressure from new business entrants, acquisitions, mergers, and organizational restructuring. https://www.hucmi.com/en/hcmbok/ Type of repository where the original copy of data was stored and probably a data identifier registered. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 original repository Type of repository where the original copy of data was stored and probably a data identifier registered. RDMT. https://terms.codata.org/rdmt/original-repository Techniques designed to make the risk of identifying a particular individual in a dataset negligible, whilst retaining the re-usability of the dataset. The purpose is to protect the privacy of the individual and comply with legislation, whilst enabling data sharing. Methods include removing direct and indirect identifiers such as names, addresses, social insurance numbers, or dates of birth, or using obfuscation methods such as encryption, hashing, generalisation, pseudonymisation, and perturbation. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 de-identification Techniques designed to make the risk of identifying a particular individual in a dataset negligible, whilst retaining the re-usability of the dataset. The purpose is to protect the privacy of the individual and comply with legislation, whilst enabling data sharing. Methods include removing direct and indirect identifiers such as names, addresses, social insurance numbers, or dates of birth, or using obfuscation methods such as encryption, hashing, generalisation, pseudonymisation, and perturbation. RDMT. https://terms.codata.org/rdmt/de-identification Program delivery managers and support function managers, at all levels in an institution who are accountable for the direct delivery and support of programs and services within their domain of business responsibility. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 manager Program delivery managers and support function managers, at all levels in an institution who are accountable for the direct delivery and support of programs and services within their domain of business responsibility. Definition from now-deprecated CASRAI Glossary term: manager Organizational leadership is: (a) The ability to attract, assess, mobilize and focus energies and talent to work towards a shared purpose aligned with the mandate of the organization; (b) The ability to change culture, processes and priorities within the organization; and, (c) The ability to mentor. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 organizational leadership Organizational leadership is: (a) The ability to attract, assess, mobilize and focus energies and talent to work towards a shared purpose aligned with the mandate of the organization; (b) The ability to change culture, processes and priorities within the organization; and, (c) The ability to mentor. Definition from now-deprecated CASRAI Glossary term: organizational-leadership Data mining practice in which large volumes of data are analysed seeking any possible relationships between data. The traditional scientific method, in contrast, begins with a hypothesis and follows with an examination of the data. Data dredging often circumvents traditional data mining techniques and may lead to premature conclusions. Uncovered patterns may be presented as statistically significant without any specific hypothesis as to the underlying causality. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 Data fishing data dredging Data mining practice in which large volumes of data are analysed seeking any possible relationships between data. The traditional scientific method, in contrast, begins with a hypothesis and follows with an examination of the data. Data dredging often circumvents traditional data mining techniques and may lead to premature conclusions. Uncovered patterns may be presented as statistically significant without any specific hypothesis as to the underlying causality. RDMT. https://terms.codata.org/rdmt/data-dredging Data that have not been organised into a format and identifiable data structure that makes them easy to access and process. These data can often be searched as long as they are digital, but they are difficult to use for computer analyses. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 Unstructured information unstructured data Data that have not been organised into a format and identifiable data structure that makes them easy to access and process. These data can often be searched as long as they are digital, but they are difficult to use for computer analyses. RDMT. https://terms.codata.org/rdmt/unstructured-data https://orcid.org/0000-0003-2687-1982 2020-11-16T22:18:14.764875Z presentation slides with interactive exercises https://orcid.org/0000-0003-2687-1982 2020-10-02T14:23:05.600474Z creating persistent identifiers The process of bringing together from two or more different sources, data that relate to the same individual, family, place or event). https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 Linkage data linkage The process of bringing together from two or more different sources, data that relate to the same individual, family, place or event). RDMT. https://terms.codata.org/rdmt/data-linkage A type of collection that describes, and points to features of another collection. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 catalogue A type of collection that describes, and points to features of another collection. RDMT. https://terms.codata.org/rdmt/catalogue The consideration of available monetary resources for a specific time period to serve a specific purpose. Kristina Hettne Simon Hodson Victoria Dominguez Del Angel funding acquisition management The consideration of available monetary resources for a specific time period to serve a specific purpose. KH To be able to use programmatic methods to access a resource's API to query and extract an appropriate subset of data. https://orcid.org/0000-0003-2687-1982 API access Machine actionable search machine access via api To be able to use programmatic methods to access a resource's API to query and extract an appropriate subset of data. PMQ Continuum of data structure that includes unstructured data, semi-structured data, and structured data. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data structure continuum Continuum of data structure that includes unstructured data, semi-structured data, and structured data. RDMT. https://terms.codata.org/rdmt/data-structure-continuum Facts, measurements, recordings, records, or observations about the world, collected by researchers, that are yet to be processed/interpreted/analysed. Data may be in any format or medium taking the form of writings, notes, numbers, symbols, text, images, films, video, sound recordings, pictorial reproductions, drawings, designs or other graphical representations, procedural manuals, forms, diagrams, work flow charts, equipment descriptions, data files, data processing algorithms, or statistical records. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data Facts, measurements, recordings, records, or observations about the world, collected by researchers, that are yet to be processed/interpreted/analysed. Data may be in any format or medium taking the form of writings, notes, numbers, symbols, text, images, films, video, sound recordings, pictorial reproductions, drawings, designs or other graphical representations, procedural manuals, forms, diagrams, work flow charts, equipment descriptions, data files, data processing algorithms, or statistical records. RDMT. https://terms.codata.org/rdmt/data Data that have not been organised into a specialised repository, such as a database, but that nevertheless have associated information, such as metadata, that makes them more amenable to processing than raw data. Semi-structured data lie somewhere between structured and unstructured data. They are not organised in a complex manner that makes sophisticated access and analysis possible. However, they may have information associated with them, such as metadata tagging that allows elements contained to be addressed. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 semi-structured data Data that have not been organised into a specialised repository, such as a database, but that nevertheless have associated information, such as metadata, that makes them more amenable to processing than raw data. Semi-structured data lie somewhere between structured and unstructured data. They are not organised in a complex manner that makes sophisticated access and analysis possible. However, they may have information associated with them, such as metadata tagging that allows elements contained to be addressed. RDMT. https://terms.codata.org/rdmt/semi-structured-data https://orcid.org/0000-0003-2687-1982 2020-10-01T20:53:37.24517Z book chapter A scientist who conducts activities in: (1) Research, development and analysis (RDA); (2) Managing research; and, (3) Representation and client services. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 research scientist A scientist who conducts activities in: (1) Research, development and analysis (RDA); (2) Managing research; and, (3) Representation and client services. Definition from now-deprecated CASRAI Glossary term: research-scientist Type of digital Persistent Identifier (PID) issued by the International DOI Foundation. This permanent digital identifier is associated with an object that permits the object to be referenced reliably even if its location and metadata undergo change over time. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 DOI digital object identifier Type of digital Persistent Identifier (PID) issued by the International DOI Foundation. This permanent digital identifier is associated with an object that permits the object to be referenced reliably even if its location and metadata undergo change over time. RDMT. https://terms.codata.org/rdmt/digital-object-identifier https://orcid.org/0000-0003-2687-1982 F3. metadata clearly and explicitly include the identifier of the data they describe Expert in one or more of the information management disciplines that support the effective and efficient management of information. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 information management specialist Expert in one or more of the information management disciplines that support the effective and efficient management of information. RDMT. https://terms.codata.org/rdmt/information-management-specialist https://orcid.org/0000-0003-2687-1982 A2. metadata are accessible, even when the data are no longer available Identify and engage in dialogue with stakeholders affected by the research, or by making its outputs FAIR. Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang stakeholder engagement on societal impact The Principal Investigator (P.I.) is a researcher who has a research leadership role and is the point of contact for a project or partnership that applies the scientific method, historical method, or other research methodology for the advancement of knowledge resulting in independent, objective, high quality, traceable, and reproducible results. The P.I. has primary responsibility for the intellectual direction and integrity of the research or research-related activity, including data production, findings and results, and ensures ethical conduct in all aspects of the research process including but not limited to the treatment of human and animal subjects, conflicts of interest, data acquisition, sharing and ownership, publication practices, responsible authorship, and collaborative research and reporting. While various tasks may be delegated to team members, some of whom may have greater expertise in specific areas, the P.I. is familiar with the various technical and scientific aspects of a project and how they fit together, is able to identify and remediate gaps, and ensure communication within the team and with users of the research data and results. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 principal investigator The Principal Investigator (P.I.) is a researcher who has a research leadership role and is the point of contact for a project or partnership that applies the scientific method, historical method, or other research methodology for the advancement of knowledge resulting in independent, objective, high quality, traceable, and reproducible results. The P.I. has primary responsibility for the intellectual direction and integrity of the research or research-related activity, including data production, findings and results, and ensures ethical conduct in all aspects of the research process including but not limited to the treatment of human and animal subjects, conflicts of interest, data acquisition, sharing and ownership, publication practices, responsible authorship, and collaborative research and reporting. While various tasks may be delegated to team members, some of whom may have greater expertise in specific areas, the P.I. is familiar with the various technical and scientific aspects of a project and how they fit together, is able to identify and remediate gaps, and ensure communication within the team and with users of the research data and results. Definition from now-deprecated CASRAI Glossary term: principal-investigator The person responsible for creating the organizational environment culture by providing clear direction and circumstances that allow people to be successful. The program manager is judged on the elements time, cost, and scope, cumulatively for all the projects and operations within the program. Program management decisions are both tactical and strategic in nature. The strategy aspects of these decisions must consider multidimensional impacts beyond the near-term delivery dates of the project. In addition to delivery and execution, the program manager has to also be concerned with the overall health and effectiveness of the program over the long term. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 program manager The person responsible for creating the organizational environment culture by providing clear direction and circumstances that allow people to be successful. The program manager is judged on the elements time, cost, and scope, cumulatively for all the projects and operations within the program. Program management decisions are both tactical and strategic in nature. The strategy aspects of these decisions must consider multidimensional impacts beyond the near-term delivery dates of the project. In addition to delivery and execution, the program manager has to also be concerned with the overall health and effectiveness of the program over the long term. Definition from now-deprecated CASRAI Glossary term: program-manager URL that points to an intermediate resolution service. The PURL resolution service associates the PURL with the actual URL and returns that URL to the client. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 PURL persistent uniform resource locator URL that points to an intermediate resolution service. The PURL resolution service associates the PURL with the actual URL and returns that URL to the client. RDMT. https://terms.codata.org/rdmt/persistent-uniform-resource-locator Widely used metadata element set, formally titled ISO 15836-1:2017, Information and documentation — The Dublin Core metadata element set — Part 1: Core elements. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 dublin core Widely used metadata element set, formally titled ISO 15836-1:2017, Information and documentation — The Dublin Core metadata element set — Part 1: Core elements. RDMT. https://terms.codata.org/rdmt/dublin-core Yann Le Franc ylefranc 2021-03-17T16:34:14.943152Z data stewardship guideline The FAIR principles are guidelines to improve the Findability, Accessibility, Interoperability, and Reuse of digital assets. The principles emphasise machine-actionability (i.e., the capacity of computational systems to find, access, interoperate, and reuse data with none or minimal human intervention) because humans increasingly rely on computational support to deal with data as a result of the increase in volume, complexity, and creation speed of data. The principles refer to three types of entities: data (or any digital object), metadata (information about that digital object), and infrastructure. https://orcid.org/0000-0003-2687-1982 https://orcid.org/0000-0003-2687-1982 2020-10-02T14:56:15.031154Z fair principle The FAIR principles are guidelines to improve the Findability, Accessibility, Interoperability, and Reuse of digital assets. The principles emphasise machine-actionability (i.e., the capacity of computational systems to find, access, interoperate, and reuse data with none or minimal human intervention) because humans increasingly rely on computational support to deal with data as a result of the increase in volume, complexity, and creation speed of data. The principles refer to three types of entities: data (or any digital object), metadata (information about that digital object), and infrastructure. https://www.go-fair.org/fair-principles/ Bin for Aptitudes related to Cloud computing environment management. leightonlc aptitudes for cloud computing environment management New datasets obtained by combining data appropriately from a variety of existing files, generating new data products that did not previously exist. Repurposed data result from data wrangling. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 repurposed data New datasets obtained by combining data appropriately from a variety of existing files, generating new data products that did not previously exist. Repurposed data result from data wrangling. RDMT. https://terms.codata.org/rdmt/repurposed-data Definition of the structure and the legal elements and attributes of an XML document. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 document type definition Definition of the structure and the legal elements and attributes of an XML document. RDMT. https://terms.codata.org/rdmt/document-type-definition https://orcid.org/0000-0003-2687-1982 I3. (meta)data include qualified references to other (meta)data Long-lasting digital reference to an object that gives information about that object regardless of what happens to that object. Developed to address link rot, a persistent identifier can be resolved to provide an appropriate representation of an object whether that object changes its online location or goes offline. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 PID persistent identifier Long-lasting digital reference to an object that gives information about that object regardless of what happens to that object. Developed to address link rot, a persistent identifier can be resolved to provide an appropriate representation of an object whether that object changes its online location or goes offline. RDMT. https://terms.codata.org/rdmt/persistent-identifier A person who is studying or has expert knowledge of one or more of the natural or physical sciences. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 scientist A person who is studying or has expert knowledge of one or more of the natural or physical sciences. Definition from now-deprecated CASRAI Glossary term: scientist 1. The act of bringing together smaller components into a single system that functions as one. 2. In the context of information technology: The end result of a process that aims to stitch together different, often disparate, subsystems so that the data contained in each becomes part of a larger, more comprehensive system that, ideally, quickly and easily shares data when needed. This often requires that organizations build a customized architecture or structure of applications to combine new or existing hardware, software and other communications. https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 integration 1. The act of bringing together smaller components into a single system that functions as one. 2. In the context of information technology: The end result of a process that aims to stitch together different, often disparate, subsystems so that the data contained in each becomes part of a larger, more comprehensive system that, ideally, quickly and easily shares data when needed. This often requires that organizations build a customized architecture or structure of applications to combine new or existing hardware, software and other communications. Definition from now-deprecated CASRAI Glossary term: integration Bin for types of Knowledge related to High performance computing management. leightonlc knowledge of high performance computing management The act of mentoring around FAIR data stewardship. Philippe Rocca-Serra Susanna Sansone https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 fair mentoring The act of mentoring around FAIR data stewardship. PMQ Exercise of authority, control and shared decision making (planning, monitoring and enforcement) over the management of data assets. Overall management of the availability, usability, integrity, and security of the data employed in an organisation. A sound data governance program includes a governing body or council, a defined set of procedures, and a plan to execute those procedures. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data governance Exercise of authority, control and shared decision making (planning, monitoring and enforcement) over the management of data assets. Overall management of the availability, usability, integrity, and security of the data employed in an organisation. A sound data governance program includes a governing body or council, a defined set of procedures, and a plan to execute those procedures. RDMT. https://terms.codata.org/rdmt/data-governance Scheme used for identification of resources (including people and organisations) and the sharing of data across domains, enterprises, and applications. XRI TC will define a Uniform Resource Identifier (URI) scheme and a corresponding Uniform Resource Namespace (URN). https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 extensible resource identifier Scheme used for identification of resources (including people and organisations) and the sharing of data across domains, enterprises, and applications. XRI TC will define a Uniform Resource Identifier (URI) scheme and a corresponding Uniform Resource Namespace (URN). RDMT. https://terms.codata.org/rdmt/extensible-resource-identifier In the context of a researcher's activities, impact is the consequence of the research and new knowledge on the advancement of the specialty. Science-based policies, regulations, services and technology transfers are some examples of ways target results can be achieved and impact demonstrated. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 impact In the context of a researcher's activities, impact is the consequence of the research and new knowledge on the advancement of the specialty. Science-based policies, regulations, services and technology transfers are some examples of ways target results can be achieved and impact demonstrated. Definition from now-deprecated CASRAI Glossary term: impact A list used to grant permission matched against credentials. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 access control list A list used to grant permission matched against credentials. Definition from now-deprecated CASRAI Glossary term: access-control-list. Learn about data copyright laws, licennsing and other legal aspects of data access. Understand the application of those laws at the project level. Celia van Gelder Mateusz Kuzak Yan Wang understand data ownership and access policies Meeting/conference organisation is a project management activity that encompasses all of the steps required to run a meeting or conference. Philippe Rocca-Serra Susanna Sansone https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 meeting/conference organisation It can be considered as a negotiated agreement between the customer and vendor which are acceptable to both parties with respect to costs and expectations in order to support the business process. Kristina Hettne Leighton Christiansen Simon Hodson Victoria Dominguez Del Angel SLM service level management It can be considered as a negotiated agreement between the customer and vendor which are acceptable to both parties with respect to costs and expectations in order to support the business process. https://www.techopedia.com/definition/13848/service-level-management-slm [LLC] From an official perspective, a national standard is adopted by a national standards body (e.g., Standards Council of Canada, American National Standards Institute, British Standards Institution) and made available to the public. Practically speaking, however, a national standard is any standard that is widely used and recognized within a country. In this context, even government standards, such as those issued by the Occupational Safety and Health Administration (OSHA), can be considered national standards. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 national standard From an official perspective, a national standard is adopted by a national standards body (e.g., Standards Council of Canada, American National Standards Institute, British Standards Institution) and made available to the public. Practically speaking, however, a national standard is any standard that is widely used and recognized within a country. In this context, even government standards, such as those issued by the Occupational Safety and Health Administration (OSHA), can be considered national standards. Definition from now-deprecated CASRAI Glossary term: national-standard A governing culture that holds that the public has the right to access the documents and proceedings of government to allow for greater openness, accountability, and engagement. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 open government A governing culture that holds that the public has the right to access the documents and proceedings of government to allow for greater openness, accountability, and engagement. Definition from now-deprecated CASRAI Glossary term: open-government access control and management Manage the assessment, implementation and monitoring of secure storage protocols. Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang storage security management 1. Exercising authority to provide direction and to undertake, coordinate, and regulate activities in support of achieving this direction and desired outcomes. Governance can be thought of as the role of an organization's board of directors or its equivalent that is focused on defining that organization's purpose and the development of the strategies, objectives, values, and policies that frame how that purpose will be pursued. It includes the development of such things as mission statements, statements of organizational objectives and values, logic models, organizational performance metrics, risk management frameworks, policies and guidelines for financial and operational matters, stakeholder relations, etc. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 governance 1. Exercising authority to provide direction and to undertake, coordinate, and regulate activities in support of achieving this direction and desired outcomes. Governance can be thought of as the role of an organization's board of directors or its equivalent that is focused on defining that organization's purpose and the development of the strategies, objectives, values, and policies that frame how that purpose will be pursued. It includes the development of such things as mission statements, statements of organizational objectives and values, logic models, organizational performance metrics, risk management frameworks, policies and guidelines for financial and operational matters, stakeholder relations, etc. Definition from now-deprecated CASRAI Glossary term: governance Low-barrier mechanism for repository interoperability. Data Providers are repositories that expose structured metadata via OAI-PMH. Service Providers then make OAI-PMH service requests to harvest that metadata. OAI-PMH is a set of six verbs or services that are invoked within HTTP. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 open archives initiative protocol for metadata harvesting Low-barrier mechanism for repository interoperability. Data Providers are repositories that expose structured metadata via OAI-PMH. Service Providers then make OAI-PMH service requests to harvest that metadata. OAI-PMH is a set of six verbs or services that are invoked within HTTP. RDMT. https://terms.codata.org/rdmt/open-archives-initiative-protocol-for-metadata-harvesting A series of computer instructions written in some human readable computer language, usually stored in a text file. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 computer code A series of computer instructions written in some human readable computer language, usually stored in a text file. Definition from now-deprecated CASRAI Glossary term: computer-code Defines the structure or format of data exchange and is achieved through tools such as XML or SQL standards. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 syntactic interoperability Defines the structure or format of data exchange and is achieved through tools such as XML or SQL standards. RDMT. https://terms.codata.org/rdmt/syntactic-interoperability Bin for Aptitudes related to Preservation costs management. leightonlc aptitudes for preservation costs management https://orcid.org/0000-0003-2687-1982 accessibility of digital assets The activity of bringing computer system resources, especially data storage and computing power on demand via Internet. Kristina Hettne Simon Hodson Victoria Dominguez Del Angel cloud computing environment management The activity of bringing computer system resources, especially data storage and computing power on demand via Internet. (modified from wikipedia) [VDA] Set of documents that has a scientific meaning. A corpus can be produced by an individual researcher's activity (including its archival materials) or from a laboratory's research, a field campaign, a survey, or any other discrete research activity. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 corpus Set of documents that has a scientific meaning. A corpus can be produced by an individual researcher's activity (including its archival materials) or from a laboratory's research, a field campaign, a survey, or any other discrete research activity. RDMT. https://terms.codata.org/rdmt/corpus Understand how to use FAIR and open research tools and services. Hugh Shanahan the use of fair and open research tools or services Practice of making data available for checking, reproducing or reusing. The mechanisms available for achieving this are: making data available on request, as supplementary information to journal articles, or as published datasets in data repositories. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 Data dissemination Data posting data sharing Practice of making data available for checking, reproducing or reusing. The mechanisms available for achieving this are: making data available on request, as supplementary information to journal articles, or as published datasets in data repositories. RDMT. https://terms.codata.org/rdmt/data-sharing Understand how the rewards for effort depend on crediting the contributions of researchers and professional groups towards making FAIR outputs. Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang appreciate importance of crediting research contributions The process of resolving a reference to useful information by using a globally available system. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 reference resolution The process of resolving a reference to useful information by using a globally available system. Definition from now-deprecated CASRAI Glossary term: reference-resolution The person who manages or coordinates resources, personnel, facilities, and operating funds-allocations in an organization conducting research, development and analysis (RDA) in the natural and physical sciences. A research manager determines the nature, priority objectives and the resources committed to their achievement within and across the organizations, and evaluates program outputs in relation to organizational objectives and policies. A research manager provides scientific advice on the direction, conduct and management of these programs. A research manager does not personally conduct research development and analysis (RDA), control and coordinate projects, or control and coordinate contracted RDA. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 research manager The person who manages or coordinates resources, personnel, facilities, and operating funds-allocations in an organization conducting research, development and analysis (RDA) in the natural and physical sciences. A research manager determines the nature, priority objectives and the resources committed to their achievement within and across the organizations, and evaluates program outputs in relation to organizational objectives and policies. A research manager provides scientific advice on the direction, conduct and management of these programs. A research manager does not personally conduct research development and analysis (RDA), control and coordinate projects, or control and coordinate contracted RDA. Definition from now-deprecated CASRAI Glossary term: research-manager Combining diverse datasets from disparate sources into one unified dataset or database. Data are accessed and extracted, moved, validated, cleaned, transformed and loaded. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 data integration Combining diverse datasets from disparate sources into one unified dataset or database. Data are accessed and extracted, moved, validated, cleaned, transformed and loaded. RDMT. https://terms.codata.org/rdmt/data-integration Written methods, instructions, and tools that, when applied in different data collection contexts produce data that are ready to be harmonised or integrated without further manipulation. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 AL, LM 22.1.25: Refined the definition of this term to apply specifically to data stewardship tasks, and to align with RDMT. Standard operating procedure for the collection of harmonised or integrated data standard operating procedure Written methods, instructions, and tools that, when applied in different data collection contexts produce data that are ready to be harmonised or integrated without further manipulation. RDMT. https://terms.codata.org/rdmt/standard-operating-procedure-harmonised-integrated-data Physical or digital storage location that can house, preserve, manage, and provide access to many types of digital and physical materials in a variety of formats. Materials in online repositories are curated to enable search, discovery, and reuse. There must be sufficient control for the physical and digital material to be authentic, reliable, accessible and usable on a continuing basis. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 repository Physical or digital storage location that can house, preserve, manage, and provide access to many types of digital and physical materials in a variety of formats. Materials in online repositories are curated to enable search, discovery, and reuse. There must be sufficient control for the physical and digital material to be authentic, reliable, accessible and usable on a continuing basis. RDMT. https://terms.codata.org/rdmt/repository Understand the concepts underpinning FAIR criteria. Angus Whyte knowledge of theories underlying fair implementation Know the purpose of the public use, choose the proper subjects for the audience and describe the project work for the purpose/audience chosen. Celia van Gelder Mateusz Kuzak Yan Wang document in lay terms https://orcid.org/0000-0003-2687-1982 2020-10-02T14:54:27.168274Z understanding persistent identifiers Collection of interrelated data often with controlled redundancy, organised according to a scheme to serve one or more applications; the data are stored so that they can be used by several programs without concern for data structures or organisation. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data upload database Collection of interrelated data often with controlled redundancy, organised according to a scheme to serve one or more applications; the data are stored so that they can be used by several programs without concern for data structures or organisation. RDMT. https://terms.codata.org/rdmt/data-upload-database A string of characters used to identify or name a resource on the Internet. Such identification enables interaction with representations of the resource over a network, typically the World Wide Web, using specific protocols. MIT data management and publishing https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 uniform resource identifier A string of characters used to identify or name a resource on the Internet. Such identification enables interaction with representations of the resource over a network, typically the World Wide Web, using specific protocols. MIT data management and publishing Definition from now-deprecated CASRAI Glossary term: uniform-resource-identifier Know how to search for and identify FAIR services or tools that fit project needs. Angus Whyte how to find fair research data tools/services (catalogues) A framework whose primary purpose is to enable information sharing and reuse via the standard description and discovery of common data and the promotion of uniform data management practices. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data reference model A framework whose primary purpose is to enable information sharing and reuse via the standard description and discovery of common data and the promotion of uniform data management practices. Definition from now-deprecated CASRAI Glossary: data-reference-model A process by which a scholarly work (such as a paper or a research proposal) is checked by a group of experts in the same field to make sure it meets the necessary standards before it is published or accepted. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 peer review A process by which a scholarly work (such as a paper or a research proposal) is checked by a group of experts in the same field to make sure it meets the necessary standards before it is published or accepted. Definition from now-deprecated CASRAI Glossary term: peer-review Bin for types of Knowledge related to Provenance information management. leightonlc 2019-10-17T15:17:02.313897Z knowledge of provenance information management (noun) Sometimes called a row, a group of fields (sometimes called columns) within a table that are relevant to a specific entity. Multiple records are contained in a file or dataset. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 Data record record (noun) Sometimes called a row, a group of fields (sometimes called columns) within a table that are relevant to a specific entity. Multiple records are contained in a file or dataset. RDMT. https://terms.codata.org/rdmt/record Reverse engineering process in which de-identified data are cross-referenced with other data sources to re-identify the personally identifiable information. This could occur if a de-identification process had not been not successfully performed, or had not been undertaken in the first place. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 de-anonymization Reverse engineering process in which de-identified data are cross-referenced with other data sources to re-identify the personally identifiable information. This could occur if a de-identification process had not been not successfully performed, or had not been undertaken in the first place. RDMT. https://terms.codata.org/rdmt/de-anonymisation HughShanahan 2019-10-18T08:45:02.980527Z application of fair tools and services Requires compliance because of a government statute or regulation, an organization internal policy, or contractual requirement. Failure to comply with a mandatory standard usually carries a sanction, such as civil or criminal penalties, or loss of employment. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 mandatory standard Requires compliance because of a government statute or regulation, an organization internal policy, or contractual requirement. Failure to comply with a mandatory standard usually carries a sanction, such as civil or criminal penalties, or loss of employment. Definition from now-deprecated CASRAI Glossary term: mandatory-standard Approach to governance that values decisions that can be backed up with data that can be verified. The success of the data-driven approach is reliant upon the quality of the data gathered and the effectiveness of its analysis and interpretation. Errors can creep into data analytics processes at any stage of the endeavour and serious issues can result when they do. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 DDDM data driven decision management Approach to governance that values decisions that can be backed up with data that can be verified. The success of the data-driven approach is reliant upon the quality of the data gathered and the effectiveness of its analysis and interpretation. Errors can creep into data analytics processes at any stage of the endeavour and serious issues can result when they do. RDMT. https://terms.codata.org/rdmt/data-driven-decision-management Single, well-defined version of all the data entities in an organisational ecosystem. Encompasses all the data in every system of record within a particular organisation. A well-maintained, current golden record should be a fundamental element of the master data management policy for every enterprise. The word “golden” is sometimes used in information technology to express the importance of some type of source. In the context of virtualization, for example, a golden image is a template for a virtual machine, virtual desktop, servers, or hard disk drive. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 golden record Single, well-defined version of all the data entities in an organisational ecosystem. Encompasses all the data in every system of record within a particular organisation. A well-maintained, current golden record should be a fundamental element of the master data management policy for every enterprise. The word “golden” is sometimes used in information technology to express the importance of some type of source. In the context of virtualization, for example, a golden image is a template for a virtual machine, virtual desktop, servers, or hard disk drive. RDMT. https://terms.codata.org/rdmt/golden-record Reliability and application efficiency of data. Perception or assessment of a dataset's fitness to serve its purpose in a given context. Aspects of data quality include: Accuracy, Completeness, Update status, Relevance, Consistency across data sources, Reliability, Appropriate presentation, Accessibility. Data quality is affected by the way data are entered, stored and managed. Maintaining data quality requires going through the data periodically and scrubbing it. Typically this involves updating, standardising, and de-duplicating records to create a single view of the data, even if it is stored in multiple disparate systems. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data quality Reliability and application efficiency of data. Perception or assessment of a dataset's fitness to serve its purpose in a given context. Aspects of data quality include: Accuracy, Completeness, Update status, Relevance, Consistency across data sources, Reliability, Appropriate presentation, Accessibility. Data quality is affected by the way data are entered, stored and managed. Maintaining data quality requires going through the data periodically and scrubbing it. Typically this involves updating, standardising, and de-duplicating records to create a single view of the data, even if it is stored in multiple disparate systems. RDMT. https://terms.codata.org/rdmt/data-quality HughShanahan 2019-10-18T08:38:59.038306Z using fair and open research tools or services https://orcid.org/0000-0003-2687-1982 https://orcid.org/0000-0003-2687-1982 2020-10-01T20:53:15.691824Z webinar Data in the form of digital materials. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 digital data Data in the form of digital materials. RDMT. https://terms.codata.org/rdmt/digital-data https://orcid.org/0000-0003-2687-1982 F1. (meta)data are assigned a globally unique and persistent identifier An organization's process of defining its strategy or direction in the context of FAIR project management activities, both in the context of current knowledge and unknown factors in the future. Philippe Rocca-Serra Susanna Sansone https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 strategic/long-term planning An organization's process of defining its strategy or direction in the context of FAIR project management activities, both in the context of current knowledge and unknown factors in the future. AL, and https://en.wikipedia.org/wiki/Strategic_planning [17.10.19] Collective processes conducted to ensure the cleanliness of data. Data are considered clean when they are relatively error-free. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data hygiene Collective processes conducted to ensure the cleanliness of data. Data are considered clean when they are relatively error-free. RDMT. https://terms.codata.org/rdmt/data-hygiene Statistical analysis and assessment of the quality of data values within a dataset for consistency, uniqueness and logic. The data profiling process cannot identify inaccurate data; it can only identify rule violations and anomalies. The insight gained by data profiling can be used to determine how difficult it will be to use existing data for other purposes. It can also be used to provide metrics to assess data quality and help determine whether or not metadata accurately describes the source data. Profiling tools evaluate the actual content, structure and quality of the data by exploring relationships that exist between value collections both within and across datasets. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data profiling Statistical analysis and assessment of the quality of data values within a dataset for consistency, uniqueness and logic. The data profiling process cannot identify inaccurate data; it can only identify rule violations and anomalies. The insight gained by data profiling can be used to determine how difficult it will be to use existing data for other purposes. It can also be used to provide metrics to assess data quality and help determine whether or not metadata accurately describes the source data. Profiling tools evaluate the actual content, structure and quality of the data by exploring relationships that exist between value collections both within and across datasets. RDMT. https://terms.codata.org/rdmt/data-profiling Set of metadata elements and their values that describe an object. Metadata elements in the record may derive from a metadata profile or standard, and may include different types of metadata (descriptive, administrative, etc.). A metadata record is typically stored within a metadata catalogue or repository. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 metadata record Set of metadata elements and their values that describe an object. Metadata elements in the record may derive from a metadata profile or standard, and may include different types of metadata (descriptive, administrative, etc.). A metadata record is typically stored within a metadata catalogue or repository. RDMT. https://terms.codata.org/rdmt/metadata-record Data that are used as primary sources to support technical or scientific enquiry, research, scholarship, or artistic activity, and that are used as evidence in the research process and/or are commonly accepted in the research community as necessary to validate research findings and results. All other digital and non-digital content have the potential of becoming research data. Research data may be experimental data, observational data, operational data, third party data, public sector data, monitoring data, processed data, or repurposed data. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 research data Data that are used as primary sources to support technical or scientific enquiry, research, scholarship, or artistic activity, and that are used as evidence in the research process and/or are commonly accepted in the research community as necessary to validate research findings and results. All other digital and non-digital content have the potential of becoming research data. Research data may be experimental data, observational data, operational data, third party data, public sector data, monitoring data, processed data, or repurposed data. RDMT. https://terms.codata.org/rdmt/research-data The medium through which learning has been provided. For example, through a course, slides, video presentation, online documentation, wiki pages or others. https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 https://orcid.org/0000-0003-2687-1982 2020-09-22T13:19:57.012225Z learning medium data manager Compilation of core electronic health data submitted by various healthcare providers and organisations, accessible by numerous authorised parties from a number of points of care, possibly even from different jurisdictions. Electronic health records typically include: contact information, information about visits to health care professionals, allergies, insurance information, family history, immunisation status, information about any conditions or diseases, a list of medications, records of hospitalisation, information about any surgeries or procedures performed. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 Digital medical record electronic health record Compilation of core electronic health data submitted by various healthcare providers and organisations, accessible by numerous authorised parties from a number of points of care, possibly even from different jurisdictions. Electronic health records typically include: contact information, information about visits to health care professionals, allergies, insurance information, family history, immunisation status, information about any conditions or diseases, a list of medications, records of hospitalisation, information about any surgeries or procedures performed. RDMT. https://terms.codata.org/rdmt/electronic-health-record Techniques used to deal with parameters having different units and scales. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 Data rescaling data scaling Techniques used to deal with parameters having different units and scales. RDMT. https://terms.codata.org/rdmt/data-scaling Definitions of the access relationships between the following metadata: data object name, a user name (or user group, or user role), and access permission(s). The information can be stored as metadata information associated with each data object. The information can be generated dynamically by applying the access controls of the collection that organises the data objects (if a collection sticky bit is turned on). https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 access controls Definitions of the access relationships between the following metadata: data object name, a user name (or user group, or user role), and access permission(s). The information can be stored as metadata information associated with each data object. The information can be generated dynamically by applying the access controls of the collection that organises the data objects (if a collection sticky bit is turned on). RDMT. https://terms.codata.org/rdmt/access-controls Monitoring the risk to privacy posed by data requests from researchers, and the practices of data custodians in providing data (information governance) to ensure that confidentiality is protected. Such governance requires specialised knowledge of technology, law, and statistical methods. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 privacy governance Monitoring the risk to privacy posed by data requests from researchers, and the practices of data custodians in providing data (information governance) to ensure that confidentiality is protected. Such governance requires specialised knowledge of technology, law, and statistical methods. RDMT. https://terms.codata.org/rdmt/privacy-governance Angus Whyte Celia van Gelder Mateusz Kuzak Yan Wang research integrity, attribution, impact awareness Bin for types of Knowledge related to Cloud computing environment management. leightonlc knowledge of cloud computing environment management Data about data. It is data (or information) that defines and describes the characteristics of other data. It is used to improve the understanding and use of the data. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 Data documentation metadata Data about data. It is data (or information) that defines and describes the characteristics of other data. It is used to improve the understanding and use of the data. RDMT. https://terms.codata.org/rdmt/metadata Values in a table presented as a series of ASCII text lines organised so that each column value is separated by a comma from the next column’s value and each row starts a new line. Along with pipe- and tab-separated values, comma-separated values are a specific example of a record-oriented data structure (usually of fixed dimension) with fields separated by an agreed delimiter. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 CSV Maybe remove as shouldn't have formats here. comma separated values Values in a table presented as a series of ASCII text lines organised so that each column value is separated by a comma from the next column’s value and each row starts a new line. Along with pipe- and tab-separated values, comma-separated values are a specific example of a record-oriented data structure (usually of fixed dimension) with fields separated by an agreed delimiter. RDMT. https://terms.codata.org/rdmt/comma-separated-values https://orcid.org/0000-0003-2687-1982 identifier Responsible for executing tasks and producing deliverables as outlined in the Project Plan and directed by the Project Manager, at whatever level of effort or participation has been defined for them. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 project team member Responsible for executing tasks and producing deliverables as outlined in the Project Plan and directed by the Project Manager, at whatever level of effort or participation has been defined for them. Definition from now-deprecated CASRAI Glossary term: project-team-member Metadata that describe a dataset or resource in such a way that people can discover and identify it. Contains information that aids with findability such as information (metadata elements) on the creator(s), affiliation(s), title, abstract, keywords, persistent identifier, related publications, etc. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 https://github.com/terms4fairskills/FAIRterminology/issues/13 descriptive metadata Metadata that describe a dataset or resource in such a way that people can discover and identify it. Contains information that aids with findability such as information (metadata elements) on the creator(s), affiliation(s), title, abstract, keywords, persistent identifier, related publications, etc. RDMT. https://terms.codata.org/rdmt/descriptive-metadata https://orcid.org/0000-0003-2687-1982 https://orcid.org/0000-0003-2687-1982 2020-10-01T21:03:55.425579Z github repository Analysing multivariate datasets using pattern recognition or other knowledge discovery techniques to identify potentially unknown and potentially meaningful data content, relationships, classification or trends. Data mining parameters include: Association (looking for patterns where one event is connected to another event); Sequence or path analysis (looking for patterns where one event leads to another later event); Classification (looking for new patterns); Clustering (finding and visually documenting groups of facts not previously known); Forecasting, or predictive analytics (discovering patterns in data that can lead to reasonable predictions about the future. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 data mining Analysing multivariate datasets using pattern recognition or other knowledge discovery techniques to identify potentially unknown and potentially meaningful data content, relationships, classification or trends. Data mining parameters include: Association (looking for patterns where one event is connected to another event); Sequence or path analysis (looking for patterns where one event leads to another later event); Classification (looking for new patterns); Clustering (finding and visually documenting groups of facts not previously known); Forecasting, or predictive analytics (discovering patterns in data that can lead to reasonable predictions about the future. RDMT. https://terms.codata.org/rdmt/data-mining https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 Data preservation https://orcid.org/0000-0003-2687-1982 2021-02-17T22:18:22.641817Z data archiving Machine processable specifications which define the structure and syntax of metadata specifications in a formal schema language. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 AL 5.12.22 Moved from 'findability of digital assets' as it is definitely not a particular data stewardship guideline. encoding schema Machine processable specifications which define the structure and syntax of metadata specifications in a formal schema language. RDMT. https://terms.codata.org/rdmt/encoding-schema In the context of computing, the process of backup and ongoing maintenance as opposed to strategies for long-term digital preservation. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 digital archiving digital archiving (computing) In the context of computing, the process of backup and ongoing maintenance as opposed to strategies for long-term digital preservation. RDMT. https://terms.codata.org/rdmt/digital-archiving-computing The process of resolving a PID to a useful state of information about a digital object by using a globally available system. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 pid resolution The process of resolving a PID to a useful state of information about a digital object by using a globally available system. RDMT. https://terms.codata.org/rdmt/pid-resolution Bin for Skills needed for Storage management. leightonlc skills related to storage management Management of a national, discipline or institutional repository of published datasets. Provision of infrastructure, curation, policy and training that govern the organisation, control, and properties of the repository such as: required file formats, access control restrictions, integrity, replication, retention, disposal, etc. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 data repository management Management of a national, discipline or institutional repository of published datasets. Provision of infrastructure, curation, policy and training that govern the organisation, control, and properties of the repository such as: required file formats, access control restrictions, integrity, replication, retention, disposal, etc. RDMT. https://terms.codata.org/rdmt/data-repository-management A document creation and management specification that builds content reuse into the authoring process. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 darwin information typing architecture A document creation and management specification that builds content reuse into the authoring process. Definition from now-deprecated CASRAI Glossary term: darwin-information-typing-architecture/ The discovery of meaningful multidimensional patterns in data. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 analytics The discovery of meaningful multidimensional patterns in data. Definition from now-deprecated CASRAI Glossary term: analytics The personal attributes necessary to perform a task. https://orcid.org/0000-0003-2687-1982 Aptitude is the innate trait or talent that a person brings to a task or situation. It is the quality of being able to do something. An aptitude is not something that can be learned or developed unless it is there to begin with. Laura Molloy, Celia van Gelder:https://docs.google.com/presentation/d/12oNBFix39ZtsLAR4tkqESBXxMU5rVg3cCKoEDG2oK-M/edit#slide=id.p5 data stewardship soft skill Ability to apply the FAIR principles i.e. describe the data with community metadata standard for machine and human, align with existing semantic models (ontologies, controlled vocabularies, ...), get a persistent ID, attribute licence and credit for data creators, legal aspect related to data (ownership, confidentiality, ethics, ...) add data provenance. Kristina Hettne Victoria Dominguez Del Angel Yann Le Franc https://orcid.org/0000-0002-7702-4495 archival documentation of data Ability to apply the FAIR principles i.e. describe the data with community metadata standard for machine and human, align with existing semantic models (ontologies, controlled vocabularies, ...), get a persistent ID, attribute licence and credit for data creators, legal aspect related to data (ownership, confidentiality, ethics, ...) add data provenance. [YLF, VDA, KH] A machine-readable format is a structured format that can be processed by a computer. Such formats can either be intended solely for machine processing (e.g. XML or RDF), or may be both human and machine accessible via appropriate markup (e.g. HTML). https://orcid.org/0000-0002-7702-4495 https://orcid.org/0000-0003-2687-1982 AL 5.12.22: Merged with now-obsolete 'machine readable' (http://purl.obolibrary.org/obo/T4FS_0000256), as we do not need that level of granularity. AL 6.5.22: The original CASRAI term (https://casrai.org/term/machine-readable-format/) has an identical source definition with 'Digital materials', and therefore has been removed. machine-readable format A machine-readable format is a structured format that can be processed by a computer. Such formats can either be intended solely for machine processing (e.g. XML or RDF), or may be both human and machine accessible via appropriate markup (e.g. HTML). Modified by AL from Wikipedia, https://en.wikipedia.org/wiki/Machine-readable_data, accessed 6.5.22 Any information obtained by a person on the understanding that they will not disclose it to others, or obtained in circumstances where it is expected that they will not disclose it. https://orcid.org/0000-0002-5214-4466 https://orcid.org/0000-0003-2687-1982 confidential information Any information obtained by a person on the understanding that they will not disclose it to others, or obtained in circumstances where it is expected that they will not disclose it. Definition from now-deprecated CASRAI Glossary term: confidential-information The FAIR Cookbook’s recipes are a combination of guidance, technical, hands-on, background and review types to cover the operation steps of FAIR data management, and are classified according to the audience types, to serve all those involved in the data management life cycle. The FAIR Cookbook is for anyone working in the Life Sciences who needs guidance on applying the FAIR Principles in practice. https://orcid.org/0000-0002-7702-4495 fair cookbook recipe The FAIR Cookbook’s recipes are a combination of guidance, technical, hands-on, background and review types to cover the operation steps of FAIR data management, and are classified according to the audience types, to serve all those involved in the data management life cycle. The FAIR Cookbook is for anyone working in the Life Sciences who needs guidance on applying the FAIR Principles in practice. https://faircookbook.elixir-europe.org/content/recipes/introduction/FAIR-cookbook-audience.html Evaluation indicators are designed to enable the measurement of the degree of compliance against a guideline using a set of criteria. The goals of such indicators are varied, but a good example would be to improve the re-usability of the digital asset being measured for increased discoverability, interoperability and overall machine actionability. https://github.com/terms4fairskills/FAIRterminology/issues/22 evaluation indicator Evaluation indicators are designed to enable the measurement of the degree of compliance against a guideline using a set of criteria. The goals of such indicators are varied, but a good example would be to improve the re-usability of the digital asset being measured for increased discoverability, interoperability and overall machine actionability. Definition by https://orcid.org/0000-0002-7702-4495 and drawn in part from https://fairplus.github.io/Data-Maturity/docs/Indicators The fairplus dataset maturity model indicators are used within the context of the model, which is intended as a comprehensive reference model for state-of-FAIRness improvement in research datasets. Based on the FAIR guiding principles, the DSM model defines and classifies requirements that constitute an incremental path towards improving FAIRness level for a given research dataset. https://orcid.org/0000-0002-7702-4495 fairplus DSM model indicator fairplus dataset maturity model indicator The fairplus dataset maturity model indicators are used within the context of the model, which is intended as a comprehensive reference model for state-of-FAIRness improvement in research datasets. Based on the FAIR guiding principles, the DSM model defines and classifies requirements that constitute an incremental path towards improving FAIRness level for a given research dataset. https://fairplus.github.io/Data-Maturity/ The fairplus DSM content-related indicators relate to what is reported in the Dataset (data) & the Dataset Descriptor (metadata). https://orcid.org/0000-0002-7702-4495 fairplus DSM content-related indicator The fairplus DSM content-related indicators relate to what is reported in the Dataset (data) & the Dataset Descriptor (metadata). https://fairplus.github.io/Data-Maturity/ The fairplus DSM representation and format indicators relate to how the data object & metadata object are represented and formatted. https://orcid.org/0000-0002-7702-4495 AL 5.12.22: Please note that the 'and' in this label is a direct representation of the fairplus DSM model, and not a ontological design choice. fairplus DSM representation and format indicator The fairplus DSM representation and format indicators relate to how the data object & metadata object are represented and formatted. https://fairplus.github.io/Data-Maturity/ The fairplus DSM hosting-environment capabilities indicator relate to the capabilities of the hosting environment that enables and supports the use of FAIR data https://orcid.org/0000-0002-7702-4495 fairplus DSM hosting-environment capabilities indicator The fairplus DSM hosting-environment capabilities indicator relate to the capabilities of the hosting environment that enables and supports the use of FAIR data https://fairplus.github.io/Data-Maturity/ No representation of Data purposed for FAIR sharing is available https://orcid.org/0000-0002-7702-4495 DSM-0-R2 No representation of Data purposed for FAIR sharing is available https://fairplus.github.io/Data-Maturity/docs/Indicators/ Structured and/or Unstructured Data are organised into Dataset(s) created for the purpose of FAIR sharing https://orcid.org/0000-0002-7702-4495 DSM-1-R2 Structured and/or Unstructured Data are organised into Dataset(s) created for the purpose of FAIR sharing https://fairplus.github.io/Data-Maturity/docs/Indicators/ Project collected Data are organized into structured Dataset(s) and conform to a locally defined Dataset Model https://orcid.org/0000-0002-7702-4495 DSM-2-R2 Project collected Data are organized into structured Dataset(s) and conform to a locally defined Dataset Model https://fairplus.github.io/Data-Maturity/docs/Indicators/ Structured Data are represented as Datasets and conform to relevant Standard Dataset Model(s) for FAIR sharing https://orcid.org/0000-0002-7702-4495 DSM-3-R2 Structured Data are represented as Datasets and conform to relevant Standard Dataset Model(s) for FAIR sharing https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset(s) content is semantically represented using Linked Data Represetations conforming to a Semantic Data Model https://orcid.org/0000-0002-7702-4495 DSM-4-R2 Dataset(s) content is semantically represented using Linked Data Represetations conforming to a Semantic Data Model https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset(s) are granularly represented and managed at the Data Element Level (e.g. ISO 11179 MDR standard) https://orcid.org/0000-0002-7702-4495 DSM-5-R2 Dataset(s) are granularly represented and managed at the Data Element Level (e.g. ISO 11179 MDR standard) https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Metadata is NOT formally represented in a structured Dataset Descriptor https://orcid.org/0000-0002-7702-4495 DSM-0-R3 Dataset Metadata is NOT formally represented in a structured Dataset Descriptor https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Metadata is formally represented in the form of an Identifiable Dataset Descriptor https://orcid.org/0000-0002-7702-4495 DSM-1-R0 Dataset Metadata is formally represented in the form of an Identifiable Dataset Descriptor https://fairplus.github.io/Data-Maturity/docs/Indicators/ A representation of the Dataset Descriptor conforming to a relevant General Purpose Metadata Schema is available https://orcid.org/0000-0002-7702-4495 DSM-1-R3 A representation of the Dataset Descriptor conforming to a relevant General Purpose Metadata Schema is available https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Descriptor(s) conforms to or extends a Standard Generic Dataset Descriptor Model to describe and represent structural metadata of Dataset(s) https://orcid.org/0000-0002-7702-4495 DSM-2-R3 Dataset Descriptor(s) conforms to or extends a Standard Generic Dataset Descriptor Model to describe and represent structural metadata of Dataset(s) https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Descriptor(s) use community-defined or domain-specific metadata standard https://orcid.org/0000-0002-7702-4495 DSM-3-R3 Dataset Descriptor(s) use community-defined or domain-specific metadata standard https://fairplus.github.io/Data-Maturity/docs/Indicators/ A Semantic Data Model (Metadata) used for data harmonisation across Datasets is formally defined and represented using Linked Data Representations https://orcid.org/0000-0002-7702-4495 DSM-4-R3 A Semantic Data Model (Metadata) used for data harmonisation across Datasets is formally defined and represented using Linked Data Representations https://fairplus.github.io/Data-Maturity/docs/Indicators/ Common Data Elements and their value sets are defined and registered in a managed Metadata Registry https://orcid.org/0000-0002-7702-4495 DSM-5-R3 Common Data Elements and their value sets are defined and registered in a managed Metadata Registry https://fairplus.github.io/Data-Maturity/docs/Indicators/ Contextual Metadata is NOT formally represented in any form https://orcid.org/0000-0002-7702-4495 DSM-0-R1 Contextual Metadata is NOT formally represented in any form https://fairplus.github.io/Data-Maturity/docs/Indicators/ Contextual Metadata is reported at summary level and represented in the Dataset Descriptor https://orcid.org/0000-0002-7702-4495 DSM-1-R1 Contextual Metadata is reported at summary level and represented in the Dataset Descriptor https://fairplus.github.io/Data-Maturity/docs/Indicators/ Contextual Metadata is formally represented in the form of a locally defined Domain Model https://orcid.org/0000-0002-7702-4495 DSM-2-R1 Contextual Metadata is formally represented in the form of a locally defined Domain Model https://fairplus.github.io/Data-Maturity/docs/Indicators/ Contextual Metadata is formally represented and conforms to a standard defined Domain Model if available https://orcid.org/0000-0002-7702-4495 DSM-3-R1 Contextual Metadata is formally represented and conforms to a standard defined Domain Model if available https://fairplus.github.io/Data-Maturity/docs/Indicators/ Contextual Metadata is formally represented by a defined set Common Data Elements https://orcid.org/0000-0002-7702-4495 DSM-4-R1 Contextual Metadata is formally represented by a defined set Common Data Elements https://fairplus.github.io/Data-Maturity/docs/Indicators/ Domain entities are represented by Managed Master Data Objects conforming to a Master Data Model used for data consolidation https://orcid.org/0000-0002-7702-4495 DSM-5-R1 Domain entities are represented by Managed Master Data Objects conforming to a Master Data Model used for data consolidation https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Descriptor (metadata) is NOT available in a Machine Readable Format https://orcid.org/0000-0002-7702-4495 DSM-0-R4 Dataset Descriptor (metadata) is NOT available in a Machine Readable Format https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Descriptor is available in Machine Readable Format https://orcid.org/0000-0002-7702-4495 DSM-1-R4 Dataset Descriptor is available in Machine Readable Format https://fairplus.github.io/Data-Maturity/docs/Indicators/ Contextual Metadata represented in the form of a Domain Model is available in a Human Readable Format https://orcid.org/0000-0002-7702-4495 DSM-2-R4 Contextual Metadata represented in the form of a Domain Model is available in a Human Readable Format https://fairplus.github.io/Data-Maturity/docs/Indicators/ A formal documentation of the adopted Standard Dataset Model is available in a Machine Readable Format https://orcid.org/0000-0002-7702-4495 DSM-3-R4 A formal documentation of the adopted Standard Dataset Model is available in a Machine Readable Format https://fairplus.github.io/Data-Maturity/docs/Indicators/ A Semantic Data Model (Metadata) describing the data is represented in a Machine Readable and Machine Interptretable format https://orcid.org/0000-0002-7702-4495 DSM-4-R4 A Semantic Data Model (Metadata) describing the data is represented in a Machine Readable and Machine Interptretable format https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset(s) are NOT available in a Machine Readable Format https://orcid.org/0000-0002-7702-4495 DSM-0-R5 Dataset(s) are NOT available in a Machine Readable Format https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset(s) available in Machine Readable Format https://orcid.org/0000-0002-7702-4495 DSM-1-R5 Dataset(s) available in Machine Readable Format https://fairplus.github.io/Data-Maturity/docs/Indicators/ If applicable, Dataset(s) available in non-proprietary Machine Readable Format relevant to the adopted standard Dataset Model https://orcid.org/0000-0002-7702-4495 DSM-3-R5 If applicable, Dataset(s) available in non-proprietary Machine Readable Format relevant to the adopted standard Dataset Model https://fairplus.github.io/Data-Maturity/docs/Indicators/ Datasets are available in a Machine Readable and Machine Interpretable format https://orcid.org/0000-0002-7702-4495 DSM-4-R5 Datasets are available in a Machine Readable and Machine Interpretable format https://fairplus.github.io/Data-Maturity/docs/Indicators/ If applicable, license information and/or permitted use and accessability to parts of the dataset is formally represented and encoded in a Machine Readable Format https://orcid.org/0000-0002-7702-4495 DSM-4-R6 If applicable, license information and/or permitted use and accessability to parts of the dataset is formally represented and encoded in a Machine Readable Format https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset(s) are NOT Identifiable via Unique Identifiers https://orcid.org/0000-0002-7702-4495 DSM-0-C0 Dataset(s) are NOT Identifiable via Unique Identifiers https://fairplus.github.io/Data-Maturity/docs/Indicators/ Each Dataset purposed for FAIR sharing is assigned a unique identifier https://orcid.org/0000-0002-7702-4495 DSM-1-C0 Each Dataset purposed for FAIR sharing is assigned a unique identifier https://fairplus.github.io/Data-Maturity/docs/Indicators/ Where applicable, data is structured in the Dataset according to the Tidy Data Principles https://orcid.org/0000-0002-7702-4495 DSM-2-C2 Where applicable, data is structured in the Dataset according to the Tidy Data Principles https://fairplus.github.io/Data-Maturity/docs/Indicators/ Where applicable, Dataset(s) scope and content are reported in compliance with relevant community-defined Data Reporting Guidelines https://orcid.org/0000-0002-7702-4495 DSM-3-C2 Where applicable, Dataset(s) scope and content are reported in compliance with relevant community-defined Data Reporting Guidelines https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset(s) content is harmonised against a designed-for-purpose Semantic Data Model https://orcid.org/0000-0002-7702-4495 DSM-4-C2 Dataset(s) content is harmonised against a designed-for-purpose Semantic Data Model https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset(s) include Reference Fields that enable joining related datasets https://orcid.org/0000-0002-7702-4495 DSM-2-C3 Dataset(s) include Reference Fields that enable joining related datasets https://fairplus.github.io/Data-Maturity/docs/Indicators/ Where applicable, Dataset Field Names use standard controlled terms as recommended by the adopted Standard https://orcid.org/0000-0002-7702-4495 DSM-3-C3 Where applicable, Dataset Field Names use standard controlled terms as recommended by the adopted Standard https://fairplus.github.io/Data-Maturity/docs/Indicators/ Key Dataset Fields are mapped to Common Data Elements as defined by the Semantic Data Model https://orcid.org/0000-0002-7702-4495 DSM-4-C3 Key Dataset Fields are mapped to Common Data Elements as defined by the Semantic Data Model https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Fields are linked and harmonized against enterprise managed Metadata Elements (e.g. MDR registered Data Elements) https://orcid.org/0000-0002-7702-4495 DSM-5-C3 Dataset Fields are linked and harmonized against enterprise managed Metadata Elements (e.g. MDR registered Data Elements) https://fairplus.github.io/Data-Maturity/docs/Indicators/ Where applicable, Dataset Field Values are standardized against a locally defined Data Dictionary within and across related Datasets https://orcid.org/0000-0002-7702-4495 DSM-2-C4 Where applicable, Dataset Field Values are standardized against a locally defined Data Dictionary within and across related Datasets https://fairplus.github.io/Data-Maturity/docs/Indicators/ Where applicable, Dataset Field Values are standardised against domain-specific Controlled Terminologies and/or Ontology Terms https://orcid.org/0000-0002-7702-4495 DSM-3-C4 Where applicable, Dataset Field Values are standardised against domain-specific Controlled Terminologies and/or Ontology Terms https://fairplus.github.io/Data-Maturity/docs/Indicators/ Values for key Domain Entities reported in the Dataset(s) are standardised and assigned unique Standard Identifiers https://orcid.org/0000-0002-7702-4495 DSM-4-C4 Values for key Domain Entities reported in the Dataset(s) are standardised and assigned unique Standard Identifiers https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Field values are controlled and managed via enterprise managed Reference and Master Data https://orcid.org/0000-0002-7702-4495 DSM-5-C4 Dataset Field values are controlled and managed via enterprise managed Reference and Master Data https://fairplus.github.io/Data-Maturity/docs/Indicators/ Study/Project-Level metadata is NOT reported https://orcid.org/0000-0002-7702-4495 DSM-0-C1 Study/Project-Level metadata is NOT reported https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Descriptor includes Descriptive Study/Project-Level summary information https://orcid.org/0000-0002-7702-4495 DSM-1-C1 Dataset Descriptor includes Descriptive Study/Project-Level summary information https://fairplus.github.io/Data-Maturity/docs/Indicators/ A locally defined Domain Model contains concepts that describes the overall project/study design, the relationships between the Datasets, the key entities reported within the Datasets and the relationships between them. https://orcid.org/0000-0002-7702-4495 DSM-2-C1 A locally defined Domain Model contains concepts that describes the overall project/study design, the relationships between the Datasets, the key entities reported within the Datasets and the relationships between them. https://fairplus.github.io/Data-Maturity/docs/Indicators/ Where applicable, study-level / experimental metadata is reported in compliance with relevant Minimum Information Reporting Guidelines https://orcid.org/0000-0002-7702-4495 DSM-3-C1 Where applicable, study-level / experimental metadata is reported in compliance with relevant Minimum Information Reporting Guidelines https://fairplus.github.io/Data-Maturity/docs/Indicators/ A Semantic Data Model includes study design Data Elements and the relationships between them https://orcid.org/0000-0002-7702-4495 DSM-4-C1 A Semantic Data Model includes study design Data Elements and the relationships between them https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Descriptor does NOT include a reference to the Dataset it describes https://orcid.org/0000-0002-7702-4495 DSM-0-C2 Dataset Descriptor does NOT include a reference to the Dataset it describes https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Descriptor includes Identifying & Descriptive Dataset-Level metadata https://orcid.org/0000-0002-7702-4495 DSM-1-C2 Dataset Descriptor includes Identifying & Descriptive Dataset-Level metadata https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Descriptor contains access information for the Dataset https://orcid.org/0000-0002-7702-4495 DSM-1-C3 Dataset Descriptor contains access information for the Dataset https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Descriptor includes reference to related Datasets and if applicable the relevant joining Dataset Fields https://orcid.org/0000-0002-7702-4495 DSM-2-C5 Dataset Descriptor includes reference to related Datasets and if applicable the relevant joining Dataset Fields https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Descriptor references a standard license under which the dataset can be re-used. https://orcid.org/0000-0002-7702-4495 DSM-3-C7 Dataset Descriptor references a standard license under which the dataset can be re-used. https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Descriptor includes Field-level Metadata as prescribed by a locally defined Dataset Model https://orcid.org/0000-0002-7702-4495 DSM-2-C6 Dataset Descriptor includes Field-level Metadata as prescribed by a locally defined Dataset Model https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Descriptor includes standard-compliant Field-level Metadata as prescribed by the adopted standard Dataset Model. https://orcid.org/0000-0002-7702-4495 DSM-3-C6 Dataset Descriptor includes standard-compliant Field-level Metadata as prescribed by the adopted standard Dataset Model. https://fairplus.github.io/Data-Maturity/docs/Indicators/ The Semantic Data Model includes a pre-defined set of Common Data Elements reported within the Datasets and the relationships between them https://orcid.org/0000-0002-7702-4495 DSM-4-C5 The Semantic Data Model includes a pre-defined set of Common Data Elements reported within the Datasets and the relationships between them https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset Descriptor includes Value-level Metadata or if applicable includes a reference to a locally defined Data Dictionary https://orcid.org/0000-0002-7702-4495 DSM-2-C7 Dataset Descriptor includes Value-level Metadata or if applicable includes a reference to a locally defined Data Dictionary https://fairplus.github.io/Data-Maturity/docs/Indicators/ Value Level Metadata includes Resolvable Identifiers for Controlled and/or Standard Terms reported in the Dataset https://orcid.org/0000-0002-7702-4495 DSM-3-C5 Value Level Metadata includes Resolvable Identifiers for Controlled and/or Standard Terms reported in the Dataset https://fairplus.github.io/Data-Maturity/docs/Indicators/ Data or metadata is hosted in non-accessible storage (e.g., personal desktop, local file system or archive) https://orcid.org/0000-0002-7702-4495 DSM-0-H1 Data or metadata is hosted in non-accessible storage (e.g., personal desktop, local file system or archive) https://fairplus.github.io/Data-Maturity/docs/Indicators/ Metadata hosting environment stores and maintains an identifiable Dataset Descriptor for each identifiable Dataset https://orcid.org/0000-0002-7702-4495 DSM-1-H1 Metadata hosting environment stores and maintains an identifiable Dataset Descriptor for each identifiable Dataset https://fairplus.github.io/Data-Maturity/docs/Indicators/ The Data hosting environment's Persistence Model is aligned with a locally defined Domain Model to enable interpretation of Datasets https://orcid.org/0000-0002-7702-4495 DSM-2-H1 The Data hosting environment's Persistence Model is aligned with a locally defined Domain Model to enable interpretation of Datasets https://fairplus.github.io/Data-Maturity/docs/Indicators/ The Data hosting environment's Persistence Model is aligned with a standard Dataset model or compliant with relevant Minimum Information Reporting Guidelines https://orcid.org/0000-0002-7702-4495 DSM-3-H1 The Data hosting environment's Persistence Model is aligned with a standard Dataset model or compliant with relevant Minimum Information Reporting Guidelines https://fairplus.github.io/Data-Maturity/docs/Indicators/ Data Hosting environment stores data in a relevant linked data store (e.g., Triple Store or Graph Databaase) https://orcid.org/0000-0002-7702-4495 DSM-4-H1 Data Hosting environment stores data in a relevant linked data store (e.g., Triple Store or Graph Databaase) https://fairplus.github.io/Data-Maturity/docs/Indicators/ Data or metadata hosted in an accessible resource but with no retrieval capability https://orcid.org/0000-0002-7702-4495 DSM-0-H2 Data or metadata hosted in an accessible resource but with no retrieval capability https://fairplus.github.io/Data-Maturity/docs/Indicators/ The Dataset and its Descriptor are indexed and retrievable (in the same or separate hosting environments) via unique and persistent identifiers https://orcid.org/0000-0002-7702-4495 DSM-1-H2 The Dataset and its Descriptor are indexed and retrievable (in the same or separate hosting environments) via unique and persistent identifiers https://fairplus.github.io/Data-Maturity/docs/Indicators/ Retrieval of the Dataset and the Dataset Descriptor utilises a standardized communication protocol that is open, free and universally implementable https://orcid.org/0000-0002-7702-4495 DSM-1-H3 Retrieval of the Dataset and the Dataset Descriptor utilises a standardized communication protocol that is open, free and universally implementable https://fairplus.github.io/Data-Maturity/docs/Indicators/ Metadata hosting environment provides programmatic access and retrieval (API) for the Dataset Descriptor https://orcid.org/0000-0002-7702-4495 DSM-2-H2 Metadata hosting environment provides programmatic access and retrieval (API) for the Dataset Descriptor https://fairplus.github.io/Data-Maturity/docs/Indicators/ For each dataset, the hosting environment maintains a globally unique, persistent and resolvable identifier for access and retrieval https://orcid.org/0000-0002-7702-4495 DSM-3-H2 For each dataset, the hosting environment maintains a globally unique, persistent and resolvable identifier for access and retrieval https://fairplus.github.io/Data-Maturity/docs/Indicators/ If applicable, Dataset hosting environment offers dataset-level authentication and authorisation capabilities https://orcid.org/0000-0002-7702-4495 DSM-3-H4 If applicable, Dataset hosting environment offers dataset-level authentication and authorisation capabilities https://fairplus.github.io/Data-Maturity/docs/Indicators/ Data Hosting Environment provides semantic querying capability https://orcid.org/0000-0002-7702-4495 DSM-4-H2 Data Hosting Environment provides semantic querying capability https://fairplus.github.io/Data-Maturity/docs/Indicators/ Dataset's Metadata is NOT searchable via keywords or elements within the Descriptor https://orcid.org/0000-0002-7702-4495 DSM-0-H3 Dataset's Metadata is NOT searchable via keywords or elements within the Descriptor https://fairplus.github.io/Data-Maturity/docs/Indicators/ Metadata hosting environment offers the capability to browse and search contents of the Dataset Descriptor https://orcid.org/0000-0002-7702-4495 DSM-1-H4 Metadata hosting environment offers the capability to browse and search contents of the Dataset Descriptor https://fairplus.github.io/Data-Maturity/docs/Indicators/ Data hosting environment offers the capability to browse and search related Datasets https://orcid.org/0000-0002-7702-4495 DSM-2-H3 Data hosting environment offers the capability to browse and search related Datasets https://fairplus.github.io/Data-Maturity/docs/Indicators/ Data Hosting environment utilises controlled terms and/or ontology terms to search within Dataset content. https://orcid.org/0000-0002-7702-4495 DSM-3-H3 Data Hosting environment utilises controlled terms and/or ontology terms to search within Dataset content. https://fairplus.github.io/Data-Maturity/docs/Indicators/ Data Hosting Environment provides semantic querying capability https://orcid.org/0000-0002-7702-4495 DSM-4-H2 Data Hosting Environment provides semantic querying capability https://fairplus.github.io/Data-Maturity/docs/Indicators/ https://orcid.org/0000-0002-7702-4495 AL 15.3.22: Added for FAIR Cookbook integration. data scientist https://orcid.org/0000-0002-7702-4495 AL 15.3.22: Added for FAIR Cookbook integration. ontologist https://orcid.org/0000-0002-7702-4495 AL 15.3.22: Added for FAIR Cookbook integration. terminology manager https://orcid.org/0000-0002-7702-4495 AL 16.3.22 Added as part of FAIR cookbook integration. software engineer https://orcid.org/0000-0002-7702-4495 AL 16.3.22: Added as part of FAIR Cookbook integration. system administrator https://orcid.org/0000-0002-7702-4495 AL 16.3.22: Added as part of FAIR Cookbook integration. Useful for such roles to be able to understand the cost of e.g. making things FAIR (and how to deal with it), but also the benefits. procurement officer Metadata creation concerns the creation of data that provides information about characteristics, aspects or context of other data entities such as a dataset or other digital or analog object. https://orcid.org/0000-0002-7702-4495 metadata creation Metadata creation concerns the creation of data that provides information about characteristics, aspects or context of other data entities such as a dataset or other digital or analog object. AL 22.3.22, and see also FRBR User Tasks at: https://sites.google.com/site/metadatastandards/chapter-6/6-3-frbr-user-tasks. Obsolete Class example to be eventually removed example to be eventually removed failed exploratory term The term was used in an attempt to structure part of the ontology but in retrospect failed to do a good job Person:Alan Ruttenberg failed exploratory term metadata complete Class has all its metadata, but is either not guaranteed to be in its final location in the asserted IS_A hierarchy or refers to another class that is not complete. metadata complete organizational term Term created to ease viewing/sort terms for development purpose, and will not be included in a release organizational term ready for release Class has undergone final review, is ready for use, and will be included in the next release. Any class lacking "ready_for_release" should be considered likely to change place in hierarchy, have its definition refined, or be obsoleted in the next release. Those classes deemed "ready_for_release" will also derived from a chain of ancestor classes that are also "ready_for_release." ready for release metadata incomplete Class is being worked on; however, the metadata (including definition) are not complete or sufficiently clear to the branch editors. metadata incomplete uncurated Nothing done yet beyond assigning a unique class ID and proposing a preferred term. uncurated pending final vetting All definitions, placement in the asserted IS_A hierarchy and required minimal metadata are complete. The class is awaiting a final review by someone other than the term editor. pending final vetting placeholder removed placeholder removed terms merged An editor note should explain what were the merged terms and the reason for the merge. terms merged term imported This is to be used when the original term has been replaced by a term imported from an other ontology. An editor note should indicate what is the URI of the new term to use. term imported term split This is to be used when a term has been split in two or more new terms. An editor note should indicate the reason for the split and indicate the URIs of the new terms created. term split universal Hard to give a definition for. Intuitively a "natural kind" rather than a collection of any old things, which a class is able to be, formally. At the meta level, universals are defined as positives, are disjoint with their siblings, have single asserted parents. Alan Ruttenberg A Formal Theory of Substances, Qualities, and Universals, http://ontology.buffalo.edu/bfo/SQU.pdf universal defined class A defined class is a class that is defined by a set of logically necessary and sufficient conditions but is not a universal "definitions", in some readings, always are given by necessary and sufficient conditions. So one must be careful (and this is difficult sometimes) to distinguish between defined classes and universal. Alan Ruttenberg defined class named class expression A named class expression is a logical expression that is given a name. The name can be used in place of the expression. named class expressions are used in order to have more concise logical definition but their extensions may not be interesting classes on their own. In languages such as OWL, with no provisions for macros, these show up as actuall classes. Tools may with to not show them as such, and to replace uses of the macros with their expansions Alan Ruttenberg named class expression to be replaced with external ontology term Terms with this status should eventually replaced with a term from another ontology. Alan Ruttenberg group:OBI to be replaced with external ontology term requires discussion A term that is metadata complete, has been reviewed, and problems have been identified that require discussion before release. Such a term requires editor note(s) to identify the outstanding issues. Alan Ruttenberg group:OBI requires discussion The term was added to the ontology on the assumption it was in scope, but it turned out later that it was not. This obsolesence reason should be used conservatively. Typical valid examples are: un-necessary grouping classes in disease ontologies, a phenotype term added on the assumption it was a disease. https://github.com/information-artifact-ontology/ontology-metadata/issues/77 https://orcid.org/0000-0001-5208-3432 out of scope