The Semantic Mapping Vocabulary provides and defines terms used for creating and maintaining semantic mappings, in particular mapping metadata. Semantic Mapping Vocabulary https://www.w3.org/TR/skos-reference/ http://www.w3.org/2004/02/skos/core# A match where the object is a broader concept than the subject. broad match https://www.w3.org/TR/skos-reference/ http://www.w3.org/2004/02/skos/core# A match where the subject and object are sufficiently similar that they can be used interchangeably in some information retrieval applications. close match https://www.w3.org/TR/skos-reference/ http://www.w3.org/2004/02/skos/core# A match where the subject and object are sufficiently similar, with a high degree of confidence, that they can be used interchangeably across a wide range of information retrieval applications. exact match https://www.w3.org/TR/skos-reference/ http://www.w3.org/2004/02/skos/core# A mapping (alignment) link between two concepts. mapping relation https://www.w3.org/TR/skos-reference/ http://www.w3.org/2004/02/skos/core# A match where the object is a narrower concept that the subject. narrow match https://www.w3.org/TR/skos-reference/ http://www.w3.org/2004/02/skos/core# A match where the subject and the object are inherently related, but where none is in any way more general than the other. related match https://github.com/mapping-commons/semantic-mapping-vocabulary/issues/3 A match where the subject is considered analogous to the subject in a different taxonomic grouping, but the object refers to a broader concept. FBbt:00007091 (D. melanogaster “subperineurial glial sheath”) and UBERON:0000202 (taxon-neutral “glial blood brain barrier”) are a cross-species broad match. cross-species broad match https://github.com/mapping-commons/semantic-mapping-vocabulary/issues/3 A match where the subject and the object belong in different taxonomic groupings, but refer to concepts similar enough that they can be used interchangeably. cross-species close match https://github.com/mapping-commons/semantic-mapping-vocabulary/issues/3 A match where the subject is considered analogous to the object in a different taxonomic grouping. FBbt:00005074 (D. melanogaster “muscle cell”) and CL:0000187 (taxon-neutral “muscle cell”) are a cross-species exact match. cross-species exact match https://github.com/mapping-commons/semantic-mapping-vocabulary/issues/3 A match where the subject is considered analogous to the subject in a different taxonomic grouping, but the object refers to a narrower concept. cross-species narrow match https://github.com/mapping-commons/semantic-mapping-vocabulary/issues/3 A match where the subject is isomorphic to the object, i.e. considered of identical or similar form, shape, or structure. isomorphic match https://github.com/mapping-commons/semantic-mapping-vocabulary/issues/3 A match where the subject cannot be considered isomorphic to the object, i.e. considered of identical or similar form, shape, or structure. The object corresponds to exactly one subject in the subject_source. non-isomorphic match A matching process that exploits background knowledge from external resources, commonly referred to as background knowledge resources. This approach is also known as indirect matching, BK-based matching or context-based matching. A match between a subject A and an object B was established because they appear equivalent under consideration of externally provided background knowledge. background knowledge-based matching process A normalization technique replacing all blank characters, such as space, tabulation, carriage return (or sequences of these) into a single blank character. The string 'ontology matching' (two spaces between 'ontology' and 'matching) becomes 'ontology matching' (one space). blank normalisation cardinality filtering A normalization technique converting each alphabetic character in a string to their lower case counterpart. The string 'Ontology' is normalised to 'ontology' (lower case). case normalization A matching process based on multiple, possibly intertwined, matching approaches. An ontology matching tool determines that a subject should be mapped to an object by employing a range of techniques, including lexical, semantic and structural. composite matching process A normalization technique replacing diactritic signs (e.g. accents, hats) with their most frequent replacements. The string 'Protégé' is normalised to 'Protege'. diacritics suppression A normalization technique removing all numbers in a string. The string 'ontology1' becomes 'ontology'. digit suppression edit distance hamming distance A matching process based on individual representations (or instances). A match between a subject A and an object B was established because they share the same instances. instance-based matching process language-based similarity measure The process of reducing the different forms of a word to one single form. lemmatization levenshtein distance A matching process based on a lexical comparison between one or more syntactic features of the subject with one or more syntactic features of the object. The label of a subject entity matches to an exact synonym of an object entity. lexical matching process A lexical matching process based on a minimum threshold of a score from a comparison based on a lexical similarity algorithm. A match between a subject and an object was established because they had a Levenshtein score higher than 0.8. lexical similarity threshold-based matching process A normalization technique replacing specific links between words, such as apostrophes, dashes, underscore, etc into dashes or blanks. The string 'alzheimer's' becomes 'alzheimers'. link stripping logical consistency filtering A matching process based on the inferences made by a logical reasoner. Two classes across ontologies are determined equivalent by an OWL reasoner such as ELK. logical reasoning process An matching process that is performed by a human agent and is based on human judgement and domain knowledge. A human curator determines that a subject should be mapped to an object by virtue of their domain expertise. manual mapping curation A triple <s,p,o> comprising a subject entity s, an object entity o and a mapping predicate p. The subject entity NCI:C9305 is mapped to the object entity ICD10:C80.9 using the skos:relatedMatch mapping predicate. mapping A process that relates to the creation, confirmation, rejection or curation of a mapping. Matching is a mapping activity that results in the creating of a mapping; mapping review is an activity that results in the confirmation of a mapping. mapping activity A matching process based on the traversing of multiple mappings. A match between a subject A and an object B was established because A was mapped to C, C was mapped to D and D was mapped to B. mapping chaining-based matching process mapping flipping mapping inversion mapping inverting semantic walk-backs A matching process based on the inverting or flipping of the subject with the object of a mapping in accordance with the semantics of the mapping predicate. A broad match between a subject A and an object B using the skos:narrowMatch mapping relation was established because B was mapped to A using the skos:broadMatch mapping relation. mapping inversion-based matching process A process that is concerned with determining if a mapping “candidate” (otherwise determined) is reasonable/correct. A match between a subject A and an object B was established elsewhere, and a human reviewer determined that the mapping is true (or false) based on an independent evaluation. mapping review matching operation matching task An process that results in a mapping between a subject and an object entity. The label of a subject entity matches to an exact synonym of an object entity. matching process A process applied to a set of mappings, usually with the intention of changing it. matching process post-processing A preliminary processing of inputs prior to performing matching. matching process pre-processing A family of preprocessing techniques for reducing strings to be compared to a common format. normalization A normalization technique removing all punctation characters from a string. The string 'e.g.' becomes 'eg'. punctuation elimination A preprocessing method transforming a string by matching a regular expression and then removing that match. The regex match “[ -]phenotype” is removed from the label field of the subject entity in the mapping. regex removal A preprocessing method transforming a string by matching a regular expression and then replacing that match with a specified substitution string. The regex match “[ -]phenotype” is replaced by “-disease” for the label field of the subject entity in the mapping. regex replacement A matching process based on a minimum threshold of a score from a comparison based on a semantic similarity algorithm. A match between a subject and an object entity was established because they had a Jaccard score higher than 0.8 based on the set of (common) superclasses. semantic similarity threshold-based matching process A technique for determining a score that characterises the similarity between two entities. similarity measure stable marriage filtering The process of reducing a word to its word stem. stemming A preprocessing method transforming a string by removing a list of stop words. The stop-words “of, and, is, a” are removed from the label field of the subject entity in the mapping. stop-word removal A similarity measure based on the comparison of strings. string-based similarity measure A string-based similarity measure which involves determining if two strings associated with mapping entities are equal. string equality Structural matching does not involve looking at "values" of properties. A match between a subject and an object was established because of the similarity of their structural features, e.g., the number of direct property of a class. structural matching process substring similarity substring test named entity recognition term extraction threshold filtering token-based distance tokenization A matching process based on an unspecified comparison. A mapping between a subject and an object was established, but it is no longer clear how or why. unspecified matching process ngram similarity