Converted from a Word document
The global project of data conversion of the notable Italian ‘Zeri Photo Archive’ into a Linked and Open Dataset
The project is supported by the Zeri Foundation with the University of Bologna and started in 2014description model of the available records, so as to define a collection of suitable ontologies to describe such a complex domain.
Indeed, the uniqueness of the Zeri collection
Fondazione Zeri, Photo Archive Catalog,
http://catalogo.fondazionezeri.unibo.it
The full project entails, together with the ontological modeling, the production of the RDF dataset, the creation of proper links to the LOD cloud, and the definition of the user interface for browsing the dataset.
The first activity of the project had been the
formalization of the
Scheda F (‘Fotografia’) (Mibact, 1999)
, – the metadata standard of the Italian Istituto Centrale per il Catalogo e la Documentazione (ICCD) – by mapping the schema onto CIDOC-CRM model (Crofts et al., 2011). In the conversion we initially considered the specific flavor of the Scheda F used by the Zeri Foundation, i.e. its subset of 113 fields (based on the experimental 1.04 version of the ICCD standard) and an handful of custom extensions to it. A deep analysis of the schema of the Scheda F showed that it is organized in semantically independent sections (called “paragrafi”, or
paragraphs), each one belonging to a specific FRBR concept (Work, Manifestation, Item); this allows the mapping to proceed by logical sections affecting only a limited number of entities and relating these entities to the data documented by the fields of the schema.
Before accomplishing the mapping, we proceeded with the creation of a
new ontology called FEO (F Entry Ontology
So, through the use of well-known ontologies – i.e. CIDOC-CRM, but also PROV-O (Lebo et al., 2013), and FaBiO (Peroni and Shotton, 2012), as part of the SPAR Ontologies
ad hoc, most of the content expressed as descriptive entries in the Scheda F have been already formally represented (Gonano et al., 2014).
In this presentation we propose an extension of our work on the Zeri Photo Archive by introducing a
See the ICCD cataloging standards at:
http://www.iccd.beniculturali.it/index.php?it/473/standard-catalografici
We are planning to publish both the F and the OA mapping to CIDOC-CRM in the next few months, according with the ICCD activitiesnew ontology for representing works of art and their related information, namely, the
OA Entry Ontology
Scheda OA (‘Opera d’Arte’) – another ICCD metadata standard
Moreover, in this paper we further investigate a way for providing a clear and shareable representation of questionable information, i.e., the
authorship attribution of works of art. In the Zeri Photo Archive there are particular authorship attribution created by either the Zeri Foundation cataloguers and/or by Federico Zeri (collector of photographs) himself, and such attributions (that could be accepted or discarded at certain point) are accompanied with the criteria that corroborate the cataloguers’ choice.
In order to provide a precise characterisation of all these aspects, we discuss the adoption of
HiCO,
Historical Context Ontology (Daquino and Tomasi, 2015) as a way for enabling a definition of authoritative attributions based on Zeri cataloguers’ own criteria. HiCO
With the hico:InterpretationAct class it’s possible to
describe the interpretation act as a process:
Two fundamental object properties complete the process: the hico:hasInterpretationType property and the hico:hasInterpretationCriterion property. The former states an arbitrary classification of the interpretation, which can be simply defined as philological, historical, semiotic, linguistic etc. The latter is a briefly explanation of the criterion used to state information extracted from a source, e.g. a literally transcription, a hypothesis, or the adoption of the literature about a specific argument.
A crucial aspect of the project is the
correct formalization of statements so as to allow the ontologically-consistent coexistence of data created by different actors that express contradictory statements about the same subject (e.g., authorship attribution data of a work of art), in order to guarantee the data integration with Pharos (Reist et al., 2015) project partners, of which the Zeri Foundation is a member. By the use of SWRL rules applied to relations between sources, criteria and context information used by an agent to explain his interpretation,
we could formally infer when an interpretation can be considered authoritative.
We could give an
example. We state aside the interpretation (i.e. the assertion “An artist X is author of a specific work of art Y in a specific time Z”) which sort of interpretation we are dealing with (i.e. authorship attribution), and which criteria have been used by the cataloguer to assert such proposition. A provenance statement ensures both author of assertion (i.e. interpretation) and author of data conversion are fully described, ensuring that no contradictory statements will affect data validity.
When necessary conditions for stating authorship are fulfilled, an attribution may be inferred as authoritative. In the example (Fig. 1), we have an attribution which respects minimal requirements for being considered an
authoritative assertion:
This obviously doesn’t entail that the attribution is surely correct, but it can be a useful tool for historians when searching for different attributions and related criteria adopted in interpretative process.
To conclude, as said before it will be the natural completion of the Zeri Project the RDF triple store set up, the creation of links to other datasets, and the definition of the user interface for browsing the linked open dataset. All these activities are now object of our research group industry and the final publication of the project is expected for the middle of 2016. Data had already been transformed in a RDF/XML dataset, according to above mentioned ontologies. Next steps of the project involve thus data publication, ensuring access to them through a SPARQL endpoint. When published, data will be enriched with further RDF links to major datasets and authority files online (e.g. VIAF for people and works, Getty thesauri, GeoNames for places
A first check on
https://datahub.io/dataset