{ "cells": [ { "cell_type": "code", "execution_count": 1, "id": "private-destiny", "metadata": {}, "outputs": [], "source": [ "### Loading Credentials from local file; \n", "### this cell is meant to be deleted before publishing\n", "import yaml\n", "\n", "with open(\"../creds.yml\", 'r') as ymlfile:\n", " cfg = yaml.safe_load(ymlfile)\n", "\n", "uri = cfg[\"sonar_creds\"][\"uri\"]\n", "user = cfg[\"sonar_creds\"][\"user\"]\n", "password = cfg[\"sonar_creds\"][\"pass\"]" ] }, { "cell_type": "markdown", "id": "diverse-great", "metadata": {}, "source": [ " SoNAR (IDH) - HNA Curriculum \n", "\n", "Notebook 3: SoNAR (IDH)" ] }, { "cell_type": "markdown", "id": "infinite-yeast", "metadata": {}, "source": [ "This curriculum is created for the SoNAR (IDH) project. SoNAR (IDH) is in its core a graph based approach to structure and links big amounts of historical data (more on the SoNAR (IDH) project and database can be found in Notebook 3). Therefor, the whole curriculum focuses on graph theory and network analysis. \n", "\n", "This notebook provides an introduction to the SoNAR (IDH) database and its underlying Neo4j graph-database technology as well as the Cypher query language which is part of the Neo4j ecosystem. " ] }, { "cell_type": "markdown", "id": "western-classics", "metadata": {}, "source": [ "# Project summary " ] }, { "cell_type": "markdown", "id": "speaking-brazilian", "metadata": {}, "source": [ "[SoNAR (IDH)](https://sonar.fh-potsdam.de/) is short for **Interfaces to Data for Historical Social Network Analysis and Research**. The main objective of the project is the examination and evaluation of approaches to build and operate an advanced research technology environment supporting HNA. \n", "\n", "SoNAR (IDH) is a research project in collaboration of the following institutions:\n", "\n", "* [Deutsches Forschungszentrum für Künstliche Intelligenz](https://www.dfki.de/)\n", "* [Fachhochschule Potsdam](http://uclab.fh-potsdam.de/)\n", "* [Humboldt-Universität zu Berlin](https://www.ibi.hu-berlin.de/)\n", "* [Staatsbibliothek zu Berlin](https://staatsbibliothek-berlin.de/en/)\n", "* [Heinrich-Heine-Universität Düsseldorf](https://www.uniklinik-duesseldorf.de/en/department-of-the-history-philosophy-and-ethics-of-medicine)\n", "\n", "\n", "One of the main elements of the SonAR (IDH) projects is a [Neo4j](https://neo4j.com/) graph database. This database contains the merged data of multiple archives and libraries. \n", "See [Chapter 2](#Data-Description) for more details about the structure and the contents of the SonAR (IDH) database." ] }, { "cell_type": "markdown", "id": "treated-national", "metadata": {}, "source": [ "# Data description\n" ] }, { "cell_type": "markdown", "id": "antique-renewal", "metadata": {}, "source": [ "The SoNAR (IDH) database consists of nodes and edges. Each of the nodes and edges have additional properties that provide rich meta information. \n", "\n", "This data description section provides details about the data sources and overall characteristics of the data. The section is based on the state of the SoNAR (IDH) database during February 2021. A diagram of the database schema can be found [here](https://camo.githubusercontent.com/9262db5eb53360acb5ccc2249ff97b4b7d82ee9199bdcb8563980f16b9d7cc95/68747470733a2f2f7472656c6c6f2d6174746163686d656e74732e73332e616d617a6f6e6177732e636f6d2f3564323530353865393136326235363762383630313439662f3565336331336262363037323836353631636335366635372f62646664383838363964376633656465616663366232633130326361666663342f556d6c4d6f64656c2e737667).\n", "\n", "\n", "
SocialRelation however, are implicit edges and were derived based on Resources.\"\"\" ... \"\"\" for the query to tell Python we are writing a character string over multiple lines. We are doing this, so the query looks tidy and well-structured. You also could write the full query in one line - but this results in bad readability and makes debugging more difficult.\n",
"Id property. The Id property is a combination of the ISIL (International Standard Identifier for Libraries and Related Organisations) and the GND-ID. Id of Max Weber is (DE-588)118629743. DE-588 is the ISIL code of the GND (Gemeinsame Normdatei) and 118629743 is the GND-ID of Max Weber.\n",
"SocialRelation-nodes are derived from Resource-nodes. The Source property of a SocialRelation is the id of the corresponding Resource\n",
"