Task 3.2 Selected SSH Ontologies and Vocabularies in SSHOC Work package 3 Lifting Technologies and Services into the SSH Cloud aimed to foster the use of selected global ontologies in the social sciences and humanities, regarding occupational titles, educational categories, sectors of industry, geographical regions, food items, and religions. These ontologies service the usage of vocabularies for classifying text corpora and predefined response categories to facilitate self-identification in survey questions.
This deliverable is a report on SSHOC metadata and data interoperability problems. It pertains to Task 3.5 under responsibility of CESSDA/FSD. In addition to an inventory of data and metadata interoperability problems, this deliverable includes choices for SSHOC recommended formats and priorities for providing conversion services and planning solutions.
This report delivers a comprehensive overview of data citation methodologies in use within the social sciences and humanities, and lays the groundwork for defining a common citation mechanism designed to render SSH data both findable and actionable.
The European Question Bank (EQB) aims to provide a central search facility across all the Consortium of European Social Science Data Archives’ (CESSDA) survey holdings. It uses a question-level metadata schema based on the DDI-Lifecycle standard (DDI Alliance 2020). The ambition is for users to be able to find survey questions, their translations to applicable languages, answer categories, pre- and postquestion texts, and the study title.
In the SSHOC project Task 3.2 (Selected SSH Ontologies and Vocabularies) several efforts were taken to foster the use of selected global ontologies regarding occupational titles, educational categories, sectors of industry, geographical regions, food items, and religions.
Citation is a pillar for the construction of knowledge. By creating proper citations in a standardized way researchers can constitute a mesh of linked information for various purposes (from credit to reuse). This becomes increasingly important as the SSHOC Task 3.4 team confronts the realities of Social Sciences and Humanities Research in a digital age, when machine actionability takes on a renewed and vital importance.
The SSHOC project task 3.5 has worked with metadata and data format interoperability issues and built an interoperability hub consisting of a portal (called Conversion Hub) and selected metadata conversion solutions that the task team is currently developing. The work is based on earlier work of task 3.5 that delivered Deliverable 3.1 Report on SSHOC (meta)data interoperability problems, which provided an inventory of metadata and data formats used by the SSHOC communities, and recommendations for metadata standards and data file formats.
This deliverable summarises the activities within Task 3.3. that were led by CLARIN ERIC with the main partners CLARIN/Athena, CLARIN/CUNI, and DARIAH/UGOE, and additional collaboration with SciencesPo. Over the course of M1-M38 the partners developed three demonstration scenarios that highlight the value of NLP (Natural Language Processing) technologies for the SSH field and investigated which aspects of the outcomes of T3.3. and in which form can be shared via SSH Open Marketplace.
SSHOC Task 3.6 (Making Data Re-usable and Actionable) focuses on two services originally from the CLARIN infrastructure domain, which in discussion and collaboration with the SSHOC communities are generalised and adapted to serve the broader Humanities and Social Sciences. Specifically with the purpose of facilitating better sharing and re-use of data and services in innovative ways.
Those services are the SSH Switchboard also known as the CLARIN Language Resource Switchboard and the SSH Virtual Collection Registry also known as the CLARIN Virtual Collection Registry (VCR).
This deliverable pertains to SSHOC Task 3.1 which was responsible for investigating and providing resources and tools to support the multilingual aspects of the future pan-EU SSH infrastructure.
Making data and services accessible and usable in SSH is very much also a matter of providing relevant translations, translation of metadata concepts, multilingual vocabularies, terminology extraction across languages, multilingual databases.