Between January 2019 and April 2022 SSHOC will deliver a series of services and tools for daily use by SSH researchers. The timeline indicates their earliest availability. Scroll down to browse the catalogue.
Cultural and scientific data cannot be understood without knowledge about the provenance (the origin, context or history). Provenance provides a critical foundation for assessing authenticity, enabling trust, and allowing reproducibility. Provenance metadata are data describing objects, people, places, times which are causally related by events. They are event centric and must be described in a historical order to ensure that there are no references to non-existent (non-recorded) events or objects.
A website & repository, maintained by SHARE/CENTERDATA, with multilingual ontologies, such as occupations, industries, tasks, education, regions, religions, - all coded according to ruling standards - to be used for respondent's self-selection or interviewer selection in the relevant survey questions in multi country surveys, such as SHARE, ESS, EVS, Gender & Generation, and alike.
The website hosts the ontologies and has - at the backend - software for using these ontologies in web/personal/telephone surveys and for batch coding.
The Ethnic and Migrant Minorities (EMM) Survey Registry, a free online database and tool that displays detailed information about existing quantitative surveys conducted with EMM populations in Europe.
A database of survey questionnaires’ texts. The first version is compiled from questionnaires from the of European Social Survey (ESS) and the European Values Study (EVS) in the English source language and their translations into Catalan, Czech, French, German, Norwegian, Portuguese, Spanish and Russian.
Audio data in the form of voice recorded interviews from the Generations and Gender Survey, processed and analyzed by CLARIN using automatic speech recognition, speaker attribution, part-of-speech-tagging, named entity labelling, and other NLP tools.
A package of training resources and exercises bundled together with guidelines for setting up a training intervention, and a standard evaluation form - for use at Train-the-Trainer bootcamps and in building cross-disciplinary cooperation within a shared framework.
Based on a proof of concept for the cross-pollination of information between research infrastructures, the guidelines under development will describe how to design interview questionnaires to facilitate the capture of digital language data and its integration into the traditional data collection process.
A registry of (meta)data conversion services featuring the most relevant SSH (meta)data formats and encompassing links to services, format recommendations for increasing interoperability, software recommendations, and a software library. Selected conversion tools will be created where necessary.
Based on the already functional CLARIN VCR, the SSHOC VCR service will enable researchers to create integrated, coherent sets of links to digital objects. These virtual collections will provide persistent identifiers and federated login.
The collection metadata is openly available and accessible via the Virtual Language Observatory.
An extension of the current CLARIN Language Resource Switchboard, the SSHOC Switchboard will match language resources and tools to data types, automatically guiding researchers to the appropriate language analysis application, and enabling the publication, sharing and reuse of the resulting data via B2SHARE.
Emerging from use-case studies, an evaluation of existing platforms, and a pilot in which secure, safe-room remote desktop connections will be established and tested between partners, the specifications will include Data Use Agreements and a schema for assessing disclosure risk in data.
The repository will be built upon the community-driven open source Dataverse software platform. Its modular design facilitates integration with other data services such as DataCite or ROpenScience, allows for distributed file storage, and supports the development of additional functionality and services.
SSHOC will build on partner experience with rich, descriptive metadata such as CDMI, CIDOC, CRM, DDI and DataCite to provide software that renders the full range of SSH data-sets citable and exploitable.