Between January 2019 and April 2022 SSHOC will deliver a series of services and tools for daily use by SSH researchers. The timeline indicates their earliest availability. Scroll down to browse the catalogue. 

SSHOC Services and Tools Timeline

Filter by category


Fifty model survey questions, each with TWO foreign language versions, produced in a pilot study comparing Human-Aided Machine Translation and human translation methodologies.

Type:
Dataset
Service Catalogue Category:
Datasets

An open-access repository of survey questions aligned at the sentence level and to be transformed into domain-specific corpora to facilitate natural language processing, 

These domain-specific corpora will be made accessible to other users in Computer Assisted Translation (CAT) format. Based on this experience, general guidelines for incorporating survey specific parallel corpora into MT systems will be produced.

Type:
Dataset
Service Catalogue Category:
Datasets

Audio data in the form of voice recorded interviews from the Generations and Gender Survey, processed and analyzed by CLARIN using automatic speech recognition, speaker attribution, part-of-speech-tagging, named entity labelling, and other NLP tools.

Type:
Dataset
Service Catalogue Category:
Datasets