Survey Infrastructures systematically interview tens of thousands of individuals across Europe each year. Respondents are selected at random from all walks of life, and the hour-long interviews provide a range of data which has value for researchers and subsequently policy makers.
While complex life histories or events may be coded into the structured taxonomies required for cutting-edge sociological research, a large proportion of the information conveyed in an interview is lost. A respondent's tone of voice, linguistic fluidity, and depth of vocabulary for example can provide insights about cognitive function, socio-economic status or verbal reasoning skills.
Making use of this lost data requires the integration of social survey and linguistic infrastructures. Such integration underpins the EOSC vision. As such, the basis for the work within SSHOC on analysing voice recorded interviews seeks to provide both a proof of concept and a framework for future research that explores this approach.
After the presentations, participants will have time for discussion with the presenters in separate breakout rooms.
Dr Judith Koops is a researcher at NIDI. Her research focuses on inequalities in family demography. She works for the Generations and Gender Programme (GGP). GGP is a social science research infrastructure that provides open access micro- and macro-level data with the aim to improve knowledge about family life and life course trajectories in low-fertility countries. In SSHOC she leads tasks 4.4 (Voice recorded interviews and audio analysis) and 4.5 (Social policy APIs for social surveys).
Joris Mulder is a researcher at CentERdata, Tilburg University. He is the Coordinator of the Dutch LISS panel and responsible for the quality of the panel (e.g. representativeness, data quality and survey response) and promoting the use and awareness of the LISS panel. With a background in both computer science and social sciences Joris focuses on managing innovative research projects, such as combining online survey research methods with sensor based data collection and data science techniques.
Dr Henk van den Heuvel is director of the Centre for Language and Speech Technolog (CLST) at Radboud University. CLST is participating in SSHOC as a linked third party of CLARIN ERIC. He has been involved in the collection, compilation and validation of many spoken and written language resources at the national and international level. In various projects automatic speech recognisers were built for Dutch and Frisian in the domains of audio mining, language learning, and healthcare. Currently he is involved in CLARIAH PLUS tasks for improving speech recognition of dialect speech and data curation. In SSHOC he works on tasks related to integrating language and speech technology into survey tools, and in sharing sensitive data through remote access.