09 July 2019 - 07:00
Utrecht, The Netherlands


DH2019 Workshop: The Case Of Interview Data

A multidisciplinary approach to the use of technology in research using interview methods

Read the associated blog post.

When & Where

Tuesday, 9 July 2019 from 09:00 to 13:00 (CEST)

TivoliVredenburgh - Co-located with Digital Humanities (DH) Conference 2019
Vredenburg 89 
3511 DX Utrecht



Arjan van Hessen, Stefania Scagliola, Louise Corti, Silvia Calamai, Norah Karrouche, Christoph Draxler, Henk van den Heuvel


The workshop

The event will take place in the frame of the Digital Humanities Conference 2019, in Utrecht from the 9th to the 12th of July and it is organised by in collaboration with the SSHOC project, with CLARIN ERIC and with the UK Data Service.

This pre-conference workshop at DH2019 in Utrecht will present the complete workflow for processing spoken interview data, e.g. from the domain of Oral History. The workshop will focus on free web services and tools.

  • Preprocessing of audio signal data
  • Automatic speech recognition for the transcription of speech data
  • Manual correction of the transcript
  • Automatic time-alignment of the transcript
  • Additional analysis and annotation of the data, e.g. emotion recognition

To register to the event you should first register and pay on the DH2019 website and then follow up on the dedicated Eventbrite page.


SSHOC' Involvement

This workshop addresses the data creation phase of the data lifecycle within the Social Sciences and Humanities, which is one of the major goals of the SSHOC project, tackled within WP4 where SSHOC partners, who are also lecturers at the workshop, are collecting, processing and analyzing audio data in the form of voice recorded interviews with automatic speech recognition and are developing a survey module specifically adapted to integrate audio recordings and their processing into the traditional data collection process.


About the Group:

The Group is a multidisciplinary group of European scholars, tools and data professionals, spanning the fields of speech technology, social sciences, human computer interaction, oral history and linguistics, interested in strengthening the position of interview data in Digital Humanities. Since 2016 they have organized a series of workshops, supported by CLARIN on this topic (See this website).

Their first concrete output was the development of the T-Chain, a tool that supports transcription and alignment of audio and text in multiple languages. Second, they developed a format for experimenting with a variety of annotation, text analysis and emotion recognition tools as they apply to interview data.


Organising Committee

  • Louise Corti, UK Data Archive, University of Essex
  • Christoph Draxler, University of Munchen
  • Arjan van Hessen, University Twente
  • Henk van de Heuvel, Radboud Universiteit Nijmegen
  • Stef Scagliola; University of Luxembourg



Draft programme

9.00 - 9.30

Introduction and short presentation on ‘Digital Humanities approaches to interview data - can historians, linguists and social scientists share tools?’

9.30 - 11.00

Preparing your audio-data, uploading these audio to the portal and automatic recognizing the speech.

Correcting the ASR-results

Downloading the (corrected) results and improving the readability

11.00 - 11.20

Coffee/Tea break

11.20 - 12.00

Introducing linguistic analysis of text: free tools

12.00 - 12.30

Introducing and exploring emotion extraction

12.30 - 13.00


Do you have questions about DH2019 Workshop: The Case Of Interview Data? Contact Group

Video Embed: 
DH in Practice - Retrieving Text from Spoken Data