SSHOC Workshop: Using Corpora for Implementing Validation. Workflows that combine quantity and quality

Date:

30 September 2019 - 07:45 to 14:00

Location:

Leipzig, Germany

Read the associated blog post

Description & aims of the workshop

The formula to validate, validate, validate“ findings from quantitative corpus analysis (Grimmer/Stewart 2013) has become a staple in discussion on the potentials and perils of quantitative approaches to text and is also central to the SSHOC project (Social Sciences & Humanities Open Cloud). However, technically, restrictions to implement validation remain quite high. Usually, dedicated resources for setting up and maintaining a server-based environment with a graphical user interface are still required. Lowering the costs of integrating quantitative and qualitative steps in workflows using corpora in social science research designs is a major objective of Research Infrastructures such as CLARIN, and of the polmineR package, an open source R package available at the Comprehensive R Archive Network / CRAN.

In this workshop, we introduce the polmineR package and explore three basic scenarios using it:

Validating the results obtained from dictionary-based sentiment analysis and classification,
Validating the results of LDA topic modelling,
Giving substantial meaning to the results of cooccurrence analyses.

We will discuss whether to potentially combine the scenarios with semi-supervised learning, and how to leverage of machine learning (MI) approaches. As the dataset and tool combination, we will use the polmineR R package in combination with a multilingual corpus of the UN General Assembly.

Lecturers

Prof. Dr. Andreas Blätte and Christoph Leonhardt

Audience

The workshop is intended for political and social scientists who are interested in using large text collections in their research. No programming skills are needed but a general familiarity with basic statistical operations on texts will be helpful. Please bring your own laptop for the hands-on session.

SSHOC involvement

This workshop addresses the challenges that specific user communities experience when contributing to SSHOC, the availability of procedures, tools and services to address these challenges, and the extent that these procedures, tools and services are sufficiently applicable for specific user communities, which is one of the major goals of the SSHOC project, tackled within WP9.

Practical details

Registration link: https://forms.gle/rZsoTWP9RoUeUiEt6. The number of participants of the workshop is limited to 25. We kindly ask you to register as soon as possible. The registration will be closed when the limit is reached.
Workshop attendance is free of charge. Refreshments (coffee breaks and lunch) are provided by the organisers.
This workshop is co-located with CLARIN Annual Conference. Workshop participants who are interested in attending the CLARIN Annual Conference should contact conference organisers (via events@clarin.eu) to check whether this would be feasible.

Address

University of Leipzig
The Paulinum – Assembly Hall and University Church of St. Paul, Augustusplatz 10
Leipzig
Germany

SSHOC Workshop: Using Corpora for Implementing Validation. Workflows that combine quantity and quality

Description & aims of the workshop

Lecturers

Audience

SSHOC involvement

Practical details

Address

News

Science Clusters Position statement on operational commitment to EOSC and Open Research

SSHOC, the SSH Open Science Cluster has a New Chair and Vice-Chair in 2024

OSCARS project funded to foster the uptake of Open Science in Europe

Strengthening Cross-Cluster Collaboration: Highlights from the 2nd SSH Open Cluster Assembly

SSHOC Continues to Build Stronger European SSH Community: Highlights from the 1st SSH Open Cluster Assembly