The video recordings from the ParlaCLARIN Workshop: Creating and Using Parliamentary Corpora, Miyazaki 2018 have been uploaded to CLARIN Videolecture.net page.
Parliamentary data is a major source of socially relevant content. It is available in ever larger quantities, is multilingual, has rich metadata, and has the distinguishing characteristic that it is essentially a transcription of spoken language produced in controlled circumstances, which is now increasingly released also in audio and video formats. All those factors in combination require solutions related to its archiving, structuring, synchronization, visualization, querying and analysis. Furthermore, adequate approaches to its exploitation also have to take into account the need of researchers from vastly different Humanities and Social Sciences fields, such as political sciences, sociology, history, and psychology.
An inspiring CLARIN-PLUS cross-disciplinary workshop “Working with parliamentary records” that was held in Sofia, Bulgaria, in Spring 2017, and a comprehensive overview of a multitude of the existing parliamentary resources within the CLARIN infrastructure clearly indicated a need for better harmonization, interoperability and comparability of the resources and tools relevant for the study of parliamentary discussions and decisions, not only in Europe but worldwide.
This workshop aims to bring together researchers interested in compiling, annotating, structuring, linking and visualising parliamentary records that are suitable for research in a wide range of disciplines in the Humanities and Social Sciences. We invite unpublished original work focusing on the collection, analysis and processing of parliamentary records.
The workshop was a part of the 11th edition of the Language Resources and Evaluation Conference (LREC2018) held in Miyazaki, Japan on Monday, 7 May, 2018.
Welcome and introduction
Welcome and introduction | Franciska de Jong
CLARIN Corpora for Parliamentary Discourse Research | Darja Fišer
Keynote talk
Applying Multi-Perspective Approaches to the Analysis of Parliamentary Data | Cornelia Ilie
Session 1: Creating parliamentary corpora
SlovParl 2.0: The Collection of Slovene Parliamentary Debates from the Period of Secession | Tomaž Erjavec
Polish Parliamentary Corpus | Maciej Ogrodniczuk
ParlAT beta: Corpus of Austrian Parliamentary Records | Tanja Wissik
A Corpus of Grand National Assembly of Turkish Parliament's Transcripts | Onur Güngör
Session 2: Enriching parliamentary corpora
UKParl: A Semantified and Topically Organized Corpus of Political Speeches | Federico Nanni
Preserving Metadata from Parliamentary Debates | Alina Karakanta
Annotation of the Corpus of the Saeima with Multilingual Standards | Roberts Darģis
A Sentiment-labelled Corpus of Hansard Parliamentary Debate Speeches | Gavin Abercrombie
Session 3: Parliamentary data in computational social sciences 1
Automatically labeled data generation for classification of reputation defence strategies | Nona Naderi
Exploring the Political Agenda of the Greek Parliament Plenary Sessions | Maria Pontiki
Findings from the Hackathon on Understanding Euroscepticism Through the Lens of Textual Data | Federico Nanni
Session 4: Parliamentary data in computational social sciences 2
A Pilot Gender Study of the Danish Parliament Corpus | Lene Offersgaard
The Parliamentary Debates as a Resource for the Textometric Study of the French Political Discourse | Francesca Frontini
Using Data Packages to Ship Annotated Corpora of Parliamentary Protocols: The GermaParl R Package | Andreas Blätte