General description
In this module, you will learn how to analyse linguistic data with R, Rstudio, and the tidyverse. We will cover descriptive and inferential statistics as well as linear modelling. Each lesson consists of a theoretical part followed by hands-on exercises in R.
As outcome of the course, you will be able to:
- visualise and summarise your data
- analyse continuous data with a linear model which includes assumption checks, data transformation, contrast coding, interpretation of the model outcome (including interactions)
- carry out inferential statistics (hypothesis formulating and testing)
- communicate your results
- conduct open and reproducible research
Topics:
- descriptive statistics, plotting of data
- introduction to linear modelling with one categorical or continuous predictor (this includes data transformation, contrast coding)
- multiple linear models (two predictors) and interactions (this includes data transformation)
- inferential statistics and communicating results
- open research
As from 2024 onwards, this module can be followed together with module 1 Introduction to R in a complete Novice track for this Summer School.
Target audience and course prerequisites
The course is aimed at linguistic researchers who have a background in statistics or who wish to refresh their basic skills and knowledge. Participants should have the R skills as taught in module 1 Introduction to R of this Summer School.
Course materials
All course materials will be provided online and include slides, exercises and solutions, R markdown files, datasets and further reading.
Teacher bio
Vinicius Macuch Silva is a post-doctoral researcher working in the Visual Communication (ViCom) Priority Programme at the Goethe University Frankfurt, Germany. He is interested in how people use language to create meaning in communication, both when interacting (face-to-face) with one another and when producing and interpreting language in various other settings. In his research Vinicius primarily uses quantitative empirical methods, including controlled experimentation as well as computational and corpus analysis, to investigate questions related to pragmatic inferencing, strategic communication, argumentation and stance-taking, as well as expressive and affective meaning.
Schedule
- Wednesday 16/07/2025, 14:00-15:30 & 16:00-17:30
- Thursday 17/07/2025, 9:00-10:30 & 11:00-12:30 & 14:00-15:30 & 16:00-17:30
- Friday 18/07/2025, 9:00-10:30 & 11:00-12:30 & 14:00-15:30 & 16:00-17:30