
Topic modeling is an umbrella term used to denote a host of semi-automated or fully automated corpus linguistic methods that aim to map the content of texts by identifying dominant themes. When driven by an algorithm that is trained in either a supervised or unsupervised manner on a dataset, the method makes it possible to…

Data collection is the base for any empirical study and corpus analysis is a well-established method used within critical discourse studies. Jędrzej Olejniczak in his blogpost wrote about data collection, management and processing in CORECON. Here I explain the motivations behind methodological choices and our rationales in data collection and data annotation protocols used when…

During the FORTHEM meeting at the University of Opole in November 2024, several academics were invited to share good practices and experiences of international cooperation with Ukrainian scholars participating in the project “Supporting cooperation between the University of Opole and Ukrainian universities within the framework of the FORTHEM Alliance” of the NAWA organization. (Details of…

The fifth biennial conference of the Brussels Institute for Journalism Studies (BIJU) and the Department of Applied Linguistics of Vrije Universiteit Brussel (VUB) in Belgium on 12-13 December 2024 brought together over 60 researchers from Europe, America, Asia and Africa, to discuss the topics of “Look Who’s Talking: Voices and Sources in the News.” (Details of the…

On 24 October 2024, we were invited to take part in the Alternative Education Week at Onisifor Ghibu Theoretical High School in Sibiu. The Alternative Education Week is a yearly nationwide initiative designed to help students develop their creativity and their socio-emotional abilities. The workshops were conducted with two groups of 9th-graders. By solving media…

The research carried out within CORECON is based on a large database that consists of hundreds of news articles and social media entries in Romanian, Polish and English. These constitute a corpus (plural: corpora): a large set of linguistic data (e.g., collected from news outlets and social media) that can be processed with the use…