Category: corpus linguistics
-
How we collect, manage and process our research material for CORECON
The research carried out within CORECON is based on a large database that consists of hundreds of news articles and social media entries in Romanian, Polish and English. These constitute a corpus (plural: corpora): a large set of linguistic data (e.g., collected from news outlets and social media) that can be processed with the use…