Information package & Course catalogue

Palacký University Olomouc

Study programmes & Course catalogue

for academic year 2026/2027
Palacký University Olomouc

Česky

Search

Course: Corpus Linguistics

« Back

Course title	Corpus Linguistics
Course code	KBH/KORE
Organizational form of instruction	Seminar
Level of course	Bachelor
Year of study	not specified
Semester	Winter and summer
Number of ECTS credits	3
Language of instruction	Czech
Status of course	Compulsory-optional
Form of instruction	Face-to-face
Work placements	This is not an internship
Recommended optional programme components	None
Course availability	The course is available to visiting students

Lecturer(s)
Pořízka Petr, PhDr. Ph.D.
Course content
1. Basic concepts, literature and software, types of corpora 2. Methodology: data collection, corpus size, representativeness of data 3. Czech corpora of written and spoken language; other projects: electronic dictionaries, literary databases 4. Corpus tools and methods (KWIC, concordance, collocation, regular and Boolean expressions statistics, frequency distribution) 5. Linguistic annotation: lemmatization, morphological and syntactic tagging (main models) 6. Complex and structured data mining - query language CQL 7. Working with data in various linguistic corpus tools
Learning activities and teaching methods
Lecture, Dialogic Lecture (Discussion, Dialog, Brainstorming), Work with Text (with Book, Textbook), Methods of Written Work, Demonstration
Learning outcomes
The aim of the course is to acquaint participants with the basic concepts of corpus linguistics and prepare them for work with corpora, which in recent years become one of the fundamental tools for the scientific study of language. Introduction to Corpus Linguistics is divided into three parts: firstly participants will learn the basic concepts; secondly they will learn to deal with some important Czech language corpora. Within the third part, students will create their own small corpora for language data analysis. Knowledge in the basic concepts of corpus linguistics. The aim of the course is to acquaint participants with the basic concepts of corpus linguistics and prepare them for work with corpora, which in recent years become one of the fundamental tools for the scientific study of language.
Prerequisites
unspecified
Assessment methods and criteria
Written exam, Analysis of Activities ( Technical works), Seminar Work (1) Regular class attendance and active participation (includes completion of tasks assigned) (2) Realization of a class project
Recommended literature
Baker, P. - Hardie, A. - McEnery, T. A Glossary of Corpus Linguistics. Edinburgh 2006. Benko, V. a kol. (2019). Webové korpusy Aranea. Bratislava. Čermák - Klímová - Petkevič. Studie z korpusové lingvistiky. Praha 2000.. Čermák, F. - Blatná, R. (eds.). Jak využívat Český národní korpus. Praha 2005. Čermák, F. - Blatná, R. Korpusová lingvistika: Stav a modelové přístupy. Praha 2006.. Čermák, F. (2017). Korpus a korpusová lingvistika. Praha. Kol. Manuál práce s ČNK (wikidokumentace). Osolsobě, K. (2014). Česká morfologie a korpusy. Praha. Pořízka, P. (2014). Tvorba korpusů a vytěžování jazykových dat (metody, modely, nástroje). Olomouc.

Study plans that include the course

Faculty	Study plan (Version)	Category of Branch/Specialization	Recommended year of study	Recommended semester
Faculty: Faculty of Arts	Study plan (Version): Czech Philology for News Media Editors (2019)	Category: Philological sciences	-	Recommended year of study:-, Recommended semester: -
Faculty: Faculty of Arts	Study plan (Version): Czech Philology for News Media Editors (2025)	Category: Philological sciences	-	Recommended year of study:-, Recommended semester: -

Palacký University Olomouc, date of update: 19.06.2026 23:53. Data created for academic year 2026/2027