Course: Basics of Corpus Linguistics

» List of faculties » FIF » KOL
Course title Basics of Corpus Linguistics
Course code KOL/VKORP
Organizational form of instruction Seminar
Level of course Master
Year of study not specified
Semester Winter and summer
Number of ECTS credits 4
Language of instruction Czech
Status of course Compulsory-optional
Form of instruction Face-to-face
Work placements This is not an internship
Recommended optional programme components None
Lecturer(s)
  • Čech Radek, Mgr. Ph.D.
  • Vrabeľ Ondřej, Mgr.
  • Pavlas Dalibor, Mgr.
Course content
(1) Corpus linguistics. History and present. Intuition vs. corpus analysis. (2) Language corpus - basic characteristics (types, representativeness, extent, annotations). (3) Basic statistical methods. (4) Czech National Corpus. (5) Prague Dependency Treebank. Its properties and characteristics. Tools for searching. (6) Searching by means of the so-called regular expressions, morphological texts and lemmata. (7) Collocations, filters, sources, attributes. (8) Sorting, creation of sub-corpora (9) Other tools for text analyses: AntConc, WordSmith, Collocate. (10) Practical exercises - orthography, morphology (11) Practical exercises - morphology (12) Practical exercises - syntax (13) Practical exercises - lexicology

Learning activities and teaching methods
Monologic Lecture(Interpretation, Training), Dialogic Lecture (Discussion, Dialog, Brainstorming), Work with Text (with Book, Textbook)
Learning outcomes
Students will be introduced to basic characteristics of corpus linguistics. They will acquire familiarity with work with the Czech National Corpus so as to be able to independently analyse any given linguistic problem with respect to possibility of use of corpus data.
Orientation in the field of corpus linguistics Ability to work with the Czech National Corpus Familiarity with work with software tools dedicated to analysing of texts (e.g. AntConc) Ability to analyse selected linguistics problems by means of language corpora
Prerequisites
Passive knowledge of English on the level necessary for reading of scholarly texts.

Assessment methods and criteria
Student performance, Seminar Work

(1) Continuous preparation for class based on relevant topics (2) Active participation in class (focused mostly on acquisition of practical skills of work with the Czech National Corpus) (3) Test (testing 1) theoretical knowledge 2) ability to process assigned linguistic problems using the Czech National Corpus
Recommended literature
  • Cvrček, V. - Kovaříková, D. (2010). Možnosti a meze korpusové lingvistiky. Naše řeč, 94, s. 113-133..
  • Čermák, F. - Blatná, R. (eds.). (2006). Korpusová lingvistika: Stav a modelové přístupy. Praha.
  • McEnery, Hardie, A. (2011). Corpus Linguistics: Method, Theory and Practice. Cambridge.
  • O'Keeffe, A. - McCarthy, M. (2010). The Routledge Handbook of Corpus Linguistics. London & New York.


Study plans that include the course
Faculty Study plan (Version) Category of Branch/Specialization Recommended year of study Recommended semester
Faculty: Faculty of Arts Study plan (Version): General Lingvistics and Theory of Communication (2014) Category: Philological sciences - Recommended year of study:-, Recommended semester: -