Lecturer(s)
|
-
Matlach Vladimír, Mgr. Ph.D.
-
Špička Jiří, prof. Mgr. Ph.D.
|
Course content
|
(1) Introduction: digital humanities, data, big data and data-mining Sources of data from Twitter, Facebook, and Instagram, to street turnout, digital records of meetings, ebooks, descriptions of cultures, and how to get something out of them. (2) Basic data-mining relationships Extracting information from relationships between people, products and virtually anything that is (un)reported. Basic graph theory, practical tools and applications. (3) Introduction to the R language for data-mining and its alternatives. The basics of working with the R language from data retrieval to data processing. (4) Advanced data-mining relationships. Finding relationships based on similar properties of objects, categorizing them, interpreting them. From multivariate analysis (MDS, PCA, SVD), to clustering, to neural networks. (5) Text analysis and processing Specifics of text data mining from bag-of-words model to purely quantitative descriptions. (6) Automatic decision making and mining Fundamentals of machine learning from data and practical applications.
|
Learning activities and teaching methods
|
Lecture
|
Learning outcomes
|
(1) The role of artificial neural networks and artificial intelligence in data mining and processing - computer vision - language and text processing - the question of human creativity (2) Working with metadata and meta-information - distinction from data and information - how to process and think about them (3) The role of text corpora in analysing human behaviour and cultures - how to gain cultural insights from texts (4) Human behaviour as sequence and text - Sequence analysis, edit distances, hidden Markovian processes, etc. - applications (5) Practical examples - Application of all acquired knowledge to selected problems (epidemiology, cash flow analysis, etc.)
|
Prerequisites
|
The lecture is just for PhD students.
|
Assessment methods and criteria
|
Oral exam
(1) 100% attendance (2) Completion of a seminar paper
|
Recommended literature
|
-
Hajičová, Panevová, Sgall. (2003). Úvod do teoretické a počítačové lingvistiky. Praha.
-
Sells, P. (1985). Lectures on Contemporary Syntactic Theories. Stanford.
-
Stockwell, R. M. (1977). Fundations of Syntactic Theory. New Persey.
|