Whereas computers can reliably process text documents in terms of their factual content (e.g. for classifying them into topics, searching or extracting information from them), they have difficulty interpreting opinions, ...read more
The project deals with an important mechanism in Chinese writing, the use of “phonetic loan characters”, as exemplified in the Dunhuang manuscript corpus and with an emphasis of non-canonical material ...read more
Stylometry, a subfield of Digital Humanities, offers new methods for segregating different writing styles. So far, stylometry has been especially popular in authorship attribution studies. This project will approach issues ...read more
In the beginning of 2014, a new collaborative project on the analysis of the syntax of Medieval Chinese was initiated in the framework of the Ghent Centre for Buddhist Studies ...read more
The main aim of this project is to develop a new and original prototype tool that supports cross-disciplinary (CD) collaboration. The prototype focuses on problems related to language and methodology. ...read more
The Corpus of Historical Low German the first treebank of historical Low German (Old Saxon/OS and Middle Low German/MLG). Particularly the syntax of MLG still forms an urgent research desideratum (cf. already Peters ...read more
The SCATE project primarily aims at the improvement of translators' efficiency. Current commercial translation tools do not meet the productivity requirements imposed by the globalisation of business activities and the ...read more
The ARCHIVE project aims to discover the differences in language use and expression of subjectivity in corporate reporting between varieties of English such as British and American English and ...read more
In the AMiCA project, we identify possibly threatening situations on social networks by means of text and image analysis. Our goal includes ensuring the online safety of children. There are three critical ...read more
The MuST project aims to extract all domain-specific terms from a multilingual technical corpus as well as the semantic relationships between these terms. The automatic detection of synonymy and hyponymy ...read more