Measuring orthographic and semantic similarity between Byzantine Greek epigrams

Start - End

2021 - 2025 (ongoing)

Type

PhD research

URL

https://lt3.ugent.be/projects/measuring-orthographic-and-semantic-similarity-bet…

Department(s)

Department of Translation, Interpreting and Communication

Research group(s)

LT3 - Language and Translation Technology Team

Research Focus

Language technology

Linguistics

Tabgroup

Abstract

The overall goal of this project is to detect and link similar hemistichs (half verses), verses and epigrams, which will result in a more dynamic system to connect related epigrams in the Database of Byzantine Book Epigrams (DBBE). To achieve this aim, insights and approaches from two active research lines within natural language processing (NLP) will be investigated, viz. automatic linguistic processing of text and machine learning approaches to measure orthographic and semantic similarity between text strings. In addition, a pilot study will be performed that extends the intra-lingual search for similar epigrams in the medieval Greek DBBE to an inter-lingual search for related epigrams in other languages, and more specifically Latin. To this end, the thriving new NLP research line of cross-lingual embeddings will be investigated.

People

Supervisor(s)

Els Lefever

Department of Translation, Interpreting and Communication

Co-supervisor(s)

Ilse De Vos

Phd Student(s)

Colin Swaelens

Department of Linguistics

Publications

Creating, enriching and valorizing treebanks of Ancient Greek(2019)
- Alek Keersmaekers
- Wouter Mercelis
- Colin Swaelens
- Toon Van Hal
Database of Byzantine Book Epigrams(2023)
- Kristoffel Demoen
- Gilbert Bentein
- Klaas Bentein
- Floris Bernard
- Julián Bértola
- Julie Boeten
- Mathijs Clement
- Cristina Cocola
- Eline Daveloose
- Sien De Groot
- Pieterjan De Potter
- Ilse De Vos
- Krystina Kubina
- Hanne Lauwers
- Paulien Lemay
- Renaat Meesters
- Delphine Nachtergaele
- Marthe Nemegeer
- Joachim Nielandt
- Mace Ojala
- Lisa-Lou Péchillon
- Raf Praet
- Rachele Ricceri
- Anne-Sophie Rouckhout
- Jeroen Schepens
- Febe Schollaert
- Lev Shadrin
- Nina Sietis
- Dimitrios Skrekas
- Colin Swaelens
- Maria Tomadaki
- Sarah-Helena Van den Brande
- Merel Van Nieuwerburgh
- Lotte Van Olmen
- Noor Vanhoe
- Nina Vanhoutte
Lemmatisation of Medieval Greek : against the limits of transformer’s capabilities?(2024)
- Colin Swaelens
- Pranaydeep Singh
- Ilse De Vos
- Els Lefever
Linguistic Annotation of Byzantine Book Epigrams(2023)
- Colin Swaelens
- Ilse De Vos
- Els Lefever