Paulien Lemay obtained a bachelor's degree in Ancient Greek Linguistics in 2013 and a master’s degree in Multilingual Professional Communication in 2014. She subsequently worked as a consultant in research-driven software development across a variety of R&D projects, ranging from blockchain applications to natural language processing.
In 2025, she began her PhD research specializing in Ancient and Byzantine Greek. Her work applies NLP techniques at scale to extensive textual corpora to investigate orthographic and semantic patterns, their historical development, and their relation to social identity and interpersonal dynamics in Greek texts. This research is conducted within the context of the ANNOPHIS project, which provides a multilingual annotation platform for historical texts. Users can upload their own research datasets for annotation and also generate annotations partially automatically through integration with pretrained models that she helps supply. In parallel, she contributes as a software developer to the database of Byzantine book epigrams in collaboration with GhentCDH.