Experimental computational framework for semantic exploration of the Voynich Manuscript using transformer embeddings, medieval corpora, semantic clustering and digital humanities methodologies.
-
Updated
May 29, 2026 - HTML
Experimental computational framework for semantic exploration of the Voynich Manuscript using transformer embeddings, medieval corpora, semantic clustering and digital humanities methodologies.
This repository contains the data and code developed for my Master’s thesis “The Model of Unseen Species Applied to Chivalric Literature in Iberian Languages.” The study applies quantitative methods — particularly the unseen-species model — to analyze patterns of transmission, preservation, and loss in Iberian chivalric literature.
A curated collection of open datasets and ontologies from the TALOS AI4SSH project (University of Crete), supporting research in Digital Humanities, Computational Philology, Greek NLP, Semantic Web, and cultural heritage data.
Post-OCR normalization pipeline for historical French corpora from Gallica
Add a description, image, and links to the computational-philology topic page so that developers can more easily learn about it.
To associate your repository with the computational-philology topic, visit your repo's landing page and select "manage topics."