-
UTwente
- Enschede
- www.caromein.nl
Popular repositories Loading
-
PageXML_regionname_normalisation
PageXML_regionname_normalisation PublicPageXML regions may be labelled differently over time or because collections are merged. This script helps to to create an overview and relabel where necessary/desired.
Python 2
-
PageXML_Entity_Recognition_Resolutions_Overijssel
PageXML_Entity_Recognition_Resolutions_Overijssel PublicRecogizing dates and people in the resolutions of Overijssel, using the models from the Republic project.
Python 1
-
PageXML_RegionandLineRecalculation
PageXML_RegionandLineRecalculation PublicForked from cconzen/ReadingOrderRecalculation
Post-process PageXMLs to improve their region reading order, including updating the internal line order
Python 1
-
PageXML_RegionCalculator
PageXML_RegionCalculator PublicA basic tool to calculate how many regions the PageXML that is being checked contain and which labels have been applied (and how many regions are there without any label)
Python 1
-
PageXML_indexing_tool
PageXML_indexing_tool PublicA Python tool for extracting and indexing tagged entities from PageXML files. The tool automatically discovers all tags in PageXML collections, groups them by category, and generates an interactive…
Python 1
-
PageXML_Empty_Page_Finder
PageXML_Empty_Page_Finder PublicNot all pages contain text. Empty sheets in books/volumes do not require transcriptions. However, sometimes an (upload) error may have occurred and one could want to check whether there might be em…
Python 1
If the problem persists, check the GitHub status page or contact support.