Here is a comprehensive list of code, data, and models created during the Ajax Multi-Commentary project (AjMC), a four-year research project funded by the Swiss National Science Foundation (grant PZ00P1_186033). The project team included M. Romanello (principal investigator), C. Pletcher (research software engineer), and S. Najem-Meyer (PhD student).
- ajmc-kodon: Minimal computing implementation of the Ajax multi-commentary platform based on Kōdōn
- Kōdōn: Minimal-computing Javascript library built upon Svelte to publish classical commentaries
- ajmc-elixir: Elixir implementation of the Ajax multi-commentary platform (discontinued)
- ajmc-pipeline: Pipeline to process digitised classical commentaries, implemented in Python
- ajmc_annotation_utils: Small Python package for dealing with documents annotated in INCEpTION
- inception-recommender: An external recommender for INCEpTION with support for Classics NER
TBA
- ajmc-public-commentaries: Image data of public domain commentaries
- OCR_artificial_data_sample: Synthetic dataset for multi-script text line recognition (sample)
- ReadableAjax: Basic HTML rendering of critical editions of Sophocles' Ajax encoded in TEI/XML
- ajmc-tei: TEI/XML exports of public domain commentaries from the Ajax Multi-Commentary project
- OCR groundtruth: Ground-truth data for OCR of digitised classical commentaries
- PLA groundtruth: Ground-truth data for PLA of digitised classical commentaries
- NE-annotated corpus: Annotated corpus to support the tasks of named entity recognition, entity linking and citation mining on classicsl commentaries.
- Lemma Linkage corpus (coming soon!): Annotated corpus to support the recognition and linking of commentary lemmas.
- OCR (optical character recognition)
- OCR kraken models: Pre-trained OCR Kraken models for historical classical commentaries
- OCR transformer model
- PLA (page layout analysis)
- Language Models
- ...
- Task-specific models
- NER
- Lemma linkage (recognition)