GitHub
Tesseract training pages generator (based on PAGE XML and Cutouts output)
Snapshot: April 2026