Releases: OCR-D/ocrd_tesserocr
Releases · OCR-D/ocrd_tesserocr
v0.20.1
v0.20.0
Changed:
- adapt to (and require)
ocrd>=3.0
– allows running- with pages in parallel (
OCRD_MAX_PARALLEL_PAGES
) in tandem with METS Server - with page timeout (
OCRD_PROCESSING_PAGE_TIMEOUT
) - with page failure fallback copycat (
OCRD_MISSING_OUTPUT=COPY
), new default isSKIP
instead ofABORT
(now via--debug
) - with page completion re-runs (
OCRD_EXISTING_OUTPUT=SKIP
), which is the new default instead ofABORT
(now via--overwrite
)
- with pages in parallel (
- switched to pyproject.toml build, tracking version via
ocrd-tool.json
Added:
- more test coverage (esp. modes w/o METS Server, METS caching, instance-caching, page-parallel)
- Docker image includes preconfigured
ocrd-all-tool.json
for these processors
Fixed:
- no more logging side effects between tests
v0.19.1
v0.19.0
v0.18.0
v0.17.0
Fixed:
- segment/recognize: fix
shrink_polygons
- segment/recognize: fix reinit scope (for
xpath_model
andauto_model
) - CI: test multiple Python versions independent of ocrd/core image
- CI: speed up build for EOL Python 3.6
- CI: chmod o+w tessdata directory of PPA/OS Tesseract
- deps-ubuntu: allow installation of PPA Tesseract to fail (for newer OS)
Changed:
- adapted to Shapely v2
- *: inherit from recognize (but override logger)
- segment*: delegate constructor instead of wrapping instance
- requires ocrd==2.48
v0.16.0
v0.15.0
Added:
- binarize:
dpi
numerical parameter to specify pixel density, #186 - binarize:
tiseg
boolean parameter to specify whether to calltessapi.AnalyseLayout
for text-image separation, #186
Changed:
- regonize: improved polygon handling, #186
- resources: proper support for
moduledir
, companion to OCR-D/core#904, #187