Replies: 1 comment 3 replies
-
Additionally, the docs appear to conflict with regards to skipping ocr. Here is a section called Optimize images without performing OCR, which recommends In the Advanced section, however, it says no image processing takes place when
Is one of those incorrect, or is there extra context I am missing? |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I'm looking at the Don’t actually OCR my PDF section of the docs and trying that, but I'm still getting ocr'd results. (I'm checking by opening pdf in chrome, no text selection before and text selection after. I can also tell it's doing ocr because it takes a while on multipage documents)
Do I misunderstand the api?
The command:
ocrmypdf --tesseract_timeout=0 --output-type=pdf in.pdf out.pdf
Version: 16.0.3
Platform: macos 14.3.1
Beta Was this translation helpful? Give feedback.
All reactions