-
-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Show progress during postprocessing #1313
Comments
That's when we ask Ghostscript to do PDF/A. Unfortunately, it doesn't give much feedback, so there's not much I can work with it. At least I'm not aware of any behavior I can monitor. It's also single threaded. Color space conversion of large images can be quite expensive in Ghostscript and is often responsible for long delays. |
But, in the above case, I used In the above case, I guess that most of the time was consumed for doing the equivalent of the following (obtained by running with -v1 on a different file):
If I run this:
I get:
So, you can probably monitor the number of pages processed, which you can use to show the progress. |
Might take time for big files. Pdf.open() potentially is expensive as well, but QPDF doesn't give us progress feedback for that. Closes Show progress during postprocessing #1313
Thanks for "OCR" progress bar issue report - fixed. After "Total file size..." nothing is happening except copying the finished file from temporary storage to its final output location. Unless you're dealing with very large PDFs (GBs), this suggests network issues or file system contention. How long is "too much time?" |
Probably also cleaning up all the temp files generated (for e.g., the images) When ocrmypdf is at this step, I can see the output file in the target directory (with the correct filesize, which means that it is likely not just a placeholder). So, I think that cleaning the temp files is actually what is taking the time.
Maybe 2-3 minutes. It is not too much when compared to the total time taken. But, it feels too much when you don't know what is happening and how long it is going to last. So, adding a progress here also would be nice. |
For large files, postprocessing takes a lot of time. Showing some progress here would make the UX better.
The main motivation behind this request was that ocrmypdf is stuck on this step (postprocessing) for about 30 min.
And now, it is stuck on this step:
The text was updated successfully, but these errors were encountered: