-
Notifications
You must be signed in to change notification settings - Fork 2.5k
Open
Labels
enhancementNew feature or requestNew feature or request
Description
Description
It would be very helpful to have an option to exclude a list of pages from processing in Docling.
For example, sometimes PDFs contain cover pages, table of contents, or appendices that should not be parsed.
Suggested way
A simple parameter (e.g., exclude_pages=[1, 2, 10]
) could be passed when processing a document.
This would allow users to skip unnecessary pages and improve performance.
Benefit
- Cleaner output by ignoring irrelevant sections
- Faster processing by reducing unnecessary parsing
- More flexibility for different use cases
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request