Releases: sbritorodr/PDFtoTXT
Releases · sbritorodr/PDFtoTXT
PDFtoTXT 3.0
Changes
- Added progress bar for each element using the ProgressBar
- Removed all unnecesary print output from console
- If one page is over 5,000 characters, ignore it and keeps translating other pages.
PDFtoTXT 2.5
Update README.MD (#9) * Update README.md * Update README.md * Rename main.py to PDFtoTXT.py * Update README.md
PDFtoTXT 2.0
PDFtoTXT + translate to spanish
It is necessary to install tesseract ocr and also install the propper trained data in order to work
This script tranforms a pdf file to txt. It supports multiple pages and also scanned pages.
If the user request it, the script translates the file to spanish too.
The script works on python 3.10
Before translation module
This is if the translate function doesn't work or something like that