How to read a pdf in Indian Languages #1840
Replies: 2 comments 1 reply
-
This has nothing to do with the language of the text, but with the font used. If you want please share the PDF and I will take a look. |
Beta Was this translation helpful? Give feedback.
-
8th-language-kannada-1-without-watermark.pdf Hi Jorj, |
Beta Was this translation helpful? Give feedback.
-
Hi,
I am trying to read a Kannada pdf book.
#Sample Code
pdf = fitz.open(text_book)
page = pdf.loadPage(24) # number of page
print(page.get_text('text'))
When I read the file the output doesn't come properly.
“£ÀªÀÄä£ÀÄß ºÉZÀÄÑ ºÉZÀÄÑ zÀÄr¹PÉÆ¼ÀÄîwÛÃj. PÀrªÉÄ
DºÁgÀ ¤ÃqÀÄwÛÃj. £ÀªÀÄä ¸ÉÃªÉ ¨ÉÃPÀÄ; £ÁªÀÅ ªÀiÁvÀæ
¨ÉÃqÀ C®èªÉÃ? £ÀªÀÄä ¸ÉêÉAiÀÄ£ÀÄß ªÀÄgÉAiÀÄ¢j.”
“zÀAiÉĬÄgÀ° ¸ÀPÀ® ¥ÁætÂUÀ¼À°èè.”
Is there any way to set the language or font while extracting text?
Beta Was this translation helpful? Give feedback.
All reactions