this post was submitted on 25 Nov 2023
1 points (100.0% liked)
Side Project
29 readers
1 users here now
A community for sharing and receiving constructive feedback on side projects.
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
How to you handle PDFs that include images with text? Any OCR advice?
You can use Tesseract via the pytesseract python library.