tesseract-ocr
? You can download it via apt or something similar.
Self-Hosted Main
A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.
For Example
- Service: Dropbox - Alternative: Nextcloud
- Service: Google Reader - Alternative: Tiny Tiny RSS
- Service: Blogger - Alternative: WordPress
We welcome posts that include suggestions for good self-hosted alternatives to popular online services, how they are better, or how they give back control of your data. Also include hints and tips for less technical readers.
Useful Lists
- Awesome-Selfhosted List of Software
- Awesome-Sysadmin List of Software
paperless-ngx has built in ocr but I don't think it would fit your needs
I will check it up
Windows 11 has this built in if you take a screenshot
Didn't know that,i use flameshot for screenshots,i will take a look thnx
You could spin up paperless-ngx. Or use pdf24 creator. Beware paperless consume will delete the file.
I used paperless-ngx before and it works pretty good.
I will check it up, i have Stirlingpdf and I see it also has ocr support
I'm not sure I understand you correctly. Do you want to apply OCR to PDFs or to Screenshots?
For PDFs there's the excellent ocrmypdf which paperless-ngx uses under the hood.
Nextcloud AIO (all-in-one) comes with full text search installed, which brings tesseract to nextcloud. so you can let tesseract-ocr run over all documents and then they will be searchable with Elasticsearch.