this post was submitted on 29 Apr 2025
10 points (100.0% liked)

Free and Open Source Software

20508 readers
21 users here now

If it's free and open source and it's also software, it can be discussed here. Subcommunity of Technology.


This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 3 years ago
MODERATORS
 

I found Adobe acrobat was good at ocr but I’m on Linux now.

top 3 comments
sorted by: hot top controversial new old
[–] Toes@ani.social 4 points 5 months ago (1 children)

I'm not super familiar with the subject but it'll probably be something based on Tesseract.

Maybe try gImageReader.

[–] e0qdk@reddthat.com 3 points 5 months ago* (last edited 5 months ago)

You can use tesseract -l jpn input.png - on the command line to have it print out the text from input.png into the console if you've got the language files for Japanese installed. (There's also language files for vertical text and a few others for script in my package manager.) Alternatively give the filename (w/o extension) instead of - to write the output into a .txt file.

On Mint, I think I did sudo apt install tesseract-ocr tesseract-ocr-jpn to get it working for the simple case of horizontal text; been a while though.

[–] Successful_Try543@feddit.org 4 points 5 months ago* (last edited 5 months ago)

Tesseract along with the desired language pack should do the OCR part and as a GUI, you can e.g. use lios or others.