this post was submitted on 20 Nov 2023
1 points (100.0% liked)

Machine Learning

1 readers
1 users here now

Community Rules:

founded 11 months ago
MODERATORS
 

I am beginner in ml,how do I get myself updated with current state of OCR. If I want to get better results than Tesseract or EasyOCR ,what path should I follow.i basically want near 100% accuracy in identifying typed/digital characters and their location in image. Is this solved ?? Any guidance would be helpful πŸ™πŸ™

top 8 comments
sorted by: hot top controversial new old
[–] EarProfessional8356@alien.top 1 points 10 months ago

Look at Meta’s Nougat OCR, use the API, run a flask server and play with it

[–] zoontechnicon@alien.top 1 points 10 months ago (1 children)

"Guys I'm new to medicine, I'm gonna solve cancer, any guidance would be helpful."

[–] AbjectDrink3276@alien.top 1 points 10 months ago

πŸ˜‚πŸ˜‚

[–] BH_Kim@alien.top 1 points 10 months ago

If you're interested in a new task, it's good for you to start by exploring Papers with Code leaderboard. You can find recent research papers related to it.

https://paperswithcode.com/dataset/textocr

[–] LinuxSpinach@alien.top 1 points 10 months ago

As of about a year ago, I haven't seen anything that really outperforms Tesseract across multiple benchmarks. You can get near 100% accuracy if the image is clean and the font isn't anything weird. But if you have image noise, you need to lower your expectations.

[–] BusinessReplyMail1@alien.top 1 points 10 months ago

Azure Form Recognizer OCR is very good.

[–] Jackrecke@alien.top 1 points 10 months ago

hey there, as a beginner in ml, staying updated on OCR is key. to surpass Tesseract or EasyOCR, focus on deep learning models, like CNNs or transformers. achieving near 100% accuracy is tough, but pre-processing, data augmentation, and model fine-tuning can help get you there. it's not fully solved, but keep experimenting and learning. good luck! πŸ™

[–] sbrealxclusive@alien.top 1 points 10 months ago

have you checked out the latest research papers on OCR? following top conferences like CVPR, ICCV, and NeurIPS can help you stay updated. consider exploring deep learning models like Transformers for improved accuracy. good luck! πŸš€