Comment

Comment on Why is OCR for handwritten content still that bad?

That’s perfect. Now I’m just wondering why chatGPT is apparently much better in OCR than a dedicated OCR model like EasyOCR or Tesseract.

Btw, Deepseek did a good job but not perfect. I also fed chatGPT a full page of notes and the transcription to markdown worked quite well, although not perfect. However, if I supply the same note as part of a larger pdf, it will refuse to transcribe it, stating that it’s unreadable.

source

Sort:hotnew top

thefactremains@lemmy.world ⁨1⁩ ⁨year⁩ ago
Because it can fill in gaps where the recognition fails.

source
- executivechimp@discuss.tchncs.de ⁨1⁩ ⁨year⁩ ago
  Which can be problematic. If it makes a mistake and isn’t obviously wrong, that could go unnoticed.
  
  source
  - thefactremains@lemmy.world ⁨1⁩ ⁨year⁩ ago
    100% agreed. But it doesn’t change the answer of why they are apparently better than OCR.
    
    source
    executivechimp@discuss.tchncs.de ⁨1⁩ ⁨year⁩ ago
    Yep
    
    source
homesweethomeMrL@lemmy.world ⁨1⁩ ⁨year⁩ ago
If I had to guess, I’d say it was the dot paper confusing the OCR reader. I suppose the LLM has some way to cancel out the dots and thereby gets a better scan of it.

source
cyrano@lemmy.dbzer0.com ⁨1⁩ ⁨year⁩ ago
Try gemini 2 it seems is pretty good at that as well

source