Anyone know any tool to transcribe text from PDF files?

Lemmy_Mouse · 2 years ago

Anyone know any tool to transcribe text from PDF files?

loathsome dongeater · 2 years ago

If you want ocr you can use tesseract-ocr. If you want to extract actual text from a pdf then you can use something like pdf2text from poppler tools but you will have to fix the formatting a lot.

Makan · 2 years ago

Anything with good formatting is fine in my book.

Or at least one that gets the words right.

Lemmy_Mouse · 2 years ago

Gotcha thanks I’m going to look into this