Debian Accessibility Project
Summary
Optical Character Recognition (OCR)
Debian Accessibility Optical Character Recognition (OCR)

This metapackage will install packages which are useful for Optical Character Recognition (OCR).

Description

For a better overview of the project's availability as a Debian package, each head row has a color code according to this scheme:

If you discover a project which looks like a good candidate for Debian Accessibility to you, or if you have prepared an unofficial Debian package, please do not hesitate to send a description of that project to the Debian Accessibility mailing list

Links to other tasks

Debian Accessibility Optical Character Recognition (OCR) packages

Official Debian packages with high relevance

ebook-speaker
E-bogslæser som læser højt med syntetisk stemme
Versions of package ebook-speaker
ReleaseVersionArchitectures
sid6.2.0-7amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
forky6.2.0-7amd64,arm64,armhf,i386,ppc64el,riscv64,s390x
trixie6.2.0-6amd64,arm64,armel,armhf,i386,ppc64el,riscv64,s390x
bookworm6.2.0-4+deb12u1amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
bullseye5.5.2-1amd64,arm64,armhf,i386
Debtags of package ebook-speaker:
accessibilityspeech
interfacecommandline
roleprogram
scopeutility
soundplayer
works-withfile
works-with-formatepub
Popcon: 19 users (18 upd.)*
Versions and Archs
License: DFSG free
Git

Denne pakke tilbyder en kommandolinje e-læser som læser højt fra elektronisk tekst via talesyntese. Programmet har en simpel brugerflade passende for Braille-terminaler.

I øjeblikket er de følgende formater understøttet (nogle formater kræver yderligere pakker som foreslået af denne pakke):

 AportisDoc
 ASCII mail text
 ASCII text
 Broadband eBooks (BBeB)
 Composite Document File (Microsoft Office Word)
 DAISY3 DTBook
 EPUB ebook data
 GIF image data
 GutenPalm zTXT
 GNU gettext message catalogue
 HTML document
 ISO-8859 text
 JPEG image data
 Microsoft Reader eBook Data
 Microsoft Windows HtmlHelp Data
 Microsoft Word 2007+
 Mobipocket E-book
 MS Windows HtmlHelp Data
 Netpbm PPM data
 OpenDocument Text
 PDF document
 PeanutPress PalmOS
 PNG image data
 POSIX shell script text
 PostScript document
 Rich Text Format
 troff or preprocessor text (e.g. Linux man-pages)
 UTF-8 Unicode mail text
 UTF-8 Unicode text
 WordPerfect
 XML document text
Screenshots of package ebook-speaker
gocr
Optisk tegngenkendelse (OCR) i kommandolinjen
Versions of package gocr
ReleaseVersionArchitectures
bullseye0.52-3amd64,arm64,armhf,i386
bookworm0.52-6amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
trixie0.52-6.1amd64,arm64,armel,armhf,i386,ppc64el,riscv64,s390x
forky0.52-6.1amd64,arm64,armhf,i386,ppc64el,riscv64,s390x
sid0.52-6.1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
Debtags of package gocr:
accessibilityocr
interfacecommandline
roleprogram
scopeapplication
useconverting
works-withimage, image:raster, text
Popcon: 148 users (66 upd.)*
Versions and Archs
License: DFSG free
Git

Dette er et OCR-program (»Optical Character Recognition«, optisk tegngenkendelse) for flere platforme.

Programmet kan læse billedfilerne pnm, pbm, pgm, ppm, nogle pcx og tga.

På nuværende tidspunkt burde programmet være i stand til god håndtering af skanninger, der har deres tekst i én kolonne uden tabeller. Skriftstørrelser på 20 til 60 billedpunkter understøttes.

Hvis du ønsker at skrive din egen OCR, så tilbydes libgocr i en separat pakke. Dokumentation og grafisk omslag tilbydes også i separate pakker.

lios
Linux-intelligent OCR-løsning
Maintainer: Samuel Thibault
Versions of package lios
ReleaseVersionArchitectures
bullseye2.7.2-2all
sid2.7.2+git20221124-1all
forky2.7.2+git20221124-1all
trixie2.7.2-8all
bookworm2.7.2-6all
Popcon: 50 users (30 upd.)*
Versions and Archs
License: DFSG free
Git

Lios tilbyder en grafisk grænseflade oven på Cuneiform og Tesseract OCR-motorer til at lave OCR-behandlingen nemmere for funktionshæmmede brugere, med fuld automatisk rotation, optimering af lysstyrke, valg af rektangel, lydmæssig tilbagemelding etc.

Screenshots of package lios
tesseract-ocr
Tesseract - OCR-værktøj for kommandolinjen
Versions of package tesseract-ocr
ReleaseVersionArchitectures
bullseye4.1.1-2.1amd64,arm64,armhf,i386
bookworm5.3.0-2amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x
trixie5.5.0-1amd64,arm64,armel,armhf,i386,ppc64el,riscv64,s390x
forky5.5.0-1amd64,arm64,armhf,i386,ppc64el,riscv64,s390x
sid5.5.0-1amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x
Debtags of package tesseract-ocr:
accessibilityocr
interfacecommandline
roleprogram
Popcon: 2166 users (646 upd.)*
Versions and Archs
License: DFSG free

Tesseract er en optisk tegngenkendelsesmotor (OCR). Programmet kan bruges direkte eller (for programmører) via en API til at udtrække udskrevet tekst fra billeder. Understøtter et bredt udvalg af sprog. Denne pakke indeholder værktøjet for kommandolinjen.

Screenshots of package tesseract-ocr

Debian packages in contrib or non-free

cuneiform
multi-language OCR system
Versions of package cuneiform
ReleaseVersionArchitectures
sid1.1.0+dfsg-13 (non-free)amd64,arm64,armel,armhf,i386,mips64el,ppc64el
bullseye1.1.0+dfsg-8 (non-free)amd64,arm64,armhf,i386
bookworm1.1.0+dfsg-9 (non-free)amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el
trixie1.1.0+dfsg-12 (non-free)amd64,arm64,armel,armhf,i386,ppc64el
forky1.1.0+dfsg-13 (non-free)amd64,arm64,armhf,i386,ppc64el
Debtags of package cuneiform:
accessibilityocr
interfacecommandline
roleprogram
scopeutility
useconverting
works-withimage, image:raster
Popcon: 41 users (69 upd.)*
Versions and Archs
License: non-free
Git

Cuneiform is an OCR system. In addition to text recognition it also does layout analysis and text format recognition.

The following languages are supported: Bulgarian, Croatian, Czech, Danish, Dutch, English, Estonian, French, German, Hungarian, Italian, Latvian, Lithuanian, Polish, Portuguese, Romanian, Russian, Serbian, Slovenian, Spanish, Swedish, Turkish and Ukrainian.

*Popularitycontest results: number of people who use this package regularly (number of people who upgraded this package recently) out of 268224