Difference between revisions of "Tesseract"
(Improved description, added categories.) |
(Update URL) |
||
Line 3: | Line 3: | ||
|Short description=Optical Character Recognition: turn an image to text | |Short description=Optical Character Recognition: turn an image to text | ||
|Full description=OCR can be used to e.g. scan books and turn them into text, which is more flexible and smaller in terms of file size. | |Full description=OCR can be used to e.g. scan books and turn them into text, which is more flexible and smaller in terms of file size. | ||
− | |Homepage URL= | + | |Homepage URL=https://github.com/tesseract-ocr/tesseract |
|Computer languages=C, C++ | |Computer languages=C, C++ | ||
|Related projects=Clara OCR, Ocre, Hocr, GOCR, WeOCR, GNU Ocrad | |Related projects=Clara OCR, Ocre, Hocr, GOCR, WeOCR, GNU Ocrad | ||
|Keywords=text, graphics, ocr | |Keywords=text, graphics, ocr | ||
− | |Last review by= | + | |Last review by=Sweil |
− | |Last review date= | + | |Last review date=2018/02/20 |
|Submitted by=mviinama | |Submitted by=mviinama | ||
|Submitted date=2013-04-11 | |Submitted date=2013-04-11 |
Revision as of 10:42, 20 February 2018
Tesseract
https://github.com/tesseract-ocr/tesseract
Optical character recognition engine
Tesseract is an optical character recognition (OCR) engine with very high accuracy. It supports many languages, output text formatting, hOCR positional information and page layout analysis. Several image formats are supported through the Leptonica library. It can also detect whether text is monospaced or proportional.
This package contains an OCR engine - libtesseract
and a command line program - tesseract
.
Licensing
License
Verified by
Verified on
Notes
License
Verified by
Genium
Verified on
11 April 2020
Leaders and contributors
Resources and communication
Audience | Resource type | URI |
---|---|---|
GitHub | VCS Repository Webview | https://github.com/tesseract-ocr/tesseract |
Python (Ref) | https://pypi.org/project/tesseract | |
Ruby (Ref) | https://rubygems.org/gems/tesseract | |
Debian (Ref) | https://tracker.debian.org/pkg/tesseract | |
R (Ref) | https://cran.r-project.org/web/packages/tesseract |
Software prerequisites
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.3 or any later version published by the Free Software Foundation; with no Invariant Sections, no Front-Cover Texts, and no Back-Cover Texts. A copy of the license is included in the page “GNU Free Documentation License”.
The copyright and license notices on this page only apply to the text on this page. Any software or copyright-licenses or other similar notices described in this text has its own copyright notice and license, which can usually be found in the distribution or license text itself.