« ICT/AHDS: Digital Collections, Best Practice Descriptions | Main | Academic Blog Portal Wiki »
March 15, 2007
Google Ocropus
New Google Project in the Works:
http://code.google.com/p/ocropus/
ocropus
open source document analysis and OCR system
OCRopus is a state-of-the-art document analysis and OCR system, featuring pluggable layout analysis, pluggable character recognition, statistical natural language modeling, and multi-lingual capabilities.
Background
The OCRopus engine is based on two research projects: a high-performance handwriting recognizer developed in the mid-90's and deployed by the US Census bureau, and novel high-performance layout analysis methods.
OCRopus is development is sponsored by Google and is initially intended for high-throughput, high-volume document conversion efforts. We expect that it will also be an excellent OCR system for many other applications.
Release dates throughout Q1-Q3, 2007.
Posted by hag at March 15, 2007 12:05 PM
Comments
Post a comment
Thanks for signing in, . Now you can comment. (sign out)
(If you haven't left a comment here before, you may need to be approved by the site owner before your comment will appear. Until then, it won't appear on the entry. Thanks for waiting.)