« ICT/AHDS: Digital Collections, Best Practice Descriptions | Main | Academic Blog Portal Wiki »

March 15, 2007

Google Ocropus

New Google Project in the Works:

http://code.google.com/p/ocropus/
ocropus
open source document analysis and OCR system

OCRopus is a state-of-the-art document analysis and OCR system, featuring pluggable layout analysis, pluggable character recognition, statistical natural language modeling, and multi-lingual capabilities.
Background

The OCRopus engine is based on two research projects: a high-performance handwriting recognizer developed in the mid-90's and deployed by the US Census bureau, and novel high-performance layout analysis methods.

OCRopus is development is sponsored by Google and is initially intended for high-throughput, high-volume document conversion efforts. We expect that it will also be an excellent OCR system for many other applications.

Release dates throughout Q1-Q3, 2007.

Posted by hag at March 15, 2007 12:05 PM

Comments

Post a comment

Thanks for signing in, . Now you can comment. (sign out)

(If you haven't left a comment here before, you may need to be approved by the site owner before your comment will appear. Until then, it won't appear on the entry. Thanks for waiting.)


Remember me?