Archives
 
 
 
  Special
 
 
 
  About Us
 
 
 

Newsletter
Free E-mail Newsletter from BYTE.com

 
    
           
Visit the home page Browse the four-year online archive Download platform-neutral CPU/FPU benchmarks Find information for advertisers, authors, vendors, subscribers Request free information on products written about or advertised in BYTE Submit a press release, or scan recent announcements Talk with BYTE's staff and readers about products and technologies

ArticlesOCR: A Lomg Way fr@m the O!d Dajs


March 1996 / Reviews / The Paperless Cubicle / OCR: A Lomg Way fr@m the O!d Dajs

Over the past 10 years, OCR has come a long way. Early desktop-based OCR solutions required tedious user training. The software would build a database, assigning an ASCII value to the bit-map image of each character it came across. Once trained, the OCR engine compared scanned character images to the images in its database, returning an ASCII character that represented the best match. Besides consuming time and resources, trainable packages did not work well when faced with a variety of fonts.

Instead of bit-map matching, omnifont technology uses feature extraction to distinguish characters, breaking down a character into its component parts and recognizing an image based on its uniqu e combination of shapes. Omnifont requires no training, consumes less d isk space, and, because a character retains the same basic features regardless of the font, works well across a broad range of typefaces.

Technology developed by Caere (maker of OmniPage Pro 6.0, WordScan Plus 4.0, and OmniPage Lite OCR, which comes bundled with the PaperPort software) not only recognizes single characters but evaluates entire words to make better predictions. The technology is based on the hidden Markov model, a complex mathematical model used in predictive technologies such as speech recognition and DNA matching.

The OCR engine looks at all the pixels that make up a word and then uses probability algorithms, including contextual evidence and language rules, to select the most probable word. Trigram analysis and dictionaries further improve accuracy. Trigram algorithms consider the probabilities of three words appearing together and can adjust the prediction of a single word based on the trigram probabilities.


Up to the Reviews section contentsGo to previous article: OCR: A Lomg Way fr@m the O!d DajsGo to next article: ScanMan Scans 24-Bit ColorSearchSend a comment on this articleSubscribe to BYTE or BYTE on CD-ROM  
Flexible C++
Matthew Wilson
My approach to software engineering is far more pragmatic than it is theoretical--and no language better exemplifies this than C++.

more...

BYTE Digest

BYTE Digest editors every month analyze and evaluate the best articles from Information Week, EE Times, Dr. Dobb's Journal, Network Computing, Sys Admin, and dozens of other CMP publications—bringing you critical news and information about wireless communication, computer security, software development, embedded systems, and more!

Find out more

BYTE.com Store

BYTE CD-ROM
NOW, on one CD-ROM, you can instantly access more than 8 years of BYTE.
 
The Best of BYTE Volume 1: Programming Languages
The Best of BYTE
Volume 1: Programming Languages
In this issue of Best of BYTE, we bring together some of the leading programming language designers and implementors...

Copyright © 2005 CMP Media LLC, Privacy Policy, Your California Privacy rights, Terms of Service
Site comments: webmaster@byte.com
SDMG Web Sites: BYTE.com, C/C++ Users Journal, Dr. Dobb's Journal, MSDN Magazine, New Architect, SD Expo, SD Magazine, Sys Admin, The Perl Journal, UnixReview.com, Windows Developer Network