Archives
 
 
 
  Special
 
 
 
  About Us
 
 
 

Newsletter
Free E-mail Newsletter from BYTE.com

 
    
           
Visit the home page Browse the four-year online archive Download platform-neutral CPU/FPU benchmarks Find information for advertisers, authors, vendors, subscribers Request free information on products written about or advertised in BYTE Submit a press release, or scan recent announcements Talk with BYTE's staff and readers about products and technologies

ArticlesImage Retrieval for Compound Documents


August 1994 / Cover Story / Image Retrieval for Compound Documents
Tom R. Halfhill

Archived information is useless if it can't be retrieved, and the easiest kind of information to retrieve is textual: documents that either originated in machine-readable form or were converted to ASCII text by OCR scanning. A much more difficult challenge is to retrieve image files or compound documents in which the target of the search is a graphic, a video clip, or a sound bite. As the definition of what constitutes a ``document'' evolves to include files with multiple embedded data types, the ability to search for attributes unique to those data types becomes increasingly important.

Fortunately, image-recognition technology is advancing at a rapid pace, driven by industrial, military, and law-enforcement needs, as well as business applications. Manuf acturers are relying more and more on machine vision and pattern recognition to automate their inspection and grading processes. For example, a Windows-based color vision program called Way-2C from Ronald A. Massa Associates (Cohasset, MA) is used to grade lumber, inspect soda crackers, and sort pills. The military uses similar technology for target acquisition and automated sentry posts. Police departments are using pattern recognition to identify fingerprints and match photos of suspects to digitized mug shots in computerized databases.

In the past, the only reliable way to index images was to tag them with keywords describing their content. This is still a worthwhile method, but the effort required to keypunch a caption for each image is justified only if you expect to retrieve the images often, as in the case of a stock-photo agency. Another drawback is that you can't always anticipate the parameters of a search; someday, you may want to locate an image that isn't described by any of its keywords.

Pattern-recognition algorithms, some based on neural networks, are getting smart enough to search common image formats such as TIFF files for specific shapes, colors, or textures. A leading product in this field is the Excalibur EFS document imaging system from Excalibur Technologies (San Diego, CA), which runs on PC, Macintosh and X-Terminal clients. Excalibur has developed a technique called adaptive pattern recognition that can analyze all types of digital data, including graphics and sound. The vision routines are so accurate that Weyerhauser uses them to sort different grades of hardboard siding by examining the wood grain.

IBM (White Plains, NY) recently announced a similar product called Visualizer UltiMedia Query, an OS/2-based DB2 client. Visualizer uses a technology known as QBIC (query by image content), developed by IBM's Almaden Research Center and the Santa Teresa Laboratory. A photo editor could use Visualizer to retrieve pictures of flowers containing a specific shade of yellow o r a particular arrangement of blossoms.

Highly specialized applications may require custom solutions, and tools are available for this purpose. For instance, Excalibur sells its recognition routines as a collection of C libraries called the XRS toolkit. A similar package of C routines, the Matrox Imaging Library, is available from Matrox (Dorval, Quebec, Canada). You can even buy PC-based tools for creating pattern-recognition programs with neural networks, such as DS2000 from Design Sciences (Vienna, VA), which supports several different network models.

As computers become more adept at handling multimedia data types, the need to efficiently store and retrieve compound documents will become as crucial to business as the paper-filled file cabinets that comprise the bulk of corporate archives today. Luckily, the technology isn't lagging too far behind the demand.


Figure: Illustration: Photograph: IBM's Visualizer Ultimedia Query can locate images from a database by shape or color. In this example, the product has located all grey, circular-shaped architectural images. Results are displayed in a closest-to-sample order.
Tom R. Halfhill is a BYTE senior news editor based in San Mateo, California. You can reach him on the Internet or BIX at thalfhill@bix.com .

Up to the Cover Story section contentsGo to previous article: Distributed Document Management with OLE and OpenDocSearchSend a comment on this articleSubscribe to BYTE or BYTE on CD-ROM  
Flexible C++
Matthew Wilson
My approach to software engineering is far more pragmatic than it is theoretical--and no language better exemplifies this than C++.

more...

BYTE Digest

BYTE Digest editors every month analyze and evaluate the best articles from Information Week, EE Times, Dr. Dobb's Journal, Network Computing, Sys Admin, and dozens of other CMP publications—bringing you critical news and information about wireless communication, computer security, software development, embedded systems, and more!

Find out more

BYTE.com Store

BYTE CD-ROM
NOW, on one CD-ROM, you can instantly access more than 8 years of BYTE.
 
The Best of BYTE Volume 1: Programming Languages
The Best of BYTE
Volume 1: Programming Languages
In this issue of Best of BYTE, we bring together some of the leading programming language designers and implementors...

Copyright © 2005 CMP Media LLC, Privacy Policy, Your California Privacy rights, Terms of Service
Site comments: webmaster@byte.com
SDMG Web Sites: BYTE.com, C/C++ Users Journal, Dr. Dobb's Journal, MSDN Magazine, New Architect, SD Expo, SD Magazine, Sys Admin, The Perl Journal, UnixReview.com, Windows Developer Network