Archives
 
 
 
  Special
 
 
 
  About Us
 
 
 

Newsletter
Free E-mail Newsletter from BYTE.com

 
    
           
Visit the home page Browse the four-year online archive Download platform-neutral CPU/FPU benchmarks Find information for advertisers, authors, vendors, subscribers Request free information on products written about or advertised in BYTE Submit a press release, or scan recent announcements Talk with BYTE's staff and readers about products and technologies

ArticlesHidden Markov Models


December 1995 / Features / Talking to Machines / Hidden Markov Models

Hidden Markov models (HMMs) consist of states connected by directional arcs or transitions containing probability information. A. A. Markov formulated the basic structure in 1913 to describe letter sequences in Russian. Each state in Markov's model corresponded to a single letter, while the transition linking A to B represented the probability that B would follow A . James Baker of Carnegie Mellon University first applied HMMs to speech recognition in the 1970s.

In Markov's original model, a state emits its unique letter, making the path easily discernible. What makes an HMM "hidden" is that it's impossible to determine the path taken through the model on the b asis of the intermediate outputs. Outputs of HMMs are simply the result of applying probabilities to the input and don't necessarily tell you what state produced them.

Speech recognition constructs the HMM for a word from spoken samples of that word. Each state contains acoustic information about a segment of the word, including acoustic variability. Transitions contain probabilities to determine the likelihood that one state will follow another state. Because they allow a recognition algorithm to move from one state to another based on the input data, HMMs are "nondeterministic" systems.


Navigating Through an HMM

illustration_link (16 Kbytes)

This diagram illustrates how a speech-recognition algorithm might identify a word by comparing a series of input vectors (i.e., speech samples) with a five-state stored HMM. Here we see three possible paths, all starting at the same state. The orange line indicates the "best path" through this HMM, the one that most closely matches the characteristics of the HMM. For this solution, the first two inputs keep the path in the first state of the HMM. In physical terms, if this HMM represented the word six, the orange path might suggest that the speaker lengthened the s sound at the start of the word.


Up to the Features section contentsGo to previous article: Talking to MachinesGo to next article: DragNETSearchSend a comment on this articleSubscribe to BYTE or BYTE on CD-ROM   E
Flexible C++
Matthew Wilson
My approach to software engineering is far more pragmatic than it is theoretical--and no language better exemplifies this than C++.

more...

BYTE Digest

BYTE Digest editors every month analyze and evaluate the best articles from Information Week, EE Times, Dr. Dobb's Journal, Network Computing, Sys Admin, and dozens of other CMP publications—bringing you critical news and information about wireless communication, computer security, software development, embedded systems, and more!

Find out more

BYTE.com Store

BYTE CD-ROM
NOW, on one CD-ROM, you can instantly access more than 8 years of BYTE.
 
The Best of BYTE Volume 1: Programming Languages
The Best of BYTE
Volume 1: Programming Languages
In this issue of Best of BYTE, we bring together some of the leading programming language designers and implementors...

Copyright © 2005 CMP Media LLC, Privacy Policy, Your California Privacy rights, Terms of Service
Site comments: webmaster@byte.com
SDMG Web Sites: BYTE.com, C/C++ Users Journal, Dr. Dobb's Journal, MSDN Magazine, New Architect, SD Expo, SD Magazine, Sys Admin, The Perl Journal, UnixReview.com, Windows Developer Network