Archives
 
 
 
  Special
 
 
 
  About Us
 
 
 

Newsletter
Free E-mail Newsletter from BYTE.com

 
    
           
Visit the home page Browse the four-year online archive Download platform-neutral CPU/FPU benchmarks Find information for advertisers, authors, vendors, subscribers Request free information on products written about or advertised in BYTE Submit a press release, or scan recent announcements Talk with BYTE's staff and readers about products and technologies

ArticlesDouble-Byte Languages


August 1996 / International Features / Asian Internet Fever / Double-Byte Languages

Searching for information is sometimes doubly difficult.

English-language software is written in what's called single-byte code. In contrast, Chinese, Japanese, and other character-based software products are written in double-byte code. "Double-byte code, as the name suggests, consists of bytes that when put together make a single character, as used in many Asian languages," explains Dion Wiggins, director of strategic development for Asia On-Line, a Hong Kong-based Internet service provider (ISP) that specializes in furnishing content in Chinese, Japanese, Thai, and other Asian languages.

Double-byte software, however, is sometimes more expensive than single-byte pack ages. Plus, it poses problems in machine translation with single-byte-based syste ms, says Hong Chen, chief executive of Aimnet, which develops Internet software.

Adds Wiggins: "When searching English data [on the Internet], you can search for part of a word. But this is not possible with double-byte character sets as the characters themselves are the whole word."

Asia On-Line is developing a search engine based on neural network technology. "Neural networks learn from patterns and events, so when the same or even a similar event happens, it can make decisions based on previous events. If you apply this technology to a search facility, it can be used for any language, including all double-byte character sets and English," Wiggins says.


Up to the International Features section contentsGo to previous article: Double-Byte LanguagesGo to next article: Government CensorshipSearchSend a comment on this articleSubscribe to BYTE or BYTE on CD-ROM  
Flexible C++
Matthew Wilson
My approach to software engineering is far more pragmatic than it is theoretical--and no language better exemplifies this than C++.

more...

BYTE Digest

BYTE Digest editors every month analyze and evaluate the best articles from Information Week, EE Times, Dr. Dobb's Journal, Network Computing, Sys Admin, and dozens of other CMP publications—bringing you critical news and information about wireless communication, computer security, software development, embedded systems, and more!

Find out more

BYTE.com Store

BYTE CD-ROM
NOW, on one CD-ROM, you can instantly access more than 8 years of BYTE.
 
The Best of BYTE Volume 1: Programming Languages
The Best of BYTE
Volume 1: Programming Languages
In this issue of Best of BYTE, we bring together some of the leading programming language designers and implementors...

Copyright © 2005 CMP Media LLC, Privacy Policy, Your California Privacy rights, Terms of Service
Site comments: webmaster@byte.com
SDMG Web Sites: BYTE.com, C/C++ Users Journal, Dr. Dobb's Journal, MSDN Magazine, New Architect, SD Expo, SD Magazine, Sys Admin, The Perl Journal, UnixReview.com, Windows Developer Network