Archives
 
 
 
  Special
 
 
 
  About Us
 
 
 

Newsletter
Free E-mail Newsletter from BYTE.com

 
    
           
Visit the home page Browse the four-year online archive Download platform-neutral CPU/FPU benchmarks Find information for advertisers, authors, vendors, subscribers Request free information on products written about or advertised in BYTE Submit a press release, or scan recent announcements Talk with BYTE's staff and readers about products and technologies

ArticlesPlanning and Implementing a Data Warehouse


June 1997 / Reseller / Planning and Implementing a Data Warehouse

How resellers can profit from data-warehouse opportunities.

David Baum

Data warehouses promise easy access to business data and a faster way to answer complex questions. However, between the theory and practical use of a data warehouse lies a rocky road. This means danger for data-warehouse users and opportunities for resellers who can handle the architectural planning and extensive integration work necessary for successful implementations.

What are these opportunities? Small reseller shops can find lucrative niche markets by specializing in individual pieces of a data-warehouse launch, such as designing data models or Web-enabling legacy databases. Larger resellers can win warehouse customers by constructing vertical solutions for r etail, banking, manufacturing, and other industry segments. The advantage here is that once a reseller solves the hard problems in the first implementation, it can repeatedly sell the solution to other companies in that industry segment.

The large consulting arms of data-warehouse vendors sell everything from piecemeal development services to turnkey, packaged data marts. Take all three of these segments into account, and you'll find a market that will reach $6.9 billion by 1999, according to the Gartner Group.

Data-warehouse projects are sparkling reseller opportunities because they require many specialized skills to design, develop, and deploy. While shrink-wrapped warehouse solutions are beginning to reach the market, most warehouse efforts are still custom jobs calling for the integration of a wide range of hardware, software, and network components. The basic pieces include a data model, a warehouse server and applications servers, the middleware layer, a coh esive network infrastructure, client-side analysis tools, data-scrubbing utilities, data-transport utilities, replication engines, and metadata repositories. But don't let this laundry list intimidate you -- the first five items are the key components.

Modeling Data

Data warehouses separate day-to-day data that production applications use from the historical data that strategic planners use to uncover new sales patterns or spot other trends that can boost business. Another difference: Data-entry professionals

and applications update production data continuously, while data-warehouse applications refresh historical data at set times, usually during off-hours when network and CPU use are light.

One of your first steps will be to decide how to store the data. Any relational DBMS (RDBMS), such as Oracle7, Sybase System 10, or IBM's DB2/400, can work as a warehouse repository. However, some organizations opt for multidimensional DBMSes (MDBMSes) designed for data warehousing, such as Oracle Express and Red Brick Systems' Red Brick VPT. A traditional RDBMS lets end users view data in two dimensions (e.g., by product and by region). With an MDBMS, you can look at data in multiple dimensions (e.g., by product and by region over time).

On another front, universal servers, from Oracle, Sybase, Informix, Computer Associates, and IBM (see "RDBMSes Get a Make-Over" and "How to Improve RDBMSes," April BYTE), are object-relational hybrids that store not only text but complex objects such as images, animation, and sound. These RDBMSes are important for companies that post multiple data types on Web sites.

"The data warehouses emerging today can potentially support lots of data types in addition to text," says Mike Thompson, MIS manager at Integrated Device Technology (IDT) in Santa Clara, California. "For example, we often have business-critical information arriving in fax format. We could scan these images and store them in a database with some kind of logic behind them."

Since 1996, Thompson and his colleagues have been immersed in a project to construct a data warehouse for several areas of the company, including marketing, sales, order processing, and finance. After buying a ready-made data model from Applied Data Resource Management (see the text box "Packaged Data Model: A Gift for Warehouse Developers" below), the team went to work extracting data from production databases on a variety of host computers. They loaded the data into an Informix data warehouse that ran using a bank of SparcServers from Sun Microsystems.

Next, IDT will deploy the warehouse applications on the company's intranet. IDT chose the Informix product for its standard universal-server capability to handle complex data and because it lets developers embed SQL statements in HTML documents, so Web users can see query results dynamically displayed. Donald DePalma, an analyst at Forrester Research, says such multimedia warehouses will soon be the norm.

What's in the Middle

Mi ddleware connects warehouse databases and front-end decision-support tools. While standard database middleware can handle this task, specialized middleware for data warehouses is arriving. Colin White, principal consultant at DataBase Associates International, a database and data-warehouse consultancy, says specialized warehouse middleware can help companies monitor, track, and control access to warehouse data. "Users need to access data belonging to other departments for cross-business function analysis," White explains.

Some middleware products, including Sybase IQ and Information Builders' popular EDA, offer copy management (also called data staging) to select, edit, summarize, combine, and load the data warehouse with information from operational databases. Quality-analysis programs and filters identify patterns and data structures in the operational data. The patterns help summarize the data and construct views useful for analysis and reporting.

Network Considerations

Data warehouses typically imply a distributed-data architecture, with bulk transfers of data during off-hours and heavy interactive querying at peak hours of the day. Without proper planning, the performance of the network can suffer. Here's where resellers can help.

There are two primary methods for populating the warehouse with data: bulk downloads, in which the entire database is refreshed on a periodic basis, and change-based replication, where the system copies over just the changes.

Transmitting only the changes puts less stress on the network but requires more complex programming to set up. Bulk downloads can heavily tax the network -- a 20-GB database implies a 20-GB transfer -- but such warehouses are easier to set up and maintain.

In either case, warehouse architects must pay close attention to the frequency and scheduling of data updates (perhaps job-scheduling software is required). It's usually easy to determine the optimum time for refreshing the warehouse with new data, such as fol lowing a billing or sales cycle.

Browser Access

The final piece of the data-warehouse package is end-user access. Increasingly today, that means a Web connection such as the one Rand built when it linked a data warehouse and an intranet-based reporting system called Oasis. Rand, a nonprofit research firm, helps organizations develop public-policy strategies.

"Instead of having a copy of warehouse-reporting applications on every desktop, users can access server-based versions through their Web browsers," says Ken Krug, a treasurer at Rand. "This streamlines electronic access to corporate financial and man-power data."

Rand created Oasis on top of the Oracle7 database and Oracle Project Accounting software. Andersen Consulting helped with the initial financial-software selection. Oasis incorporates statistical and project-accounting programs built using the Oracle software in conjunction with Microsoft Excel and Netscape Web Server. Every two weeks, data from Oracle Project Accounting flows to an Oracle7 data warehouse that Rand uses to generate dozens of ad hoc reports.

Phase 1 of Oasis includes 45 reports accessible via the company's intranet to help users track the thousands of simultaneous projects and tasks ongoing at the organization. These reports are available anytime at all via Netscape Navigator.

A new generation of decision-support tools is helping VARs build next-generation applications, such as MicroStrategy's DSS Web, an analytical engine for translating user queries from Web browsers into an optimal SQL execution plan. Other products in this category include Oracle Express Server, Information Builders' WebFocus, Seagate Software's Crystal Reports, BusinessObjects' soon-to-be-released Darwin, and IQ Software's IQ LiveWeb.

Expanding Opportunities

The Internet's flexibility as an information-delivery vehicle and the growing cultivation of corporate data will continue to mean much custom development work for resellers. Even in an e ra of shrink-wrapped solutions, each data warehouse requires a unique architecture and includes a specific set of business requirements.

"For us, it comes back to what's best for our customers," says Rick Roy , vice president/information products division of reseller M&I Data Services. "Our real thrust with each data-warehouse initiative is to help companies identify the information that is truly important to them."


Where to Find


Business Objects

Cupertino, CA
Phone:    408-973-9300
Internet: 
http://www.businessobjects.com


Computer Systems Advisors

Woodcliff Lake, NJ
Phone:    201-391-6500
Internet: 
http://www.silverrun.com


DataBase Associates International

Morgan Hill, CA
Phone:    408-779-0436
Internet: 
http://www.dbaint.com


Digital Equipment Corp.

Maynard, MA
Phone:    800-332-4636
Internet: 
http://www.digital.com


Information Builders, Inc.

New York, NY
Phone:    800-969-4636
Internet: 
http://www.ibi.com


Informix

Menlo Park, CA
Phone:    800-331-1763
Internet: 
http://www.informix.com


IQ Software

Norcross, GA
Phone:    800-458-0386
Internet: 
http://www.iqsc.com


M&I Data Services

Brown Deer, WI
Phone:    800-236-3282
Internet: 
http://www.midata.com


MicroStrategy, Inc.

Vi
enna, VA
Phone:    703-848-8600
Internet: 
http://www.microstrategy.com


Oracle Corp.

Redwood Shores, CA
Phone:    800-392-2999
Internet: 
http://www.oracle.com


Pioneer Electronics

Cleveland, OH
Phone:    800-657-0168
Internet: 
http://www.pios.com


Platinum Technology

Oakbrook Terrace, IL
Phone:    800-442-6861
Internet: 
http://www.platinum.com


Prism Solutions

Sunnyvale, CA
Phone:    408-752-1888
Internet: 
http://www.tandem.com


Red Brick Systems

New York, NY
Phone:    800-777-2585
Internet: 
http://www.redbrick.com


Sybase, Inc.

Emeryville, CA
Phone:    800-879-2273
Internet: 
http://www.sybase.com


HotBYTEs
 - information on products covered or advertised in BYTE


Rick Roy

photo_link (41 Kbytes)

"Our real thrust with each data-warehouse initiative is to help companies identify the information that is truly important to them."


David Baum is a freelance technology writer residing in Santa Barbara, California. You can reach him at dwbaum@silcom.com .

Up to the Reseller section contentsGo to next article: Packaged Data Model: A Gift for Warehouse DevelopersSearchSend a comment on this articleSubscribe to BYTE or BYTE on CD-ROM  
Flexible C++
Matthew Wilson
My approach to software engineering is far more pragmatic than it is theoretical--and no language better exemplifies this than C++.

more...

BYTE Digest

BYTE Digest editors every month analyze and evaluate the best articles from Information Week, EE Times, Dr. Dobb's Journal, Network Computing, Sys Admin, and dozens of other CMP publications—bringing you critical news and information about wireless communication, computer security, software development, embedded systems, and more!

Find out more

BYTE.com Store

BYTE CD-ROM
NOW, on one CD-ROM, you can instantly access more than 8 years of BYTE.
 
The Best of BYTE Volume 1: Programming Languages
The Best of BYTE
Volume 1: Programming Languages
In this issue of Best of BYTE, we bring together some of the leading programming language designers and implementors...

Copyright © 2005 CMP Media LLC, Privacy Policy, Your California Privacy rights, Terms of Service
Site comments: webmaster@byte.com
SDMG Web Sites: BYTE.com, C/C++ Users Journal, Dr. Dobb's Journal, MSDN Magazine, New Architect, SD Expo, SD Magazine, Sys Admin, The Perl Journal, UnixReview.com, Windows Developer Network