|
Data Warehousing, Enterprise Portals, and the Information Supply Chain This column has been devoted to covering Real-Time Data Warehousing for the last several months. This month we turn our attention to the emerging role of Enterprise Information Portals (EIP) in managing what we will call the Information Supply Chain. An EIP will become an essential component of both traditional and real-time data warehouse environments What is unique about DW projects that succeed? When we researched this question, we found a common theme that should not be surprising in retrospect. The teams that succeed understand their consumers. In a more complete sense, they understand the role each consumer plays in creating, using and acting on information. Information analysis or decision support is not just a one step process of generating a report and reaching a conclusion. Raw data and intermediate results flow from individual to individual through many channels. Along the way numerous facts are generated, trends are analyzed, hypotheses are tested, and decisions are made. This flow of content from original source(s) to ultimate action(s) is what we call the Information Supply Chain. Information Supply Chain Picture this: Information is a fluid that comes from many sources and is blended in unique ways as it is moved in buckets from one location to another as it is needed. A data warehouse might be seen as a refinery that takes in undifferentiated raw material and produces many distilled variants that serve a specific purpose well. This creates the optimal product with which to begin analysis. The error is to think of this stage as the end anything other than the duty assignment of this one analyst. It is only the beginning of the chain; information still moves in buckets the rest of the way. (1) Very few information consumers use raw data. Why? The easy answer is to blame the technology or peoples’ willingness to accept the technology. “If only the tools were easier!” “If only we didn’t have so many computer-phobic users!” Yes, technology advances are reducing barriers to broader use. Yes, more computer-literate employees and better training will mean more people can use what you offer. But the degree of data warehouse adoption has a (low) natural ceiling that cannot be exceeded by traditional means. Regardless of the degree of aggregation and summary or the amount of transformation and derivation, most data warehouses are purveyors of raw data. The simple truth is the vast majority of information consumers do not use raw data. Their work begins with intermediate results, such as spreadsheets and documents, prepared by someone else. The average consumer adds value to these results and then sends this modified content on to be used by others. Most information consumers are not “end users” at all; they are “middle users”. (2) Most effective information is 5-7 steps from the last IT managed source. With these definitions in mind, we suggest you conduct a trial sources and uses study. Identify a true decision maker in your organization. Find the presentation documents that contain the effective information she relies on. Trace these documents back through all the intermediate destinations to the IT managed source(s). If your enterprise is typical, you will find, as we did, that data has been manually manipulated in 5 to 7 separate steps on its way from the original raw data source to final action. For instance, at a major computer manufacturer in the early 1990’s, top managers relied on the green, red, blue, and yellow sheets to make decisions. The green sheet contained current sales with notable highlights such as large orders or key customer trends. This content originated from numerous systems and was actively massaged and repacked at least 4 times before being hand assembled into the green sheet. The data for the cost sheet (red) went through so many hands we never knew for sure. The blue backlog sheet was sometimes accused of being wild guesses and little white lies. Nobody knew for sure where the supporting data came from or how it was used to produce the results. The outside influences sheet (yellow) was composed of news and clues and espionage that did not come from IT sources at all. A well-designed data warehouse will eliminate sourcing problems and may eliminate several steps in the overall flow. But it will never eliminate the management dynamic that exists in all organizations. The person who first gets the data may not use it. The person who uses the data may not act on it. The person who acts on it may not know the underlying data. Very rarely is the decision maker the data analyst much less the original data gatherer. This is the essence of the information supply chain from get to use to act. See the sidebar entitled: “An Information Supply Chain Example”. (3) The greatest productivity and consistency loss occurs in the middle of the chain. A data warehouse is generally designed to serve the first one, or at most few, steps in the information supply chain. We help the initial consumer get the raw data. We may provide support for early usage steps. Rarely does our reach extend to the middle, much less to the end, of the chain. The cost of not serving the middle to end of the chain can be high. The desktop processing at the upper end of the chain is massively inefficient when expensive personnel are used as data entry clerks. Consistency is jeopardized by high error rates caused by re-keying and manual manipulation, the lack of checks and balances, and capriciously volatile business processes. Several factors are the direct cause of loss of productivity and information consistency in the upper parts of the chain.
Enter the Enterprise Portal I will not define Enterprise Information Portals in detail since much has been published on this topic. However, it is critical for you to understand that I am referring to a specific subset of the broad range of contenders for this title. The subset includes those portal products that provide an extensive array of document and unstructured information management services in addition to structured data access and analysis. The large number of “business intelligence” portals, which primarily offer report, query, and OLAP services, need not apply. This form of EIP provides top-down integration to complement the bottom-up integration of data warehousing. The EIP provides the mechanism to catalog the document sets that are the content delivery vehicles for the middle-to-end of the Information Supply Chain. An EIP can address the productivity and consistency issues from the last section. First, an EIP provides a mechanism for content creators to catalog their results for others to use. Navigation facilities help information consumers find relevant information regardless of the document or data type, the storage form, or the location. Second, an EIP helps eliminate reinventing the wheel while cataloging multiple answers to the same question along with the supporting evidence. When people cannot easily find what they need, the first problem is the productivity loss in reproducing the results. The second problem may be more severe. People unwittingly introduce a redundant “answer” to the information flow, which is highly likely to be inconsistent in source, form, or results with other analyses. The existence of multiple, even conflicting, answers is not itself the problem. It is reasonable for different analysts to produce different results when their assumptions, their sources, or their methods are different. The information value is higher when a decision maker has multiple conjectures available supported by clearly defined calculation methods. Often, though, the multiple results are found and presented without the defining background that may account for their differences. If an analyst comes across multiple “answers”, they must either accept one on faith or be forced to reconcile them. This outcome is better then the typical scenario where conflicting results are presented to a key decision maker without explanation. Her trust in the whole process is undermined. Third, an EIP can be used to prepare and present a complete information package. It allows you to store together all content (documents and data) about a topic. It can include references or copies of news articles with supporting facts or ideas. It can include a link to any intranet or Internet source that may be relevant. Because it is based on the interactive media of the World Wide Web, it can include audio clips, video presentations, tutorials or demonstrations. Fourth, an Enterprse Information Portal can create a more egalitarian environment for information sharing. The existence of a familiar and ubiquitous mechanism for publishing and subscribing to information breaks down traditional barriers between departments and business units. Those whose power comes solely from limiting the flow of information will resist the introduction of an EIP into their domain. Be watchful for this hidden motivation. The highest goal of information management should be optimizing the Information Supply Chain of the enterprise. This requires increased attention on how information flows through your organization. Data Warehouse services can be extended further up the supply chain by supporting multiple stages of structured data analysis. An Enterprise Portal can be introduced to handle the organization of the document-centric middle-to-end of the supply chain. Instead of today’s bucket brigade, using our fluid example, we will be building subject-oriented pipelines to move content efficiently throughout the enterprise. |
|||||||||||||||||||||||||||||||||
|
home | about us | solutions | products | partners | clients | careers | contact us | search |
|||
|
|
FACE="Verdana, Helvetica, Arial">Texas.78746 tel(512)329-6646,.fax(512)329-0767 | ||