The Data Warehouse Toolkit Second Edition The Complete Guide to Dimensional Modeling Ralph Kimball Margy Ross Wiley Computer Publishing John Wiley & Sons, Inc. NEW YORK CHICHESTER WEINHEIM BRISBANE SINGAPORE TORONTO
Dimension Table Attributes Date Dimension Product Dimension Store Dimension Promotion Dimension Degenerate Transaction Number Dimension Retail Schema in Action Retail Schema Extensibility Resisting Comfort Zone Urges Dimension Normalization (Snowflaking) Too Many Dimensions Surrogate Keys Market Basket Analysis Chapter 3 Chapter 4 Inventory Introduction to the Value Chain Inventory Models Inventory Periodic Snapshot Inventory Transactions Inventory Accumulating Snapshot Value Chain Integration Data Warehouse Bus Architecture Data Warehouse Bus Matrix Conformed Dimensions Conformed Facts Procurement Procurement Case Study Procurement Transactions Multiple- versus Single-Transaction Fact Tables Complementary Procurement Snapshot
If Slowly Changing Dimensions Type 1: Overwrite the Value Type 2: Add a Dimension Row Type 3: Add a Dimension Column Hybrid Slowly Changing Dimension Techniques Predictable Changes with Multiple Version Overlays Unpredictable Changes with Single Version Overlay More Rapidly Changing Dimensions Chapter 5 Order Management Introduction to Order Management Order Transactions Fact Normalization Dimension Role-Playing Product Dimension Revisited Customer Ship-To Dimension Deal Dimension Degenerate Dimension for Order Number Junk Dimensions Multiple Currencies Header and Line Item Facts with Different Granularity Invoice Transactions Profit and Loss Facts Profitability The Most Powerful Data Mart Profitability Words of Warning Customer Satisfaction Facts Accumulating Snapshot for the Order Fulfillment Pipeline Lag Calculations Multiple Units of Measure Beyond the Rear-View Mirror Fact Table Comparison Transaction Fact Tables Periodic Snapshot Fact Tables Accumulating Snapshot Fact Tables
viii j Contents Designing Real-Time Partitions Requirements for the Real-Time Partition Transaction Grain Real-Time Partition Periodic Snapshot Real-Time Partition Accumulating Snapshot Real-Time Partition Chapter 6 Chapter 7 Customer Relationship Management CRM Overview Operational and Analytical CRM Packaged CRM Customer Dimension Name and Address Parsing Other Common Customer Attributes Dimension Outriggers for a Low-Cardinality Attribute Set Large Changing Customer Dimensions Implications of Type 2 Customer Dimension Changes Customer Behavior Study Groups Commercial Customer Hierarchies Combining Multiple Sources of Customer Data Analyzing Customer Data from Multiple Business Processes Accounting Accounting Case Study General Ledger Data General Ledger Periodic Snapshot General Ledger Journal Transactions Financial Statements Budgeting Process Consolidated Fact Tables Role of OLAP and Packaged Analytic Solutions
Chapter 8 Chapter 9 Human Resources Management Time-Stamped Transaction Tracking in a Dimension Time-Stamped Dimension with Periodic Snapshot Facts Audit Dimension Keyword Outrigger Dimension AND/OR Dilemma Searching for Substrings Survey Questionnaire Data Financial Services Banking Case Study Dimension Triage Household Dimension Multivalued Dimensions Minidimensions Revisited Arbitrary Value Banding of Facts Point-in-Time Balances Heterogeneous Product Schemas Heterogeneous Products with Transaction Facts Chapter 10 Telecommunications and Utilities Telecommunications Case Study General Design Review Considerations Granularity Date Dimension Degenerate Dimensions Dimension Decodes and Descriptions Surrogate Keys Too Many (or Too Few) Dimensions Draft Design Exercise Discussion Geographic Location Dimension Location Outrigger Leveraging Geographic Information Systems
Chapter 11 Chapter 12 Chapter 13 Transportation Airline Frequent Flyer Case Study Multiple Fact Table Granularities Linking Segments into Trips Extensions to Other Industries Cargo Shipper Travel Services Combining Small Dimensions into a Superdimension Class of Service Origin and Destination More Date and Time Considerations Country-Specific Calendars Time of Day as a Dimension or Fact Date and Time in Multiple Time Zones Education University Case Study Accumulating Snapshot for Admissions Tracking Factless Fact Tables Student Registration Events Facilities Utilization Coverage Student Attendance Events Other Areas of Analytic Interest Health Care Health Care Value Circle Health Care Bill Roles Played By the Date Dimension Multivalued Diagnosis Dimension Extending a Billing Fact Table to Show Profitability Dimensions for Billed Hospital Stays
Complex Health Care Events Medical Records Fact Dimension for Sparse Facts Going Back in Time Late-Arriving Fact Rows Late-Arriving Dimension Rows Chapter 14 Chapter 15 Electronic Commerce Web Client-Server Interactions Tutorial Why the Clickstream Is Not Just Another Data Source Challenges of Tracking with Clickstream Data Specific Dimensions for the Clickstream Clickstream Fact Table for Complete Sessions Clickstream Fact Table for Individual Page Events Aggregate Clickstream Fact Tables Integrating the Clickstream Data Mart into the Enterprise Data Warehouse Electronic Commerce Profitability Data Mart Insurance Insurance Case Study Insurance Value Chain Draft Insurance Bus Matrix Policy Transactions Dimension Details and Techniques Alternative (or Complementary) Policy Accumulating Snapshot Policy Periodic Snapshot Conformed Dimensions Conformed Facts Heterogeneous Products Again Multivalued Dimensions Again
More Insurance Case Study Background Updated Insurance Bus Matrix Claims Transactions Claims Accumulating Snapshot Policy/Claims Consolidated Snapshot Factless Accident Events Common Dimensional Modeling Mistakes to Avoid Chapter 16 Building the Data Warehouse Business Dimensional Lifecycle Road Map Road Map Major Points of Interest Project Planning and Management Assessing Readiness Scoping Justification Staffing Developing and Maintaining the Project Plan Business Requirements Definition Requirements Preplanning Collecting the Business Requirements Postcollection Documentation and Follow-up Lifecycle Technology Track Technical Architecture Design Eight-Step Process for Creating the Technical Architecture Product Selection and Installation Lifecycle Data Track Dimensional Modeling Physical Design Aggregation Strategy Initial Indexing Strategy Data Staging Design and Development Dimension Table Staging Fact Table Staging
xiii Lifecycle Analytic Applications Track Analytic Application Specification Analytic Application Development Deployment Maintenance and Growth Common Data Warehousing Mistakes to Avoid Chapter 17 Present Imperatives and Future Outlook Ongoing Technology Advances Political Forces Demanding Security and Affecting Privacy Conflict between Beneficial Uses and Insidious Abuses Who Owns Your Personal Data? What Is Likely to Happen? Watching the Watchers... How Watching the Watchers Affects Data Warehouse Architecture Designing to Avoid Catastrophic Failure Catastrophic Failures Countering Catastrophic Failures Intellectual Property and Fair Use Cultural Trends in Data Warehousing Managing by the Numbers across the Enterprise Increased Reliance on Sophisticated Key Performance Indicators Behavior Is the New Marquee Application Packaged Applications Have Hit Their High Point Application Integration Has to Be Done by Someone Data Warehouse Outsourcing Needs a Sober Risk Assessment In Closing Glossary Index