Data Warehouse ETL Toolkit : Practical Techniques for Extracting, Cleaning, Conforming, and Delivering Data by Ralph Kimball and Joe Caserta (2025, Trade Paperback)

Great Book Prices Store (337447)
96.6% positive Feedback
Price:
US $44.08
Approximately£32.59
+ $19.99 postage
Estimated delivery Mon, 30 Jun - Thu, 10 Jul
Returns:
14 days return. Buyer pays for return postage. If you use an eBay delivery label, it will be deducted from your refund amount.
Condition:
New
Data Warehouse ETL Toolkit : Practical Techniques for Extracting, Cleaning, Conforming, and Delivering Data, Paperback by Kimball, Ralph; Caserta, Joe, ISBN 0764567578, ISBN-13 9780764567575, Brand New, Free shipping in the US Discusses how to use an ELT system, covering such topics as choosing an architecture, building a data cleaning subsystem, and finetuning the ELT process for optimum performance.

About this product

Product Identifiers

PublisherWiley & Sons, Incorporated, John
ISBN-100764567578
ISBN-139780764567575
eBay Product ID (ePID)30765163

Product Key Features

Number of Pages528 Pages
LanguageEnglish
Publication NameData Warehouse ETL Toolkit : Practical Techniques for Extracting, Cleaning, Conforming, and Delivering Data
Publication Year2025
SubjectData Modeling & Design, Databases / Data Warehousing
TypeTextbook
Subject AreaComputers
AuthorRalph Kimball, Joe Caserta
FormatTrade Paperback

Dimensions

Item Height1.2 in
Item Weight31.3 Oz
Item Length9.1 in
Item Width7.4 in

Additional Product Features

Intended AudienceScholarly & Professional
LCCN2004-016909
Dewey Edition22
TitleLeadingThe
IllustratedYes
Dewey Decimal005.74
Table Of ContentAcknowledgments. About the Authors. Introduction. Part I: Requirements, Realities, and Architecture. Chapter 1: Surrounding the Requirements. Chapter 2: ETL Data Structures. Part II: Data Flow. Chapter 3: Extracting. Chapter 4: Cleaning and Conforming. Chapter 5: Delivering Dimension Tables. Chapter 6: Delivering Fact Tables. Part III: Implementation and operations. Chapter 7: Development. Chapter 8: Operations. Chapter 9: Metadata. Chapter 10: Responsibilities. Part IV: Real Time Streaming ETL Systems. Chapter 11: Real-Time ETL Systems. Chapter 12: Conclusions. Index.
SynopsisThe single most authoritative guide on the most difficult phase of building a data warehouse The extract, transform, and load (ETL) phase of the data warehouse development life cycle is far and away the most difficult, time-consuming, and labor-intensive phase of building a data warehouse. Done right, companies can maximize their use of data storage; if not, they can end up wasting millions of dollars storing obsolete and rarely used data. Bestselling author Ralph Kimball, along with Joe Caserta, shows you how a properly designed ETL system extracts the data from the source systems, enforces data quality and consistency standards, conforms the data so that separate sources can be used together, and finally delivers the data in a presentation-ready format. Serving as a road map for planning, designing, building, and running the back-room of a data warehouse, this book provides complete coverage of proven, timesaving ETL techniques. Beginning with a quick overview of ETL fundamentals, it then looks at ETL data structures, both relational and dimensional. The authors show how to build useful dimensional structures, providing practical examples of techniques. Along the way youll learn how to: Plan and design your ETL system Choose the appropriate architecture from the many possible options Build the development/test/production suite of ETL processes Build a comprehensive data cleaning subsystem Tune the overall ETL process for optimum performance, Cowritten by Ralph Kimball, the world's leading data warehousing authority, whose previous books have sold more than 150,000 copies Delivers real-world solutions for the most time- and labor-intensive portion of data warehousing-data staging, or the extract, transform, load (ETL) process Delineates best practices for extracting data from scattered sources, removing redundant and inaccurate data, transforming the remaining data into correctly formatted data structures, and then loading the end product into the data warehouse Offers proven time-saving ETL techniques, comprehensive guidance on building dimensional structures, and crucial advice on ensuring data quality, Cowritten by Ralph Kimball, the world's leading data warehousing authority, whose previous books have sold more than 150,000 copies.
LC Classification NumberQA76.9.D37

All listings for this product

Buy it now
Any condition
New
Pre-owned

Ratings and reviews

4.0
2 product ratings
  • 1 users rated this 5 out of 5 stars
  • 0 users rated this 4 out of 5 stars
  • 1 users rated this 3 out of 5 stars
  • 0 users rated this 2 out of 5 stars
  • 0 users rated this 1 out of 5 stars

Would recommend

Good value

Compelling content

Most relevant reviews

  • Need this book to help with the most difficult part of designing a data warehouse, ETLing incoming data.

    Excellent companion to a data warehouse design project. As I began studying the process of designing a data warehouse (DW) it quickly became apparent that more information and specifics were required to understand the most important aspect next to DW design, the data extraction, transformation, and loading of relevant and accurate data into the newly designed DW. It does not matter how excellent the design and architecture of a DW system appears, if the data is not processed properly and accurately that is loaded into the DW from the outset. The dependent processes that show the value of the DW for continued acceptance and funding, generally excluding 'direct' or 'pass through' data sources, such as daily business decision and other adhoc queries, dependent non/relational databases, and other canned enterprise applications will never satisfy its users, while potentially bad decisions can be made from otherwise incorrect data loaded into the DW. There are many details covered that ensure that ETL teams investigate various data scenarios so they can create input data into the DW that can be relied on and changed easily as data sources change or new data sources are incorporated into the critical ETL process ensuring a long life to a very expensive investment.

    Verified purchase: YesCondition: Pre-owned