A curated list of books covering various aspects of data engineering, from beginner-friendly introductions to more specialized and advanced topics. Highlights include foundational texts like 'The Data Warehouse Toolkit' by Ralph Kimball and practical guides such as 'Designing Data-Intensive Applications' by Martin Kleppmann. The compilation also features resources on modern technologies and practices, including data pipelines, cloud patterns, and machine learning with Spark.

Sort: