Best of Data Architecture2025

  1. 1
    Article
    Avatar of techworld-with-milanTech World With Milan·49w

    What I learned from the book Designing Data-Intensive Applications

    A comprehensive review of Martin Kleppmann's "Designing Data-Intensive Applications" after two complete readings. The book provides foundational knowledge about distributed systems, covering reliability, scalability, and maintainability principles. Key topics include data models (relational vs document vs graph), storage engines (B-trees vs LSM-trees), replication strategies, partitioning, transactions, and stream processing. The review highlights the book's strengths in explaining trade-offs and connecting theory to practice, while noting limitations like outdated examples and dense theoretical content. Recommended for experienced engineers working with data-intensive systems.

  2. 2
    Article
    Avatar of medium_jsMedium·1y

    Building a modern Data Warehouse from scratch

    Learn how to build a modern data warehouse using SQL Server. The project guides you through designing data architecture with Medallion Architecture, setting up ETL pipelines, developing data models, and creating data analytics and reporting solutions. Key steps include setting up project tools, implementing data quality checks, and creating bronze, silver, and gold layers for data processing and hierarchy. Resources and detailed instructions are provided for each phase.

  3. 3
    Article
    Avatar of swizecswizec.com·1y

    I was wrong about databases

    The post discusses misconceptions about database performance with arrays and clarifies how PostgreSQL handles wide table rows using TOAST records and tombstones. It emphasizes the importance of measuring performance based on specific data access patterns, noting that modern SSDs make data packing less critical.

  4. 4
    Article
    Avatar of bytebytegoByteByteGo·50w

    EP166: What is Event Sourcing?

    Event sourcing is a design paradigm that stores events leading to state changes rather than current state data, providing determinism and recoverability. The approach uses an append-only event store with sequenced events to rebuild application state. The newsletter also covers software deployment pipelines, data lake architecture, Netflix's distributed counter implementation, and TCP handshake mechanics.

  5. 5
    Video
    Avatar of youtubeYouTube·1y

    SQL Data Warehouse from Scratch | Full Hands-On Data Engineering Project

    Learn how to build a modern SQL data warehouse from scratch, incorporating real-world practices used in companies like Mercedes-Benz. The project covers data architecture design, ETL processes, and data modeling basics. By the end, you'll have a professional portfolio project to showcase your skills.

  6. 6
    Article
    Avatar of programmingdigestProgramming Digest·47w

    Which Data Architecture Should I Choose for My Workplace? — A Data Engineer’s Approach

    A comprehensive guide comparing four major data architecture approaches: Data Warehouse, Data Lake, Data Lakehouse, and Data Mesh. The article explains when to use each approach, their advantages and challenges, and provides platform recommendations. It focuses on the Medallion Architecture with its Bronze, Silver, and Gold layers for modern data warehouse design, emphasizing the importance of requirement analysis and proper architectural selection based on data types, analytical needs, and organizational structure.

  7. 7
    Article
    Avatar of tdsTowards Data Science·1y

    4 Things I Learned Building a Data Platform using Medallion Architecture in the Last 4 Years

    Celebrating four years of working with a medallion architecture data platform, the author shares key lessons learned. These include the importance of flexibility in applying the architecture, the potential need for additional data layers, the significance of proper data cataloging, and the balance between flexibility and maintainability. These insights aim to help others working with similar data organization approaches.