We'll talk about data processing at Uber and how they revamped their ETL platform to make it modular and scalable. Plus, software testing anti-patterns and how to get better at finishing your side projects.

Quastor's platform is  dedicated to providing insights and resources for software developers and technology enthusiasts, focusing on software architecture, design patterns, and system scalability. Through articles, case studies, and architecture reviews, Quastor offers insights into building robust and scalable software systems. Developers can learn about architectural principles, distributed systems, and microservices design to design and implement scalable and maintainable applications.

Quastor Daily

Uber handles its massive data needs with an Exabyte-scale ETL system, scaling their data processing with Apache Spark and a custom framework called Sparkle. The revamped architecture focuses on modularity, reliability, and observability, handling extensive data generated from their services. Key tools in their ETL processes include Apache Spark, dbt, Apache Airflow, AWS Glue, and Google Cloud Dataflow.

How Uber Built an Exabyte-Scale System for Data Processing

The Architecture of Uber’s ETL Platform