Uber successfully migrated 18,000 Hive ETL workflows generating 5 million monthly queries to Spark SQL, achieving 50% reduction in runtime and resource usage. The migration involved building three core services: Query Translation Service for converting HiveQL to Spark SQL, Data Validation Service for ensuring output

15m read timeFrom uber.com
Post cover image
Table of contents
MotivationArchitectureMigration StrategyAutomated Migration Service (AMS)Query Translation ServiceData Validation ServiceBridging the Gap Between Hive and Spark

Sort: