A guide on setting up Apache Spark and Airflow on a local machine using Docker. The setup includes components like Airflow UI, Jupyter Notebook, Spark UI, Spark History Server, and Spark Shell. The post covers architecture details, directory structures, and configuration steps, including cloning the GitHub repository, generating SSH keys, building Docker images, and running the setup.

6m read timeFrom blog.devgenius.io
Post cover image
Table of contents
Apache Spark & Airflow in Docker: Step by Step guideArchitecture

Sort: