Best of Data Engineering — February 2024
- 1
- 2
Taro·2y
The Fundamentals of Data Engineering - Preface + Chapter 1: Data Engineering Described
This post provides an overview of the book 'Fundamentals of Data Engineering', discussing the motivations behind the book and the importance of data engineering in relation to data science and machine learning. It covers the Data Science Hierarchy of Needs, the Data Engineering Lifecycle, and the skills and activities of a Data Engineer. The post also discusses the stages of Data Maturity and the different types of Data Engineers. The next blog post will cover Chapter 2 of the book.
- 3
Python in Plain English·2y
Setting up PostgreSQL and PgAdmin4 using Docker: A Simplified Approach.
This post provides a simplified approach to setting up PostgreSQL and PgAdmin4 using Docker. It explains the benefits of containerization and guides readers through the process of installing Docker, running the Hello World container, and setting up PostgreSQL and PgAdmin4 containers. The post emphasizes the ease and efficiency of managing databases with containerization.
- 4
- 5
KDnuggets·2y
Collection of Free Courses to Learn Data Science, Data Engineering, Machine Learning, MLOps, and LLMOps
A collection of free courses for learning data science, data engineering, machine learning, MLOps, and LLMOps is provided. The courses are self-paced and community-based, offering valuable resources for beginners and experienced professionals.
- 6
MotherDuck·2y
DuckDB & Python : end-to-end data engineering project
This post discusses an end-to-end data engineering project using DuckDB and Python. It explores the architecture, ingestion pipeline, and sinking data using DuckDB. The post also provides instructions on how to fetch data from PyPi using Python and DuckDB.
- 7