I’m running a free live session on YouTube covering: Steps you can take today to transition from a data analyst(or similar role) to data engineering. https://www.startdataengineering.com/post/data-analyst-to-data-engineer/
active · last success 2026-06-19 00:07
I’m running a free live session on YouTube covering: Steps you can take today to transition from a data analyst(or similar role) to data engineering. https://www.startdataengineering.com/post/data-analyst-to-data-engineer/
A practical guide to detecting and handling RI violations before your stakeholders see NULL data. Blog: https://www.startdataengineering.com/post/why-referential-integrity-matters/ Code: https://github.com/josephmachado/referential_integrity/tree/main
Blog: https://www.startdataengineering.com/post/data-engineering-roadmap/ Trying to upskill as a data engineer? You most likely have come across one of the many data engineering roadmaps that list a long set of tools. If you are: * Wondering how to convince recruiters and...
I’m running a free live session on YouTube covering: 1. Why orchestration exists in the first place (the mental model most tutorials skip) 2. Full-refresh vs incremental pipelines, and how Airflow handles each 3. Managing long dependencies with asset-based scheduling 4. The...
Code: https://github.com/josephmachado/data-engineering-course-sample/ This is a sample from my upcoming Data Engineering Course
Code: https://github.com/josephmachado/advanced_spark_sql_for_data_engineers/tree/main Full Course: https://josephmachado.podia.com/advanced-spark-sql-workshop-for-data-engineers Feedback Link: https://form.typeform.com/to/f51flAI1 1. Date & Time June 21st, 2025 1:00 PM -...
Creating a pipeline for screen casting with auto subtitle generation with Open AI Whisper code at: https://github.com/josephmachado/scripts/tree/main/recording
Signup for my free DE101 course: https://www.startdataengineering.com/email-course/ Blog: https://www.startdataengineering.com/post/use-structs-sql/ Code: https://github.com/josephmachado/adv_data_transformation_in_sql/tree/main/concepts/nested_data_types
We will go over: 1. How the workshop is structured, who its meant for, pre-requisites and the format of the workshop. 2. Answer any questions you may have about the workshop.
Code: https://github.com/josephmachado/de_project Blog: https://github.com/josephmachado/de_project/blob/main/setup-data-project.ipynb Feedback: https://form.typeform.com/to/AyUYk4RZ Description: We will go over the critical steps involved in setting up a data project.
Code: https://github.com/josephmachado/adv_data_transformation_in_sql/tree/main?tab=readme-ov-file#advanced-data-transformation-in-sql-workshop Feedback form: https://jrir55dxz0v.typeform.com/to/FH21xsvY Description: SQL is the bread and butter of data engineering! Data...
No full content extracted yet.
Extracting…
Demo for blog at: https://www.startdataengineering.com/post/dbt-data-build-tool-tutorial/#2-dbt-the-t-in-elt Code at https://github.com/josephmachado/simple_dbt_project/
Setting up an ELT data-ops workflow with multiple environments for developers is often extremely time consuming. What if there was a way to speed up this process, so that you could concentrate on modeling your data and delivering value to your end users? The good news is that...