• maiweb v0.1.0
  • ★
  • Feedback

StartDataEngineering

active · last success 2026-06-19 00:07

Visit site ↗ · Feed ↗

  • StartDataEngineering youtube.com channel data-engineering video youtube 2026-06-16 20:51
    ↗

    I’m running a free live session on YouTube covering: Steps you can take today to transition from a data analyst(or similar role) to data engineering. https://www.startdataengineering.com/post/data-analyst-to-data-engineer/

    ▶ Watch on YouTube Opens in a new tab
    I’m running a free live session on YouTube covering: Steps you can take today to transition from a data analyst(or similar role) to data engineering. https://www.startdataengineering.com/post/data-analyst-to-data-engineer/
  • StartDataEngineering youtube.com channel data-engineering video youtube 2026-04-22 22:39
    ↗

    A practical guide to detecting and handling RI violations before your stakeholders see NULL data. Blog: https://www.startdataengineering.com/post/why-referential-integrity-matters/ Code: https://github.com/josephmachado/referential_integrity/tree/main

    ▶ Watch on YouTube Opens in a new tab
    A practical guide to detecting and handling RI violations before your stakeholders see NULL data. Blog: https://www.startdataengineering.com/post/why-referential-integrity-matters/ Code: https://github.com/josephmachado/referential_integrity/tree/main
  • StartDataEngineering youtube.com channel data-engineering video youtube 2026-04-15 19:04
    ↗

    Blog: https://www.startdataengineering.com/post/data-engineering-roadmap/ Trying to upskill as a data engineer? You most likely have come across one of the many data engineering roadmaps that list a long set of tools. If you are: * Wondering how to convince recruiters and...

    ▶ Watch on YouTube Opens in a new tab
    Blog: https://www.startdataengineering.com/post/data-engineering-roadmap/ Trying to upskill as a data engineer? You most likely have come across one of the many data engineering roadmaps that list a long set of tools. If you are: * Wondering how to convince recruiters and non-technical hiring managers to interview you, when you don’t “know” a tool * New to the career and overwhelmed by the proliferation of tools * Worried that LLMs will take away all data jobs This video is for you. 00:00 The problem with DE roadmaps 00:40 Fundamentals and best practices 02:47 Process to learn a new tool 03:11 Example: Apache Iceberg 05:31 Conclusion
  • StartDataEngineering youtube.com channel data-engineering video youtube 2026-04-12 06:50
    ↗

    I’m running a free live session on YouTube covering: 1. Why orchestration exists in the first place (the mental model most tutorials skip) 2. Full-refresh vs incremental pipelines, and how Airflow handles each 3. Managing long dependencies with asset-based scheduling 4. The...

    ▶ Watch on YouTube Opens in a new tab
    I’m running a free live session on YouTube covering: 1. Why orchestration exists in the first place (the mental model most tutorials skip) 2. Full-refresh vs incremental pipelines, and how Airflow handles each 3. Managing long dependencies with asset-based scheduling 4. The Airflow 3.0 features and best practices worth knowing now If you’re trying to land a Data Engineering role or level up, this is the session to catch live. Code: ​https://github.com/josephmachado/airflow-tutorial P.S. I’m opening enrollment for my Data Engineering course on April 26th — 75 seats only. More details coming soon.
  • StartDataEngineering youtube.com channel data-engineering video youtube 2026-04-02 16:59
    ↗

    Code: https://github.com/josephmachado/data-engineering-course-sample/ This is a sample from my upcoming Data Engineering Course

    ▶ Watch on YouTube Opens in a new tab
    Code: https://github.com/josephmachado/data-engineering-course-sample/ This is a sample from my upcoming Data Engineering Course
  • StartDataEngineering youtube.com channel data-engineering video youtube 2025-06-21 18:10
    ↗

    Code: https://github.com/josephmachado/advanced_spark_sql_for_data_engineers/tree/main Full Course: https://josephmachado.podia.com/advanced-spark-sql-workshop-for-data-engineers Feedback Link: https://form.typeform.com/to/f51flAI1 1. Date & Time June 21st, 2025 1:00 PM -...

    ▶ Watch on YouTube Opens in a new tab
    Code: https://github.com/josephmachado/advanced_spark_sql_for_data_engineers/tree/main Full Course: https://josephmachado.podia.com/advanced-spark-sql-workshop-for-data-engineers Feedback Link: https://form.typeform.com/to/f51flAI1 1. Date & Time June 21st, 2025 1:00 PM - 2:00 PM EST (10:00 AM - 11:00 AM PST) 2. What You Will Learn * How to use JOINs to validate data and identify underlying data issues * How to use advanced aggregation functions & check data quality with GROUP BY 3. Who This Workshop Is For 3.1. Prequisites: * SQL basics, especially JOIN & GROUP BY basics (see basics here) * Basic understanding of fact and dimension tables * GitHub codespaces or Docker compose (if running locally) 3.2. Perfect for: * People with some experience in SQL * People who work with SQL regularly 3.3. Not suitable for: * People who don't know SQL basics, especially JOIN & GROUP BY basics (see basics here) * People looking for topics other than advanced JOIN and GROUP BY techniques 4. How to Join * Format: YouTube live workshop with hands-on coding * Participation: You are expected to code along * Interaction: Live Q&A session included * Practice: Exercises provided
  • StartDataEngineering youtube.com channel data-engineering video youtube 2025-05-23 19:21
    ↗

    Creating a pipeline for screen casting with auto subtitle generation with Open AI Whisper code at: https://github.com/josephmachado/scripts/tree/main/recording

    ▶ Watch on YouTube Opens in a new tab
    Creating a pipeline for screen casting with auto subtitle generation with Open AI Whisper code at: https://github.com/josephmachado/scripts/tree/main/recording
  • StartDataEngineering youtube.com channel data-engineering video youtube 2025-02-23 11:48
    ↗

    Data Wrangler Visual Studio Code Extension

    ▶ Watch on YouTube Opens in a new tab
    Data Wrangler Visual Studio Code Extension
  • StartDataEngineering youtube.com channel data-engineering video youtube 2024-10-20 06:11
    ↗

    Signup for my free DE101 course: https://www.startdataengineering.com/email-course/ Blog: https://www.startdataengineering.com/post/use-structs-sql/ Code: https://github.com/josephmachado/adv_data_transformation_in_sql/tree/main/concepts/nested_data_types

    ▶ Watch on YouTube Opens in a new tab
    Signup for my free DE101 course: https://www.startdataengineering.com/email-course/ Blog: https://www.startdataengineering.com/post/use-structs-sql/ Code: https://github.com/josephmachado/adv_data_transformation_in_sql/tree/main/concepts/nested_data_types
  • StartDataEngineering youtube.com channel data-engineering video youtube 2024-10-13 06:10
    ↗

    We will go over: 1. How the workshop is structured, who its meant for, pre-requisites and the format of the workshop. 2. Answer any questions you may have about the workshop.

    ▶ Watch on YouTube Opens in a new tab
    We will go over: 1. How the workshop is structured, who its meant for, pre-requisites and the format of the workshop. 2. Answer any questions you may have about the workshop.
  • StartDataEngineering youtube.com channel data-engineering video youtube 2024-09-29 07:38
    ↗

    Code: https://github.com/josephmachado/de_project Blog: https://github.com/josephmachado/de_project/blob/main/setup-data-project.ipynb Feedback: https://form.typeform.com/to/AyUYk4RZ Description: We will go over the critical steps involved in setting up a data project.

    ▶ Watch on YouTube Opens in a new tab
    Code: https://github.com/josephmachado/de_project Blog: https://github.com/josephmachado/de_project/blob/main/setup-data-project.ipynb Feedback: https://form.typeform.com/to/AyUYk4RZ Description: We will go over the critical steps involved in setting up a data project.
  • StartDataEngineering youtube.com channel data-engineering video youtube 2024-08-11 07:24
    ↗

    Code: https://github.com/josephmachado/adv_data_transformation_in_sql/tree/main?tab=readme-ov-file#advanced-data-transformation-in-sql-workshop Feedback form: https://jrir55dxz0v.typeform.com/to/FH21xsvY Description: SQL is the bread and butter of data engineering! Data...

    ▶ Watch on YouTube Opens in a new tab
    Code: https://github.com/josephmachado/adv_data_transformation_in_sql/tree/main?tab=readme-ov-file#advanced-data-transformation-in-sql-workshop Feedback form: https://jrir55dxz0v.typeform.com/to/FH21xsvY Description: SQL is the bread and butter of data engineering! Data engineers must know how to use SQL to process data effectively. Many job requirements ask for "advanced SQL," but there is no clear consensus on what that means. Understanding the patterns of data processing in SQL empowers you to effectively tackle any problem. This webinar will provide you with the 'what/Why/How' of popular SQL techniques (Windows and CTEs), ensuring that your code is not only easy to read and maintain, but also practical and applicable in real-world scenarios. Are you ready to level up your data skills? I am excited to invite you to my upcoming free webinar on Advanced data processing in SQL workshop! 📅 Date: August 10th, 2024 ⏰ Time: 1 PM - 3 PM EST (10 AM - 1 PM PST) 📍 Where: YouTube live link​ 💰 Cost: Free Prerequisites: 1. SQL basics (SELECT, WHERE, JOINs & GROUP BY) 2. Setup: https://github.com/josephmachado/adv_data_transformation_in_sql/tree/main?tab=readme-ov-file#prerequisites In this hands-on workshop, you'll learn: 1. How to use window functions effectively 2. How to write maintainable SQL with CTEs 3. Common analytical queries and SQL templates to answer them Who Should Attend? 1. Anyone looking to improve their SQL skills 2. Anyone interested in understanding the window functions, CTEs, and advanced data processing techniques in SQL 3. You are familiar with SQL basics (SELECT, WHERE, JOINs & GROUP BY)
  • StartDataEngineering youtube.com channel data-engineering video youtube 2024-08-01 18:34
    ↗

    ▶ Watch on YouTube Opens in a new tab

    No full content extracted yet.

    Extracting…
  • StartDataEngineering youtube.com channel data-engineering video youtube 2024-05-31 19:09
    ↗

    Demo for blog at: https://www.startdataengineering.com/post/dbt-data-build-tool-tutorial/#2-dbt-the-t-in-elt Code at https://github.com/josephmachado/simple_dbt_project/

    ▶ Watch on YouTube Opens in a new tab
    Demo for blog at: https://www.startdataengineering.com/post/dbt-data-build-tool-tutorial/#2-dbt-the-t-in-elt Code at https://github.com/josephmachado/simple_dbt_project/
  • StartDataEngineering youtube.com channel data-engineering video youtube 2021-03-15 23:15
    ↗

    Setting up an ELT data-ops workflow with multiple environments for developers is often extremely time consuming. What if there was a way to speed up this process, so that you could concentrate on modeling your data and delivering value to your end users? The good news is that...

    ▶ Watch on YouTube Opens in a new tab
    Setting up an ELT data-ops workflow with multiple environments for developers is often extremely time consuming. What if there was a way to speed up this process, so that you could concentrate on modeling your data and delivering value to your end users? The good news is that there is a way. You can leverage dbt cloud to setup an ELT data-ops workflow in a very short time. In this post, we cover how to setup a data-ops workflow for an ELT system. We will go over how to setup dbt, snowflake, CI and schedule jobs. This data-ops workflow can be easily modified and built upon as your data team's needs evolve. dbt tutorial: https://youtu.be/gtZ8h8Aynmw
  • End of feed
Maibook — your private personalized AI community
  • rcanand.com
  • mlaillc.com
  • @rcanand (X)
  • LinkedIn
  • Feedback
  • Credits