Skip to content

data-engineering-helpers/data-engineering-skilling

Repository files navigation

Knowledge Sharing - Data Engineering skilling

Table of Content (ToC)

Created by gh-md-toc

Overview

This project aims at collecting in a single place training resources for the up skilling of data engineers.

Even though the members of the GitHub organization may be employed by some companies, they speak on their personal behalf and do not represent these companies.

References

Data Engineering helpers

Articles and posts

Spark on Luminousmen on Substack

Anatomy of Spark applications

Data Engineering resources by Ahmed Alsaket

Article - 2025-08

Learning

  1. Master Python: https://lnkd.in/e5rCbvP8
  2. Learn SQL: https://lnkd.in/efMKFkfX
  3. Learn MySQL: https://lnkd.in/efk-Mi3c
  4. Learn MongoDB: https://lnkd.in/eMKPWtqX
  5. Dominate PySpark: https://lnkd.in/exwA2hKz
  6. Learn Bash, Airflow & Kafka: https://lnkd.in/eyN6u2yd
  7. Learn Git & GitHub: https://lnkd.in/eX_Q8s99
  8. Learn CICD basics: https://lnkd.in/epKGivFY
  9. Decode Data Warehousing: https://lnkd.in/eKnVbFAB
  10. Learn DBT: : https://lnkd.in/eG9eaEuE
  11. Learn Data Lakes: https://lnkd.in/eQ9xxAJT
  12. Learn DataBricks: https://lnkd.in/ePZpCv86
  13. Learn Azure Databricks: https://lnkd.in/eBij4akJ
  14. Learn Snowflake: https://lnkd.in/erETmtFU
  15. Learn Apache NiFi: http://bit.ly/43btwYy
  16. Learn Debezium: http://bit.ly/3K6W5gL

Portfolio with 5 must-try projects

  1. Reddit ETL Pipeline - https://lnkd.in/ekmgzGc8
  2. Surfline Dashboard - https://lnkd.in/e6AdaDzz
  3. Finnhub Streaming Data Pipeline - https://lnkd.in/eCF5kZvE
  4. Audiophile End-To-End ELT Pipeline - https://lnkd.in/ercYzXtX
  5. Streamify - https://lnkd.in/ePiEwH5k

Data Engineering articles by Mayurkumar Surani

Data Engineering Q&A by Sachin Chandrashekhar

Data Engineering on DataBricks by Jakub Lasak

Data Engineering illustrations by Riya Khandelwal

Books

Fundamentals of Data Engineering

Designing Data Intensive Applications

Curricula

DataBricks Growth Path

DataExpert

Data Engineering training

Transition from data science to data engineering

Specific topics

Python

Data Analysis with SQL

Duplication removal

Web sites

Data Engineering toolkit by Second brain

About

Skilling/training resources for data engineers

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published