Data Engineer
Instagram Data Analysis using Python and Power BI
ETL Process Using Airflow and Docker
This project provides a Docker-based setup to explore advanced PySpark DataFrame concepts using Jupyter notebooks. The environment includes all necessary dependencies, making it easy to get started with PySpark for data processing and analysis.
Apache pyspark by example
Code, quizzes, and notes from the DeepLearning.AI Data Engineering Professional Certificate specialization, showcasing practical projects, skills developed, and a capstone work in data engineering.