Open to opportunities

I build the
data pipelines
that power decisions.

Data Engineer with 2+ years at Ipsos, processing 5M+ records across global datasets. I design ETL systems in Python, SQL, and Spark that teams actually rely on. AWS certified. Master's in Data Analytics. Based in Dublin.

pipeline_status.py
# rohit_yadav.pipeline_stats
records_processed = "5M+"
time_saved = "40%"
datasets_managed = 15
countries = 8
error_reduction = "25%"
 
stack = [
"Python", "SQL", "Spark",
"dbt", "AWS", "Airflow"
]
 
status = "ready_to_build" |
scroll
Python/SQL/Apache Spark/dbt/AWS/Airflow/ETL Pipelines/Data Modelling/Kafka/Power BI/Python/SQL/Apache Spark/dbt/AWS/Airflow/ETL Pipelines/Data Modelling/Kafka/Power BI/
01

Experience

Aug 2022 — Sep 2024 Mumbai, India

Data Engineer-I @ Ipsos

  • Designed and maintained ETL pipelines processing 5M+ records from global market research surveys using Python, SQL, and Apache Spark
  • Implemented dbt models to standardise data transformations, improving documentation, testing, and repeatability
  • Automated reporting workflows — reduced manual data preparation time by 40%
  • Built data validation frameworks across 15+ datasets, reducing downstream errors by 25%
  • Supported migration of on-premise infrastructure to AWS (S3, Lambda, EC2)
  • Developed reusable Spark jobs and Python scripts adopted as the team's core toolkit
PythonSQLSparkdbtAWSPandas
May 2025 — Present Dublin, Ireland

Sales Assistant @ Eurospar

  • Managing inventory and customer operations while completing Master's in Data Analytics
  • Developed communication and problem-solving skills in a fast-paced retail environment
02

Projects

Data Pipeline

Web Retail Revenue Data Warehouse

End-to-end data warehouse and BI solution for online retail — star-schema design, SSIS ETL orchestration, and Tableau dashboards.

Analytics

Customer Churn Insights Dashboard

Modelled customer lifecycle data with dimensional modelling. Built interactive Power BI dashboards tracking churn, retention, and revenue impact.

Automation

Survey Reporting Automation

Automated cleaning, aggregation, and scheduled publishing of survey data — Excel and PDF reports refreshed on a cadence for stakeholders.

03

Skills

Data Engineering

Apache Spark / PySpark, ETL/ELT Pipelines, dbt, Apache Airflow, Apache Kafka, Data Modelling (Star & Snowflake), Batch & Stream Processing

Cloud & Infrastructure

AWS (S3, EC2, Lambda, Redshift), Databricks, Git & GitHub, CI/CD, Linux, Shell Scripting, Docker basics

Programming

Python (Pandas, NumPy, Boto3), SQL (PostgreSQL, MySQL, SQL Server), PySpark, Scala (basic), Java (Core)

Analytics & Visualisation

Power BI, Tableau, Looker Studio, DAX & Power Query, Excel Dashboards, Data Visualisation, A/B Testing

04

Education & Certs

2025 — 2026

Master's in Data Analytics

Dublin Business School

Pipeline Architecture, ML, Cloud Computing, Data Governance
2019 — 2022

BSc Computer Science

Mumbai University

Core CS, Databases, Software Engineering, Java
AWS Certified Cloud Practitioner
Core Java Certification
Streaming Pipelines — Apache Kafka
Career Skills in Data Analytics
05

Let's connect

Open to Data Engineer & Data Analyst roles in Dublin and remote. Got a data problem? Let's chat.