Raj Patel

C003

Principal Data Engineer

Bangalore, India

Summary

Data engineer with 6 years of experience building large-scale data platforms. Specialized in real-time streaming pipelines and ML infrastructure. Built systems processing 50TB daily.

6.3

Years Experience

$95,000

Expected Salary

Hybrid

Work Mode

Skills

Languages: Python Scala Java SQL
Frameworks: Apache Spark Kafka Airflow Flink
Databases: Snowflake PostgreSQL Cassandra Hive
Cloud: AWS GCP
Tools: Docker Kubernetes MLflow dbt

Work Experience

Lead Data Engineer

Flipkart

2021-04 - present

45 months

Leading data platform team of 8 engineers. Built real-time recommendation pipeline processing 100M+ events/day.

Python Scala Apache Spark Kafka Airflow Snowflake AWS

Senior Data Engineer

Infosys

2019-06 - 2021-03

21 months

Designed ETL pipelines for Fortune 500 banking client. Reduced data processing time from 8 hours to 45 minutes.

Python Java Apache Spark Hive HDFS Airflow GCP

Data Engineer

TCS

2018-07 - 2019-05

10 months

Built data warehouse for telecom client. Migrated legacy Oracle systems to Hadoop ecosystem.

Python SQL Hadoop Hive Sqoop

Projects

StreamETL open-source

6 months

Framework for building streaming ETL pipelines with exactly-once semantics. Used by 50+ companies.

Scala Apache Spark Kafka Delta Lake GitHub

DataQuality Monitor personal

4 months

Automated data quality monitoring tool with Slack/PagerDuty alerting integration.

Python Great Expectations Airflow PostgreSQL GitHub

Education

Masters in Data Science

IIT Bombay

Graduated 2018 • GPA: 3.9

Certifications

  • AWS Data Analytics Specialty
  • Databricks Certified Data Engineer Professional