4× AWS Certified • TCS (Vanguard Client)

Bhavik Mundra

Data Engineer • AWS | PySpark | Real-Time Pipelines

Building production-grade PySpark ETLs on AWS Glue & EMR on EKS • Migrated mainframe/COBOL workloads for Vanguard

About Me

I'm a Data Engineer with production experience delivering high-scale, regulated financial workloads at TCS for Vanguard.

I specialize in modernizing legacy ETL systems — migrating mainframe/COBOL + DB2 jobs to fully serverless AWS architectures using PySpark, Glue, Lambda, Step Functions, and EMR on EKS.

Currently focused on building real-time streaming pipelines, data lakehouses with Apache Iceberg, and Kubernetes-native data platforms using Argo Workflows and Karpenter.

Ujjain → Indore, India

Available for full-time roles

Technical Expertise

AWS Glue

PySpark

EMR on EKS

Apache Iceberg

Kinesis

Lambda

Step Functions

Terraform

ArgoCD

Karpenter

SageMaker

Experience

Data Engineer

Tata Consultancy Services (TCS)

Vanguard Project — Client-Embedded Team

Mar 2025 – Present

Indore, India

Collaborated directly with Vanguard’s US team to migrate mainframe/COBOL ETL jobs and DB2 data to AWS, enabling real-time client portfolio calculations.
Developed production-grade PySpark ETL pipelines in AWS Glue, serverless Lambda functions, orchestrated via Step Functions, with S3 + CloudFormation (IaC).
Designed secure, cost-efficient AWS architecture for regulated financial workloads.