ANANT SHARMA

Data Engineering & AI Consultant

Building enterprise data platforms & AI-powered solutions

What I Do

Data Engineering

13+ years architecting enterprise data platforms, data lakes, CI/CD, data governance and ETL/ELT pipelines at scale. Delivered 2.5 PB migrations with $1M+ annual savings.

AI & Machine Learning

Building agentic AI systems, multi-agent workflows, and Vision AI platforms. Winner of ProVision AI Competition. Expert in LLMs, RAG, and computer vision.

Cloud Architecture

AWS & Azure certified architect. Designing multi-cloud solutions, infrastructure as code, and serverless architectures for enterprise-scale systems.

TECHNICAL STACK

Data Platforms

Snowflake • AWS Redshift
BigQuery • Data Lakes
ETL/ELT Pipelines
Apache Spark • PySpark
Apache Kafka

Cloud Services

AWS (Solutions Architect)
Azure (Certified)
Google Cloud Platform
Terraform • Infrastructure as Code

Development

Python • SQL • JavaScript
PySpark • Apache Spark
Git • CI/CD • Docker

AI & Machine Learning

OpenAI • Claude • Cortex
Azure AI Foundry • RAG
LangChain • Computer Vision
Spark ML • SageMaker

FEATURED PROJECTS

Snowflake Summit 2024 Speaker

Invited to present at Snowflake Summit 2024 in San Francisco, CA - sharing insights on architecting and executing a massive 2.5 PB data migration to Snowflake that delivered $1M+ annual cost savings. This presentation showcased real-world challenges and solutions in enterprise-scale data transformation, performance optimization, and cloud cost management.

2500+ Attendees 2.5 PB Migration $1M+ Savings San Francisco
SARAS AI Robot

SARAS AI Robot

Built an AI-powered robot with computer vision and natural language processing. Features autonomous navigation, object recognition, and interactive conversations using OpenAI GPT.

AI/ML Computer Vision Robotics
Home Lab Server Build

Home Lab Server Build

Custom home lab server setup for running local AI models, data pipelines, and development environments. Complete infrastructure setup with virtualization and container orchestration.

Infrastructure Virtualization Self-Hosted
Smart Home Automation

Smart Home Automation

Comprehensive home automation system with IoT sensors, voice control, and automated workflows. Integrated with AI for predictive automation and energy optimization.

IoT Home Assistant Automation

PROFESSIONAL JOURNEY

Principal Data Engineer

ProCogia | Jan 2021 - Present
  • Invited to present at Snowflake Data Summit 2024 in San Francisco on large-scale data migration
  • Architected and executed 2.5 PB data migration to Snowflake, achieving $1M+ annual cost savings
  • Designed and built data lake architecture from scratch on AWS + Snowflake with Kafka integration
  • Optimized ML model performance by 4000x, reducing cloud costs significantly by refactoring Scikit-learn to Sedona & Spark ML
  • Implemented anonymization pipeline using AWS EMR + Snowpark for data lake standardization
  • Orchestrated complex ETL workflows using Apache Airflow and Azure Data Factory

Lead Consultant

HCL Technologies | Jan 2019 - Dec 2020
  • Designed dimensional data models for AWS Redshift and fraud detection systems
  • Developed data modeling solutions for Group Risk Model for leading Malaysian banking institutions
  • Automated data validation processes using Python, reducing processing time from days to hours
  • Created regulatory reporting solutions for Bank Negara compliance

Senior Consultant

PwC | Apr 2016 - Jan 2019
  • Implemented AWS data lakes using S3, Glue, and Athena for scalable ad-hoc reporting
  • Built real-time data pipelines using Kinesis Firehose and Google Firebase
  • Deployed PwC data models on ERwin Data Modeler for BFSI domain clients
  • Designed NoSQL data models for MongoDB using Hackolade
  • Provided on-site consulting in South Africa for CTR Automation & Assurance projects

Project Engineer

Wipro | Apr 2014 - Apr 2016
  • Served as ETL Lead for major Data Warehousing project with 39 source systems and 20+ years of historical data
  • Utilized SAS Enterprise Suite (DI Studio, Enterprise Guide, Web Report Studio) and Teradata FSLDM
  • Created reusable ETL components for Oracle, DB2, and SQL Server integration (awarded SPOT Award)
  • Developed automation scripts adopted as BVM process standard across all EDW projects

CERTIFICATIONS

AWS Certified

Solutions Architect
Data Analytics Specialty

View Credential

Snowflake Certified

SnowPro Core
Data Cloud Platform

View Credential

Azure Certified

Data Fundamentals
Cloud Platform

View Credential

IBM Certified

Python Core
Data Analytics

View Credential

EDUCATION

Bachelor of Technology

Information Technology

Dr. A.P.J. Abdul Kalam Technical University

LET'S COLLABORATE

Ready to transform your data infrastructure? Let's discuss how we can work together on your next big project.