Open to AI/ML Opportunities

|

12+ years applying statistical modeling, causal inference, and machine learning across federal, telecom, and financial sectors. Specializing in Agentic AI, RAG architectures, and end-to-end ML deployment.

LangChain Agentic AI RAG LlamaIndex GCP / AWS MLOps
12+ Years Experience
85% Hallucination Reduction
$10M+ Business Impact

About Me

Data Science Manager with 12+ years applying statistical modeling, causal inference, and machine learning to drive data products in federal, telecom, and financial sectors. Expertise in end-to-end development and deployment of AI solutions including LLM-based architectures, RAG systems, forecasting models, and cohort and churn analyses.

Proven ability to partner with engineering, product, and sales teams to shape analytics strategy and deliver actionable insights โ€” reducing hallucination risk by 85% and achieving 90% accuracy in fee forecasting.

๐Ÿง  Agentic AI & LLMs

RAG design, multi-agent orchestration, LangChain, LangGraph, LlamaIndex, CrewAI, AutoGen

๐Ÿข Industries

Federal Government, Telecom, Healthcare, Hospitality, Financial Services

โšก MLOps & Cloud

GCP BigQuery, AI Platform, AWS, 50+ Airflow DAGs, CI/CD, ML governance frameworks

๐Ÿ“Š Leadership

Data Science Manager, team mentoring, executive stakeholder communication, analytics strategy

Technical Proficiency

Agentic AI & LLMs
Python / ML Frameworks
Statistical Modeling
Cloud & MLOps (GCP/AWS)
Data Visualization

Featured Projects

๐Ÿค–

Agentic RAG โ€” Medical Literature System

LangChain Llama 3.2 Chroma Agentic AI

Engineered an autonomous multi-step RAG system capable of synthesizing 14,000+ pages of medical literature (~21K chunks). Implemented self-directed context grounding with Chroma vector database and automated testing pipelines.

85% Hallucination Reduction
21K+ Chunks Indexed
๐Ÿฅ

Medicaid A/B Testing & Experimentation

A/B Testing Causal Inference Statistics

Designed and executed a controlled A/B test for a national Medicaid client, doubling click-through rates and generating an estimated $8โ€“10M in retained managed-care coverage value through data-driven experimentation design.

2x CTR (15% โ†’ 30%)
$10M Coverage Value
๐Ÿ“ˆ

Financial Fee Forecasting Platform

Time Series Ensemble ML Forecasting

Built high-precision predictive models for distinct financial portfolios using advanced time series methods and ensemble techniques, deployed across multi-domain resource and capacity planning challenges for federal clients.

90% Forecast Accuracy
$3.2M Net-New Revenue
๐Ÿ“ก

Verizon Dispatch Prediction Model

XGBoost Optimization Operations

Spearheaded a predictive dispatch reduction model at Verizon, classifying slow/moderate/fast dispatches to route jobs by efficiency. Improved accuracy 15% over baseline and significantly reduced operational costs.

25% Dispatch Reduction
+15% Accuracy vs Baseline

Technical Expertise

๐Ÿง 

Agentic AI & LLMs

LangChain LangGraph LlamaIndex CrewAI AutoGen RAG LangSmith Chroma / Pinecone
๐Ÿค–

Machine Learning

XGBoost Random Forest LightGBM Neural Networks Time Series Scikit-learn TensorFlow
๐Ÿ“

Statistical Methods

Causal Inference A/B Testing Hypothesis Testing Regression Analysis Clustering SMOTE
โ˜๏ธ

Cloud & MLOps

GCP BigQuery AWS Airflow Docker Spark CI/CD n8n / Retool
๐Ÿ’ป

Programming

Python SQL R Scala Bash PowerShell
๐Ÿ“Š

Data & Visualization

Tableau Power BI Plotly Matplotlib Hadoop ETL Pipelines

Skills Network

Interactive map of skills and projects โ€” hover to explore connections, drag to rearrange

LLMs & AI Machine Learning Statistics Cloud & MLOps Programming Projects

Hover to highlight connections  ยท  Drag nodes  ยท  Click to pin

AI Playground

Live AI-powered demos โ€” all running against real resume data

Paste any job description and see how Sai's profile matches the role.

Select a project for a detailed technical breakdown generated on demand.

Questions hiring managers actually ask โ€” click any to get an instant answer.

Experience

Mar 2020 โ€“ Present

Data Science Manager

Deloitte Consulting LLP

  • Engineered an autonomous multi-step RAG system (LangChain + Llama 3.2) synthesizing 14,000+ pages of medical literature, reducing hallucination risk by 85%
  • Designed A/B test for national Medicaid client โ€” raised CTR from 15% to 30%, generating $8โ€“10M in retained managed-care coverage value
  • Built high-precision forecasting models for federal financial portfolios achieving 90% accuracy using ensemble time series techniques
  • Architected scalable data lake on GCP (BigQuery, AI Platform) and orchestrated 50+ Airflow DAGs, cutting ETL processing time by 60%
  • Engineered fraud risk models with 70% detection precision on highly skewed government datasets using SMOTE-augmented imbalanced learning
  • Delivered data-driven strategies generating $3.2M net-new revenue; established CI/CD practices and ML governance frameworks, accelerating time-to-production by 30%
Nov 2019 โ€“ Mar 2020

Data Analyst

Hilton

  • Implemented LightGBM Regressor with 90% accuracy to predict per-property revenue for real-time financial planning
  • Designed robust ARIMA forecasting model enabling real-time revenue predictions and strategic planning
  • Automated SQL pipelines for dataset creation, reducing data preparation time by 50%
Mar 2017 โ€“ Nov 2019

Statistical Data Analyst

Verizon

  • Developed predictive dispatch reduction model achieving 25% decrease in unnecessary dispatch hours, improving resource allocation across the logistics team
  • Improved model accuracy by 15% over baseline by classifying slow/moderate/fast dispatches for efficiency-based job routing
  • Executed A/B testing on design changes โ€” 15% increase in sign-ups and 20% reduction in onboarding drop-off
  • Analyzed FiOS cancellation drivers using customer feedback analytics, improving satisfaction scores by 15%
Jul 2015 โ€“ May 2016

Research Data Analyst

George Mason University

  • Built classification ML models to identify optimal restaurant locations for Golden Corral and Ovation brands โ€” identified 4 prime expansion locations
  • Developed predictive models for currency exchange rate forecasting using time series analysis
2012 โ€“ 2014

Data Analyst / Sr. Project Engineer

Regalix & IBM โ€” India

  • Maximized AdWords campaign ROI by 20% through keyword refinement and data-driven campaign management at Regalix
  • Implemented SQL Server performance tuning and query optimization at IBM, improving system workload efficiency

Education & Certifications

Education

๐ŸŽ“

Post Graduate Certificate

Generative AI for Business Applications

The University of Texas at Austin

2026 (In Progress)
๐ŸŽ“

Master of Science

Data Analytics Engineering

George Mason University

2016
๐ŸŽ“

Bachelor of Technology

Computer Science Engineering

JNTU

2011

Certifications

โ˜๏ธ

Google Cloud Professional

Machine Learning Engineer
โ˜๏ธ

AWS

Machine Learning Specialty
๐Ÿ“Š

Tableau Desktop

Specialist
๐Ÿ“Š

Power BI

Data Analyst

Publications

Peer-reviewed research spanning AI, machine learning, and applied computer science โ€” including construction risk management, generative AI applications, and drone-based crop health monitoring.

Let's Connect

Get In Touch

I'm always interested in discussing new opportunities, innovative AI/ML projects, and potential collaborations in data science and generative AI.

Phone

(270) 421-3344

LinkedIn

Connect with me