Open to work

Apaar
Saroj

AI Engineer · Data Scientist · ML Engineer

MS in Information Technology at Arizona State (4.0 GPA). 3+ years in analytics, machine learning, and AI. I build data and ML systems end to end, from raw data through modeling to deployment.

3+Years Industry
4.0GPA at ASU
AIFirst Focus
AS
Apaar Saroj
AI Engineer · New York, NY
ML & Statistical ModelingAdvanced
SHAP & ExplainabilityAdvanced
Data Engineering & ETLAdvanced
Analytics & SQLExpert
Apaar Saroj

Analytics roots.
AI trajectory.

I have 3+ years of experience in analytics, machine learning, and AI, and I recently finished my MS in Information Technology at Arizona State with a 4.0 GPA. I'm targeting AI, Data Science, and ML Engineering roles where I can build end-to-end pipelines and Gen-AI products, and bring systems thinking to problems that actually matter.

My capstone is an end-to-end ML pipeline that scores healthcare financing efficiency across 52 countries. I used Shannon entropy to weight 13 indicators objectively across WHO, OECD, and World Bank data, modeled the drivers with fixed-effects regression and SHAP-explained Random Forests, and shipped it as a live, Dockerized Streamlit dashboard with a policy simulator.

Before grad school I spent three years working with production data. At EXL I built and ran daily data engineering scripts processing 100K+ financial transactions for a regulated portfolio of around 10 million households, plus the Power BI reporting leadership used for compliance and planning.

At Team Computers I built Python predictive models on sensor data to find the drivers of equipment downtime, which fed into a 20% drop in unplanned downtime and a 15% cut in maintenance costs.

I'm strongest in Python and SQL, classical ML, and the statistical side of the work, with a growing focus on Gen-AI. If you're working on a hard data problem in any domain, I'd be glad to talk.

🎓
Arizona State UniversityMS Information Technology · 4.0 GPA · May 2026
Production Data Experience3+ years across data engineering, predictive modeling, and BI at EXL and Team Computers
🤖
End-to-End ML & AIFrom data sourcing and modeling through SHAP explainability to a deployed, Dockerized dashboard
📍
New York, NYOpen to remote, hybrid, or on-site in the US

Work that ships.

Team Computers · Wind Energy

Predictive Maintenance & Turbine Analytics

Built Python predictive models on wind turbine sensor data to identify the drivers of unplanned downtime, integrating weather and IoT feeds to catch early failure signals and improve energy production planning.

Outcomes
↓ 20% downtime · ↓ 15% maintenance costs · ↑ 15% energy efficiency
PythonMLTableauIoTForecastingSQL
ASU · IFT 533 · Data VisualizationTableau

LendingClub Credit Risk Analytics Dashboard

3-phase end-to-end analytics project on 14.5M LendingClub loan records (2007–2018). Built an interactive Tableau dashboard for credit risk analysts covering default patterns, rejection analysis, DTI behavior, and geographic risk distribution across all US states.

Key Finding
20% default rate · Grade A→G correlates directly with rising default risk
TableauTableau PrepCredit RiskData VizEDA
◈ Live Dashboard
ASU · NLP ExplorationResearch

Cross-Domain Tone Classification

Studied whether you can train BERT models on incompatible label systems, one labeled by tone and one by emotions, and still map them to a shared target using a small calibration set. The finding: domain alignment matters more than dataset size.

Key Insight
Smaller domain-aligned dataset (2.6k) beat the larger misaligned one (8k), 54.6% vs 46.5%
BERTPyTorchHuggingFaceNLPTransfer Learning

Where I've built.

Sep 2022 — Jul 2024
Business Analyst
EXL · Gurugram, India
  • Embedded with the data team for a Big Six UK energy provider, building and running 20+ daily data engineering scripts that processed over 100K financial transactions across cash matching, unallocated transactions, and balance reconciliation, for a regulated portfolio serving around 10 million households.
  • Built 6+ Power BI dashboards for regulatory compliance, cash flow tracking, and final credits reporting, used directly by finance leadership for strategic planning and audit readiness.
  • Wrote and automated recurring financial reports in SQL and Python, cutting turnaround on time-sensitive queries from 24 hours to under 1, and took ownership of processes handed over from the onshore team.
Jul 2021 — Sep 2022
Developer, BI & Analytics
Team Computers · Gurugram, India
  • Built Python predictive models on wind turbine sensor data to identify downtime drivers, with real-time dashboards for performance and energy forecasting.
  • Integrated weather forecast data into the analytics pipeline using SQL and Python, building energy production models that improved resource planning and operational efficiency by 15%.
  • Reduced unplanned turbine downtime by 20% and maintenance costs by 15% via predictive models.

Tech stack.

AI & Modeling
LLMs & Prompt Engineering Familiar
BERT / Transformers Proficient
scikit-learn / XGBoost Advanced
SHAP / Explainability Advanced
K-Means / Clustering Advanced
Panel & Statistical Modeling Proficient
Data & Engineering
Python (pandas, NumPy) Expert
SQL Expert
Streamlit Advanced
Docker Proficient
ETL & Data Pipelines Advanced
Git / Version Control Proficient
Visualization & Cloud
Tableau Expert
Power BI Advanced
Matplotlib / Seaborn Advanced
AWS Foundational
Statistical Analysis Advanced
API Integration Proficient

Credentials.

MS Information Technology
Arizona State University
2024 – May 2026Tempe, AZ
★ 4.0 GPA
B.Tech Computer Science
Kurukshetra University
2015 – 2019Haryana, India
Certifications
AWS Academy ML Foundations
Amazon Web Services
Prompt Engineering for Developers
DeepLearning.AI

What leaders say.

"A pivotal role in developing complex analytics processes and delivering critical MI reports that reduced client costs."
MG
Manvi Gupta
Sr. AVP · EXL
"Unparalleled professionalism and data-driven insight that guided our organization toward optimal outcomes."
AK
Ajeet Singh Kaintura
Manager, Transformation & Solutioning · EXL

Let's build
something.

Open to full-time AI Engineer and Data Scientist roles. Also happy to talk about internships, research collaborations, or interesting AI problems.

View ResumeLatest version · PDF · 1 page
View →