AVAILABLE — ACTIVELY INTERVIEWING

Shane Thakkar

DATA · AI · ML

Building data solutions and machine learning systems.

/ 01

Selected work

02 LIVE PRODUCTS
/ 02

Projects

04 PROJECTS
in-sample46%+ distress61%+ recovered69%PRECISION → HORIZON
Published · MAY 25 '26SEC · NLP · 10-K Analysis

Failing Companies Tell on Themselves in Their Annual Reports

A model that reads what companies say about themselves in SEC filings catches 79% of bankruptcies across 24 cases. The 'false positives' include Nordstrom, Walgreens, Macy's, Kohl's, CVS, and Lucid Motors, all of whose decline didn't end in court.

pythonsklearnpandasnlpedgar
read article
22%11%GO-FOR-IT %19992025
Published · APR 22 '26NFL · WPA · XGBoost

Fourth Down Is Still Football's Biggest Coaching Problem

107k decisions from 1999–2025, scored against historically optimal calls. Coaches still leave ~one free win on the table every year — and the conservative ones make most of their mistakes in the red zone.

pythonpandasnumpyxgboostnflfastR
read article
HAMPERVERALBTSU
Published · APR 18 '26F1 · Bayesian

Who Is Actually the Best F1 Driver?A Bayesian approach to separating skill from the car

A hierarchical model on 2014–2025 race data decomposes finishing position into driver effect, car effect, and DNF risk. The result reveals the Verstappen Paradox — and Hamilton at the top with 85% confidence.

pythonpymcbayesianarvizfastf1
read article
HEIGHTVELOCITY
Published · MAY 13 '25MLB · Regression

Why Height Doesn't Predict Velocity in Major League Baseball

Physics says taller pitchers should throw harder. The data says they don't. The story is selection bias: by the time you reach MLB, the relationship that dominates youth ball has been compressed away by survival.

pythonsklearnpandasnumpystatcast
read article
/ 03

Tech I work with

STACK
drag to pick up
python
r
sql
jupyter
pandas
scikit-learn
tensorflow
keras
snowflake
databricks
docker
tableau
power bi
excel
react
next.js
claude code
  • python
  • r
  • sql
  • jupyter
  • pandas
  • scikit-learn
  • tensorflow
  • keras
  • snowflake
  • databricks
  • docker
  • tableau
  • power bi
  • excel
  • react
  • next.js
  • claude code
/ 04

About

WHO

Hi, I’m Shane.

Recent Business Analytics & AI graduate with a strong foundation in machine learning and data analytics. I build end-to-end data solutions, from data processing and modeling to deploying live applications that deliver real insights.

My work spans advanced statistical modeling, predictive analytics, and building production-grade tools. Currently seeking full-time opportunities in Data Science, Analytics, and Machine Learning where I can apply technical skills to solve complex problems.

Quick Facts

[ 05 ]
Education
B.S. Business Analytics & AI, UT Dallas (May 2026)
Based
Frisco, Texas
Focus
Analytics · Data Science · ML · BI
Stack
Python · SQL · R
Teams
Chicago BearsChicago CubsChicago BullsMcLaren F1
/ 05

Get in touch

LET'S TALK

Open to the next problem worth solving.

Exploring full-time roles in analytics, ML, data science, and BI. Always happy to connect about projects, ideas, or interesting problems.