Skip to main content

Who this track is for

The Data Scientist track is for analysts and engineers who want to build models that drive real decisions. You’ll go deep on statistical foundations, experimental design, and the full ML lifecycle — from feature engineering to production monitoring. Best fit if you:
  • Come from an analytics, research, or quant background
  • Want to run experiments and build models at product companies
  • Are interested in the why behind ML, not just the how

Curriculum

Level 1 — Statistical Foundations (free)

ModuleKey Topics
Advanced StatisticsBayesian inference, distributions, sampling, CLT
Experimental DesignA/B testing, power analysis, p-values, practical significance

Level 2 — Machine Learning

ModuleKey Topics
Supervised Learning Deep DiveRegression, trees, ensembles, model selection
Feature EngineeringEncoding, imputation, scaling, feature importance
Model EvaluationCross-validation, ROC/AUC, calibration, leakage

Level 3 — Production ML

ModuleKey Topics
ML Pipelinessklearn pipelines, preprocessing, reproducibility
Model DeploymentFastAPI, batch inference, model registries
Monitoring & DriftDistribution shift, data quality, alerting

Level 4 — Advanced Topics

ModuleKey Topics
Time SeriesForecasting, seasonality, ARIMA, Prophet
NLP FundamentalsText classification, embeddings, transformers for DS
Causal InferenceDiff-in-diff, IV, propensity scoring

Level 5 — Capstone

A Kaggle-style competition on a real dataset — judged by final F1 score and a written analysis submitted to your cohort.