Problem
Move beyond notebook-level ML by deploying a reliable API that can handle real traffic, maintain low latency, and scale correctly in Kubernetes.
MLOps Coursework Project
Built for MIDS W255, this project packages a DistilBERT sentiment model behind a production-style FastAPI service with Redis caching, containerized deployment, Kubernetes orchestration, and performance monitoring under sustained load.

Problem
Move beyond notebook-level ML by deploying a reliable API that can handle real traffic, maintain low latency, and scale correctly in Kubernetes.
Approach
Implemented FastAPI endpoints with typed request/response schemas, baked model artifacts into the container image, added Redis caching, and deployed to AKS with service routing.
Outcome
Delivered a full MLOps lifecycle project with automated tests, containerized inference, and load-test observability using k6 and Grafana.

