Cat Weiss

Data Engineer

About Me

I am a Data Engineer at Algolia and I hold a Master of Information and Data Science from UC Berkeley. I design, deploy, and productionize data pipelines and data models that turn complex product and behavioral data into measurable business outcomes. My current work centers on modeling customer and prospect success to improve forecasting, targeting, and strategic decision-making. Across 8 years building technical solutions for SaaS platforms and custom database integrations, I have worked across data engineering, analytics engineering, and applied machine learning. I focus on building reliable, scalable systems that support both fast experimentation and long-term growth. I am especially interested in roles at the intersection of data engineering and machine learning, including ML engineering, advanced analytics engineering, and production ML systems. Long term, I want to own end-to-end data and ML platforms that move models from exploration to robust, observable, and scalable deployment.

Download Resume

Featured Projects

Thesis

KubeBench

Master's capstone project: a Kubernetes-native benchmark and model-serving platform for code-generating LLMs, with fine-tuned domain models, live cluster-based evaluation, and cloud deployment infrastructure.

PythonKubernetesFastAPIQLoRATerraformGCP + DigitalOcean

Case Study Code Live Demo

Catville

Autonomous social simulation inspired by the Stanford/Google generative agents paper. Multi-agent characters interact daily using a local Mistral model, and the system publishes narrative daily summaries via Buttondown.

PythonOllamaMistralAgent SimulationPoetryButtondown API

Case Study Code Live Demo

Coursework

Persona-Aware RAG Assistant

Final GenAI project for UC Berkeley DATASCI 267. Built an end-to-end RAG system with Qdrant retrieval, Cohere reranking, and persona-specific prompting for engineering and marketing users.

PythonLangChainQdrantMistral-7BCohere RerankRAG Evaluation

Case Study Code

Coursework

MLOps Sentiment API

End-to-end machine learning API project for MLOps coursework: FastAPI + PyTorch sentiment inference, Redis caching, Dockerized deployment to Kubernetes on Azure, and performance validation with k6 + Grafana.

FastAPIPyTorchRedisDockerKubernetes (AKS)k6 + Grafana

Case Study Code

Coursework

AI vs Real Image Classifier

Coursework project building a computer-vision classifier to detect AI-generated versus real images, including preprocessing, augmentation, model iteration, and evaluation on held-out subsets.

TensorFlow/KerasCNNComputer VisionNumPyJupyterModel Tuning

Case Study Code