ML in Production

Models that ship and stay up

A prototype in a notebook is the easy 20%. These are the systems built for the other 80% — serving, monitoring, retraining, and graceful failure.

Real-time ranking service

In production

Low-latency ranking model serving personalized results, retrained nightly with automated evaluation gates.

PythonFastAPIDockerRedisMLflow

In production

Orchestrated daily forecasting jobs with data-quality checks, backfills, and drift monitoring.

AirflowdbtBigQueryPython

Pilot

Hybrid pipeline using a small fine-tuned model with an LLM fallback for long-tail cases.

PythonTransformersLLM APIPrometheus