🤖 DriftWatch

Detect when your LLM silently changes behaviour. Catch GPT-4o, Claude, and Gemini drift before it breaks your product.

View on GitHub → View Pricing ↓

The Problem

LLMs update constantly. When they do, your prompts may stop working. Your JSON parser breaks. Your classifier returns different answers. Your code generator ignores format instructions.

You find out from your users. Not from the model provider.

How It Works

# pip install driftwatch from driftwatch import Monitor monitor = Monitor(model="gpt-4o", baseline="baseline_prompts.json") score = monitor.check_drift("test_prompts.json") if score > 0.15: send_alert("LLM drift detected: {:.2f}".format(score))

Features

20 curated test prompts across 7 categories
Cosine similarity scoring against baseline
Configurable alert thresholds
Email and Slack webhook notifications
Historical drift tracking
Open source — MIT license

⚠️ Real Drift Detected

GPT-4o format drift on Day 12 — caught by DriftWatch before users noticed. Read the full story on Dev.to →

Simple Pricing

Starter

£9.90

per month

For indie developers.
3 monitors, hourly checks,
email alerts

Pro

£29

per month

For teams.
Unlimited monitors,
Slack alerts, priority support

Enterprise

£99

per month

For organizations.
Custom integrations,
SLA, dedicated support

Pay via Stripe. Cancel anytime. Secure checkout.

Open Source

DriftWatch is open source. Deploy it yourself on Railway, Render, or any Python hosting. MIT license.

GitHub Repository →