Changelog
Notable user-facing changes per release.
v0.1.0
The first documented release. merido is a local-first, OpenAI-compatible AI gateway in Rust that ships as a single binary serving the API, the dashboard, a landing page, and these docs on one port.
Gateway & request path
- OpenAI-compatible data plane:
/v1/chat/completions,/v1/messages(Anthropic),/v1/responses,/v1/embeddings,/v1/images/generations,/v1/audio/*,/v1/models, and more — all translated through one canonical representation. - Routing to 40+ upstream providers via API key or OAuth, with encryption at rest.
Routing & reliability
- Virtual models with
failover,load_balance,weighted,cost_optimized, andlatency_basedstrategies. - Cost/latency/health-aware target selection, fallback across accounts → targets → tiers, and a circuit breaker with cooldowns.
Token saving & caching
- Automatic
tool_resultcompression filters and opt-in Caveman output-compression mode. - A savings ledger (
/api/savings,merido gain). - Exact request dedup and an opt-in semantic cache with a precision guard and revalidation sampling.
The Token-Optimization Advisor
- Usage analysis with recommendations (
merido advise/discover,/api/advisor). - An apply engine with a probation window and auto-rollback.
Deploy & operate
- Profiles:
local(SQLite, single-user) andcloud(Postgres, multi-tenant). - Docker and Fly.io deployment; production hardening via a dashboard password hash and required API keys.
- Opt-in self-serve multi-tenancy and Redis-backed high availability.
merido migrateto move data between SQLite and Postgres; Helicone import.