Skip to content

Changelog

Notable user-facing changes per release.

v0.1.0

The first documented release. merido is a local-first, OpenAI-compatible AI gateway in Rust that ships as a single binary serving the API, the dashboard, a landing page, and these docs on one port.

Gateway & request path

  • OpenAI-compatible data plane: /v1/chat/completions, /v1/messages (Anthropic), /v1/responses, /v1/embeddings, /v1/images/generations, /v1/audio/*, /v1/models, and more — all translated through one canonical representation.
  • Routing to 40+ upstream providers via API key or OAuth, with encryption at rest.

Routing & reliability

  • Virtual models with failover, load_balance, weighted, cost_optimized, and latency_based strategies.
  • Cost/latency/health-aware target selection, fallback across accounts → targets → tiers, and a circuit breaker with cooldowns.

Token saving & caching

  • Automatic tool_result compression filters and opt-in Caveman output-compression mode.
  • A savings ledger (/api/savings, merido gain).
  • Exact request dedup and an opt-in semantic cache with a precision guard and revalidation sampling.

The Token-Optimization Advisor

  • Usage analysis with recommendations (merido advise / discover, /api/advisor).
  • An apply engine with a probation window and auto-rollback.

Deploy & operate

  • Profiles: local (SQLite, single-user) and cloud (Postgres, multi-tenant).
  • Docker and Fly.io deployment; production hardening via a dashboard password hash and required API keys.
  • Opt-in self-serve multi-tenancy and Redis-backed high availability.
  • merido migrate to move data between SQLite and Postgres; Helicone import.

MIT / Apache-2.0 licensed.