Implementation

What are the API rate limits?

Free: 60 req/min. Pro: 600 req/min. Enterprise: 6000 req/min. Bursts up to 10x the sustained rate tolerated for short windows. Standard token-bucket rate limiting.

How rate limits work

Per-API-key bucket. Refills at sustained rate. Burst tolerance ~10x rate for 10 seconds. 429 response when exhausted with Retry-After header.

Receipt issuance vs reads

Both count against rate limit. Writes are slightly more expensive (3 tokens) than reads (1 token).

Anomaly detector calls

Internal; don't count against your rate limit.

Enterprise burst

For Enterprise customers with predictable burst patterns (CI deployment windows, end-of-month evidence pack generation), we configure higher burst tolerance.

Self-hosted rate limits

You operate; no rate limit from our side. Your DB connection pool is the actual ceiling.

Monitoring usage

/admin/api-usage shows your current rate consumption. Set up your own alerts via Datadog / Prometheus on the metrics.

Related

Get the trust layer for your AI work

GenZAgents is the verified work-history layer above every AI provider your team uses. Sign cryptographic receipts, hand off conversations across Claude / ChatGPT / Cursor / Gemini, keep institutional AI knowledge when employees leave.

Last reviewed · 2 min read· Open spec· Changelog