DocumentFlowAI DocsSign Up
Guides

Rate Limits

Rate limits protect API stability for all users. Limits are enforced per API key and per IP address on public endpoints.

Limits by Plan

PlanRequests / minMonthly Quota
Free10100
Starter502,000
Growth20010,000
Enterprise1,000Custom

IP-Based Limits (Public Endpoints)

Auth endpoints (/auth/signup, /auth/login) have separate IP-based rate limits to prevent abuse:

EndpointLimitWindow
/auth/signup5 requests5 minutes per IP
/auth/login10 requests1 minute per IP

Rate Limit Headers

When a rate limit is exceeded, the API returns HTTP 429 with a Retry-After header indicating when to retry.

bash
HTTP/1.1 429 Too Many Requests
Retry-After: 60
Content-Type: application/json

{
  "detail": {
    "error": "rate_limit_exceeded",
    "message": "Too many requests from your IP. Try again in 60 seconds.",
    "retry_after": 60
  }
}

Best Practices

Implement exponential backoff

On 429 responses, wait for the Retry-After duration then retry with increasing delays.

Use the batch endpoint

Instead of sending 50 individual requests, use /v1/batch to process them in one call — counts as 50 quota units but only 1 rate-limit unit.

Cache results when possible

Identical documents return the same output. Cache extraction results keyed by file hash to avoid redundant API calls.

Monitor your usage

Poll /v1/usage periodically and alert before reaching 80% of your monthly quota.

Enterprise plans:Need higher limits? Contact us for Enterprise pricing with custom rate limits, dedicated infrastructure, and SLA guarantees.