Kimara API Docs

Rate Limits

Understand API rate limits, headers, and error handling.

All API endpoints are rate-limited to ensure fair usage across all users. Limits vary by subscription tier.

Rate Limit Headers

Every API response includes rate limit headers:

HeaderDescription
X-RateLimit-Minute-LimitMaximum requests per minute
X-RateLimit-Minute-RemainingRequests remaining in current minute
X-RateLimit-Minute-ResetUnix timestamp when the minute window resets

Execution endpoints (running apps/workflows) also include hourly limits:

HeaderDescription
X-RateLimit-Hour-LimitMaximum executions per hour
X-RateLimit-Hour-RemainingExecutions remaining in current hour
X-RateLimit-Hour-ResetUnix timestamp when the hour window resets

General Rate Limit

All endpoints share a general rate limit of 120 requests per minute per user, regardless of subscription tier.

Execution Rate Limits

Run endpoints (POST /api/v1/apps/:id and POST /api/v1/runs) have additional per-tier limits:

TierPer MinutePer Hour
Starter10100
Plus20200
Max30300
Ultra90900

Handling Rate Limits

When rate-limited, the API returns 429 Too Many Requests with a Retry-After header indicating how many seconds to wait:

{
  "error": {
    "code": "RATE_LIMITED",
    "message": "Rate limit exceeded",
    "retry_after": 30
  }
}

Best practice: Check the X-RateLimit-Minute-Remaining header before making requests. When remaining hits zero, wait until the X-RateLimit-Minute-Reset timestamp.

GPU Quota

In addition to rate limits, execution endpoints enforce GPU usage quotas. When exceeded, you receive 429 with code GPU_QUOTA_EXCEEDED.

On this page