Advanced Retry Configuration

Cortex supports advanced retry logic with exponential backoff, jitter, and Retry-After header support.

Overview
Quick Start
Retry Policy Presets
Custom Retry Configuration
Configuration Levels
Backoff Strategies
Jitter Types
Retryable Errors
Database Schema
Examples

Overview

The retry system automatically retries failed requests for transient errors (network issues, rate limits, temporary server errors) while respecting Retry-After headers and applying intelligent backoff strategies.

Key Features

Exponential Backoff: Progressively increase delay between retries
Jitter: Randomize delays to prevent thundering herd
Retry-After Support: Honor server-provided retry timing
Error Classification: Automatic detection of retryable vs non-retryable errors
Configurable Policies: Preset and custom retry configurations
Per-Provider Settings: Different retry behavior for different providers

Automatically Retried Status Codes

408 - Request Timeout
429 - Too Many Requests (respects Retry-After header)
500 - Internal Server Error
502 - Bad Gateway
503 - Service Unavailable (respects Retry-After header)
504 - Gateway Timeout

Quick Start

Using Preset Policies

oauth:
  retry:
    policy: "conservative"  # Recommended default

providers:
  anthropic:
    oauth:
      enabled: true
      retry:
        policy: "aggressive"  # Override for specific provider

Custom Configuration

providers:
  openai:
    oauth:
      enabled: true
      retry:
        policy: "custom"
        max_attempts: 5
        base_delay: 1s
        max_delay: 30s
        multiplier: 2.0
        backoff_strategy: "exponential"
        jitter_type: "full"
        respect_retry_after: true

Retry Policy Presets

Conservative (Recommended Default)

Best for production environments. Balances reliability with responsiveness.

retry:
  policy: "conservative"

Parameters:

Max attempts: 3
Base delay: 1s
Max delay: 30s
Multiplier: 2.0
Jitter: Full
Backoff: Exponential

Retry Timeline:

Initial request fails
Wait ~1s (0.5s - 1.5s with jitter)
Retry 1 fails
Wait ~2s (1s - 3s with jitter)
Retry 2 fails
Wait ~4s (2s - 6s with jitter)
Retry 3 (final attempt)

Aggressive

Use for unreliable networks or critical requests where retries are preferred.

retry:
  policy: "aggressive"

Parameters:

Max attempts: 5
Base delay: 500ms
Max delay: 30s
Multiplier: 2.0
Jitter: Full
Backoff: Exponential

Retry Timeline:

Initial request fails
Wait ~500ms (250ms - 750ms with jitter)
Retry 1 fails
Wait ~1s (500ms - 1.5s with jitter)
Retry 2 fails
Wait ~2s (1s - 3s with jitter)
Retry 3 fails
Wait ~4s (2s - 6s with jitter)
Retry 4 fails
Wait ~8s (4s - 12s with jitter)
Retry 5 (final attempt)

None

Disable retries completely. Use for testing or fast-fail scenarios.

retry:
  policy: "none"

Parameters:

Max attempts: 1 (no retries)

Custom Retry Configuration

For specific requirements, define your own retry policy.

Configuration Options

Option	Type	Description	Default
`policy`	string	Preset policy name	`conservative`
`max_attempts`	int	Maximum number of retry attempts	3
`base_delay`	duration	Initial delay between retries	1s
`max_delay`	duration	Maximum delay between retries	30s
`multiplier`	float	Backoff multiplier (≥ 1.0)	2.0
`backoff_strategy`	string	Backoff algorithm	`exponential`
`jitter_type`	string	Jitter algorithm	`full`
`respect_retry_after`	bool	Honor Retry-After headers	`true`

Example: High-Priority with Quick Retries

retry:
  policy: "custom"
  max_attempts: 7
  base_delay: 100ms
  max_delay: 10s
  multiplier: 1.5
  backoff_strategy: "exponential"
  jitter_type: "decorrelated"
  respect_retry_after: true

Example: Linear Backoff

retry:
  policy: "custom"
  max_attempts: 5
  base_delay: 2s
  max_delay: 10s
  multiplier: 1.0
  backoff_strategy: "linear"
  jitter_type: "equal"
  respect_retry_after: true

Configuration Levels

Retry configuration can be specified at multiple levels with the following precedence (highest to lowest):

Provider OAuth Level - Most specific, applies only to that provider's OAuth token refresh
Global OAuth Level - Applies to all providers using OAuth
Default - Conservative policy if nothing is specified

Example: Multi-Level Configuration

# Global default for all OAuth providers
oauth:
  retry:
    policy: "conservative"

providers:
  # Uses global conservative policy
  anthropic:
    oauth:
      enabled: true

  # Overrides with aggressive policy
  openai:
    oauth:
      enabled: true
      retry:
        policy: "aggressive"

  # Completely custom configuration
  gemini:
    oauth:
      enabled: true
      retry:
        policy: "custom"
        max_attempts: 5
        base_delay: 1s
        max_delay: 60s
        multiplier: 2.0
        backoff_strategy: "exponential"
        jitter_type: "full"

Backoff Strategies

Exponential (Recommended)

Delay doubles with each retry (modified by multiplier).

backoff_strategy: "exponential"
multiplier: 2.0  # Each retry waits 2x longer than previous

Delay Sequence (base=1s, multiplier=2.0):

Retry 1: 1s
Retry 2: 2s
Retry 3: 4s
Retry 4: 8s
Retry 5: 16s (capped at max_delay)

Linear

Delay increases by a constant amount.

backoff_strategy: "linear"
base_delay: 2s  # Increment for each retry

Delay Sequence (base=2s):

Retry 1: 2s
Retry 2: 4s
Retry 3: 6s
Retry 4: 8s
Retry 5: 10s (capped at max_delay)

Constant

Same delay for all retries.

backoff_strategy: "constant"
base_delay: 3s

Delay Sequence:

All retries: 3s

Jitter Types

Jitter randomizes delays to prevent synchronized retries across clients (thundering herd problem).

Full Jitter (Recommended)

Random delay between 0 and calculated backoff.

jitter_type: "full"

Example: If backoff = 4s, actual delay will be random between 0s and 4s.

Equal Jitter

Half the backoff plus random value up to half.

jitter_type: "equal"

Example: If backoff = 4s, actual delay will be 2s + random(0, 2s) = between 2s and 4s.

Decorrelated Jitter

Delay based on previous delay, not just the base backoff.

jitter_type: "decorrelated"

More sophisticated algorithm that considers the previous actual delay.

No Jitter

Use exact calculated backoff with no randomization.

jitter_type: "none"

Use only when: Testing or debugging, not recommended for production.

Retryable Errors

Automatically Retried

The following conditions trigger automatic retries:

Network errors - Connection timeouts, DNS failures, etc.
HTTP 408 - Request Timeout
HTTP 429 - Too Many Requests (with Retry-After support)
HTTP 500 - Internal Server Error
HTTP 502 - Bad Gateway
HTTP 503 - Service Unavailable (with Retry-After support)
HTTP 504 - Gateway Timeout

Never Retried

The following errors are considered non-retryable:

HTTP 4xx (except 408, 429) - Client errors
HTTP 401 - Unauthorized
HTTP 403 - Forbidden
HTTP 404 - Not Found
HTTP 400 - Bad Request
Validation errors - Invalid input, parse errors

Retry-After Header Support

When a server returns HTTP 429 or 503 with a Retry-After header, Cortex will:

Parse the header value (seconds or HTTP date)
Use the specified delay instead of calculated backoff
Continue with normal retry logic after the specified delay

HTTP/1.1 429 Too Many Requests
Retry-After: 10

# Cortex will wait 10 seconds before retrying

Database Schema

Retry configuration is stored in the database (migration 009) for persistence.

Providers Table

ALTER TABLE providers ADD COLUMN retry_policy TEXT DEFAULT 'conservative';
ALTER TABLE providers ADD COLUMN retry_max_attempts INTEGER DEFAULT 0;
ALTER TABLE providers ADD COLUMN retry_base_delay_ms INTEGER DEFAULT 0;
ALTER TABLE providers ADD COLUMN retry_max_delay_ms INTEGER DEFAULT 0;
ALTER TABLE providers ADD COLUMN retry_multiplier REAL DEFAULT 0.0;
ALTER TABLE providers ADD COLUMN retry_backoff_strategy TEXT DEFAULT '';
ALTER TABLE providers ADD COLUMN retry_jitter_type TEXT DEFAULT '';
ALTER TABLE providers ADD COLUMN retry_respect_retry_after BOOLEAN DEFAULT TRUE;

Provider OAuth Table

ALTER TABLE provider_oauth ADD COLUMN retry_policy TEXT DEFAULT 'conservative';
-- (same columns as providers table)

Server Config (Global Defaults)

INSERT INTO server_config (key, value) VALUES ('retry.policy', 'conservative');
-- (additional retry config keys)

Examples

Example 1: Production Setup

Conservative defaults with aggressive retry for critical provider:

oauth:
  retry:
    policy: "conservative"

providers:
  anthropic:
    oauth:
      enabled: true
      # Uses global conservative policy

  critical-service:
    oauth:
      enabled: true
      retry:
        policy: "aggressive"  # More retries for critical service

Example 2: Development/Testing

Fast-fail for quick feedback:

providers:
  all-providers:
    oauth:
      enabled: true
      retry:
        policy: "none"  # No retries during development

Example 3: Unreliable Network

Custom configuration optimized for poor connectivity:

providers:
  mobile-provider:
    oauth:
      enabled: true
      retry:
        policy: "custom"
        max_attempts: 7
        base_delay: 500ms
        max_delay: 60s
        multiplier: 1.5
        backoff_strategy: "exponential"
        jitter_type: "decorrelated"
        respect_retry_after: true

Example 4: Rate-Limited API

Respect rate limits with longer delays:

providers:
  rate-limited-api:
    oauth:
      enabled: true
      retry:
        policy: "custom"
        max_attempts: 3
        base_delay: 5s
        max_delay: 120s
        multiplier: 3.0
        backoff_strategy: "exponential"
        jitter_type: "full"
        respect_retry_after: true

Best Practices

Use Conservative as Default: The conservative preset works well for most production scenarios
Enable Retry-After: Always set respect_retry_after: true to honor server guidance
Add Jitter in Production: Use full or equal jitter to prevent thundering herd
Don't Over-Retry: More retries != better; 3-5 attempts is usually sufficient
Monitor Retry Metrics: Track retry rates to identify problematic providers
Test Both Paths: Test both success and retry scenarios
Consider Costs: Each retry consumes resources; balance reliability with efficiency

Troubleshooting

Retries Not Working

Check policy is not set to none
Verify error is retryable (check status code)
Ensure max_attempts > 1
Check logs for retry attempts

Too Many Retries

Reduce max_attempts
Increase base_delay and max_delay
Consider using none policy for non-critical operations

Long Delays

Reduce max_delay
Use linear or constant backoff instead of exponential
Reduce multiplier

Rate Limits Still Occurring

Ensure respect_retry_after: true
Increase base_delay
Use larger multiplier (3.0 or higher)
Consider reducing concurrent requests

Migration

Existing configurations without retry settings will automatically use the conservative policy. No migration is required, but you can opt-in to custom policies by adding the retry configuration.

Upgrading from Previous Versions

Migration 009 automatically adds retry configuration columns with conservative defaults. Existing OAuth providers will use conservative retry policy unless explicitly configured otherwise.

FilesExpand file tree

RETRY_CONFIGURATION.md

Latest commit

History

RETRY_CONFIGURATION.md

File metadata and controls

Advanced Retry Configuration

Table of Contents

Overview

Key Features

Automatically Retried Status Codes

Quick Start

Using Preset Policies

Custom Configuration

Retry Policy Presets

Conservative (Recommended Default)

Aggressive

None

Custom Retry Configuration

Configuration Options

Example: High-Priority with Quick Retries

Example: Linear Backoff

Configuration Levels

Example: Multi-Level Configuration

Backoff Strategies

Exponential (Recommended)

Linear

Constant

Jitter Types

Full Jitter (Recommended)

Equal Jitter

Decorrelated Jitter

No Jitter

Retryable Errors

Automatically Retried

Never Retried

Retry-After Header Support

Database Schema

Providers Table

Provider OAuth Table

Server Config (Global Defaults)

Examples

Example 1: Production Setup

Example 2: Development/Testing

Example 3: Unreliable Network

Example 4: Rate-Limited API

Best Practices

Troubleshooting

Retries Not Working

Too Many Retries

Long Delays

Rate Limits Still Occurring

Migration

Upgrading from Previous Versions

Related