feat: retry on 429 #2244

jjhwan-h · 2025-08-21T05:22:24Z

Summary

This PR introduces a round-robin retry mechanism for HTTP 429 (Too Many Requests) responses.
Users can control the number of retries and the delay between them using two new flags:

--retry-rounds

--retry-delay

Changes

Options

Added RetryRounds (int) → number of retries for 429 responses

Added RetryDelay (int) → delay between retries (in ms)

Added validation to ensure retries and delay values are valid

Runner

Integrated a new retryLoop with drainedCh synchronization

Introduced retryJob struct for retry queue management

On 429 responses, jobs are pushed into the retryCh for delayed execution

Used atomic.Int64 to track remaining retries and close drainedCh when done

Tests

Added TestRunner_Process_And_RetryLoop

srv1: Always returns 429 → retries exhausted, expected 3×429, 0×200

srv2: Returns 2×429 then 200 → expected 2×429, 1×200

Verified drain and retry logic

Example

httpx -l urls.txt --retry-rounds 2 --retry-delay 500

A request returning 429 will be retried up to 2 times

Each retry happens after 500ms delay

If retries are exhausted, the request is considered failed

Related Issue

#1078

Summary by CodeRabbit

New Features
- Automatic retries for HTTP 429 (rate-limited) responses with configurable rounds and delay.
- New CLI flags: --retry-rounds and --retry-delay.
- Run completion now waits for all retry attempts to finish.
- Validation: when retry rounds > 0, retry delay must be > 0.
Tests
- Added a concurrency test verifying retry behavior and handling of rate-limited responses.

coderabbitai · 2025-08-21T05:22:31Z

Walkthrough

Adds configurable HTTP 429 retry support: new Options fields and CLI flags with validation; introduces a retryJob type, retry channel, and retryLoop with scheduling and atomic draining; propagates retry channel through processing (including nested probes); updates Runner method signatures and adds tests.

Changes

Cohort / File(s)	Summary
Options API and CLI flags `runner/options.go`	Adds `RetryRounds` and `RetryDelay` fields to `Options`, registers `--retry-rounds` and `--retry-delay` CLI flags, and validates `RetryDelay > 0` when `RetryRounds > 0`.
Retry mechanism core `runner/runner.go`, `runner/...`	Adds `retryJob` type (`runner/types.go`), introduces `retryCh` channel, implements `retryLoop` with scheduling and atomic in-flight tracking, enqueues retries on HTTP 429 up to `RetryRounds` with `RetryDelay`, and threads `retryCh` through `Process`/`process` and nested probe calls.
Tests `runner/runner_test.go`	Adds `TestRunner_Process_And_RetryLoop` to exercise 429->retry behavior across servers; includes a minor whitespace alignment tweak in another test.

Sequence Diagram(s)

sequenceDiagram
  autonumber
  actor User as RunEnumeration
  participant W as Worker/process
  participant RL as RetryLoop
  participant S as HTTP Server
  participant O as Output

  rect rgba(230,245,255,0.6)
    User->>W: process(target, retryCh)
    W->>S: analyze request
    S-->>W: response (200 / 429)
    alt 429 and attempts < RetryRounds
      W->>RL: enqueue retryJob (when = now + RetryDelay)
    end
    W-->>O: emit result
  end

  rect rgba(240,255,240,0.6)
    loop until drained
      RL-->>RL: wait until(job.when)
      RL->>S: analyze retry request
      S-->>RL: response
      RL-->>O: emit retry result
      alt 429 and attempts < RetryRounds
        RL->>RL: re-enqueue retryJob
      end
    end
  end

  note over User,RL: Drained signaled when initial + retry jobs complete

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

Poem

I nibbled code paths, hop by hop,
Queued tiny retries when servers said "stop".
A delay and a round, then another try—
Counting attempts as the packets fly.
When channels are empty and burrows clear,
I twitch my nose and thunder forth a cheer. 🐇✨

Tip

🔌 Remote MCP (Model Context Protocol) integration is now available!

Pro plan users can now connect to remote MCP servers from the Integrations page. Connect with popular remote MCPs such as Notion and Linear to add more context to your reviews and chats.

✨ Finishing Touches

📝 Generate Docstrings

🧪 Generate unit tests

Create PR with unit tests
Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

CodeRabbit Commands (Invoked using PR/Issue comments)

Type @coderabbitai help to get the list of available commands.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Status, Documentation and Community

Visit our Status Page to check the current availability of CodeRabbit.
Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (1)

runner/runner.go (1)
1354-1355: Consider documenting the retry attempt tracking logic.

The condition if job.attempt == 1 adds to the remaining counter only for first attempts. While this is correct for tracking initial jobs vs retries, it might benefit from a clarifying comment.

Add a comment to clarify the tracking logic:
+				// Track only initial attempts (attempt=1) to know when all original jobs are done
 				if job.attempt == 1 {
 					remaining.Add(1)
 				}

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

💡 Knowledge Base configuration:

MCP integration is disabled by default for public repositories
Jira integration is disabled by default for public repositories
Linear integration is disabled by default for public repositories

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between 5f55bfa and 9936878.

📒 Files selected for processing (4)

runner/options.go (3 hunks)
runner/runner.go (9 hunks)
runner/runner_test.go (3 hunks)
runner/types.go (1 hunks)

🧰 Additional context used

🧬 Code graph analysis (4)

runner/options.go (2)

runner/runner.go (1)

New (114-389)

common/httpx/httpx.go (1)

New (46-213)

runner/types.go (3)

common/httpx/httpx.go (1)

HTTPX (33-43)

common/httpx/types.go (1)

Target (4-8)

runner/options.go (1)

ScanOptions (53-110)

runner/runner_test.go (4)

runner/runner.go (1)

New (114-389)

runner/options.go (1)

Options (173-353)

runner/types.go (1)

Result (35-103)

common/httpx/http2.go (1)

HTTP (15-15)

runner/runner.go (4)

common/httpx/httpx.go (1)

HTTPX (33-43)

common/httpx/types.go (1)

Target (4-8)

runner/options.go (1)

ScanOptions (53-110)

runner/types.go (1)

Result (35-103)

🔇 Additional comments (11)

runner/types.go (1)

123-132: LGTM! Well-structured retry state management type.

The retryJob struct appropriately captures all necessary state for implementing HTTP 429 retry logic. The fields are properly organized and the use of time.Time for scheduling is appropriate.

runner/options.go (2)

277-278: Good addition of retry configuration fields.

The new RetryRounds and RetryDelay fields are appropriately typed and well-positioned within the Options struct alongside other rate-limiting related configurations.

535-536: Clear and well-documented CLI flags for retry configuration.

The flags are properly grouped under "Optimizations" with clear descriptions. The default delay of 500ms is reasonable for HTTP 429 retry scenarios.

runner/runner_test.go (1)

322-409: Well-designed test for concurrent retry behavior.

The test thoroughly validates the retry mechanism with proper concurrent execution handling and accurate result counting. The test setup with two servers having different 429 response patterns effectively covers the retry logic edge cases.

runner/runner.go (7)

1261-1263: Good channel and goroutine management for retry mechanism.

The retry channel creation and retry loop initialization are properly placed before the main processing begins.

1306-1308: Proper synchronization with drain channel.

The conditional wait on the drain channel when RetryRounds > 0 ensures all retry attempts complete before closing the output channel. This prevents premature termination.

1333-1393: Well-implemented retry loop with proper cancellation and draining.

The retry loop implementation is robust with:

Proper context cancellation support

Atomic counter for tracking in-flight jobs

Timer-based scheduling with cleanup

Correct drain signaling when all work completes

1481-1493: Correct retry job scheduling for 429 responses.

The retry job creation properly captures all necessary state and schedules with the configured delay. The check for RetryRounds > 0 ensures retries only happen when configured.

1499-1514: Good propagation of retry channel through TLS/CSP probe paths.

The retry channel is correctly passed through all recursive r.process calls in both TLS and CSP probing paths, ensuring consistent retry behavior across all request types.

1548-1560: Consistent retry handling for custom port requests.

The retry logic for custom port requests mirrors the default port handling, maintaining consistency across different request paths.

1566-1570: Proper retry channel propagation in custom port TLS probing.

The retry channel is correctly passed through the TLS probe paths for custom ports.

runner/options.go

coderabbitai

Actionable comments posted: 2

🧹 Nitpick comments (4)

runner/runner_test.go (4)
325-344: Make the server conditions more explicit (< vs !=) for readability

Using “not equal to N” works, but “less than N” communicates intent (first N-1 calls) more clearly and avoids edge-case surprises if the counter increments beyond the sentinel.

Apply this diff to clarify:
-    if atomic.AddInt32(&hits1, 1) != 4 {
+    if atomic.AddInt32(&hits1, 1) < 4 {
         w.WriteHeader(http.StatusTooManyRequests)
         return
     }
@@
-    if atomic.AddInt32(&hits2, 1) != 3 {
+    if atomic.AddInt32(&hits2, 1) < 3 {
         w.WriteHeader(http.StatusTooManyRequests)
         return
     }
345-351: Tiny nit: document RetryDelay units to avoid future confusion

Options.RetryDelay is treated as milliseconds by the new retry loop. Consider a short comment to prevent accidental second-based assumptions during future edits.

Example:
 r, err := New(&Options{
     Threads:     1,
     RetryRounds: 2,
-    RetryDelay:  5,
+    RetryDelay:  5, // milliseconds
     Timeout:     3,
 })
373-389: Compare by prefix rather than strict equality to make URL matching robust

Depending on how Result.URL is populated, it may include a trailing slash or path. Using strict equality with srvX.URL risks brittle matches. HasPrefix is sufficient here and resilient to minor formatting differences.

Apply this diff:
-            switch res.StatusCode {
+            switch res.StatusCode {
             case http.StatusTooManyRequests:
-                if res.URL == srv1.URL {
+                if strings.HasPrefix(res.URL, srv1.URL) {
                     s1n429++
                 } else {
                     s2n429++
                 }
             case http.StatusOK:
-                if res.URL == srv1.URL {
+                if strings.HasPrefix(res.URL, srv1.URL) {
                     s1n200++
                 } else {
                     s2n200++
                 }
             }
401-409: Strengthen assertions with server hit counts

In addition to output-based assertions, verify the exact number of HTTP calls made to each server. This catches cases where the output channel behavior changes but retries still occur (or vice versa).

Apply this diff:
     // srv2: should have 2x 429 responses and 1x 200 (succeeds on 3rd attempt)
     require.Equal(t, 2, s2n429)
     require.Equal(t, 1, s2n200)
+
+    // Also assert exact server invocations:
+    // srv1: 1 initial + 2 retries = 3 total
+    require.EqualValues(t, 3, atomic.LoadInt32(&hits1))
+    // srv2: 1 initial + 2 retries = 3 total, ends with 200 on 3rd
+    require.EqualValues(t, 3, atomic.LoadInt32(&hits2))

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

💡 Knowledge Base configuration:

MCP integration is disabled by default for public repositories
Jira integration is disabled by default for public repositories
Linear integration is disabled by default for public repositories

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between 9936878 and 7fe6a48.

📒 Files selected for processing (2)

runner/options.go (3 hunks)
runner/runner_test.go (3 hunks)

🚧 Files skipped from review as they are similar to previous changes (1)

runner/options.go

🧰 Additional context used

🧬 Code graph analysis (1)

runner/runner_test.go (3)

runner/runner.go (1)

New (114-389)

runner/options.go (1)

Options (173-353)

runner/types.go (1)

Result (35-103)

🔇 Additional comments (2)

runner/runner_test.go (2)

4-11: Imports look correct and minimal

The newly added imports are necessary for the test logic (servers, status codes, atomics, sync utils). No concerns.

Also applies to: 19-19

236-239: Formatting-only struct field alignment

This change is cosmetic and does not affect behavior. Safe to keep.

coderabbitai · 2025-08-21T06:50:51Z

runner/runner_test.go

+	_, drainedCh := r.retryLoop(context.Background(), retryCh, output, r.analyze)
+


💡 Verification agent

🧩 Analysis chain

Don’t ignore the retry loop’s stop handle; avoid a goroutine leak

retryLoop appears to return a stop/cleanup handle (you currently discard it with _). If the loop isn’t explicitly stopped, the goroutine can remain blocked on retryCh after the test finishes, causing leaks and potential flakiness in suite runs.

Apply this diff to wire cancellation correctly and ensure cleanup even on early test failures:

- _, drainedCh := r.retryLoop(context.Background(), retryCh, output, r.analyze) + ctx, cancel := context.WithCancel(context.Background()) + stopRetry, drainedCh := r.retryLoop(ctx, retryCh, output, r.analyze) + t.Cleanup(func() { + // Ensure the loop exits and resources are released + stopRetry() + cancel() + })

If you want to double-check the signature and the semantics of the first return value:

🏁 Script executed:

#!/bin/bash # Inspect retryLoop signature/returns to confirm cleanup expectations rg -nP -C2 'func\s+$r\s+\*Runner$\s+retryLoop$' runner

Length of output: 342

🏁 Script executed:

#!/bin/bash # Retrieve the full signature (including return values) of retryLoop to confirm the first return value’s type and semantics rg -nP -C10 'func\s+\(r\s+\*Runner$\s+retryLoop\(' runner/runner.go

Length of output: 691

🏁 Script executed:

#!/bin/bash # Extract the full body of retryLoop to confirm how stop() and drainedCh operate sed -n '1333,1400p' runner/runner.go

Length of output: 1518

🏁 Script executed:

#!/bin/bash # Locate usages of retryLoop in the test file rg -n -C5 'retryLoop' runner/runner_test.go # Show full context around lines 350–380 to inspect how stop and drainedCh are handled sed -n '340,380p' runner/runner_test.go

Length of output: 1276

Wire up the retryLoop stop handle to prevent goroutine leaks

The test currently invokes retryLoop with a background context and discards its first return value (stop func()), so the internal goroutine never sees a cancellation signal. This can leave the loop blocked on retryCh after the test ends, leading to leaked goroutines and flakiness.

• File: runner/runner_test.go
• Lines: 356–357

Suggested change:

- _, drainedCh := r.retryLoop(context.Background(), retryCh, output, r.analyze) + // Create a cancellable context and capture the stop function + ctx, cancel := context.WithCancel(context.Background()) + stopRetry, drainedCh := r.retryLoop(ctx, retryCh, output, r.analyze) + // Ensure the loop exits and the context is cleaned up after the test + t.Cleanup(func() { + stopRetry() + cancel() + })

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

_, drainedCh := r.retryLoop(context.Background(), retryCh, output, r.analyze)

// Create a cancellable context and capture the stop function

ctx, cancel := context.WithCancel(context.Background())

stopRetry, drainedCh := r.retryLoop(ctx, retryCh, output, r.analyze)

// Ensure the loop exits and the context is cleaned up after the test

t.Cleanup(func() {

stopRetry()

cancel()

})

🤖 Prompt for AI Agents

In runner/runner_test.go around lines 356–357, the test calls r.retryLoop(context.Background(), ...) but ignores the returned stop func, leaving the internal goroutine unable to observe cancellation; capture the first return value (stop func) when calling retryLoop and ensure you call it (preferably via defer stop() immediately after the call) so the retryLoop is signaled to exit and no goroutine is leaked.

coderabbitai · 2025-08-21T06:50:51Z

runner/runner_test.go

+	wg.Wait()
+	<-drainedCh
+	close(output)
+	drainWG.Wait()


🛠️ Refactor suggestion

Guard against hangs: time-bound the wait for drainedCh

If the retry loop logic regresses, <-drainedCh can block indefinitely and hang the test suite. Use a bounded wait to fail fast and surface the defect.

Apply this diff:

- wg.Wait() - <-drainedCh - close(output) + wg.Wait() + select { + case <-drainedCh: + // drained successfully + case <-time.After(5 * time.Second): + t.Fatalf("retry loop did not drain within timeout") + } + close(output)

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

wg.Wait()

<-drainedCh

close(output)

drainWG.Wait()

wg.Wait()

select {

case <-drainedCh:

// drained successfully

case <-time.After(5 * time.Second):

t.Fatalf("retry loop did not drain within timeout")

}

close(output)

drainWG.Wait()

Mzack9999 · 2025-08-27T18:06:07Z

Nice one! I was thinking about something similar with automatic detection of rate limit from RL headers combined with https://github.com/projectdiscovery/ratelimit/tree/feat-per-key-limit to make the tool compliant with them without slowness.
Just curious, were you testing many different paths towards the same target in order to hit 429? Generally httpx use case is to send few requests to a very large number of targets.

jjhwan-h · 2025-08-28T14:37:11Z

Thanks for the feedback!

In my testing I was indeed sending many different paths to the same target, which resulted in 429 responses. However, in real-world environments with CDNs and WAFs, 429s can also occur sporadically across multiple targets, even with only a few requests per host.

The --retry-rounds flag is therefore intended as an optional safeguard to preserve result accuracy in such cases. It does not alter the default behavior and can be enabled only when needed.

if you feel additional tests or adjustments to the implementation would make this more useful or easier to maintain, I’d be happy to revise the PR accordingly!

jjhwan-h added 4 commits August 21, 2025 14:00

feat: add round-robin retries via Retry Rounds/Retry Delay

b8e75fa

fix: remove short aliases (-rr, -rd) to avoid redefinition

8c2f4ff

fix: add max retries reached

6147653

feat: add round-robin retries via Retry Rounds/Retry Delay

ae4b2e9

jjhwan-h force-pushed the feat/retry-on-429 branch from a4ffbdd to 9936878 Compare August 21, 2025 05:42

coderabbitai bot reviewed Aug 21, 2025

View reviewed changes

runner/options.go Outdated Show resolved Hide resolved

jjhwan-h force-pushed the feat/retry-on-429 branch from 9936878 to 230afce Compare August 21, 2025 06:38

refactor: retryLoop with atomic counter and drained channel

7fe6a48

jjhwan-h force-pushed the feat/retry-on-429 branch from 230afce to 7fe6a48 Compare August 21, 2025 06:40

coderabbitai bot reviewed Aug 21, 2025

View reviewed changes

Mzack9999 self-requested a review August 27, 2025 18:02

dogancanbakir linked an issue Aug 28, 2025 that may be closed by this pull request

round robin system for 429 status code #1078

Open

jjhwan-h changed the title ~~Feat/retry on 429~~ feat: retry on 429 Oct 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feat: retry on 429 #2244

feat: retry on 429 #2244

jjhwan-h commented Aug 21, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Aug 21, 2025 •

edited

Loading

Chat

Support

CodeRabbit Commands (Invoked using PR/Issue comments)

Other keywords and placeholders

CodeRabbit Configuration File (`.coderabbit.yaml`)

Status, Documentation and Community

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Aug 21, 2025

Uh oh!

coderabbitai bot Aug 21, 2025

Uh oh!

Mzack9999 commented Aug 27, 2025

Uh oh!

jjhwan-h commented Aug 28, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		_, drainedCh := r.retryLoop(context.Background(), retryCh, output, r.analyze)

-	_, drainedCh := r.retryLoop(context.Background(), retryCh, output, r.analyze)
+    // Create a cancellable context and capture the stop function
+    ctx, cancel := context.WithCancel(context.Background())
+    stopRetry, drainedCh := r.retryLoop(ctx, retryCh, output, r.analyze)
+    // Ensure the loop exits and the context is cleaned up after the test
+    t.Cleanup(func() {
+        stopRetry()
+        cancel()
+    })

Uh oh!

feat: retry on 429 #2244

Are you sure you want to change the base?

feat: retry on 429 #2244

Conversation

jjhwan-h commented Aug 21, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Options

Runner

Tests

Example

Related Issue

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Aug 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Poem

Chat

Support

CodeRabbit Commands (Invoked using PR/Issue comments)

Other keywords and placeholders

CodeRabbit Configuration File (.coderabbit.yaml)

Status, Documentation and Community

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Aug 21, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Aug 21, 2025

Choose a reason for hiding this comment

Uh oh!

Mzack9999 commented Aug 27, 2025

Uh oh!

jjhwan-h commented Aug 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jjhwan-h commented Aug 21, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Aug 21, 2025 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)

jjhwan-h commented Aug 28, 2025 •

edited

Loading