Skip to content

Reprocessing workflow cause many DLQ messages #708

Closed
@iliapolo

Description

@iliapolo

Whenever our reprocess workflow is triggered, we see a bunch of DLQ message that say:

Rate exceeded (Service: AmazonECS; Status Code: 400; Error Code: ThrottlingException; Request ID: 1db00da3-2470-4c18-b97a-ffca0c6aa602; Proxy: null)

This happens because reprocessing means spawning around 40,000 ECS tasks, while our current service limit stands on 1000.

I think we need a two vector solution here:

  1. Increase service limits to make reprocessing faster.
  2. Tweak the retry policy or some other mechanism to prevent this from happening.

Metadata

Metadata

Assignees

Labels

bugSomething isn't workingp1

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions