Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fixed dispatcher skipping delivery of a batch during concurrent replays #5499

Merged
merged 1 commit into from
Oct 30, 2019

Conversation

merlimat
Copy link
Contributor

Motivation

There is a race condition when replaying messages on the PersistentDispatcherMultipleConsumers. This is apparently only happening in the context of delivering messages after a scheduled delay.

The Dispatcher.readMoreEntries() can get invoked in multiple occasions:

  • Piggybacking on the completion of the previous read operation
  • When the delaye tracker gets triggered

If there is already a replay read operation pending, we're currently ignoring the message ids returned by the delay tracker. These messages will not be retried on any other occasion, leading to backlog building up (until a consumer disconnects/reconnects).

Modifications

If we know we have a pending replay operation, don't fetch message ids from the delay tracker.

@merlimat merlimat added the type/bug The PR fixed a bug or issue reported a bug label Oct 29, 2019
@merlimat merlimat added this to the 2.4.2 milestone Oct 29, 2019
@merlimat merlimat self-assigned this Oct 29, 2019
@merlimat
Copy link
Contributor Author

This issue apparently was introduced in #5391

@jerrypeng
Copy link
Contributor

rerun java8 tests

@wolfstudy
Copy link
Member

org.apache.pulsar.functions.worker.PulsarFunctionE2ESecurityTest.testAuthorizationWithAnonymousUser

run java8 tests

@wolfstudy wolfstudy merged commit 01ca24b into apache:master Oct 30, 2019
@merlimat merlimat deleted the delayed-delivery-bug branch October 30, 2019 13:39
wolfstudy pushed a commit that referenced this pull request Nov 20, 2019
…ys (#5499)

There is a race condition when replaying messages on the `PersistentDispatcherMultipleConsumers`. This is apparently only happening in the context of delivering messages after a scheduled delay. 

The `Dispatcher.readMoreEntries()` can get invoked in multiple occasions: 
 * Piggybacking on the completion of the previous read operation
 * When the delaye tracker gets triggered

If there is already a replay read operation pending, we're currently ignoring the message ids returned by the delay tracker. These messages will not be retried on any other occasion, leading to backlog building up (until a consumer disconnects/reconnects).

(cherry picked from commit 01ca24b)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
type/bug The PR fixed a bug or issue reported a bug
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants