Concurrent produce() sequence number fix #1050

t-d-d · 2021-03-21T09:35:48Z

Read and update sequence numbers immediately before the produce request.
Fixes #1005 Maybe #598

Putting this up for review and input but I'm still unsure as to the failure modes. I'm wondering whether we should decrement the sequence numbers if the broker.produce() call fails.

Read/update sequence numbers immediately before produce request

t-d-d · 2021-03-22T02:17:16Z

More info on dealing with failures with inflight request > 1. Although I don't think we need this full implementation, as I don't think we need to guarantee ordering for concurrent requests (or we at least state that we don't.)

I think this PR as-is is an improvement to the current behaviour. But I think there are questions around concurrent invocations of produce() and inflight requests in the idempotent mode, especially dealing with errors.

Nevon · 2021-05-11T12:45:00Z

Although I don't think we need this full implementation, as I don't think we need to guarantee ordering for concurrent requests (or we at least state that we don't.)

This is correct. We state that you need to set maxInflightSomethingSomething to 1 in order to use the idempotent producer.

The KIP you linked to has some ideas about improving this by reassigning sequence numbers if there's an error - but I'd say that's a future improvement.

The case we're trying to solve here is where people are essentially doing:

await Promise.all([
  producer.send(),
  producer.send(),
  producer.send()
])

While the order between these three is of course arbitrary, assuming that there are no errors, all three requests should succeed. Currently, this is very likely to fail as the sequence number for all three requests are likely to be the same. This PR will change that so that the sequence numbers will always be incremented as soon as they are assigned. In fact, a future refactoring could change the interface to the EoS manager so that getting a sequence number automatically increments it, instead of relying on updating it separately.

What this PR doesn't solve are the failure cases - but as it also doesn't make them worse, I'm inclined to accept it anyway. Some notes about the failure cases below:

These notes are not really related to this PR, but I'm putting them here to increase our shared understanding.

From KIP-98:

For a given PID, sequence numbers will start from zero and be monotonically increasing, with one sequence number per topic partition produced to. The sequence number will be incremented by the producer on every message sent to the broker.

There was a comment in #598 that indicated that the sequence number should increase by one per request. This statement from KIP-98 can be read that way, but that's not what they mean. What they mean is just that there's a sequence number per topic partition, not that it's incremented by one per produce request. Our implementation is correct in that sense, that it increases with the number of records produced. Nothing to do here, just clearing something up.

It goes on:

The broker will reject a produce request if its sequence number is not exactly one greater than the last committed message from that PID/TopicPartition pair. Messages with a lower sequence number result in a duplicate error, which can be ignored by the producer. Messages with a higher number result in an out-of-sequence error, which indicates that some messages have been lost, and is fatal.
...
The Producer will raise an OutOfOrderSequenceException if the broker detects data loss. In other words, if it receives a sequence number which is greater than the sequence it expected. This exception will be returned in the Future and passed to the Callback, if any. This is a fatal exception, and future invocations of Producer methods like send, beginTransaction, commitTransaction, etc. will raise an IlegalStateException.

This makes me think that the current design of the transactional producer is a bit wonky. What I guess should happen is that if we receive and OUT_OF_ORDER_SEQUENCE_NUMBER, we should transition to a new state in the EoS manager that rejects immediately on any attempt to send any more. This is currently not handled anywhere that I can see. While we could reject any pending requests as well, those should receive an OUT_OF_ORDER_SEQUENCE_NUMBER anyway from the broker, so it's maybe not worth the hassle (getting them out of the queue and rejecting them can be a bit tricky).

There is also the gnarlier situation where we don't get an OUT_OF_ORDER_SEQUENCE_NUMBER, but the request fails anyway, for example due to a connection error where we run out of retries. In such a case, we can't know what the next sequence number is. In this case, it seems the only safe thing to do is to handle it in the same way as if we got an OUT_OF_ORDER_SEQUENCE_NUMBER error and reject any further use of the transactional producer, and require a clean slate with a new producer id etc.

src/producer/__tests__/concurrentIdempotent.spec.js

Test caess for idempotent producer

t-d-d · 2021-07-25T09:55:55Z

@Nevon I have added code that reverts the sequence numbers in case of a failure. Now at least it will recover in the normal case of sequential produce() calls (for a topic-partition.)

For errors when there are concurrent produce() calls ongoing (for a topic-partition) the idempotent producer will die with a UNKNOWN_PRODUCER_ID or OUT_OF_ORDER_SEQUENCE_NUMBER error on any error. This could be made to work if the sequence number stuff was moved inside the mutex lock that limits maxInFlightRequests. It would be pretty simple to add another lock (per broker, or maybe even per partition?) in sendMessages.js

Test cases added that capture the current behaviour.

I just noticed

kafkajs/src/producer/sendMessages.js

Line 127 in 9cf3dfb

await Promise.all(requests)

should probably be an allSettled(). Would cause problems if not all the promises are settled before the retry.

Nevon

Just one minor note about whether or not we can remove the sleeps from the tests, but overall it looks good to me. Thanks for your patience!

Nevon · 2021-07-27T07:15:51Z

src/producer/__tests__/idempotentProduceMessages.spec.js

+    )
+  })
+
+  it('concurrent produce() calls > all messages are written to the partition once', async () => {


Don't worry about changing this, but just for future reference, you can just wrap these test cases in a describe block to group the tests together, instead of repeating the prefix in each test case:

describe('concurrent produce() calls', () => { test('all messages are written to the partition at once') })

src/producer/__tests__/idempotentProduceMessages.spec.js

Nevon · 2021-07-27T07:18:44Z

src/utils/promiseAllSettled.js

+ * @template T
+ */
+
+function allSettled(promises) {


Nice. We should actually bump the requirement on Node to 12 at this point, but that's outside the scope of this PR.

pi2sqr · 2022-12-12T13:50:53Z

Hi,

I have a question.
I'm trying to do performance test for default topic: -topic kafka-test --replication-factor 2 --partitions 1.
No other settings different from default.

I've Producer and Consumer written in Java.
Producer tries to do async send 10 messages in loop one by one, each with 10kB volume record:

for (int i=0; i<numberOfRecords; i++) {
start_time = System.currentTimeMillis();
ProducerRecord<String, String> record = new ProducerRecord<>(kafkaProp.getProperty("kafka.topic"), "key " + i, start_time + message);
Future sendFuture = producer.send(record);
if (sendFuture.isDone()) {
try {
sendFuture.get();
} catch (InterruptedException var7) {
Thread.currentThread().interrupt();
throw new KafkaException("Interrupted", var7);
} catch (ExecutionException var8) {
throw new KafkaException("Send failed", var8.getCause());
}
}
}
producer.flush();
producer.close();

But even for one Producer I'm getting:
"...org.apache.kafka.common.errors.OutOfOrderSequenceException: The broker received an out of order sequence number. for topic-partition kafka-test-0 with producerId 24053, epoch 851, and sequence number 1
13:19:36:569 [kafka-producer-network-thread | producer-1] INFO org.apache.kafka.clients.producer.internals.TransactionManager - [Producer clientId=producer-1] ProducerId set to 24053 with epoch 852"

What is strange for me, that the first 2050 messages are sent in less than a second.
Then it take some time to send next messages, 20ms up, but still 2050 x 10270B is not a size of topic's buffer.
The more Producers I have for one partition the more errors I'm getting.
From Consumer site I'm counting how many messages I'm not receiving and it's from 200 over 2000 with 1 Producer. For and more Producers its over a 60% of messages that are not received (maybe send correctly).

Can you advise?

t-d-d added 2 commits March 21, 2021 17:08

Concurrent produce() sequence number fix

3ccf221

Read/update sequence numbers immediately before produce request

1 Integration test

03ea18d

t-d-d added 2 commits March 25, 2021 22:20

Merge branch 'master' into 1005_concurrent_produce_sequence_numbers

572083d

Merge branch 'master' into 1005_concurrent_produce_sequence_numbers

ab3f96c

joaosouzaminu mentioned this pull request Apr 30, 2021

KafkaJSProtocolError: The broker received an out of order sequence number #598

Closed

Nevon approved these changes May 11, 2021

View reviewed changes

src/producer/__tests__/concurrentIdempotent.spec.js Outdated Show resolved Hide resolved

t-d-d added 2 commits July 21, 2021 15:39

Merge branch 'master' into 1005_concurrent_produce_sequence_numbers

5fde09c

Revert sequenece numbers on produce() fail

e36f248

Test caess for idempotent producer

t-d-d requested a review from Nevon July 25, 2021 09:56

Nevon approved these changes Jul 27, 2021

View reviewed changes

t-d-d added 3 commits July 31, 2021 10:28

Merge branch 'master' into 1005_concurrent_produce_sequence_numbers

5daca9b

Remove sleep() from tests

a686ebd

waitFor the # of caals rather than # of results

2781376

t-d-d mentioned this pull request Aug 14, 2021

Concurrent idempotent producer error handling fixes #1172

Merged

Merge branch 'master' into 1005_concurrent_produce_sequence_numbers

0c87970

aikoven referenced this pull request in aikoven/kafkajs Jan 16, 2022

1.15.0-fork.4

d08db4f

Merge branch 'master' into 1005_concurrent_produce_sequence_numbers

43e71bc

Nevon merged commit 7cc43da into tulios:master Feb 9, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Concurrent produce() sequence number fix #1050

Concurrent produce() sequence number fix #1050

t-d-d commented Mar 21, 2021

t-d-d commented Mar 22, 2021

Nevon commented May 11, 2021

t-d-d commented Jul 25, 2021

Nevon left a comment

Nevon Jul 27, 2021

Nevon Jul 27, 2021

pi2sqr commented Dec 12, 2022

Concurrent produce() sequence number fix #1050

Concurrent produce() sequence number fix #1050

Conversation

t-d-d commented Mar 21, 2021

t-d-d commented Mar 22, 2021

Nevon commented May 11, 2021

t-d-d commented Jul 25, 2021

Nevon left a comment

Choose a reason for hiding this comment

Nevon Jul 27, 2021

Choose a reason for hiding this comment

Nevon Jul 27, 2021

Choose a reason for hiding this comment

pi2sqr commented Dec 12, 2022