eventhub-scaler works on new hubs where initially no storage checkpoint exists #798

christle · 2020-04-30T14:48:47Z

eventhub-scaler can scale even when no storage checkpoint exists.

To scale up without checkpoint, the first calculation based only on eventhub partition infos. The tricky part was to differentiate between a partition without messages and exactly 1 unprocessed message.

Fixes #797

jeffhollan · 2020-04-30T16:36:34Z

Thank you @christle - it may have been this same issue but I'd even seen issues where checkpoints exist (in that Azure Storage has partition records), but the offset was null becaues that partition had never received a message. Do you know if this will catch that case too?

christle · 2020-05-04T05:30:10Z

Hi @jeffhollan, as fas as i know, the scaler ignores the offset value, so this pr dont fix this issue. But i have made a similar observation for the offset and have an idea, what's the cause for this and in which case it's happening. It took's me a while to find this out.
In our project, we have some hubs in our Eventhub and everything works with keda as expected, but then we add a new hub for an Endpoint from an IOT hub and with or without messages, the scaler scales out to the maximum pod number. The offset inside the checkpoint where null. Like i said the offset is ingored by the scaler, but in this case the lastsequencenumber inside the partition stays on -1. No matter how much messages comes from the IOT hub Endpoint. With -1 the scaler runs inside the azure_eventhub_scaler.go file to this line:

unprocessedEventCountInPartition = (math.MaxInt64 - partitionInfo.LastSequenceNumber) + checkpoint.SequenceNumber

I think this should ends in an overflow Exception.The result is, the scaler returns an very high metric value (everytime the same value, no matter how much messages).

I dont know why messages, which are routed from the IOT Hub to an EventHub, don't increase the lastsequencenumber and the offset, but i think the scaler should ignore the -1 and do nothing instead of scaling up.

SatishRanjan · 2020-05-20T20:46:45Z

pkg/scalers/azure_eventhub_scaler.go

@@ -116,15 +117,22 @@ func parseAzureEventHubMetadata(metadata, resolvedEnv map[string]string) (*Event
 }

 //GetUnprocessedEventCountInPartition gets number of unprocessed events in a given partition
-func (scaler *AzureEventHubScaler) GetUnprocessedEventCountInPartition(ctx context.Context, partitionID string) (newEventCount int64, err error) {
+func (scaler *AzureEventHubScaler) GetUnprocessedEventCountInPartition(ctx context.Context, partitionID string) (newEventCount int64, checkpoint Checkpoint, err error) {
 	partitionInfo, err := scaler.client.GetPartitionInformation(ctx, partitionID)


This is the redundant call to GetPartitionInformation for Partition id in GetUnprocessedEventCountInPartition, we can pass the reference of partitionInfo eventhub.HubPartitionRuntimeInformation from IsActive and GetMetrics methods where this information has already been retrieved. Other than that this change will solve the issue of "Until the an event hub processor is scaled to at least 1 manually, the keda-operator will never be able to get the checkpoint data"

Ok thank you. I can change that. But now there are some merge conflicts. I will try to fix it.

in isActive is no partitionInfo, only runtimeInfo but i can add it

ok i'm done. I eliminate one call to GetPartitionInformation and fix the merge conflicts.

… exists Signed-off-by: Christian Leinweber <christian.leinweber@maibornwolff.de>

ahmelsayed · 2020-05-21T05:24:24Z

Thanks @christle for your contribution :)

christle requested review from ahmelsayed and zroubalik as code owners April 30, 2020 14:48

zroubalik assigned ahmelsayed May 11, 2020

ahmelsayed approved these changes May 16, 2020

View reviewed changes

ahmelsayed requested a review from SatishRanjan May 16, 2020 22:36

evillgenius75 mentioned this pull request May 18, 2020

Event Hub Scaler logic is not properly accounting for initial state of empty EventHub partitions #830

Closed

SatishRanjan reviewed May 20, 2020

View reviewed changes

christle force-pushed the master branch from 3455d74 to 6348faa Compare May 21, 2020 01:29

eventhub: can scale on new hubs where initially no storage checkpoint…

19224b2

… exists Signed-off-by: Christian Leinweber <christian.leinweber@maibornwolff.de>

christle force-pushed the master branch from 6348faa to 19224b2 Compare May 21, 2020 01:41

ahmelsayed merged commit 85735b2 into kedacore:master May 21, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

eventhub-scaler works on new hubs where initially no storage checkpoint exists #798

eventhub-scaler works on new hubs where initially no storage checkpoint exists #798

christle commented Apr 30, 2020

jeffhollan commented Apr 30, 2020

christle commented May 4, 2020

SatishRanjan May 20, 2020 •

edited

Loading

christle May 20, 2020

christle May 21, 2020

christle May 21, 2020

ahmelsayed commented May 21, 2020

eventhub-scaler works on new hubs where initially no storage checkpoint exists #798

eventhub-scaler works on new hubs where initially no storage checkpoint exists #798

Conversation

christle commented Apr 30, 2020

jeffhollan commented Apr 30, 2020

christle commented May 4, 2020

SatishRanjan May 20, 2020 • edited Loading

Choose a reason for hiding this comment

christle May 20, 2020

Choose a reason for hiding this comment

christle May 21, 2020

Choose a reason for hiding this comment

christle May 21, 2020

Choose a reason for hiding this comment

ahmelsayed commented May 21, 2020

SatishRanjan May 20, 2020 •

edited

Loading