Fix flaky Iceberg parquet metadata cache test #24415

nmahadevuni · 2025-01-22T06:50:13Z

Description

Fixes flaky test described in #22422

Motivation and Context

It seems jmx metrics may get mixed up if we are using the query runner and multiple workers. The jmx metrics are bound to coordinator cache object only and owing to the very tiny data read in the test query, if the coordinator doesn't process any split, it will just report zero metrics. So, making the number of nodes to only 1, so this problem is eliminated.

Impact

No impact

Test Plan

Release Notes

Please follow release notes guidelines and fill in the release notes below.

== NO RELEASE NOTE ==

nmahadevuni · 2025-01-23T09:11:06Z

@hantangwangd can you please review this?

hantangwangd · 2025-01-23T19:35:17Z

@nmahadevuni Sorry for the late response. I took some time to figure out why coordinator and workers in the query runner will impact each other's jmx metrics (it seems that one will override the other). The core reason is that, all nodes in the query runner have the same MBeanServer instance and then registered their own parquet metadata cacheStatsMBean into this singleton MBeanServer.

Set the number of nodes to only 1 can definitely fix this flaky test. But if we want to completely solve the problem, we need to inject separate MBeanServers for different coordinators and workers, which may involve a small refactoring of IcebergPlugin and IcebergConnectorFactory. So, do you think it's necessary to do this refactoring to thoroughly address the mutual influence of jmx metrics between different nodes in a query runner? @tdcmeehan

nmahadevuni · 2025-01-27T06:52:09Z

Thanks @hantangwangd. Yes I think that helps us get the real world setup in the query runner too. Will help a lot with metrics integration tests.

Fix flaky Iceberg parquet metadata cache test

7fd5618

nmahadevuni requested review from hantangwangd, ZacBlanco and a team as code owners January 22, 2025 06:50

nmahadevuni requested a review from presto-oss January 22, 2025 06:50

prestodb-ci added the from:IBM PR from IBM label Jan 22, 2025

prestodb-ci requested review from a team, aaneja and infvg and removed request for a team January 22, 2025 06:50

tdcmeehan approved these changes Jan 23, 2025

View reviewed changes

tdcmeehan merged commit 3534361 into prestodb:master Jan 23, 2025
52 checks passed

shangm2 pushed a commit to shangm2/presto that referenced this pull request Jan 23, 2025

Fix flaky Iceberg parquet metadata cache test (prestodb#24415)

64b211f

This was referenced Jan 28, 2025

Add release notes for 0.291 #24445

Open

Add release notes for 0.291 #24448

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix flaky Iceberg parquet metadata cache test #24415

Fix flaky Iceberg parquet metadata cache test #24415

nmahadevuni commented Jan 22, 2025

nmahadevuni commented Jan 23, 2025

hantangwangd commented Jan 23, 2025

nmahadevuni commented Jan 27, 2025

Fix flaky Iceberg parquet metadata cache test #24415

Fix flaky Iceberg parquet metadata cache test #24415

Conversation

nmahadevuni commented Jan 22, 2025

Description

Motivation and Context

Impact

Test Plan

Release Notes

nmahadevuni commented Jan 23, 2025

hantangwangd commented Jan 23, 2025

nmahadevuni commented Jan 27, 2025