[SPARK-47547] BloomFilter fpp degradation #50933

ishnagy · 2025-05-19T09:37:45Z

What changes were proposed in this pull request?

This change fixes a performance degradation issue in the current BloomFilter implementation.

The current bit index calculation logic does not use any part of the indexable space above the first 31bits, so when the inserted item count approaches (or exceeds) Integer.MAX_VALUE, it will produce significantly worse collision rates than an (ideal) uniformly distributing hash function.

Why are the changes needed?

This should qualify as a bug.

The upper bound on the bit capacity of the current BloomFilter implementation in spark is approx 137G bits (64 bit longs in an Integer.MAX_VALUE sized array). The current indexing scheme can only address about 2G bits of these.

On the other hand, due to the way the BloomFilters are used, the bug won't cause any logical errors, it will gradually render the BloomFilter instance useless by forcing more-and-more queries on the slow path.

Does this PR introduce any user-facing change?

No

How was this patch tested?

new test

One new java testclass was added to sketch to test different combinations of item counts and expected fpp rates.

common/sketch/src/test/java/org/apache/spark/util/sketch/TestSparkBloomFilter.java

testAccuracyEvenOdd
in N number of iterations inserts N even numbers (2*i), and leaves out N odd numbers (2*i+1) from the BloomFilter.

The test checks the 100% accuracy of mightContain=true on all of the even items, and measures the mightContain=true (false positive) rate on the not-inserted odd numbers.

testAccuracyRandom
in 2N number of iterations inserts N pseudorandomly generated numbers in two differently seeded (theoretically independent) BloomFilter instances. All the random numbers generated in an even-iteration will be inserted into both filters, all the random numbers generated in an odd-iteration will be left out from both.

The test checks the 100% accuracy of mightContain=true for all of the items inserted in an even-loop. It counts the false positives as the number of odd-loop items for which the primary filter reports mightContain=true but secondary reports mightContain=false. Since we inserted the same elements into both instances, and the secondary reports non-insertion, the mightContain=true from the primary can only be a false positive.

patched

One minor (test) issue was fixed in

common/sketch/src/test/scala/org/apache/spark/util/sketch/BloomFilterSuite.scala

where the potential repetitions in the randomly generated stream of insertable items resulted in slightly worse fpp measurements than the actual. The problem affected the those testcases more where the cardinality of the tested type is low (the chance of repetition is high), e.g. Byte and Short.

removed from the default runs

Running these test as part of the default build process was turned off with adding @Disabled annotation to the new testclass.

Was this patch authored or co-authored using generative AI tooling?

No

…errors in scala suite

…of the combined hash

peter-toth · 2025-05-19T12:57:47Z

common/sketch/src/test/java/org/apache/spark/util/sketch/TestSparkBloomFilter.java

+            }
+        }
+
+        long mightContainEven = 0;


Please rename these 2 in this test case to clarify that these are actually indices of numbers in a randomly generated stream.

peter-toth · 2025-05-19T13:18:09Z

common/sketch/src/test/java/org/apache/spark/util/sketch/TestSparkBloomFilter.java

+                optimalNumOfBits / Byte.SIZE / 1024 / 1024
+        );
+        Assumptions.assumeTrue(
+                2 * optimalNumOfBits / Byte.SIZE < 4 * ONE_GB,


I guess 4 * ONE_GB is a reasoable limit, can we extract it to a constant and add some comment to it.

…eckstyle errors, renaming test vars

…ward compatible with previously serialized streams

peter-toth · 2025-05-19T14:56:27Z

common/sketch/src/test/java/org/apache/spark/util/sketch/TestSparkBloomFilter.java

+                "mightContainLong must return true for all inserted numbers"
+        );
+
+        double actualFpp = (double) mightContainOddIndexed / numItems;


/ numItems doesn't seem correct here as you don't test numItems number of numbers that were surely not added into the filter.

indeed, it should probably be very close to the proper value, but this calculation doesn't account for the odd indexes ignored based on the secondary's result.

let me try to address that somehow.

… in random test + test formatting

…ic in random test

peter-toth · 2025-05-20T09:39:34Z

Can you please post the output of the new TestSparkBloomFilter here when the 4GB limit of REQUIRED_HEAP_UPPER_BOUND_IN_BYTES is lifted?
And summarize the actual false positive rate (FPP) before and after this fix when numItems = {1000000, 1000000000, 5000000000} and expected FPP is the default 3%?

ishnagy · 2025-05-20T17:36:19Z

the tests with the 4GB limit are still running, I'll post a summary from the results tomorrow, and start a new run that can cover all of the 5G element count cases.

ishnagy · 2025-05-21T16:27:01Z

The filter-from-hex-constant test started to make me worry about compatibility with serialized instances created with the older logic. Even if we can deserialize the buffer and the seed properly, the actual bits will be set in completely different positions. That is, there's no point in trying to use an old (serialized) buffer with the new logic.

Should we create a dedicated BloomFilterImplV2 class for the fixed logic, just so we can keep the old V1 implementation for deserializing old byte streams?

peter-toth · 2025-05-22T09:38:55Z

Should we create a dedicated BloomFilterImplV2 class for the fixed logic, just so we can keep the old V1 implementation for deserializing old byte streams?

I don't think we need to keep the old implementation just to support old serialized versions. It seems we use our bloom filter implementation only in BloomFilterAggregate.

cc @cloud-fan

ishnagy · 2025-05-23T07:00:14Z

I ran into some trouble with generating the test results (running on a single thread, the whole batch takes ~10h on my machine). I'll try to make an update on Monday.

…t output capture

…t output capture - 2nd take

…t output capture - 3rd take

ishnagy · 2025-05-26T22:58:52Z

version	testName	n	fpp	allocatedBitCount	setBitCount	saturation	expectedFpp%	actualFpp%	runningTime
OLD	testAccuracyEvenOdd	1000000	0.05	6235264 (0 MB)	2952137	0.473458	5.000000 %	5.025400 %	PT19.267149499S
OLD	testAccuracyEvenOdd	1000000	0.03	7298496 (0 MB)	3618475	0.495784	3.000000 %	3.022900 %	PT19.628671953S
OLD	testAccuracyEvenOdd	1000000	0.01	9585088 (1 MB)	4968111	0.518317	1.000000 %	0.994700 %	PT19.476457289S
OLD	testAccuracyEvenOdd	1000000	0.001	14377600 (1 MB)	7203887	0.501049	0.100000 %	0.102200 %	PT19.944492903S
OLD	testAccuracyEvenOdd	1000000000	0.05	6235224256 (743 MB)	1814052150	0.290936	5.000000 %	50.920521 %	PT28M6.091484671S
OLD	testAccuracyEvenOdd	1000000000	0.03	7298440896 (870 MB)	1938187323	0.265562	3.000000 %	59.888499 %	PT30M26.383544378S
OLD	testAccuracyEvenOdd	1000000000	0.01	9585058432 (1142 MB)	2065015223	0.215441	1.000000 %	76.025548 %	PT36M30.827858084S
OLD	testAccuracyEvenOdd	1000000000	0.001	14377587584 (1713 MB)	2127081112	0.147944	0.100000 %	90.896130 %	PT45M58.403282401S
OLD	testAccuracyEvenOdd	5000000000	0.05	31176121152 (3716 MB)	2147290054	0.068876	5.000000 %	99.963940 %	PT1H28M39.598973373S
OLD	testAccuracyEvenOdd	5000000000	0.03	36492204224 (4350 MB)	2147464804	0.058847	3.000000 %	99.995623 %	PT1H41M22.171084285S
OLD	testAccuracyEvenOdd	5000000000	0.01	47925291904 (5713 MB)	2147483464	0.044809	1.000000 %	99.999939 %	PT1H59M42.481346242S
OLD	testAccuracyEvenOdd	5000000000	0.001	71887937856 (8569 MB)	2147483648	0.029873	0.100000 %	100.000000 %	PT2H32M41.743734635S

ishnagy · 2025-05-26T23:20:34Z

version	testName	n	fpp	allocatedBitCount	setBitCount	saturation	expectedFpp%	actualFpp%	runningTime
NEW	testAccuracyEvenOdd	1000000	0.05	6235264 (0 MB)	2952282	0.473481	5.000000 %	5.046800 %	PT13.599525353S
NEW	testAccuracyEvenOdd	1000000	0.03	7298496 (0 MB)	3619967	0.495988	3.000000 %	3.018000 %	PT14.086955381S
NEW	testAccuracyEvenOdd	1000000	0.01	9585088 (1 MB)	4968081	0.518314	1.000000 %	1.013400 %	PT14.300125629S
NEW	testAccuracyEvenOdd	1000000	0.001	14377600 (1 MB)	7205256	0.501145	0.100000 %	0.095100 %	PT14.746387272S
NEW	testAccuracyEvenOdd	1000000000	0.05	6235224256 (743 MB)	2963568196	0.475295	5.000000 %	4.889721 %	PT35M6.22696009S
NEW	testAccuracyEvenOdd	1000000000	0.03	7298440896 (870 MB)	3628684972	0.497186	3.000000 %	2.963030 %	PT37M31.833552669S
NEW	testAccuracyEvenOdd	1000000000	0.01	9585058432 (1142 MB)	4973807865	0.518913	1.000000 %	1.001407 %	PT43M23.782325058S
NEW	testAccuracyEvenOdd	1000000000	0.001	14377587584 (1713 MB)	7210348423	0.501499	0.100000 %	0.100803 %	PT57M35.474342424S
NEW	testAccuracyEvenOdd	5000000000	0.05	31176121152 (3716 MB)	14360939834	0.460639	5.000000 %	6.727508 %	PT2H21M2.643592951S
NEW	testAccuracyEvenOdd	5000000000	0.03	36492204224 (4350 MB)	17711039216	0.485338	3.000000 %	3.806971 %	PT2H29M18.334864292S
NEW	testAccuracyEvenOdd	5000000000	0.01	47925291904 (5713 MB)	24462662240	0.510433	1.000000 %	1.321482 %	PT2H56M51.935983408S
NEW	testAccuracyEvenOdd	5000000000	0.001	71887937856 (8569 MB)	35637830341	0.495741	0.100000 %	0.176216 %	PT3H38M21.888031962S

peter-toth

Yeah, actualFpp% seems to be much better when the number of inserted items (n) is huge (~1B).
I'm not sure that the bug actually caused any issues in the injected runtime filters due to the much lower default values of spark.sql.optimizer.runtime.bloomFilter.max... configs, but it is also possible to build a bloom filter manually so it is better to fix it.

BTW, this issue seems to have been observed in Spark: https://stackoverflow.com/questions/78162973/why-is-observed-false-positive-rate-in-spark-bloom-filter-higher-than-expected and was tried to fix with #46370 before.
That old PR was similar to how the issue was fixed in Guava with adding a new strategy / Murmur implementation while this PR fixes the root cause in the current Bloom filter implementation.

peter-toth · 2025-05-27T10:54:30Z

@cloud-fan, as you added the original bloom filter implementation to Spark, could you please take a look at this PR?

ishnagy · 2025-05-27T14:11:17Z

the only relevant difference between the OLD and the NEW versions is in the logic to derive the k hash bits:

OLD

    for (int i = 1; i <= numHashFunctions; i++) {
      int combinedHash = h1 + (i * h2);
      // ...
    }

NEW

    long combinedHash = (long) h1 * Integer.MAX_VALUE;
    for (long i = 0; i < numHashFunctions; i++) {
      combinedHash += h2;
      // ...
    }

LuciferYang · 2025-05-30T07:13:02Z

common/sketch/src/test/java/org/apache/spark/util/sketch/TestSparkBloomFilter.java

+import java.util.concurrent.ConcurrentHashMap;
+
+
+@Disabled


Submitting a test case and directly disabling it is not an ideal approach. Why can't it undergo regular validation through GitHub Actions?

Additionally, I suggest first creating a micro-benchmark relevant to this scenario and recording the results without this pr. Then, update the code and the new benchmark results in this pr to demonstrate the optimization effect.

Or can the scenarios in org.apache.spark.sql.execution.benchmark.BloomFilterBenchmark reflect the optimizations brought about by the current pr?

Submitting a test case and directly disabling it is not an ideal approach. Why can't it undergo regular validation through GitHub Actions?

I agree, in spirit, this test code I submitted is much more close to a benchmark (measurement rather than validation) than to an actual test case, with the emphasis on expectations and assertions.

The reason I disabled it by default, because on a single thread, it takes 10+ hours to run the all the cases, and I didn't want to interfere with running time of the regular test suites.

I wasn't aware of the benchmark workflow, I will have a look whether I can fit this logic in there. Not sure, if it will be a snap in fit, because the code focuses on obtaining a Bloom filter specific measure, the false pos rate, not some more usual or generic measures like running time or resource consumption.

Moreover, the performance gains won't be directly apparent on the sketch level. If anything, it will have a slightly worse running time (but shouldn't consume more mem than the previous logic). The gains should only be measurable in client code (like sql) that uses the sketch implementation. E.g. with a reasonably low error rate in the implementation won't force almost any queried element (in a client) on the slow path when the filter is saturated.

Or can the scenarios in org.apache.spark.sql.execution.benchmark.BloomFilterBenchmark reflect the optimizations brought about by the current pr?

This may or may not be measurable with the current benchmarks, I haven't looked into that yet. As a rule of thumb, in the current implementation, after a few hundred million elements the false pos rate gets noticeably (a few percents) higher than expected, around about a billion (2^30) it diverges significantly (a few tens of percents), and above 2G (2^31) items, it gets prohibitively high (90%+). With the proposed new logic the error rate remains within a few percents off of the expected on all scales.

If the current benchmarks already use Bloom filters with more than a few hundred million items inserted, then the performance improvements should be visible there.

I'll try to adapt the current tests into the benchmark workflow.

Yeah, unfortunately Spark benchmarks can measure only time, but can't measure qualities like the false positive rate of a bloom filter.

I wonder shall we remove TestSparkBloomFilter from this PR or add some comments to it to explain why it is disabled?

If there are other test cases that already cover the changes in the current pull request? If so, I agree to remove TestSparkBloomFilter (as per Spark's coding conventions, it should actually be named SparkBloomFilterSuite). There's no point in adding a test case that cannot be continuously verified by GitHub Actions, as it's likely that no one will remember to execute it later on.

add some comments to it to explain why it is disabled?

@peter-toth I suspect the reason it was marked with @Disabled is that the execution time was too long. I tried running it using GitHub Actions, but it was eventually terminated because it exceeded the two-hour execution limit...

https://github.com/LuciferYang/spark/actions/runs/15495958232/job/43632855094

Yeah. Another option is to reduce the test cases to still validate the improvement of the PR, but with reasoable runtimes.

It seems like the degradation of false positive rate is not yet visible at n = 1M. But when n = 1000M the actual FPP is much higher than the expected. (Actuals are 50.920521%, 59.888499% and 76.025548% when expecteds are 5%, 3% and 1%). Unfortunately it seems those test cases took 30-40 mins to complete each.
So how about testing only 1 n between those 2 where the improvement of the PR is visible but the test completes in let's say 5-10 mins. It shoule be enough to test the 3% default FPP case.

It would be excellent if the verification could be carried out within a relatively concise case.
Moreover, it would be even better if the test case could be rewritten in Scala.

Sorry for the late reply, I got a bit lost in the test configuration of the project, it took a while until I could come up with something reasonable to address the concerns.

Submitting a test case and directly disabling it is not an ideal approach. Why can't it undergo regular validation through GitHub Actions?

I think I have already mentioned why I disabled the test in the first place, just for the sake of completeness repeating it here, indeed, the main reason is the impractical running time. If not parallelized properly, running all the slower testcases one after the other, the total running time could easily end up at dozens of hours.

The intention wasn't removing it from regular runs altogether, but to err on the safe side, and not to add an extra 10+ hours of runtime accidentally to e.g. pre-merge runs (supposedly on a fast path).

Fortunately, the individual cases can be run concurrently, so if there are enough threads to run the suite, even the slowest cases can complete in ~2.5h

Or can the scenarios in org.apache.spark.sql.execution.benchmark.BloomFilterBenchmark reflect the optimizations brought about by the current pr?

possibly, yes, but I haven't managed to run the sql benchmarks, and we would still have to solve the problem of capturing a custom measure (error rate) in the benchmarks, instead of the currently supported (e.g. running time).

[...] (as per Spark's coding conventions, it should actually be named SparkBloomFilterSuite). [...]
I have renamed the testclass, Idea now complains about:

Test class name 'SparkBloomFilterSuite' doesn't match regex '[A-Z][A-Za-z\d]*Test(s|Case)?|Test[A-Z][A-Za-z\d]*|IT(.*)|(.*) IT(Case)?'

other than that, everything seems functional.

I would rather not remove the new tests, in the end, at the moment these are the only piece of logic that can demonstrate the error with the current implementation. Rewriting the tests in scala may be an option, but I'm not that comfortable with my scala skills, to confidently jump into that.

LuciferYang · 2025-06-06T17:01:20Z

common/sketch/pom.xml

+    <dependency>
+      <groupId>org.junit-pioneer</groupId>
+      <artifactId>junit-pioneer</artifactId>
+      <version>2.3.0</version>


For the management of dependency versions, they should be placed in the parent pom.xml. However, if TestSparkBloomFilter can be removed from the current pr, then it seems that this dependency is no longer needed either.

I'll defer addressing this, until we decide what should happen with TestSparkBloomFilter.
(remove & move the versions under managed dependencies)

…isabled test

ishnagy · 2025-06-16T12:49:13Z

Now I have added a new tag org.apache.spark.tags.SlowTest which can be used in inclusion/exclusion rules.
(*) none of the currently available test tags seemed to be a great match for a slow test in the sketch module

% find common/tags/src/test -type f | sort
common/tags/src/test/java/org/apache/spark/tags/AmmoniteTest.java
common/tags/src/test/java/org/apache/spark/tags/ChromeUITest.java
common/tags/src/test/java/org/apache/spark/tags/DockerTest.java
common/tags/src/test/java/org/apache/spark/tags/ExtendedHiveTest.java
common/tags/src/test/java/org/apache/spark/tags/ExtendedLevelDBTest.java
common/tags/src/test/java/org/apache/spark/tags/ExtendedSQLTest.java
common/tags/src/test/java/org/apache/spark/tags/ExtendedYarnTest.java
common/tags/src/test/java/org/apache/spark/tags/SlowHiveTest.java
common/tags/src/test/java/org/apache/spark/tags/SlowSQLTest.java
common/tags/src/test/java/org/apache/spark/tags/WebBrowserTest.java

The problem is, I don't quite know where those inclusion exclusions should happen exactly.
I found

.github/workflows/maven_test.yml
.github/workflows/build_and_test.yml

but I'm not quite sure, how to add the new configuration to them for the sketch project only.

ishnagy · 2025-06-16T12:49:41Z

@LuciferYang
@peter-toth
please have a look at my updates / comments.

ishnagy · 2025-06-16T12:51:21Z

but I'm not quite sure, how to add the new configuration to them for the sketch project only.

... so, I expect the running time of the pre merge build to blow up on the next run.

LuciferYang · 2025-06-16T13:35:54Z

Those tags are only used for grouping the tests. It doesn't imply that tests labeled as @SlowTest won't be executed. Moreover, since there are originally not many test cases for the sketch module, there's no need for grouping them. I think it's enough as long as we can ensure that the test can be finished within a few minutes.

LuciferYang · 2025-06-16T13:37:09Z

common/tags/src/test/java/org/apache/spark/tags/SlowTest.java

+@TagAnnotation
+@Retention(RetentionPolicy.RUNTIME)
+@Target({ElementType.METHOD, ElementType.TYPE})
+public @interface SlowTest { }


We don't need to add a new TagAnnotation here.

ishnagy · 2025-06-16T14:03:57Z

Those tags are only used for grouping the tests. It doesn't imply that tests labeled as @SlowTest won't be executed.

I totally get that. @Disabled guaranteed non-execution, these new grouping options just make it possible to configure some workflows to pick up these tests and run them regularly, even if we happen to exclude them from the fast path.

The workflows still have to be adjusted to consider the new tag, but (I think) this can be the cleanest way to isolate this very specialized and very slow test from the rest of the regular tests. Perhaps we can configure the benchmark workflow to pick this up, and run among the other long-running benchmark cases?

[...] I think it's enough as long as we can ensure that the test can be finished within a few minutes.

Although I do have some performance improvement ideas already in mind, I don't think the slowest testcase (5G elements, 0.1% fpp) can be completed under an hour.

If we could somehow guarantee (code or documentation) that the impl class wouldn't be instantiated with a lower error rate than 3%, then we could get rid of the 1% and the 0.1% cases, which constitute the bulk of the runtime. IIRC, one round of the 3% case takes around half an hour, so with the other parameter combinations, we can bring down the total runtime to around 3h.

If we can run the suite on multiple cores (and we can configure maven to use them), we should be able to fit into the 2h GH actions execution limit conveniently. Do we know anything about the number of cores in the runners?

LuciferYang · 2025-06-17T02:38:19Z

Github-hosted runners generally have 4 vcores. Currently, the submit pipeline uses sbt for testing. Additionally, I would like to reiterate my viewpoint: we should strive to have this test completed within a few minutes(5~10 mins), rather than taking 2 hours. Otherwise, we ought to optimize it or temporarily remove this test case.

…ase running time

…st tag

…default

… fpp problems by default

ishnagy · 2025-06-24T15:26:18Z

cutting down the testcases to the bare minimum (3%fpp and 1G items), the test now completes in little over 10minutes on my machine. would this be an acceptable running time?

if the testing concerns are adequately addressed, can we please have a look on the serialization/compatibility questions that came up earlier? in hindsight, it feels really sketchy to deserialize old bytestreams into the updated implementation without any errors or warnings (query results from the inconsistently deserialized object won't make any sense).

adding a new version enum feels like the clean solution, but I'm not sure if it is not an overkill. (e.g. if a serialized bloomfilter never gets shared between different application runs)

peter-toth · 2025-06-26T12:18:13Z

Yeah, I believe 10 minutes runtime is acceptable, but if I were you I would test with 100M, maybe the improvement is visible there as well and 1 minute is just enough.

can we please have a look on the serialization/compatibility questions that came up earlier? in hindsight, it feels really sketchy to deserialize old bytestreams into the updated implementation without any errors or warnings (query results from the inconsistently deserialized object won't make any sense).

Bloom filter functions (BloomFilterAggregate / BloomFilterMightContain) seem like internal ones so I wouldn't expect serialized filters to be shared between different application runs, but it seems some native accelerators reimplemented Spark's logic (including its flaws) so we should probably use a new version with this improvement.

ishnagy · 2025-07-02T08:32:06Z

Bloom filter functions (BloomFilterAggregate / BloomFilterMightContain) seem like internal ones so I wouldn't expect serialized filters to be shared between different application runs, but it seems some native accelerators reimplemented Spark's logic (including its flaws) so we should probably use a new version with this improvement.

ok, I'll try to provide an update by EOW.

ishnagy added 4 commits May 15, 2025 01:20

SPARK-47547 BloomFilter fpp degradation: addressing the int32 truncation

3c5a843

SPARK-47547 BloomFilter fpp degradation: fixing test data repetition …

08cbfeb

…errors in scala suite

SPARK-47547 BloomFilter fpp degradation: scrambling the high 32bytes …

e3cb08e

…of the combined hash

SPARK-47547 BloomFilter fpp degradation: random distribution fpp test

c4e3f58

github-actions bot added SQL BUILD labels May 19, 2025

peter-toth reviewed May 19, 2025

View reviewed changes

ishnagy added 2 commits May 19, 2025 16:24

SPARK-47547 BloomFilter fpp degradation: javadoc for test methods, ch…

1a0b66f

…eckstyle errors, renaming test vars

SPARK-47547 BloomFilter fpp degradation: make seed serialization back…

d912b66

…ward compatible with previously serialized streams

peter-toth reviewed May 19, 2025

View reviewed changes

ishnagy force-pushed the SPARK-47547_bloomfilter_fpp_degradation branch from 8edf4dd to 57298f0 Compare May 19, 2025 16:07

SPARK-47547 BloomFilter fpp degradation: counting discarded odd items…

f589e2c

… in random test + test formatting

ishnagy force-pushed the SPARK-47547_bloomfilter_fpp_degradation branch from 57298f0 to f589e2c Compare May 19, 2025 16:11

ishnagy added 2 commits May 19, 2025 19:03

SPARK-47547 BloomFilter fpp degradation: refactoring FPP counting log…

f597c76

…ic in random test

SPARK-47547 BloomFilter fpp degradation: checkstyle fix

4ea633d

SPARK-47547 BloomFilter fpp degradation: fix test bug

6696106

ishnagy added 3 commits May 26, 2025 17:51

SPARK-47547 BloomFilter fpp degradation: parallelization friendly tes…

b75e187

…t output capture

SPARK-47547 BloomFilter fpp degradation: parallelization friendly tes…

2d8a9f1

…t output capture - 2nd take

SPARK-47547 BloomFilter fpp degradation: parallelization friendly tes…

4a30794

…t output capture - 3rd take

ishnagy changed the title ~~[WIP] [SPARK-47547] BloomFilter fpp degradation~~ [SPARK-47547] BloomFilter fpp degradation May 27, 2025

peter-toth approved these changes May 27, 2025

View reviewed changes

LuciferYang reviewed May 30, 2025

View reviewed changes

LuciferYang reviewed Jun 6, 2025

View reviewed changes

SPARK-47547 BloomFilter fpp degradation: addressing concerns around d…

d9d6980

…isabled test

ishnagy force-pushed the SPARK-47547_bloomfilter_fpp_degradation branch from 70f7c5e to d9d6980 Compare June 16, 2025 12:08

LuciferYang reviewed Jun 16, 2025

View reviewed changes

ishnagy added 5 commits June 17, 2025 10:36

SPARK-47547 BloomFilter fpp degradation: cut down test cases to decre…

39a46c9

…ase running time

Merge branch 'master' into SPARK-47547_bloomfilter_fpp_degradation

7f235e7

SPARK-47547 BloomFilter fpp degradation: revert creating a new SlowTe…

16be3a9

…st tag

SPARK-47547 BloomFilter fpp degradation: disable progress logging by …

e91b5ca

…default

SPARK-47547 BloomFilter fpp degradation: adjust tolerance and fail on…

897c1d4

… fpp problems by default

		import java.util.concurrent.ConcurrentHashMap;


		@Disabled

[SPARK-47547] BloomFilter fpp degradation #50933

Are you sure you want to change the base?

[SPARK-47547] BloomFilter fpp degradation #50933

Uh oh!

Conversation

ishnagy commented May 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

new test

patched

removed from the default runs

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

peter-toth May 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

peter-toth May 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

peter-toth commented May 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ishnagy commented May 20, 2025

Uh oh!

ishnagy commented May 21, 2025

Uh oh!

peter-toth commented May 22, 2025

Uh oh!

ishnagy commented May 23, 2025

Uh oh!

ishnagy commented May 26, 2025

Uh oh!

ishnagy commented May 26, 2025

Uh oh!

peter-toth left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

peter-toth commented May 27, 2025

Uh oh!

ishnagy commented May 27, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ishnagy May 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

peter-toth Jun 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ishnagy commented Jun 16, 2025

ishnagy commented May 19, 2025 •

edited

Loading

peter-toth May 19, 2025 •

edited

Loading

peter-toth May 19, 2025 •

edited

Loading

peter-toth commented May 20, 2025 •

edited

Loading

peter-toth left a comment •

edited

Loading

ishnagy May 30, 2025 •

edited

Loading

peter-toth Jun 8, 2025 •

edited

Loading

ishnagy commented Jun 16, 2025 •

edited

Loading

LuciferYang commented Jun 17, 2025 •

edited

Loading