Skip to content

[SPARK-5731][Streaming][Test] Fix incorrect test in DirectKafkaStreamSuite #4597

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 2 commits into from

Conversation

tdas
Copy link
Contributor

@tdas tdas commented Feb 13, 2015

The test was incorrect. Instead of counting the number of records, it counted the number of partitions of RDD generated by DStream. Which is not its intention. I will be testing this patch multiple times to understand its flakiness.

PS: This was caused by my refactoring in #4384

@koeninger check it out.

@tdas tdas changed the title [SPARK-5731] Fix incorrect test in DirectKafkaStreamSuite [SPARK-5731][Streaming][Test] Fix incorrect test in DirectKafkaStreamSuite Feb 13, 2015
@SparkQA
Copy link

SparkQA commented Feb 13, 2015

Test build #603 has started for PR 4597 at commit e9a1820.

  • This patch merges cleanly.

@SparkQA
Copy link

SparkQA commented Feb 13, 2015

Test build #604 has started for PR 4597 at commit e9a1820.

  • This patch merges cleanly.

@SparkQA
Copy link

SparkQA commented Feb 13, 2015

Test build #605 has started for PR 4597 at commit e9a1820.

  • This patch merges cleanly.

@SparkQA
Copy link

SparkQA commented Feb 13, 2015

Test build #606 has started for PR 4597 at commit e9a1820.

  • This patch merges cleanly.

@SparkQA
Copy link

SparkQA commented Feb 13, 2015

Test build #27455 has started for PR 4597 at commit e9a1820.

  • This patch merges cleanly.

@SparkQA
Copy link

SparkQA commented Feb 13, 2015

Test build #27458 has started for PR 4597 at commit d236235.

  • This patch merges cleanly.

@SparkQA
Copy link

SparkQA commented Feb 13, 2015

Test build #27455 has finished for PR 4597 at commit e9a1820.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AmplabJenkins
Copy link

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/27455/
Test FAILed.

@SparkQA
Copy link

SparkQA commented Feb 13, 2015

Test build #604 has finished for PR 4597 at commit e9a1820.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds the following public classes (experimental):
    • case class Params(

@SparkQA
Copy link

SparkQA commented Feb 13, 2015

Test build #606 has finished for PR 4597 at commit e9a1820.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@koeninger
Copy link
Contributor

LGTM

@SparkQA
Copy link

SparkQA commented Feb 13, 2015

Test build #603 has finished for PR 4597 at commit e9a1820.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Feb 13, 2015

Test build #605 has finished for PR 4597 at commit e9a1820.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Feb 13, 2015

Test build #607 has started for PR 4597 at commit d236235.

  • This patch merges cleanly.

@SparkQA
Copy link

SparkQA commented Feb 13, 2015

Test build #608 has started for PR 4597 at commit d236235.

  • This patch merges cleanly.

@SparkQA
Copy link

SparkQA commented Feb 13, 2015

Test build #609 has started for PR 4597 at commit d236235.

  • This patch merges cleanly.

@SparkQA
Copy link

SparkQA commented Feb 13, 2015

Test build #27458 has finished for PR 4597 at commit d236235.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AmplabJenkins
Copy link

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/27458/
Test PASSed.

@SparkQA
Copy link

SparkQA commented Feb 13, 2015

Test build #609 has finished for PR 4597 at commit d236235.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Feb 14, 2015

Test build #607 has finished for PR 4597 at commit d236235.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Feb 14, 2015

Test build #608 has finished for PR 4597 at commit d236235.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@rxin
Copy link
Contributor

rxin commented Feb 14, 2015

Seems like streamingcontext itself is a bit flaky ...

@tdas
Copy link
Contributor Author

tdas commented Feb 14, 2015

Why? Even the failure of StreamingContext.stop() unit test in the failed Test build #609 above was causing by the same broadcast piece failure.

[info] - stop(stopSparkContext=true) after stop(stopSparkContext=false) *** FAILED *** (65 milliseconds)
[info]   org.apache.spark.SparkException: Job aborted due to stage failure: Task 0 in stage 0.0 failed 1 times, most recent failure: Lost task 0.0 in stage 0.0 (TID 0, localhost): java.io.IOException: org.apache.spark.SparkException: Failed to get broadcast_0_piece0 of broadcast_0

This is indeed weird. Anyways, the Kafka test seems pretty solid now. If its okay with you I merging this, and we can continue investigating on the broadcast problem independently.

@rxin
Copy link
Contributor

rxin commented Feb 14, 2015

Please go ahead and merge.
On Feb 14, 2015 4:29 AM, "Tathagata Das" notifications@github.com wrote:

Why? Even the failure of StreamingContext.stop() unit test in the failed
Test build #609 #609 above was
causing by the same broadcast piece failure.

[info] - stop(stopSparkContext=true) after stop(stopSparkContext=false) *** FAILED *** (65 milliseconds)
[info] org.apache.spark.SparkException: Job aborted due to stage failure: Task 0 in stage 0.0 failed 1 times, most recent failure: Lost task 0.0 in stage 0.0 (TID 0, localhost): java.io.IOException: org.apache.spark.SparkException: Failed to get broadcast_0_piece0 of broadcast_0

This is indeed weird. Anyways, the Kafka test seems pretty solid now. If
its okay with you I merging this, and we can continue investigating on the
broadcast problem independently.


Reply to this email directly or view it on GitHub
#4597 (comment).

asfgit pushed a commit that referenced this pull request Feb 18, 2015
…Suite

The test was incorrect. Instead of counting the number of records, it counted the number of partitions of RDD generated by DStream. Which is not its intention. I will be testing this patch multiple times to understand its flakiness.

PS: This was caused by my refactoring in #4384

koeninger check it out.

Author: Tathagata Das <tathagata.das1565@gmail.com>

Closes #4597 from tdas/kafka-flaky-test and squashes the following commits:

d236235 [Tathagata Das] Unignored last test.
e9a1820 [Tathagata Das] fix test

(cherry picked from commit 3912d33)
Signed-off-by: Tathagata Das <tathagata.das1565@gmail.com>
@asfgit asfgit closed this in 3912d33 Feb 18, 2015
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants