[ML Data Frame] Start directly data frame rather than via the scheduler #42067

davidkyle · 2019-05-10T09:45:36Z

Start triggers the indexer directly, the indexer runs in another thread so this is safe to do from a API request network thread. Moves the scheduled next() method and starting the scheduler from the task executor into the DF task.

This small change means the scheduler does not have to run at a high frequency to reduce latency in starting the data frame.

There is a question about whether the scheduler should be stopped on finish when a checkpoint is reached

elasticmachine · 2019-05-10T09:45:39Z

Pinging @elastic/ml-core

hendrikmuhs

To your question: If the persistent task gets removed, the scheduler gets removed as well, so having autostop after the checkpoint is reached we indirectly remove the scheduler afaik or "that's the plan".

hendrikmuhs · 2019-05-10T11:44:53Z

...frame/src/main/java/org/elasticsearch/xpack/dataframe/transforms/DataFrameTransformTask.java

+        return DataFrameTransformTask.SCHEDULE_NAME + "_" + getTransformId();
+    }
+
+    static SchedulerEngine.Schedule next() {


Maybe for a followup PR: can be non-static? to access members and change the interval dynamically.

Depending on how far the DF has progressed

davidkyle · 2019-05-10T13:10:46Z

I hit a problem in the tests which assert on the indexer state.

When the indexer is started it's state goes from STOPPED -> STARTED
The indexer is then triggered and state transitions STARTED -> INDEXING
When the indexer is finished (all docs processed) state transitions INDEXING -> STARTED

With this change because the indexer is triggered it should be in state = INDEXING when the start request returns, unless it has finished really quickly.

We have a bunch of yml tests that start a data feed then assert state = STARTED. Those tests aren't processing much data so we expect to go from STARTED -> INDEXING -> STARTED pretty quickly, when we check the state it could be STARTED or INDEXING.

I pushed a change to account for this but what I don't understand is why the tests haven't failed in CI previously given the indexer could be either STARTED or INDEXING

davidkyle · 2019-05-10T15:35:47Z

run elasticsearch-ci/packaging-sample

benwtrent · 2019-05-10T15:55:11Z

...frame/src/main/java/org/elasticsearch/xpack/dataframe/transforms/DataFrameTransformTask.java

+
+    private SchedulerEngine.Schedule next() {
+        return (startTime, now) -> {
+            return now + 1000; // to be fixed, hardcode something


For sure future work will be needed here so that we can adjust for failures, etc.

benwtrent · 2019-05-10T15:58:34Z

...frame/src/main/java/org/elasticsearch/xpack/dataframe/transforms/DataFrameTransformTask.java

@@ -207,6 +207,10 @@ public synchronized void start(ActionListener<Response> listener) {
        persistStateToClusterState(state, ActionListener.wrap(
            task -> {
                auditor.info(transform.getId(), "Updated state to [" + state.getTaskState() + "]");
+                long now = System.currentTimeMillis();
+                // kick off the indexer
+                triggered(new Event(schedulerJobName(), now, now));


It seems to me, once we add a tad more logic to the SchedulerEngine.Schedule this won't be necessary because it will know if it has to be triggered right now, later, or not at all.

But this is a good stop-gap until we fix the created schedule.

…er (elastic#42067) Trigger indexer start directly to put the indexer in INDEXING state immediately

Trigger start

25eb2f2

davidkyle added >refactoring v8.0.0 v7.2.0 :ml/Transform Transform labels May 10, 2019

hendrikmuhs approved these changes May 10, 2019

View reviewed changes

davidkyle added 2 commits May 10, 2019 13:11

Make method non-static

5485064

State could be started or indexing

3712875

Depending on how far the DF has progressed

benwtrent self-requested a review May 10, 2019 15:42

benwtrent approved these changes May 10, 2019

View reviewed changes

davidkyle merged commit cc988ce into elastic:master May 20, 2019

davidkyle deleted the start-without-delay branch May 20, 2019 12:29

davidkyle mentioned this pull request May 20, 2019

[ML Data Frame] Start directly data frame rather than via the scheduler #42224

Merged

davidkyle added a commit to davidkyle/elasticsearch that referenced this pull request May 21, 2019

[ML Data Frame] Start directly data frame rather than via the schedul…

5e33a99

…er (elastic#42067) Trigger indexer start directly to put the indexer in INDEXING state immediately

$@polyfractal$ polyfractal mentioned this pull request May 21, 2019

transforms_stats#"Test Get Transform Stats" fails because pages_processed/checkpoint don't match #42309

Closed

gurkankaymak pushed a commit to gurkankaymak/elasticsearch that referenced this pull request May 27, 2019

[ML Data Frame] Start directly data frame rather than via the schedul…

273a305

…er (elastic#42067) Trigger indexer start directly to put the indexer in INDEXING state immediately

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ML Data Frame] Start directly data frame rather than via the scheduler #42067

[ML Data Frame] Start directly data frame rather than via the scheduler #42067

Uh oh!

davidkyle commented May 10, 2019 •

edited

Loading

Uh oh!

elasticmachine commented May 10, 2019

Uh oh!

hendrikmuhs left a comment

Uh oh!

hendrikmuhs May 10, 2019 •

edited

Loading

Uh oh!

davidkyle commented May 10, 2019

Uh oh!

davidkyle commented May 10, 2019

Uh oh!

benwtrent May 10, 2019

Uh oh!

benwtrent May 10, 2019

Uh oh!

Uh oh!

[ML Data Frame] Start directly data frame rather than via the scheduler #42067

[ML Data Frame] Start directly data frame rather than via the scheduler #42067

Uh oh!

Conversation

davidkyle commented May 10, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticmachine commented May 10, 2019

Uh oh!

hendrikmuhs left a comment

Choose a reason for hiding this comment

Uh oh!

hendrikmuhs May 10, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

davidkyle commented May 10, 2019

Uh oh!

davidkyle commented May 10, 2019

Uh oh!

benwtrent May 10, 2019

Choose a reason for hiding this comment

Uh oh!

benwtrent May 10, 2019

Choose a reason for hiding this comment

Uh oh!

Uh oh!

davidkyle commented May 10, 2019 •

edited

Loading

hendrikmuhs May 10, 2019 •

edited

Loading