Skip to content

[ML] Metadata Migration Meta issue #32905

Closed
@davidkyle

Description

@davidkyle

Anomaly detector jobs and datafeed configurations are currently stored in the cluster state, the initial design decision was based on the need to distribute work across the cluster. After persistent tasks were invented -and with a little hindsight- it is not necessary to store config in the cluster state if the persistent task parameters contain enough information to open a job and start a datafeed.

Placing config in the cluster state has caused a number of issues:

Proposal

Starting in 6.last new job and datafeed configurations will be stored in a new internal index .ml-config. Jobs created prior to 6.last will be migrated to index documents and removed from the cluster state with the goal of removing all ml data from the cluster state. 7.x will retain the ability to read jobs from the cluster state to support full cluster upgrades and restoring a snapshot containing a global cluster state. The preferred solution is to automatically migrate extant cluster state jobs on allocation. In the 8 series all the code to handle ml config in cluster state will be dropped.

Work Plan

Work will be done on the feature branch feature/feature-jindex-6x and feature/feature-jindex-master with regular merges. The first stage is the changes required to create and run a job with its configuration stored in an index. Once that is stable and passing the testing gate the migration of existing jobs on upgrade will be tackled.

Phase 1: Run Jobs with their configuration defined in an index

Phase 1a: 6.6 & 6.7 Jobs can be defined in the clusterstate or an index document

Phase 2: Migrate Job and Datafeed Configuration

Issues

Nice to haves

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions