Skip to content

[Logs UI] Research partitioning of log entries for categorization jobs #46610

@weltenwort

Description

@weltenwort

Summary

The goal of this research effort is to determine how the quality of the categories derived by the ML algorithms can be improved. In particular, it should be investigated how the knowledge about the log entries belonging the distinct log types (via event.dataset) can be utilized in the job configurations.

Challenges

  • The set of log entry types can vary dynamically. The user could, for example, add a new type of logs to their centralized logging setup. Similarly, the use-case might have changed such that a certain type of log entries is no longer ingested. We would ideally be able to accommodate these kinds of changes without requiring the user to take action and without losing the trained model.
  • The wide variety of log ingestion setups can lead to a larger number of log types falling into the "other" type because they don't have a proper dataset field. Would that "other" partition still make sense? Would that field become a requirement?
  • If there are separate categorization jobs for separate types of logs, can their anomalies be mixed in a visualization without misrepresenting the data?

Acceptance criteria

We have learned...

  • how to configure ML jobs to take advantage of the ecs.dataset field in log entries.
  • whether a "catch-all" partition makes sense.
  • what the implications are in terms of storage and compute resource usage.
  • how to dynamically include new datasets in the analysis as the use-cases evolve.
  • what the semantics of the category anomalies are and whether they can be compared across jobs.

Metadata

Metadata

Assignees

Labels

Feature:Logs UILogs UI featureR&DResearch and development ticket (not meant to produce code, but to make a decision)Team:Infra Monitoring UI - DEPRECATEDDEPRECATED - Label for the Infra Monitoring UI team. Use Team:obs-ux-infra_servicesv7.5.0

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions