Skip to content

[ML] Degrees of freedom argument errors while running the "gallery" dataset #20

Closed
@dolaru

Description

@dolaru

Spotted in 6.3.0

When the gallery dataset is analysed, there's several hundred error messages being outputted in Elasticsearch's logs with the following messages:

[2018-03-19T17:24:26,319][ERROR][o.e.x.m.j.p.l.CppLogMessageHandler] [gallery1_20180319-1719_000_0] [autodetect/8041] [CStatisticalTests.cc@103] Failed to compute significance Error in function fisher_f_distribution<double>::fisher_f_distribution: Degrees of freedom argument is 0, but must be > 0 ! d1 = 0, d2 = 8, x = 0
[2018-03-19T17:24:26,327][ERROR][o.e.x.m.j.p.l.CppLogMessageHandler] [gallery1_20180319-1719_000_0] [autodetect/8041] [CPeriodicityHypothesisTests.cc@251] Bad input: Error in function boost::math::chi_squared_distribution<double>::chi_squared_distribution: Degrees of freedom argument is 0, but must be > 0 !, df = 0, percentage = 90

Analysis config:

{
        "bucket_span" : "1h",
        "detectors" : [
          {
            "detector_description" : "rare by status over clientip",
            "function" : "rare",
            "by_field_name" : "status",
            "over_field_name" : "clientip"
          },
          {
            "detector_description" : "freq_rare by uri over clientip",
            "function" : "freq_rare",
            "by_field_name" : "uri",
            "over_field_name" : "clientip"
          },
          {
            "detector_description" : "high_count by status over clientip",
            "function" : "high_count",
            "by_field_name" : "status",
            "over_field_name" : "clientip"
          },
          {
            "detector_description" : "high_count by uri over clientip",
            "function" : "high_count",
            "by_field_name" : "uri",
            "over_field_name" : "clientip"
          },
          {
            "detector_description" : "sum(bytes) by method over clientip",
            "function" : "sum",
            "field_name" : "bytes",
            "by_field_name" : "method",
            "over_field_name" : "clientip"
          }
        ],
        "influencers" : [
          "clientip"
        ]
      }

Full log:
elasticsearch.log

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions