Skip to content

Memory estimation endpoint returns "0" for non-empty dataset. #49140

Closed
@przemekwitek

Description

@przemekwitek

Dataset: barcelona_accidents

With the following request:

{
  "source": {
    "index": "barcelona_accidents"
  },
  "analysis": {
    "outlier_detection": {}
  }
}

'_estimate_memory_usage' endpoint returns the following response:

{
  "expected_memory_without_disk" : "0",
  "expected_memory_with_disk" : "0"
}

Apparently problem lies in data extraction, as the following search query produced by data extractor yields no results:

{
  "size": 0,
  "query": {
    "bool": {
      "filter": [
        {
          "exists": {
            "field": "day",
            "boost": 1
          }
        },
        {
          "exists": {
            "field": "doc.day",
            "boost": 1
          }
        },
        {
          "exists": {
            "field": "doc.hour",
            "boost": 1
          }
        },
        {
          "exists": {
            "field": "doc.location.lat",
            "boost": 1
          }
        },
        {
          "exists": {
            "field": "doc.location.lon",
            "boost": 1
          }
        },
        {
          "exists": {
            "field": "doc.mild_injuries",
            "boost": 1
          }
        },
        {
          "exists": {
            "field": "doc.serious_injuries",
            "boost": 1
          }
        },
        {
          "exists": {
            "field": "doc.vehicles_involved",
            "boost": 1
          }
        },
        {
          "exists": {
            "field": "doc.victims",
            "boost": 1
          }
        },
        {
          "exists": {
            "field": "hour",
            "boost": 1
          }
        },
        {
          "exists": {
            "field": "mild_injuries",
            "boost": 1
          }
        },
        {
          "exists": {
            "field": "serious_injuries",
            "boost": 1
          }
        },
        {
          "exists": {
            "field": "vehicles_involved",
            "boost": 1
          }
        },
        {
          "exists": {
            "field": "victims",
            "boost": 1
          }
        }
      ],
      "adjust_pure_negative": true,
      "boost": 1
    }
  },
  "track_total_hits": 2147483647
}

It starts working fine, however if fields without a doc. prefix are removed from the query.

Metadata

Metadata

Assignees

No one assigned

    Labels

    :mlMachine learning

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions