Improving documentation about es.read.fields.include, and fixing a related bug #1822

masseyke · 2021-12-14T20:48:18Z

The documentation for es.read.fields.include left room for confusion. Also the bevaior was different between spark 1,
spark 2, and spark 3 -- setting "es.read.fields.include" to part of a hierarchy caused a NullPointerException in spark 1
and spark 2.
Closes #1784

…lated bug

masseyke · 2021-12-14T20:55:17Z

This is confusing, and easiest for me to think about using concrete examples. I added them in the integration tests in this PR, but it's probably easier to see without code. Using the schema from #1784:
root

 |-- features: struct (nullable = true)
 |    |-- hashtags: array (nullable = true)
 |    |    |-- element: struct (containsNull = true)
 |    |    |    |-- count: long (nullable = true)
 |    |    |    |-- text: string (nullable = true)

Say I have this data in elasticsearch:

features:
    hashtags:
        [
            {"text": "foo"},
            {"count":"bar"}
        ]

If I leave "es.read.fields.include" unset or if I set it to "features.hashtags.*" I get

features:
    hashtags:
        [
            {"text": "foo"},
            {"count":"bar"}
        ]

If I set "es.read.fields.include" to "features.hashtags" I get

features:
    hashtags:
        [
            {},
            {}
        ]

If I set "es.read.fields.include" to "features.hashtags.text" I get

features:
    hashtags:
        [
            {"text": "foo"},
            {}
        ]

jbaiera

LGTM, thanks for clarifying these.

…lated bug (elastic#1822)

…lated bug (#1822) (#1833) The documentation for es.read.fields.include left room for confusion. Also the bevaior was different between spark 1, spark 2, and spark 3 -- setting "es.read.fields.include" to part of a hierarchy caused a NullPointerException in spark 1 and spark 2. Closes #1784

…lated bug (#1822) (#1834) The documentation for es.read.fields.include left room for confusion. Also the bevaior was different between spark 1, spark 2, and spark 3 -- setting "es.read.fields.include" to part of a hierarchy caused a NullPointerException in spark 1 and spark 2. Closes #1784

Improving documentation about es.read.fields.include, and fixing a re…

2bf5a9f

…lated bug

masseyke requested a review from jbaiera December 14, 2021 20:48

masseyke added >docs bug labels Dec 14, 2021

jbaiera approved these changes Dec 14, 2021

View reviewed changes

masseyke merged commit 4a14860 into elastic:master Dec 14, 2021

masseyke deleted the fix/read-fields-include branch December 14, 2021 23:09

masseyke added the v8.1.0 label Dec 14, 2021

masseyke mentioned this pull request Dec 14, 2021

Elasticsearch return empty list of objects to spark #1784

Closed

masseyke added v7.17.0 v8.0.0-rc2 labels Dec 20, 2021

masseyke added a commit to masseyke/elasticsearch-hadoop that referenced this pull request Dec 20, 2021

Improving documentation about es.read.fields.include, and fixing a re…

3fd53f7

…lated bug (elastic#1822)

masseyke added a commit to masseyke/elasticsearch-hadoop that referenced this pull request Dec 20, 2021

Improving documentation about es.read.fields.include, and fixing a re…

2cead85

…lated bug (elastic#1822)

masseyke mentioned this pull request Jan 12, 2022

[DOCS] Add 8.0.0-rc1 release notes #1859

Merged

jrodewig removed the >docs label Jan 28, 2022

jrodewig mentioned this pull request Jan 28, 2022

[DOCS] Add 7.17.0 release notes #1891

Merged

jrodewig added :Spark v8.0.0-rc1 and removed v8.0.0-rc2 labels Jan 28, 2022

masseyke mentioned this pull request Feb 8, 2022

[DOCS] Add 8.0.0 GA release notes #1902

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improving documentation about es.read.fields.include, and fixing a related bug #1822

Improving documentation about es.read.fields.include, and fixing a related bug #1822

Uh oh!

masseyke commented Dec 14, 2021

Uh oh!

masseyke commented Dec 14, 2021

Uh oh!

jbaiera left a comment

Uh oh!

Uh oh!

Improving documentation about es.read.fields.include, and fixing a related bug #1822

Improving documentation about es.read.fields.include, and fixing a related bug #1822

Uh oh!

Conversation

masseyke commented Dec 14, 2021

Uh oh!

masseyke commented Dec 14, 2021

Uh oh!

jbaiera left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!