-
Notifications
You must be signed in to change notification settings - Fork 28.6k
[SPARK-30505][DOCS] Deprecate Avro option ignoreExtension
in sql-data-sources-avro.md
#27194
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-30505][DOCS] Deprecate Avro option ignoreExtension
in sql-data-sources-avro.md
#27194
Conversation
This follows up #27174 (comment) |
docs/sql-data-sources-avro.md
Outdated
@@ -230,7 +230,7 @@ Data source options of Avro can be set via: | |||
<tr> | |||
<td><code>ignoreExtension</code></td> | |||
<td>true</td> | |||
<td>The option controls ignoring of files without <code>.avro</code> extensions in read.<br> If the option is enabled, all files (with and without <code>.avro</code> extension) are loaded.</td> | |||
<td>The option controls ignoring of files without <code>.avro</code> extensions in read.<br> If the option is enabled, all files (with and without <code>.avro</code> extension) are loaded.<br> The option has been already deprecated, and it will be removed in the future releases. Please use the general data source option <code>pathGlobFilter</code> for filtering file names.</td> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the work.
Could you also create a doc entry for the option pathGlobFilter
and link to it here in this PR?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I added it here
Test build #116659 has finished for PR 27194 at commit
|
Test build #116660 has finished for PR 27194 at commit
|
Hi @MaxGekk Sorry, I meant a new document for the option I have come up with a good idea: https://issues.apache.org/jira/browse/SPARK-30506 Are you interested in writing the new doc? It should be very helpful for users. |
Yeah, discussed with @gengliangwang offline. I think that's a better option. |
I described the global option because it has been already described in each load method - json, text, parquet ... in DataFrameReader, for example
No, I am not. If you think, description of the option |
@MaxGekk I see. I didn't know that the option I think reverting the description of the option pathGlobFilter in Still, we need to document the options in SPARK-30506. I will find someone else or do it myself. |
This reverts commit 91a1dee.
Test build #116687 has finished for PR 27194 at commit
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please update the PR description. The following looks incorrect. It's ORC related PR which is already closed.
Closes #27179
@dongjoon-hyun I have fixed the PR number related to Avro |
docs/sql-data-sources-avro.md
Outdated
@@ -230,7 +230,7 @@ Data source options of Avro can be set via: | |||
<tr> | |||
<td><code>ignoreExtension</code></td> | |||
<td>true</td> | |||
<td>The option controls ignoring of files without <code>.avro</code> extensions in read.<br> If the option is enabled, all files (with and without <code>.avro</code> extension) are loaded.</td> | |||
<td>The option controls ignoring of files without <code>.avro</code> extensions in read.<br> If the option is enabled, all files (with and without <code>.avro</code> extension) are loaded.<br> The option has been already deprecated, and it will be removed in the future releases. Please use the general data source option <code>pathGlobFilter</code> for filtering file names.</td> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
been already deprecated
-> been deprecated
.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Also, please provide the new screenshot of the updated page at PR description.
I have attached the screenshot. |
Test build #116757 has finished for PR 27194 at commit
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
I am just going to merge this - for other file-based source options, we could do it separately.
+1, late LGTM. |
What changes were proposed in this pull request?
Updated
docs/sql-data-sources-avro.md
, and added a few sentences about already deprecated in code Avro optionignoreExtension
.Closes #27174
Why are the changes needed?
To make users doc consistent to the code where
ignoreExtension
has been already deprecated, seespark/external/avro/src/main/scala/org/apache/spark/sql/avro/AvroUtils.scala
Lines 46 to 47 in 3663dbe
Does this PR introduce any user-facing change?
No
How was this patch tested?
by building docs