-
Notifications
You must be signed in to change notification settings - Fork 2.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
AWS: Change GlueCatalog skip archive default to true #6916
AWS: Change GlueCatalog skip archive default to true #6916
Conversation
+1 for the changing default, also linking previous ticket #5965 (comment) should we update the doc as well in this PR. |
@singhpk234 Ah yes good point we should update the AWS integration docs in this PR, will update |
221f64e
to
ce863fc
Compare
ce863fc
to
ab5a7a1
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
thanks for raising this, this has been a complaint for a long time
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, Thanks @amogh-jahagirdar !
Thanks for the fix @amogh-jahagirdar , and thanks for @singhpk234 and @yyanyy for reviews! |
Currently GLUE_CATALOG_SKIP_ARCHIVE_DEFAULT is set to false. This means every glue commit will archive older versions of table, which quickly hits the Glue limit (a max of 100k versions gets maintained) for streaming use cases. After discussing with users of Glue Catalog, came to the conclusion that it's better to change this default behavior.
CC: @jackye1995 @singhpk234