-
Notifications
You must be signed in to change notification settings - Fork 2.5k
fix(ingest): Repair affected logical timestamp milli tables #14161
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
84 commits
Select commit
Hold shift + click to select a range
cf502e3
current progress
c5e6421
seems to be working for spark non vectorized and avro
1f28581
filters working
19fdf52
prevent overflow
8a8194f
use read support instead of mapping function
a957db1
use repaired schema instead of doing operations after we read the data
9e660fb
add spark log support
290f90f
remove find cols to multiply class
4f6277f
log file changes as requested and set up the read supports for spark …
acf3ac9
hive working
2cb2fda
add individual test and fix issue with dropping messagetype logical
e325e68
revert calls for rewrite avro with extra param
a9a3958
revert config to prevent timestamp evolutions
22c2cd9
A few fixes
1552ff9
fix bug with field reuse in avro schema repair
1e68dd2
fix read parquet log block timestamp ntz
8e7454f
allow long to timestampntz without cast
aff83fa
refactor AvroSchemaRepair for performance and add unit tests
c82f3bc
refactor schema repair for performance and add testing
1a95e25
try fix other spark versions
3dc2aac
fix spark 3.3 build
1e9b7e4
fix spark3.4 build
b99c1a0
hopefully fix spark4
a4d08ef
fix issue with union schema, add table schema to cdc in missing place
d53c4e5
add spark cow read testing for repair
77b336d
building, and add spark mor tests
f8bdd66
forgot to add the zips
47afa1d
cow write testing
11f08d5
add mor testing
bc46e51
disable tests for spark 3.3
61eabe6
fix for spark 3.4
9b37bd4
fix spark33 for real
47ce682
remove fg reader test
2ce59e1
remove unneeded avro utils change
7009727
fix spark 4
f638cd8
fix timestamps in deltastreamer test
045130f
fix failing test
c05a361
vectorized fallback for 3.3 and 3.4
459cba7
fix vectorized fallback
c3cab96
add testing, and also fallback for local timestamp millis
a2b609f
add tests from java-parquet
536baf1
replace import with hardcode
2f9f582
fix long import names
8ed13d0
address most review comments
e9c2540
Fixing spark3.3 reads
nsivabalan 206fab4
minor renames
nsivabalan 9e06e70
Adding java docs
nsivabalan 43661f7
java docs
nsivabalan fefa7d8
refactor repairFooterSchema to a common module
20c217f
rename resolveNullableSchema method
dfaf9df
add check for no compactions and clustering on test table
36b1c60
Add checks for enableLogicalTimestampFieldRepair
f857017
Revert pom changes
yihua 3a07c44
Revert "refactor repairFooterSchema to a common module"
yihua 6422a6e
Fix scalastyle
yihua cd0bca5
Fix build on Spark 3.3 and 3.4
yihua 50ebfd4
Fix TestHoodieDeltaStreamer
yihua 294337d
Fix licences
yihua f3b52ae
Fix Spark 4
yihua 9ce9c57
Fix Spark 4 and renames
yihua 5fe65e6
Fix Spark40LegacyHoodieParquetFileFormat
yihua 1171713
Add flag for repairing loogical timestamp in avro log file reader
7304a96
Fix checkstyle
yihua 0ad14ab
Fix bug in config passing
yihua 38ba9b3
Disable TestAvroSchemaConverter to validate other tests
yihua 152d849
Update testNestedTypeVectorizedReadWithTypeChange test behavior
yihua 8ebc4d5
Another fix
yihua 5c4d16d
Fix the testLogicalTypesReadRepair
linliu-code 4034a51
Make schema fetching and parsing lazy for adjusting column and partit…
yihua 37ab24c
Fix testReadIndexDefFromStorage
yihua 3df5906
Enhance test
yihua efbfa7c
Disable failed tests for investigation
yihua 6db7a16
Fix test logic
yihua fddab79
Fix naming of configs
yihua 5e8bb52
fix col stats test failures
4446be8
Addressing Siva's feedback on fixing col stats for v1 index defn
nsivabalan 9f96890
Fixing tests in TestHoodieTableMetadataUtil
nsivabalan a04d3ca
Fix tests to follow out-of-the-box behavior
yihua 08f332a
Fix testReadIndexDefFromStorage
yihua 886029a
Remove test comment
yihua 3da1946
Fix typo
yihua 9b1c057
Fix ShowColumnStatsOverlapProcedure
yihua d57cede
Fix AvroSchemaRepair for Avro compatibility
yihua ea6eca9
Skip timestamp millis column for initial table version < 9
linliu-code File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Some comments aren't visible on the classic Files Changed page.
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.