-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Open
Labels
enhancementAny new improvement worthy of a entry in the changelogAny new improvement worthy of a entry in the changelog
Description
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
The arrow-testing repository has several data files that caused issues with the C/C++ implementation
- https://github.com/apache/arrow-testing/tree/master/data/arrow-ipc-stream
- https://github.com/apache/arrow-testing/tree/master/data/arrow-ipc-file
- https://github.com/apache/arrow-testing/tree/master/data/parquet/fuzzing
- https://github.com/apache/arrow-testing/tree/master/data/csv/fuzzing
It would be nice to add tests in this repository that ensure the parquet/arrow/csv readers behave "nicely" when reading such files.
The definition of "nice" would be "no panics, and errors when appropriate"
Describe the solution you'd like
- Add tests to try and read all the above mentioned invalid files
- If any cause panics, temporarily skip them in the tests and file a ticket to track fixing the panics
Describe alternatives you've considered
Additional context
This was inspired while reviewing this doc from @pitrou on arrow security guidelines:
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
enhancementAny new improvement worthy of a entry in the changelogAny new improvement worthy of a entry in the changelog