Is your feature request related to a problem or challenge? Please describe what you are trying to do.
The arrow-testing repository has several data files that caused issues with the C/C++ implementation
It would be nice to add tests in this repository that ensure the parquet/arrow/csv readers behave "nicely" when reading such files.
The definition of "nice" would be "no panics, and errors when appropriate"
Describe the solution you'd like
- Add tests to try and read all the above mentioned invalid files
- If any cause panics, temporarily skip them in the tests and file a ticket to track fixing the panics
Describe alternatives you've considered
Additional context
This was inspired while reviewing this doc from @pitrou on arrow security guidelines:
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
The arrow-testing repository has several data files that caused issues with the C/C++ implementation
It would be nice to add tests in this repository that ensure the parquet/arrow/csv readers behave "nicely" when reading such files.
The definition of "nice" would be "no panics, and errors when appropriate"
Describe the solution you'd like
Describe alternatives you've considered
Additional context
This was inspired while reviewing this doc from @pitrou on arrow security guidelines: