You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
GH-47027: [C++][Parquet] Fix repeated column pages not being written when reaching page size limit (#47032)
### Rationale for this change
Ensures Parquet pages are written when the buffered data reaches the configured page size, while also ensuring pages are only split on record boundaries when required.
Without this fix, page sizes can grow unbounded until the row group is closed.
### What changes are included in this PR?
Fixes off-by-one error in logic to control when pages can be written.
### Are these changes tested?
Yes, added a new unit test.
### Are there any user-facing changes?
**This PR contains a "Critical Fix".**
This bug could cause a crash when writing a large number of rows of a repeated column and reaching a page size > max int32.
* GitHub Issue: #47027
Authored-by: Adam Reeve <adreeve@gmail.com>
Signed-off-by: Adam Reeve <adreeve@gmail.com>
0 commit comments