Skip to content

python: export exact arrow reader schema#9

Draft
abnobdoss wants to merge 1 commit into
aba-142-arrow-reader-pyarrow-streamfrom
aba-162-arrow-reader-exact-schema
Draft

python: export exact arrow reader schema#9
abnobdoss wants to merge 1 commit into
aba-142-arrow-reader-pyarrow-streamfrom
aba-162-arrow-reader-exact-schema

Conversation

@abnobdoss
Copy link
Copy Markdown
Owner

Stack position: Rust PR after #8 (ABA-162).

Exports the Arrow schema used by pyiceberg_core ArrowReader from the scan task/record batch transformer path instead of naively mapping the requested Iceberg schema. Intended to support metadata and partition constant projections without PyArrow C stream schema mismatches.

Validation reported by implementer:

  • cargo check --manifest-path bindings/python/Cargo.toml
  • cd bindings/python && uv run maturin develop
  • cd bindings/python && uv run --no-sync pytest tests/test_scan.py

Current status: draft, under Codex red-team review before upstream consideration. Known red-team focus: first-task schema derivation across multi-task scans, public API exposure in record_batch_transformer, empty stream schema behavior.

@abnobdoss abnobdoss force-pushed the aba-162-arrow-reader-exact-schema branch 2 times, most recently from 8432ca9 to 0844a9e Compare May 25, 2026 02:08
@abnobdoss abnobdoss force-pushed the aba-162-arrow-reader-exact-schema branch from 0844a9e to 78787c0 Compare May 25, 2026 02:21
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant