Skip to content

python: expose pyarrow arrow scan reader#8

Draft
abnobdoss wants to merge 1 commit into
aba-155-file-scan-task-input-contextfrom
aba-142-arrow-reader-pyarrow-stream
Draft

python: expose pyarrow arrow scan reader#8
abnobdoss wants to merge 1 commit into
aba-155-file-scan-task-input-contextfrom
aba-142-arrow-reader-pyarrow-stream

Conversation

@abnobdoss
Copy link
Copy Markdown
Owner

Stack position: Rust PR after #7 (ABA-142).

Adds pyiceberg_core.scan.ArrowReader returning a pyarrow.RecordBatchReader via arrow-rs IntoPyArrow, backed by iceberg-rust ArrowReader. Scope is intentionally narrowed: it rejects metadata projections and partition-data tasks until the binding can export the exact reader-produced Arrow schema.

Validation:

  • cargo check --manifest-path bindings/python/Cargo.toml
  • uv run maturin develop
  • uv run --no-sync pytest tests/test_scan.py

Challenger loop: red-team found schema export mismatches for metadata and partition constants; both are now rejected before pyarrow stream export. Final review green.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant