Skip to content

NestedExtensionArray __get_item__ loses pyarrow dtypes #321

@dougbrn

Description

@dougbrn

Bug report
NestedExtensionArray __get_item__ appears to not preserve pyarrow dtypes for doubly nested datasets, where the deepest layer still has the pyarrow dtypes, but any middle layers revert to numpy dtypes, see the failed test in #320 for a reproducer. The failing test is specifically designed to catch this, but it was passing previously because prior to #320, generate_data wasn't fully using pyarrow dtypes. In #320, I started trying to fix this, but it had cascading changes that were causing more and more tests to fail, so instead I just want to capture this as an issue for now.

Before submitting
Please check the following:

  • I have described the situation in which the bug arose, including what code was executed, information about my environment, and any applicable data others will need to reproduce the problem.
  • I have included available evidence of the unexpected behavior (including error messages, screenshots, and/or plots) as well as a description of what I expected instead.
  • If I have a solution in mind, I have provided an explanation and/or pseudocode and/or task list.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions