Skip to content

[Epic] Implement RunArray (Run Length Encoding (RLE) / Run End Encoding (REE) support) #3520

Open
@alamb

Description

@alamb

Is your feature request related to a problem or challenge? Please describe what you are trying to do.
Arrow has added REE support apache/arrow#14176, similar to dictionary arrays that allow repeated values to be encoded in a space efficient manner that also allows fast processing.

Describe the solution you'd like
Implement REE in arrow-rs. Some likely candidate:

  • Support in DataType
  • Support in ArrayData
  • New REE array
  • Support REE in IPC
  • Support REE in cast kernels
  • Support REE in compute kernels

Describe alternatives you've considered
Remaining tasks:

Additional context

Among other things, @brancz is working to improve aggregation performance in DataFusion using Runarrays, see

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementAny new improvement worthy of a entry in the changelog

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions