Skip to content

[SPARK-40337][PS] Implement SeriesGroupBy.describe in pandas API on Spark#55686

Closed
devin-petersohn wants to merge 3 commits intoapache:masterfrom
devin-petersohn:SPARK-40337-series-groupby-describe
Closed

[SPARK-40337][PS] Implement SeriesGroupBy.describe in pandas API on Spark#55686
devin-petersohn wants to merge 3 commits intoapache:masterfrom
devin-petersohn:SPARK-40337-series-groupby-describe

Conversation

@devin-petersohn
Copy link
Copy Markdown
Contributor

What changes were proposed in this pull request?

Implement SeriesGroupBy.describe() in the pandas API on Spark

Why are the changes needed?

Missing API coverage

Does this PR introduce any user-facing change?

Yes

How was this patch tested?

Unit tests

Was this patch authored or co-authored using generative AI tooling?

Generated-by: Claude Code (claude-opus-4-6)

…on Spark

Co-Authored-By: Claude <noreply@anthropic.com>
@devin-petersohn devin-petersohn changed the title [SPARK-40337][PYTHON] Implement SeriesGroupBy.describe in pandas API on Spark [SPARK-40337][PS] Implement SeriesGroupBy.describe in pandas API on Spark May 5, 2026
The copy() call inherited 2-level column_label_names from the groupby
aggregation result, but the new column_labels are single-level. Pass
column_label_names=None to reset it.

Co-Authored-By: Claude <noreply@anthropic.com>
Comment thread python/pyspark/pandas/groupby.py Outdated
dispersion and shape of a dataset's distribution, excluding
``NaN`` values.

.. versionadded:: 4.2.0
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same 4.3.0.

Co-Authored-By: Claude <noreply@anthropic.com>
@HyukjinKwon
Copy link
Copy Markdown
Member

Merged to master and branch-4.x.

HyukjinKwon pushed a commit that referenced this pull request May 6, 2026
…park

### What changes were proposed in this pull request?

Implement `SeriesGroupBy.describe()` in the pandas API on Spark

### Why are the changes needed?

Missing API coverage

### Does this PR introduce _any_ user-facing change?

Yes

### How was this patch tested?

Unit tests

### Was this patch authored or co-authored using generative AI tooling?

Generated-by: Claude Code (claude-opus-4-6)

Closes #55686 from devin-petersohn/SPARK-40337-series-groupby-describe.

Authored-by: Devin Petersohn <devin.petersohn@gmail.com>
Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>
(cherry picked from commit 11d51c7)
Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants