Skip to content

[GLUTEN-11539][VL] Improve error message when spark.io.compression.codec=none#12360

Open
brijrajk wants to merge 1 commit into
apache:mainfrom
brijrajk:fix/11539-none-codec-message
Open

[GLUTEN-11539][VL] Improve error message when spark.io.compression.codec=none#12360
brijrajk wants to merge 1 commit into
apache:mainfrom
brijrajk:fix/11539-none-codec-message

Conversation

@brijrajk

@brijrajk brijrajk commented Jun 24, 2026

Copy link
Copy Markdown
Contributor

What changes are proposed in this pull request?

Follow-up to #12333.

When spark.io.compression.codec=none, the error message from #12333 told users to configure spark.gluten.sql.columnar.shuffle.codec, which is misleading if their intent is to disable shuffle compression.

none is special-cased to also point users to spark.shuffle.compress=false, which is the correct knob for disabling Gluten native shuffle compression (already handled in ColumnarShuffleWriter and ColumnarBatchSerializer).

Before (from #12333):

Gluten shuffle only supports lz4, zstd. none is not supported.
You may configure spark.gluten.sql.columnar.shuffle.codec to lz4 or zstd.

After:

Gluten shuffle does not support codec 'none'. To disable shuffle compression,
set spark.shuffle.compress=false. To use a supported codec, set
spark.gluten.sql.columnar.shuffle.codec to lz4 or zstd.

Files changed

  • GlutenShuffleUtils.scala — special-cases none with a more actionable error message
  • MiscOperatorSuite.scala — updates the none regression test assertion to match

How was this patch tested?

MiscOperatorSuite — 97/97 passed locally (Spark 4.0, Velox backend).


Was this patch authored or co-authored using generative AI tooling?

Yes. Claude Code (claude-sonnet-4-6) was used as an AI assistant during development.

Related issue: #11539

…dec=none

Follow-up to apache#12333. When spark.io.compression.codec is set to none,
the previous error message told users to configure
spark.gluten.sql.columnar.shuffle.codec, which is misleading if the
intent is to disable compression.

Special-case none to also point users to spark.shuffle.compress=false,
which is the correct knob to disable Gluten native shuffle compression
(handled in ColumnarShuffleWriter and ColumnarBatchSerializer).
@github-actions github-actions Bot added CORE works for Gluten Core VELOX labels Jun 24, 2026
@github-actions

Copy link
Copy Markdown

Run Gluten Clickhouse CI on x86

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CORE works for Gluten Core VELOX

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant