Fix bug in prefill_chunk_size that ignores disable_compile flag #38067

xmarva · 2025-05-11T02:36:38Z

This PR fixes a bug in the prefill_chunking function where the compilation check is not performed before calling get_compiled_call().

When using prefill_chunk_size > 0 in a GenerationConfig, the model's forward function is always compiled, even if disable_compile=True is specified. This happens because the prefill_chunking function directly calls get_compiled_call() without checking if compilation should occur:

model_forward = self.get_compiled_call(generation_config.compile_config)

Modified to:

compile_forward = self._valid_auto_compile_criteria(model_kwargs, generation_config)
if compile_forward:
    model_forward = self.get_compiled_call(generation_config.compile_config)

This ensures that models aren't compiled when disable_compile=True is set.

Also fixed a typo in the error message ("chunkink" -> "chunking"), lol

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline, Pull Request section?
Yes, but this is still my first PR here, hope it's ok
Was this discussed/approved via a Github issue or the forum?
Did you make sure to update the documentation with your changes?
As far as I understand it, there's no need to change the documentation
Did you write any new necessary tests?
No, tested with a simple generation script using prefill_chunk_size=8 and disable_compile=True. Before the fix, torch.compile was being called despite disable_compile=True. After the fix, no compilation occurs as expected.

Who can review?

@gante

…pile flag

github-actions · 2025-05-11T02:36:50Z

Hi 👋, thank you for opening this pull request! The pull request is converted to draft by default. The CI will be paused while the PR is in draft mode. When it is ready for review, please click the Ready for review button (at the bottom of the PR page). This will assign reviewers and trigger CI.

gante

Thank you for the detailed problem explanation and proactive PR solving it 🤗 PRs like yours help improve transformers for everyone!

HuggingFaceDocBuilderDev · 2025-05-13T13:23:52Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

…ingface#38067) Fix bug in prefill_chunk_size implementation that ignores disable_compile flag

Fix bug in prefill_chunk_size implementation that ignores disable_com…

3f78a02

…pile flag

github-actions bot marked this pull request as draft May 11, 2025 02:36

xmarva marked this pull request as ready for review May 11, 2025 02:38

github-actions bot requested a review from gante May 11, 2025 02:39

gante approved these changes May 13, 2025

View reviewed changes

gante enabled auto-merge (squash) May 13, 2025 13:11

gante merged commit 07feaad into huggingface:main May 13, 2025
23 checks passed

zucchini-nlp pushed a commit to zucchini-nlp/transformers that referenced this pull request May 14, 2025

Fix bug in prefill_chunk_size that ignores disable_compile flag (hugg…

535b428

…ingface#38067) Fix bug in prefill_chunk_size implementation that ignores disable_compile flag

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix bug in prefill_chunk_size that ignores disable_compile flag #38067

Fix bug in prefill_chunk_size that ignores disable_compile flag #38067

Uh oh!

xmarva commented May 11, 2025

Uh oh!

github-actions bot commented May 11, 2025

Uh oh!

gante left a comment

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented May 13, 2025

Uh oh!

Uh oh!

Fix bug in prefill_chunk_size that ignores disable_compile flag #38067

Fix bug in prefill_chunk_size that ignores disable_compile flag #38067

Uh oh!

Conversation

xmarva commented May 11, 2025

Before submitting

Who can review?

Uh oh!

github-actions bot commented May 11, 2025

Uh oh!

gante left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented May 13, 2025

Uh oh!

Uh oh!