fix: preserve speaker_encoder in checkpoints and improve sample rate error by haosenwang1018 · Pull Request #222 · QwenLM/Qwen3-TTS

haosenwang1018 · 2026-02-23T01:51:14Z

Problem

Two issues in the finetuning script (sft_12hz.py / dataset.py):

1. speaker_encoder deleted from checkpoints — resume impossible

Lines 150-153 of sft_12hz.py explicitly delete speaker_encoder weights before saving:

drop_prefix = "speaker_encoder"
keys_to_drop = [k for k in state_dict.keys() if k.startswith(drop_prefix)]
for k in keys_to_drop:
    del state_dict[k]

When resuming from a checkpoint, model.speaker_encoder = None → crash on the next forward pass.

2. Unhelpful sample rate error message

dataset.py:105 asserts sr == 24000 but doesn't show the detected sample rate or suggest a fix, leaving users confused.

Fix

Remove the speaker_encoder deletion — keep weights in checkpoints so resume works. A comment explains the rationale.
Improve the assertion to show the actual sample rate and suggest the ffmpeg conversion command.

Fixes #204

…error Two fixes for the finetuning script: 1. Stop deleting speaker_encoder weights from checkpoints. The deletion makes resume training impossible because the model crashes on the next forward pass with NoneType error. A comment explains the rationale for keeping them. 2. Improve the sample rate assertion error message to show the actual detected sample rate and suggest the ffmpeg conversion command, reducing user confusion. Fixes QwenLM#204 Signed-off-by: haosenwang1018 <haosenwang1018@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: preserve speaker_encoder in checkpoints and improve sample rate error#222

fix: preserve speaker_encoder in checkpoints and improve sample rate error#222
haosenwang1018 wants to merge 1 commit intoQwenLM:mainfrom
haosenwang1018:fix/preserve-speaker-encoder-in-checkpoints

haosenwang1018 commented Feb 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

haosenwang1018 commented Feb 23, 2026

Problem

1. speaker_encoder deleted from checkpoints — resume impossible

2. Unhelpful sample rate error message

Fix

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant