Replies: 3 comments 5 replies
-
Another thing that may need to be added to an eventual FAQ, some information from the code base:FAQ: Context Length and Lyrics Cutoff on Long Duration SongsQuestion
AnswerYes, there is a hard-coded context length, and yes, it's likely what's causing the lyrics cutoff. Here's the technical breakdown: The 4096 Token Context WindowThe 5Hz Language Model (LM) has a hard-coded This is the total context window — it must fit both the input prompt AND the generated output. How Duration Maps to TokensThe model is called "5Hz" because it generates 5 audio semantic tokens per second. So:
The Budget BreakdownThe code at So for a 200s song:
At first glance, 4,032 seems like enough for 1,500 tokens of output. But here's the catch — this is the total sequence length including the input prompt. The vLLM engine treats Why Lyrics Cut OffThe constrained logits processor (
But if the model hits the Increasing from 200s to 204s doesn't help because the problem isn't a few tokens short — it's that the context window is being exhausted by the combination of prompt + metadata + audio codes. The Real LimitsRoughly speaking, with
If you're on a low-VRAM GPU where Suggestions
|
Beta Was this translation helpful? Give feedback.
-
|
How can i use "init_llm=false" when am using ComfyUI and Ace-Step in the Pinokio setup ? |
Beta Was this translation helpful? Give feedback.
-
|
This version of 1.5 seems a huge downgrade in my opinion. The version 1 was producing far better songs / music. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Does it have a hard coded context length for the Model to work with ??
Is this why when setting a 200 second duration length, still runs out of time and stops a lyrics abruptly (maybe a 3 words) before the end of the song.
If you then increase the duration to 204 seconds. It still misses the last few words.
It feels like the Model Context is totally running out by the end of the song ??
Beta Was this translation helpful? Give feedback.
All reactions