Live - "client-content" without end of turn blocks voice response. #682

tiagoefreitas · 2025-02-16T13:15:18Z

Description of the bug:

I was trying to add hidden context to gemini live audio to give it instructions during audio conversations that are hidden from the users (not only at the beggining). The gemini docs say we can add previous context with clientcontent but the model always responds, even if I add the model response as a turn with turncomplete=true like this:

{
  "clientContent": {
    "turns": [
      {
        "role": "user",
        "parts": [
          {
            "text": "Context xxx"
          }
        ]
      },
      {
        "role": "model",
        "parts": [
          {
            "text": "ok."
          }
        ]
      }
    ],
    "turnComplete": true
  }
}

The model will still reply in audio with "ok" again

And if turncomplete is false, the model will not reply to audio content until I send a text message to end the turn.

If I instruct in the prompt not to reply, it seems the model was trained to always reply so 90% of the times it says “ok” or something.

Actual vs expected behavior:

No response

Any other information you'd like to share?

No response

The text was updated successfully, but these errors were encountered:

MarkDaoust · 2025-02-18T15:38:01Z

Is this something you can't do with the system instructions at the start of the conversation?

And if turncomplete is false, the model will not reply to audio content until I send a text message to end the turn.

I haven't tried mixing in client content without an end of turn. But this is not the behavior I would expect.

tiagoefreitas · 2025-02-18T16:10:18Z

@MarkDaoust it doesn't follow the instructions 90% of the time, it still replies.
This is kind of expected as the model was likely trained with pairs of request/reply.

but with turncomplete=true and a model role turn, it should not reply again, but it does.

MarkDaoust · 2025-02-20T17:50:51Z

Thanks for the feedback, I've raised this with the internal API team.

MarkDaoust · 2025-03-19T19:53:12Z

We're going to make turn_complete the default so this doesn't happen by accident.

gmKeshari added type:bug Something isn't working status:triaged Issue/PR triaged to the corresponding sub-team component:python sdk Issue/PR related to Python SDK labels Feb 17, 2025

gmKeshari assigned pamorgan Feb 17, 2025

MarkDaoust added component:api Issues related to the API, not the SDK. and removed component:python sdk Issue/PR related to Python SDK labels Feb 18, 2025

MarkDaoust changed the title ~~Live context~~ Live - "client-content" without end of turn blocks voice response. Feb 20, 2025

MarkDaoust assigned MarkDaoust and unassigned pamorgan Mar 19, 2025

MarkDaoust added type:feature request New feature request/enhancement and removed type:bug Something isn't working labels Apr 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Live - "client-content" without end of turn blocks voice response. #682

Live - "client-content" without end of turn blocks voice response. #682

tiagoefreitas commented Feb 16, 2025 •

edited by MarkDaoust

Loading

MarkDaoust commented Feb 18, 2025

tiagoefreitas commented Feb 18, 2025

MarkDaoust commented Feb 20, 2025

MarkDaoust commented Mar 19, 2025

Live - "client-content" without end of turn blocks voice response. #682

Live - "client-content" without end of turn blocks voice response. #682

Comments

tiagoefreitas commented Feb 16, 2025 • edited by MarkDaoust Loading

Description of the bug:

Actual vs expected behavior:

Any other information you'd like to share?

MarkDaoust commented Feb 18, 2025

tiagoefreitas commented Feb 18, 2025

MarkDaoust commented Feb 20, 2025

MarkDaoust commented Mar 19, 2025

tiagoefreitas commented Feb 16, 2025 •

edited by MarkDaoust

Loading