fix: Gemini thoughts not correctly accumulated when streaming enabled #514

OwenDavisBC · 2025-10-17T01:01:58Z

Fixes #510

Poggecci

Hey Owen. Thank you for the contribution!

This PR seems to both resolve the thoughts accumulation issue, but also changes what content we send in our requests to the model. Is there a reason you've coupled these two changes? The original rationale behind stripping thoughts was a naive form of context management (although the relevance of this is fully dependent on the GenAI SDK's handling of the thoughts we provide), but since we've had this behavior since release, a change to it merits its own PR.

Happy to approve if just the accumulation is kept in this PR or if we have some further discussion on the matter.

Poggecci · 2025-10-21T17:18:54Z

core/src/main/java/com/google/adk/models/Gemini.java

              } else {
+                if (accumulatedThoughtText.length() > 0
+                    && GeminiUtil.shouldEmitAccumulatedText(currentProcessedLlmResponse)) {
+                  LlmResponse aggregatedTextResponse =


aggregatedThoughtResponse here?

OwenDavisBC · 2025-10-21T18:14:03Z

@Poggecci thank you for taking a look. The issue with stripping thoughts is that will ultimately lead to empty parts being sent to the gemini API now that we are accumulating thought-only parts. I can add some more processing to remove those from the llm request here

adk-java/core/src/main/java/com/google/adk/models/Gemini.java

Lines 208 to 235 in 7f12064

    
           public Flowable<LlmResponse> generateContent(LlmRequest llmRequest, boolean stream) { 
        
             llmRequest = GeminiUtil.prepareGenenerateContentRequest(llmRequest, !apiClient.vertexAI()); 
        
             GenerateContentConfig config = llmRequest.config().orElse(null); 
        
             String effectiveModelName = llmRequest.model().orElse(model()); 
        
             logger.trace("Request Contents: {}", llmRequest.contents()); 
        
             logger.trace("Request Config: {}", config); 
        
             if (stream) { 
        
               logger.debug("Sending streaming generateContent request to model {}", effectiveModelName); 
        
               CompletableFuture<ResponseStream<GenerateContentResponse>> streamFuture = 
        
                   apiClient.async.models.generateContentStream( 
        
                       effectiveModelName, llmRequest.contents(), config); 
        
               return Flowable.defer( 
        
                   () -> 
        
                       processRawResponses( 
        
                           Flowable.fromFuture(streamFuture).flatMapIterable(iterable -> iterable))); 
        
             } else { 
        
               logger.debug("Sending generateContent request to model {}", effectiveModelName); 
        
               return Flowable.fromFuture( 
        
                   apiClient 
        
                       .async 
        
                       .models 
        
                       .generateContent(effectiveModelName, llmRequest.contents(), config) 
        
                       .thenApplyAsync(LlmResponse::create)); 
        
             } 
        
           }

, but that would not match what I see in the python-adk.

In the python-adk if I look at pre-processing before https://github.com/google/adk-python/blob/4a842c5a1334c3ee01406f796651299589fe12ab/src/google/adk/models/google_llm.py#L149-L154

    if stream:
      responses = await self.api_client.aio.models.generate_content_stream(
          model=llm_request.model,
          contents=llm_request.contents,
          config=llm_request.config,
      )

I see nothing removing thoughts - https://github.com/google/adk-python/blob/4a842c5a1334c3ee01406f796651299589fe12ab/src/google/adk/models/google_llm.py#L300-L326.

  async def _preprocess_request(self, llm_request: LlmRequest) -> None:

    if self._api_backend == GoogleLLMVariant.GEMINI_API:
      # Using API key from Google AI Studio to call model doesn't support labels.
      if llm_request.config:
        llm_request.config.labels = None

      if llm_request.contents:
        for content in llm_request.contents:
          if not content.parts:
            continue
          for part in content.parts:
            # Create copies to avoid mutating the original objects
            if part.inline_data:
              part.inline_data = copy.copy(part.inline_data)
              _remove_display_name_if_present(part.inline_data)
            if part.file_data:
              part.file_data = copy.copy(part.file_data)
              _remove_display_name_if_present(part.file_data)

    # Initialize config if needed
    if llm_request.config and llm_request.config.tools:
      # Check if computer use is configured
      for tool in llm_request.config.tools:
        if isinstance(tool, types.Tool) and tool.computer_use:
          llm_request.config.system_instruction = None
          await self._adapt_computer_use_tool(llm_request)

OwenDavisBC marked this pull request as ready for review October 17, 2025 16:21

OwenDavisBC mentioned this pull request Oct 17, 2025

Gemini thoughts not correctly accumulated when streaming enabled #510

Open

OwenDavisBC force-pushed the ISSUE-510 branch 2 times, most recently from bbf6d0c to 33ac74f Compare October 20, 2025 17:01

Poggecci self-requested a review October 21, 2025 17:15

Poggecci requested changes Oct 21, 2025

View reviewed changes

OwenDavisBC force-pushed the ISSUE-510 branch 7 times, most recently from 516d136 to a3f0560 Compare October 27, 2025 17:12

OwenDavisBC force-pushed the ISSUE-510 branch from a3f0560 to cbddffb Compare October 28, 2025 19:45

OwenDavisBC requested a review from Poggecci October 28, 2025 21:55

OwenDavisBC force-pushed the ISSUE-510 branch 4 times, most recently from c1754f1 to eb26d88 Compare November 4, 2025 16:28

fix: Gemini thoughts not correctly accumulated when streaming enabled

e71adcb

OwenDavisBC force-pushed the ISSUE-510 branch from eb26d88 to e71adcb Compare November 10, 2025 18:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: Gemini thoughts not correctly accumulated when streaming enabled #514

fix: Gemini thoughts not correctly accumulated when streaming enabled #514

OwenDavisBC commented Oct 17, 2025 •

edited

Loading

Uh oh!

Poggecci left a comment

Uh oh!

Poggecci Oct 21, 2025

Uh oh!

OwenDavisBC commented Oct 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix: Gemini thoughts not correctly accumulated when streaming enabled #514

Are you sure you want to change the base?

fix: Gemini thoughts not correctly accumulated when streaming enabled #514

Conversation

OwenDavisBC commented Oct 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Poggecci left a comment

Choose a reason for hiding this comment

Uh oh!

Poggecci Oct 21, 2025

Choose a reason for hiding this comment

Uh oh!

OwenDavisBC commented Oct 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

OwenDavisBC commented Oct 17, 2025 •

edited

Loading