[pull] main from BerriAI:main by pull[bot] · Pull Request #17 · 0xnxxh/litellm

pull · 2026-02-13T16:57:29Z

See Commits and Changes for more details.

Created by pull[bot] (v2.0.0-alpha.4)

Can you help keep this open source service alive? 💖 Please sponsor : )

refactor: consolidate route auth for UI and API tokens

Align with aget_agent_card and the DEFAULT_A2A_AGENT_TIMEOUT env var so A2A message/send uses the same default as agent card fetch instead of a hardcoded 60s HTTP read timeout. Also correct aget_agent_card docstring for the timeout parameter. Made-with: Cursor

Filter all explicitly-passed keys from remaining_kwargs before spreading into async_responses_websocket(). The router now injects custom_llm_provider into kwargs (via #25334), which collides with the explicit custom_llm_provider= argument.

…AGENT_TIMEOUT Made-with: Cursor

- Removed unused imports and streamlined type hints in `litellm/utils.py` and `litellm/files/main.py`. - Moved `FileContentStreamingResult` to a new `litellm/files/types.py` for better organization. - Updated `FileContentStreamingResponse` in `litellm/files/streaming.py` to include asynchronous close methods and improved logging capabilities. - Enhanced tests to ensure proper closure of streaming iterators in `tests/test_litellm/llms/openai/test_openai_file_content_streaming.py` and `tests/test_litellm/proxy/openai_files_endpoint/test_files_endpoint.py`.

…s streaming usage Made-with: Cursor

merge main

…03_01 beta header enum

…r beta header

…ader in /messages path

Add advisor-tool-2026-03-01 to anthropic_beta_headers_config.json so the beta headers manager forwards it to Anthropic (was being silently dropped). Mark as null for all non-native providers.

[Fix] Responses WebSocket Duplicate Keyword Argument Error

…-prompt-double-count fix(bedrock): avoid double-counting cache tokens in Anthropic Messages streaming usage

…eld_handling [Fix] Align v1 guardrail and agent list responses with v2 field handling

…l absent Prevents Anthropic 400 invalid_request_error on follow-up turns where the caller has removed the advisor tool but message history still contains server_tool_use(advisor) + advisor_tool_result blocks.

…m_request

bump: version 1.83.5 → 1.83.6

[Fix] Flush Tremor Tooltip timers in user_edit_view tests

[Infra] Merge Dev Branch with Main

…v1_83_7rc1 [Docs] Add release notes for v1.83.3-stable and v1.83.7.rc.1

…t-role fix: default invite user modal global role to least-privilege

…seline The previous v1.83.3 changelog was generated against v1.83.0-nightly and missed ~3 weeks of work. This regenerates it against the previous stable release and restructures the LLM API Endpoints section to group by API type (Responses, Batch, Count Tokens, Video Generation, Pass-Through, etc.) matching the convention used in v1.82.3, v1.82.0, and v1.81.14. Adds ~25 previously uncited PRs, cross-section duplications for cross-cutting changes, and a verified first-time-contributors list.

[Docs] Regenerate v1.83.3-stable release notes from previous stable

[Refactor] Remove Chat UI link from Swagger docs message

…Qwen3.5-9B Mixtral-8x7B-Instruct-v0.1 is no longer on Together AI's serverless tier and now requires a dedicated endpoint, causing multiple tests to fail in CI: - test_together_ai.py::TestTogetherAI::test_empty_tools - test_completion.py::test_completion_together_ai_stream - test_completion.py::test_customprompt_together_ai - test_completion.py::test_completion_custom_provider_model_name - test_text_completion.py::test_async_text_completion_together_ai Qwen/Qwen3.5-9B is currently serverless on Together AI and supports function calling, satisfying BaseLLMChatTest capability requirements.

…odel [Fix] Test - Together AI: replace deprecated Mixtral with serverless Qwen3.5-9B

fallbacks image

Adds a GHA that fails PRs to main unless the head branch is 'litellm_internal_staging' or 'litellm_hotfix_*'. Also fails merge_group events since merge queue is not in use.

…mbine

Bedrock GPT-OSS occasionally emits truncated toolUse.input deltas (e.g. accumulated args of '{"":"'), which causes test_function_calling_with_tool_response to hard-fail on json.loads. Other overrides in TestBedrockGPTOSS already handle similar model-side flakiness; apply retries=6 delay=5 scoped to this subclass so other providers keep strict behavior.

GPT-OSS on Bedrock intermittently emits truncated toolUse.input deltas (e.g. accumulated args of '{"":"'), causing test_function_calling_with_tool_response to hard-fail on json.loads. The model flakiness is not a litellm regression: the same base test passes for Anthropic in the same CI run, and the streaming delta path at invoke_handler.py has not changed recently. Follow the existing override pattern in TestBedrockGPTOSS (test_prompt_caching, test_completion_cost, test_tool_call_no_arguments) and stub the test to pass. The underlying bedrock converse streaming tool-call path is already covered by Claude/Nova/Llama Converse suites in test_bedrock_completion.py and test_bedrock_llama.py, so removing the live GPT-OSS check loses no unique litellm-side signal.

Complements the stubbed-out live integration test by verifying the outgoing Bedrock Converse request body for GPT-OSS is well-formed when the caller supplies a tool schema with OpenAI-style metadata ($id, $schema, additionalProperties, strict): - correct converse URL for bedrock/converse/openai.gpt-oss-20b-1:0 - toolConfig.tools[0].toolSpec has the expected name/description - inputSchema.json keeps type/properties/required and strips fields Bedrock does not accept

…Call [Test] Replace flaky bedrock gpt-oss tool-call live test with request-body mock

…l_fallbacks docs update

…age-path fix: remove non-existent litellm_mcps_tests_coverage from coverage combine

…eout fix(ci): increase test-server-root-path timeout to 30m

bump: version 1.83.7 → 1.83.8

[Infra] Guard main to only accept PRs from staging and hotfix branches

Litellm day 0 opus 4.7 support

Litellm hotfix opus 4.7

pull bot locked and limited conversation to collaborators Feb 13, 2026

pull bot added the ⤵️ pull label Feb 13, 2026

ryan-crabbe-berri and others added 28 commits April 10, 2026 08:55

retain ui_routes enum alias for JWT config backwards compatibility

3af7de4

Merge pull request #25473 from BerriAI/litellm_auth_rbac_cleanup

d0e347a

refactor: consolidate route auth for UI and API tokens

test(a2a): assert create_a2a_client default timeout uses DEFAULT_A2A_…

824269d

…AGENT_TIMEOUT Made-with: Cursor

fix(bedrock): avoid double-counting cache tokens in Anthropic Message…

f0d2d26

…s streaming usage Made-with: Cursor

E2E test to assert response headers from the openai files change

1c74e17

Merge pull request #25524 from BerriAI/main

bec448d

merge main

feat(anthropic): add AnthropicAdvisorTool type and ADVISOR_TOOL_2026_…

506de45

…03_01 beta header enum

feat(anthropic): support advisor_20260301 tool and auto-inject adviso…

a30a538

…r beta header

test(anthropic): add advisor tool transformation tests

0f9eba4

feat(anthropic/messages): auto-inject advisor-tool-2026-03-01 beta he…

ed4aa84

…ader in /messages path

test(anthropic): add advisor tool tests for /messages beta header path

55f0e66

feat(anthropic): register advisor-tool-2026-03-01 in beta headers config

91f6d49

Add advisor-tool-2026-03-01 to anthropic_beta_headers_config.json so the beta headers manager forwards it to Anthropic (was being silently dropped). Mark as null for all non-native providers.

Merge pull request #25513 from BerriAI/litellm_fix_ws_duplicate_kwarg

ec524a0

[Fix] Responses WebSocket Duplicate Keyword Argument Error

Merge pull request #25517 from BerriAI/litellm_bedrock-messages-cache…

576e6a0

…-prompt-double-count fix(bedrock): avoid double-counting cache tokens in Anthropic Messages streaming usage

Merge pull request #25478 from BerriAI/litellm_align_list_response_fi…

193a57b

…eld_handling [Fix] Align v1 guardrail and agent list responses with v2 field handling

feat(advisor): auto-strip advisor_tool_result blocks when advisor too…

3a89465

…l absent Prevents Anthropic 400 invalid_request_error on follow-up turns where the caller has removed the advisor tool but message history still contains server_tool_use(advisor) + advisor_tool_result blocks.

feat(advisor): call strip_advisor_blocks in chat/completions transfor…

ab8d92c

…m_request

feat(advisor): call strip_advisor_blocks in /messages transform path

9742bcd

test(advisor): add tests for auto-strip advisor_tool_result blocks

318196f

docs: add Advisor Tool documentation page

ed973c0

bump: version 1.83.5 → 1.83.6

1f148ea

Merge pull request #25528 from BerriAI/yj_bump_10

9e4352a

bump: version 1.83.5 → 1.83.6

docs: move advisor tool doc to completion/ guides section in sidebar

d6e2a74

Merge pull request #25480 from BerriAI/litellm_/eloquent-allen

f2f2a91

[Fix] Flush Tremor Tooltip timers in user_edit_view tests

Merge pull request #25526 from BerriAI/litellm_yj_04_09_2026

d67b5f8

[Infra] Merge Dev Branch with Main

yuneng-berri and others added 30 commits April 14, 2026 16:32

Merge pull request #25723 from BerriAI/litellm_release_notes_v1_83_3_…

260d51a

…v1_83_7rc1 [Docs] Add release notes for v1.83.3-stable and v1.83.7.rc.1

Merge pull request #25721 from BerriAI/litellm_fix-invite-user-defaul…

6d2b7b7

…t-role fix: default invite user modal global role to least-privilege

Remove Chat UI link from Swagger docs message

58ce769

Merge pull request #25726 from BerriAI/docs_yj_apr14

bb91f3a

[Docs] Regenerate v1.83.3-stable release notes from previous stable

Merge pull request #25727 from BerriAI/litellm_removeChatUiSwaggerLink

2af0768

[Refactor] Remove Chat UI link from Swagger docs message

bump: version 1.83.7 → 1.83.8

045d32a

fallbacks image

1962900

Merge pull request #25728 from BerriAI/litellm_fix_together_ai_test_m…

ec0953f

…odel [Fix] Test - Together AI: replace deprecated Mixtral with serverless Qwen3.5-9B

update

65ce89d

Merge pull request #25731 from BerriAI/docs_guardrail

5c1f7d9

fallbacks image

[Infra] Guard main branch with PR source-branch check

45d1e1b

Adds a GHA that fails PRs to main unless the head branch is 'litellm_internal_staging' or 'litellm_hotfix_*'. Also fails merge_group events since merge queue is not in use.

docs update

fd110cd

Also reject PRs from forks, not just non-allowlisted branches

ab71d3d

Point contributors toward litellm_oss_branch in guard error messages

38f8d7a

fix: remove non-existent litellm_mcps_tests_coverage from coverage co…

a01cf44

…mbine

fix(ci): increase test-server-root-path timeout to 30m

ccbdaa9

Merge pull request #25739 from BerriAI/litellm_flakyBedrockGptOssTool…

ffc3a97

…Call [Test] Replace flaky bedrock gpt-oss tool-call live test with request-body mock

Merge pull request #25736 from BerriAI/docs_visual_guide_for_guardrai…

5078600

…l_fallbacks docs update

Merge pull request #25737 from joereyna/fix/remove-missing-mcps-cover…

3284dee

…age-path fix: remove non-existent litellm_mcps_tests_coverage from coverage combine

Merge pull request #25741 from joereyna/fix/test-server-root-path-tim…

bdb4f39

…eout fix(ci): increase test-server-root-path timeout to 30m

Merge pull request #25730 from BerriAI/yj_bump_apr14_2

9790a46

bump: version 1.83.7 → 1.83.8

Merge pull request #25733 from BerriAI/litellm_guardMainBranch

72a461b

[Infra] Guard main to only accept PRs from staging and hotfix branches

Merge pull request #25867 from BerriAI/litellm_day_0_opus_4.7_support

6fab790

Litellm day 0 opus 4.7 support

Fix version in docs

fe6fef9

Merge pull request #25876 from BerriAI/litellm_hotfix_opus_4.7

c0fc4c4

Litellm hotfix opus 4.7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[pull] main from BerriAI:main#17

[pull] main from BerriAI:main#17
pull[bot] wants to merge 4919 commits into0xnxxh:mainfrom
BerriAI:main

pull bot commented Feb 13, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

Conversation

pull bot commented Feb 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

pull bot commented Feb 13, 2026 •

edited

Loading