test(W-18772941): test against LLM Gateway #60

mdonnalley · 2025-06-16T21:42:59Z

What does this PR do?

Adds a script (yarn test:llmg) to test that tools are successfully invoked across multiple LLMs

Developers will need to set SF_LLMG_API_KEY to the api key, which will be stored in 1password

Developers will also need a llmg-test.yml (which is gitignored) to tell the test which models and prompts to use. For example,

models:
  - llmgateway__OpenAIGPT35Turbo_01_25
  - llmgateway__OpenAIGPT4OmniMini
  - llmgateway__BedrockAnthropicClaude4Sonnet

prompts:
  - What's my salesforce username?
  - List all my orgs
  # send multiple messages to the LLM
  - - I am a Salesforce developer. My current working directory is ~/my-project. My current org is [email protected].
      - Deploy my project

At this point the script is meant to be used during development to help write tool descriptions and parameters that are effective across multiple LLMs.

In the future we could possibly leverage this to write regression tests (although that would be expensive!)

What issues does this PR fix or reference?

@W-18772941@

…o mdonnalley/logging

mdonnalley and others added 30 commits June 10, 2025 14:30

feat: add telemetry

cc4eba1

Merge branch 'main' into mdonnalley/telemetry

1e8e9a2

fix: ensure all events have same props

e15d1e8

chore: clean up

5ba0e32

feat: add runtimeMs

4df91ce

fix: handle failed connection to appinsights

e5c062a

feat: logging

8e393fa

chore: extract method signatures

fe2d52b

chore: code review

5e85f75

Merge branch 'mdonnalley/telemetry' into mdonnalley/logging

35aad85

feat: add client info to telemetry events

c6d7f79

fix: init telemetry in catch

7eda5d1

fix: consolidate TOOL_CALLED and TOOL_ERROR

05d188b

Merge branch 'mdonnalley/telemetry' into mdonnalley/logging

6a2c895

feat: count tokens of each tool

024fa77

chore: clean up token count logging

a909d69

Merge branch 'main' into mdonnalley/logging

14ce216

Merge remote-tracking branch 'origin/main' into mdonnalley/logging

b7dc943

chore: bump core

c5bb910

fix: remove faulty token counting

e93b915

Merge branch 'mdonnalley/logging' of github.com:salesforcecli/mcp int…

fd6d746

…o mdonnalley/logging

fix: set SF_LOG_LEVEL when using --debug

91523e2

test: add testing against LLM Gateway

03b7a3d

Merge branch 'main' into mdonnalley/llmg

394cb50

chore: clean up

5cefe3d

test: list token counts

15aa440

test: make more developer friendly

d18028f

chore: clean up

bb3754a

chore: clean up

0b534d1

chore: clean up

4856d5b

mdonnalley added 4 commits June 17, 2025 13:31

chore: make tables prettier

3ce1fc8

test: allow longer chats

f66b793

test: clean up implementation

a6b3705

test: add --entry-point flag

6f86505

mdonnalley mentioned this pull request Jun 20, 2025

feat: add apex/agent testing capabilities to MCP @W-18609330@ #62

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

test(W-18772941): test against LLM Gateway #60

test(W-18772941): test against LLM Gateway #60

Uh oh!

mdonnalley commented Jun 16, 2025 •

edited

Loading

Uh oh!

Uh oh!

test(W-18772941): test against LLM Gateway #60

Are you sure you want to change the base?

test(W-18772941): test against LLM Gateway #60

Uh oh!

Conversation

mdonnalley commented Jun 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

What issues does this PR fix or reference?

Uh oh!

Uh oh!

mdonnalley commented Jun 16, 2025 •

edited

Loading