Minimal edit to enable Deepseek prefilling #96

aryopg · 2025-04-24T23:07:02Z

Summary

Reincorporating the edits made in #75 to allow prefilling deepseek models.

This was not handled in the latest update and if I were to run a request with a prefilling, the code would throw an error asking to use is_prefix=True. Once I set that parameter, the code would ask me to use the beta url.

Changes Introduced

safetytooling/apis/inference/openai/chat.py
- If the model is either deepseek-reasoner or deepseek-chat, use the prompt.deepseek_format function to prepare the prompt.
- If the last message is by the assistant, use deepseek's beta url. (Note that the overriding is very hacky at the moment, I'm open to any suggestions on how to handle this better)

jplhughes · 2025-04-25T10:00:55Z

safetytooling/apis/inference/openai/chat.py

            api_func = self.aclient.chat.completions.create
+        if model_id in {"deepseek-chat", "deepseek-reasoner"}:
+            if prompt.is_last_message_assistant():
+                self.aclient.base_url = "https://api.deepseek.com/beta"


should you have an else here to swap back to the other version or swap back to non beta after the call is successful or something?

Good catch! Added in 71fe0d4

… when successful

jplhughes · 2025-04-25T10:25:32Z

safetytooling/apis/inference/openai/chat.py

+
+        original_base_url = self.aclient.base_url
+        try:
+            if model_id in {"deepseek-chat", "deepseek-reasoner"}:


Do we have a DEEPSEEK_MODELS dict somewhere?

We have one in safety-tooling/safetytooling/apis/inference/api.py, but that would result in a circular import. We can create a constants file somewhere maybe?

jplhughes · 2025-04-25T10:26:55Z

safetytooling/apis/inference/openai/chat.py

+        finally:
+            # Always revert the base_url after the call
+            self.aclient.base_url = original_base_url


one thought - could this have strange async race conditions :/

hmm, maybe the base url should be passed directly to the api_func on a call-wise basis (rather than setting it as an attribute of the entire class). Since the class itself could be handling many requests with different models (and even different providers if it was set up differently).

ah that's true, should we have an asyncio lock maybe?

base url should be passed directly to the api_func on a call-wise basis

The api_func doesn't accept base url unfortunately. And i guess locking would harm concurrency..
Another (naive) approach is to instantiate the api_func again and again (instantiate openai.AsyncClient).

Would you be against locking?

I think locking would mess things up in terms of throughput since it would lock until the async call is complete which would be bad. Perhaps you can override the URL via "extra_headers"?

I'm still a little worried about this. Can we just use "https://api.deepseek.com/beta" always? Then we can set in api.py and remove all this logic internally of needing to swap between

what ended up happening here?

… deepseek-prefill

aryopg added 4 commits April 25, 2025 00:00

allowing prefilling for deepseek models

302867e

OAI model return reasoning content

e9843f7

developer role for o1-like models

0b6541a

gracefully handle cases when there is no reasoning content

7d80a38

aryopg marked this pull request as ready for review April 24, 2025 23:12

aryopg requested review from jplhughes and kxcloud April 25, 2025 08:26

aryopg self-assigned this Apr 25, 2025

jplhughes reviewed Apr 25, 2025

View reviewed changes

handle non-prefilled prompt, and revert back to the original base url…

71fe0d4

… when successful

jplhughes reviewed Apr 25, 2025

View reviewed changes

aryopg marked this pull request as draft April 25, 2025 14:39

aryopg added 2 commits May 6, 2025 23:21

Merge branch 'main' of github.com:safety-research/safety-tooling into…

a30b651

… deepseek-prefill

remove outdated comment

cd3190f

Minimal edit to enable Deepseek prefilling #96

Are you sure you want to change the base?

Minimal edit to enable Deepseek prefilling #96

Uh oh!

Conversation

aryopg commented Apr 24, 2025

Summary

Changes Introduced

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants