-
-
Notifications
You must be signed in to change notification settings - Fork 353
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update model fields immediately on save #1125
Conversation
906887b
to
27116fc
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- Tested the
deepcopy
change by removing the code for it and failingpytest
. Reinstating it works as intended. - Tested that the changes to model fields are saved and there is no need to restart Jupyter AI. The system logs report the change as intended. Resolves Issue Provider text fields values saved from Chat UI are only applied after Jupyter Lab is restarted. #1118 .
- Issue
profile_name
field ignored for Amazon Bedrock Chat Models in jupyter-ai Extension #1007 : used blankprofile_name
and new region and the saved model works well. - Issue Missing Request Schema for SageMaker Endpoint in jupyter-ai Extension #1006 (SageMaker endpoint request schema error). This still needs to be tried though there may not be JAI users who are calling SageMaker for the models.
- Issue api_version in GUI not considered, need to set OPENAI_API_VERSION #807 to be tested but no access to Azure. Hopefully @OliverKleinBST can test this.
I think this should go in, but I suspect that I do not have a way to test it (no access to Amazon models nor Azure which are the ones that are parametrized). Maybe it would be good to expose some model parameters (temperature) as fields so that this could be tested by folks easily? |
To clarify, I think this is not a result of this PR but since #711. Yet, it might be that fixing these together could be easier than separately. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
All LGTM.
@krassowski Thank you for the callout. I agree it would be good to also include a fix for completion fields before the next patch release. However, I think it should be done in a separate PR since it can be done independently. I can help with these changes. It shouldn't be too much work; the Proceeding to merge. Thank you @krassowski and @srdas for the testing & feedback! |
27116fc
to
5697888
Compare
5697888
to
61a4de7
Compare
I'm unlikely to find time this week, if you see how to fix it easily feel free to go ahead. Also, agree a separate PR would be better. |
@krassowski No worries! Take care. |
@meeseeksdev please backport to v3-dev |
Co-authored-by: david qiu <[email protected]>
Confirm that it solves also the problem reported in #807 |
* Backport PR #1049: Added new Anthropic Sonnet3.5 v2 models (#1050) Co-authored-by: Sanjiv Das <[email protected]> * Backport PR #1051: Added Developer documentation for streaming responses (#1058) Co-authored-by: Sanjiv Das <[email protected]> * Backport PR #1048: Implement streaming for `/fix` (#1059) Co-authored-by: Sanjiv Das <[email protected]> * Backport PR #1057: [pre-commit.ci] pre-commit autoupdate (#1060) Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Backport PR #1064: Added Ollama to the providers table in user docs (#1066) Co-authored-by: Sanjiv Das <[email protected]> * Backport PR #1056: Add examples of using Fields and EnvAuthStrategy to developer documentation (#1073) Co-authored-by: Alan Meeson <[email protected]> * Backport PR #1069: Merge Anthropic language model providers (#1076) Co-authored-by: Sanjiv Das <[email protected]> * Backport PR #1068: Allow `$` to literally denote quantities of USD in chat (#1079) Co-authored-by: david qiu <[email protected]> * Backport PR #1075: Fix magic commands when using non-chat providers w/ history (#1080) Co-authored-by: Alan Meeson <[email protected]> * Backport PR #1077: Fix `/export` by including streamed agent messages (#1081) Co-authored-by: Mahmut CAVDAR <[email protected]> * Backport PR #1072: Reduced padding in cell around code icons in code toolbar (#1084) Co-authored-by: Sanjiv Das <[email protected]> * Backport PR #1087: Improve installation documentation and clarify provider dependencies (#1091) Co-authored-by: Sanjiv Das <[email protected]> * Backport PR #1092: Remove retired models and add new `Haiku-3.5` model in Anthropic (#1093) Co-authored-by: Sanjiv Das <[email protected]> * Backport PR #1094: Continue to allow `$` symbols to delimit inline math in human messages (#1095) Co-authored-by: david qiu <[email protected]> * Backport PR #1097: Update `faiss-cpu` version range (#1101) Co-authored-by: david qiu <[email protected]> * Backport PR #1104: Fix rendering of code blocks in JupyterLab 4.3.0+ (#1105) Co-authored-by: david qiu <[email protected]> * Backport PR #1106: Catch error on non plaintext files in `@file` and reply gracefully in chat (#1110) Co-authored-by: Sanjiv Das <[email protected]> * Backport PR #1109: Bump LangChain minimum versions (#1112) Co-authored-by: david qiu <[email protected]> * Backport PR #1119: Downgrade spurious 'error' logs (#1124) Co-authored-by: ctcjab <[email protected]> * Backport PR #1127: Removes outdated OpenAI models and adds new ones (#1130) Co-authored-by: Sanjiv Das <[email protected]> * Backport PR #1131: [pre-commit.ci] pre-commit autoupdate (#1132) Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Backport PR #1125: Update model fields immediately on save (#1133) Co-authored-by: david qiu <[email protected]> * Backport PR #1139: Fix install step in CI (#1140) Co-authored-by: david qiu <[email protected]> * Backport PR #1129: Fix JSON serialization error in Ollama models (#1141) Co-authored-by: Mr.W <[email protected]> * Backport PR #1137: Update completion model fields immediately on save (#1142) Co-authored-by: david qiu <[email protected]> * [v3-dev] Initial migration to `jupyterlab-chat` (#1043) * Very first version of the AI working in jupyterlab_collaborative_chat * Allows both collaborative and regular chat to work with AI * handle the help message in the chat too * Autocompletion (#2) * Fix handler methods' parameters * Add slash commands (autocompletion) to the chat input * Stream messages (#3) * Allow for stream messages * update jupyter collaborative chat dependency * AI settings (#4) * Add a menu option to open the AI settings * Remove the input option from the setting widget * pre-commit * linting * Homogeneize typing for optional arguments * Fix import * Showing that the bot is writing (answering) (#5) * Show that the bot is writing (answering) * Update jupyter chat dependency * Some typing * Update extension to jupyterlab_chat (0.6.0) (#8) * Fix linting * Remove try/except to import jupyterlab_chat (not optional anymore), and fix typing * linter * Python unit tests * Fix typing * lint * Fix lint and mypy all together * Fix web_app settings accessor * Fix jupyter_collaboration version Co-authored-by: david qiu <[email protected]> * Remove unecessary try/except * Dedicate one set of chat handlers per room (#9) * create new set of chat handlers per room * make YChat an instance attribute on BaseChatHandler * revert changes to chat handlers * pre-commit * use room_id local var Co-authored-by: Nicolas Brichet <[email protected]> --------- Co-authored-by: Nicolas Brichet <[email protected]> --------- Co-authored-by: david qiu <[email protected]> Co-authored-by: david qiu <[email protected]> * Backport PR #1134: Improve user messaging and documentation for Cross-Region Inference on Amazon Bedrock (#1143) Co-authored-by: Sanjiv Das <[email protected]> * Backport PR #1136: Add base API URL field for Ollama and OpenAI embedding models (#1149) Co-authored-by: Sanjiv Das <[email protected]> * [v3-dev] Remove `/export`, `/clear`, and `/fix` (#1148) * remove /export * remove /clear * remove /fix * Fix CI in `v3-dev` branch (#1154) * fix check release by bumping to impossible version * fix types * Update Playwright Snapshots --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> * [v3-dev] Dedicate one LangChain history object per chat (#1151) * dedicate a separate LangChain history object per chat * pre-commit * fix mypy * Backport PR #1160: Trigger update snapshots based on commenter's role (#1161) Co-authored-by: david qiu <[email protected]> * Backport PR #1155: Fix code output format in IPython (#1162) Co-authored-by: Divyansh Choudhary <[email protected]> * Backport PR #1158: Update `/generate` to not split classes & functions across cells (#1164) Co-authored-by: Sanjiv Das <[email protected]> * Remove v2 frontend components (#1156) * First pass to remove the front end chat * Remove code-toolbar by using a simplified markdown renderer in settings * Remove chat-message-menu (should be ported in jupyter-chat) * Remove chat handler * Follow up 'Remove chat-message-menu (should be ported in jupyter-chat)' commit * Clean package.json * Remove UI tests * Remove the generative AI menu * Remove unused components * run yarn dedupe --------- Co-authored-by: David L. Qiu <[email protected]> * Upgrade to `jupyterlab-chat>=0.7.0` (#1166) * upgrade to jupyterlab-chat 0.7.0 * pre-commit * upgrade to @jupyter/chat ^0.7.0 in frontend * Remove v2 backend components (#1168) * remove v2 llm memory, implement ReplyStream * remove v2 websockets & REST handlers * remove unused v2 data models * fix slash command autocomplete * fix unit tests * remove unused _learned context provider * fix mypy * pre-commit * fix optional k arg in YChatHistory * bump jupyter chat to 0.7.1 to fix Python 3.9 tests * revert accidentally breaking /learn --------- Co-authored-by: Lumberbot (aka Jack) <[email protected]> Co-authored-by: Sanjiv Das <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Alan Meeson <[email protected]> Co-authored-by: Mahmut CAVDAR <[email protected]> Co-authored-by: ctcjab <[email protected]> Co-authored-by: Mr.W <[email protected]> Co-authored-by: Nicolas Brichet <[email protected]> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Divyansh Choudhary <[email protected]>
* add failing test that asserts fields are included in lm_provider_params * fix lm_provider_params prop to include fields * fix bug that writes to `self.settings["model_parameters"]` * add test capturing bug introduced by jupyterlab#421 * pre-commit
* Backport PR jupyterlab#1049: Added new Anthropic Sonnet3.5 v2 models (jupyterlab#1050) Co-authored-by: Sanjiv Das <[email protected]> * Backport PR jupyterlab#1051: Added Developer documentation for streaming responses (jupyterlab#1058) Co-authored-by: Sanjiv Das <[email protected]> * Backport PR jupyterlab#1048: Implement streaming for `/fix` (jupyterlab#1059) Co-authored-by: Sanjiv Das <[email protected]> * Backport PR jupyterlab#1057: [pre-commit.ci] pre-commit autoupdate (jupyterlab#1060) Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Backport PR jupyterlab#1064: Added Ollama to the providers table in user docs (jupyterlab#1066) Co-authored-by: Sanjiv Das <[email protected]> * Backport PR jupyterlab#1056: Add examples of using Fields and EnvAuthStrategy to developer documentation (jupyterlab#1073) Co-authored-by: Alan Meeson <[email protected]> * Backport PR jupyterlab#1069: Merge Anthropic language model providers (jupyterlab#1076) Co-authored-by: Sanjiv Das <[email protected]> * Backport PR jupyterlab#1068: Allow `$` to literally denote quantities of USD in chat (jupyterlab#1079) Co-authored-by: david qiu <[email protected]> * Backport PR jupyterlab#1075: Fix magic commands when using non-chat providers w/ history (jupyterlab#1080) Co-authored-by: Alan Meeson <[email protected]> * Backport PR jupyterlab#1077: Fix `/export` by including streamed agent messages (jupyterlab#1081) Co-authored-by: Mahmut CAVDAR <[email protected]> * Backport PR jupyterlab#1072: Reduced padding in cell around code icons in code toolbar (jupyterlab#1084) Co-authored-by: Sanjiv Das <[email protected]> * Backport PR jupyterlab#1087: Improve installation documentation and clarify provider dependencies (jupyterlab#1091) Co-authored-by: Sanjiv Das <[email protected]> * Backport PR jupyterlab#1092: Remove retired models and add new `Haiku-3.5` model in Anthropic (jupyterlab#1093) Co-authored-by: Sanjiv Das <[email protected]> * Backport PR jupyterlab#1094: Continue to allow `$` symbols to delimit inline math in human messages (jupyterlab#1095) Co-authored-by: david qiu <[email protected]> * Backport PR jupyterlab#1097: Update `faiss-cpu` version range (jupyterlab#1101) Co-authored-by: david qiu <[email protected]> * Backport PR jupyterlab#1104: Fix rendering of code blocks in JupyterLab 4.3.0+ (jupyterlab#1105) Co-authored-by: david qiu <[email protected]> * Backport PR jupyterlab#1106: Catch error on non plaintext files in `@file` and reply gracefully in chat (jupyterlab#1110) Co-authored-by: Sanjiv Das <[email protected]> * Backport PR jupyterlab#1109: Bump LangChain minimum versions (jupyterlab#1112) Co-authored-by: david qiu <[email protected]> * Backport PR jupyterlab#1119: Downgrade spurious 'error' logs (jupyterlab#1124) Co-authored-by: ctcjab <[email protected]> * Backport PR jupyterlab#1127: Removes outdated OpenAI models and adds new ones (jupyterlab#1130) Co-authored-by: Sanjiv Das <[email protected]> * Backport PR jupyterlab#1131: [pre-commit.ci] pre-commit autoupdate (jupyterlab#1132) Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Backport PR jupyterlab#1125: Update model fields immediately on save (jupyterlab#1133) Co-authored-by: david qiu <[email protected]> * Backport PR jupyterlab#1139: Fix install step in CI (jupyterlab#1140) Co-authored-by: david qiu <[email protected]> * Backport PR jupyterlab#1129: Fix JSON serialization error in Ollama models (jupyterlab#1141) Co-authored-by: Mr.W <[email protected]> * Backport PR jupyterlab#1137: Update completion model fields immediately on save (jupyterlab#1142) Co-authored-by: david qiu <[email protected]> * [v3-dev] Initial migration to `jupyterlab-chat` (jupyterlab#1043) * Very first version of the AI working in jupyterlab_collaborative_chat * Allows both collaborative and regular chat to work with AI * handle the help message in the chat too * Autocompletion (jupyterlab#2) * Fix handler methods' parameters * Add slash commands (autocompletion) to the chat input * Stream messages (jupyterlab#3) * Allow for stream messages * update jupyter collaborative chat dependency * AI settings (jupyterlab#4) * Add a menu option to open the AI settings * Remove the input option from the setting widget * pre-commit * linting * Homogeneize typing for optional arguments * Fix import * Showing that the bot is writing (answering) (jupyterlab#5) * Show that the bot is writing (answering) * Update jupyter chat dependency * Some typing * Update extension to jupyterlab_chat (0.6.0) (jupyterlab#8) * Fix linting * Remove try/except to import jupyterlab_chat (not optional anymore), and fix typing * linter * Python unit tests * Fix typing * lint * Fix lint and mypy all together * Fix web_app settings accessor * Fix jupyter_collaboration version Co-authored-by: david qiu <[email protected]> * Remove unecessary try/except * Dedicate one set of chat handlers per room (jupyterlab#9) * create new set of chat handlers per room * make YChat an instance attribute on BaseChatHandler * revert changes to chat handlers * pre-commit * use room_id local var Co-authored-by: Nicolas Brichet <[email protected]> --------- Co-authored-by: Nicolas Brichet <[email protected]> --------- Co-authored-by: david qiu <[email protected]> Co-authored-by: david qiu <[email protected]> * Backport PR jupyterlab#1134: Improve user messaging and documentation for Cross-Region Inference on Amazon Bedrock (jupyterlab#1143) Co-authored-by: Sanjiv Das <[email protected]> * Backport PR jupyterlab#1136: Add base API URL field for Ollama and OpenAI embedding models (jupyterlab#1149) Co-authored-by: Sanjiv Das <[email protected]> * [v3-dev] Remove `/export`, `/clear`, and `/fix` (jupyterlab#1148) * remove /export * remove /clear * remove /fix * Fix CI in `v3-dev` branch (jupyterlab#1154) * fix check release by bumping to impossible version * fix types * Update Playwright Snapshots --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> * [v3-dev] Dedicate one LangChain history object per chat (jupyterlab#1151) * dedicate a separate LangChain history object per chat * pre-commit * fix mypy * Backport PR jupyterlab#1160: Trigger update snapshots based on commenter's role (jupyterlab#1161) Co-authored-by: david qiu <[email protected]> * Backport PR jupyterlab#1155: Fix code output format in IPython (jupyterlab#1162) Co-authored-by: Divyansh Choudhary <[email protected]> * Backport PR jupyterlab#1158: Update `/generate` to not split classes & functions across cells (jupyterlab#1164) Co-authored-by: Sanjiv Das <[email protected]> * Remove v2 frontend components (jupyterlab#1156) * First pass to remove the front end chat * Remove code-toolbar by using a simplified markdown renderer in settings * Remove chat-message-menu (should be ported in jupyter-chat) * Remove chat handler * Follow up 'Remove chat-message-menu (should be ported in jupyter-chat)' commit * Clean package.json * Remove UI tests * Remove the generative AI menu * Remove unused components * run yarn dedupe --------- Co-authored-by: David L. Qiu <[email protected]> * Upgrade to `jupyterlab-chat>=0.7.0` (jupyterlab#1166) * upgrade to jupyterlab-chat 0.7.0 * pre-commit * upgrade to @jupyter/chat ^0.7.0 in frontend * Remove v2 backend components (jupyterlab#1168) * remove v2 llm memory, implement ReplyStream * remove v2 websockets & REST handlers * remove unused v2 data models * fix slash command autocomplete * fix unit tests * remove unused _learned context provider * fix mypy * pre-commit * fix optional k arg in YChatHistory * bump jupyter chat to 0.7.1 to fix Python 3.9 tests * revert accidentally breaking /learn --------- Co-authored-by: Lumberbot (aka Jack) <[email protected]> Co-authored-by: Sanjiv Das <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Alan Meeson <[email protected]> Co-authored-by: Mahmut CAVDAR <[email protected]> Co-authored-by: ctcjab <[email protected]> Co-authored-by: Mr.W <[email protected]> Co-authored-by: Nicolas Brichet <[email protected]> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Divyansh Choudhary <[email protected]>
Description
profile_name
field ignored for Amazon Bedrock Chat Models in jupyter-ai Extension #1007.Fixes usage of model fields by respecting the updated values immediately on save (without requiring a restart).
Model fields are used to specify provider-specific keyword arguments directly from Jupyter AI's Settings UI. These include
Base API URL
for OpenAI,Region name
for Amazon Bedrock, etc.Demo
Screen.Recording.2024-11-27.at.10.33.12.AM.mov
Details for contributors
Fixes a bug introduced by Setting default model providers #421:
BaseProvider.get_model_parameters()
would return the fields set by the existingconfig.json
on init, instead of the user-specified overrides in theAiExtension.model_parameters
trait.ConfigManager
accidentally merging the existing config intoself.settings["model_parameters"]
instead of a deep copy of that dictionary.Fixes a bug introduced by Distinguish between completion and chat models #711: Model fields were not being returned at all from the dictionary returned by
ConfigManager.lm_provider_params
.config.json
on init still got passed to the LLM due to the bug introduced by Setting default model providers #421. Essentially, the bug introduced by Setting default model providers #421 "covered up" the bug introduced by Distinguish between completion and chat models #711. This explains why the issue was difficult to reproduce and why a restart fixed it.Makes the
allowed_providers
,blocked_providers
,allowed_models
,blocked_models
arguments toConfigManager
optional and default toNone
, as indicated by their type signatures.Adds unit test coverage for both aforementioned bugs.
Testing instructions
config.py
:deepcopy(self._defaults.get(config_key))
toself._defaults.get(config_key)
near line 240.**fields,
near line 475.pytest
, and verify that the 2 added tests fail. This asserts that the tests actually capture the bugs introduced by Setting default model providers #421 and Distinguish between completion and chat models #711.pytest
, and verify the 2 added tests now pass.