Add execution frequency or delay when executing workflow in batches #3327
Replies: 2 comments 4 replies
-
To add to this: Generally i would want to set a rate limit per api call. I played around with embedding model from Cohere, where i would give it a longer file. Cohere allows you 100 api calls per minute with a trial key. In this case the embedding slammed the api, till it hit the limit and then lost all progress in the process. And no, this is not a case of pay for more. |
Beta Was this translation helpful? Give feedback.
-
Adding a +1 here, this is absolutely necessary to prevent overloading APIs. |
Beta Was this translation helpful? Give feedback.
-
Self Checks
Dify version
0.6.1
Cloud or Self Hosted
Self Hosted (Docker)
Steps to reproduce
When run workflow in batches, sometime the LLM query will reach the frequency usage if I put multi LLM node in workflow.
The idea is if can add a config for batch run and setting up the delay in every run.
✔️ Expected Behavior
The workflow run can support sequence or parallel mode , and support delay if the LLM api is lower query frequency.
❌ Actual Behavior
Currently , the run mode will reach limitation of query frequency on some LLM , like 01Yi, etc.
Beta Was this translation helpful? Give feedback.
All reactions