Add execution frequency or delay when executing workflow in batches #3327

Semonxue · 2024-04-10T14:12:59Z

Semonxue
Apr 10, 2024

Self Checks

This is only for bug report, if you would like to ask a quesion, please head to Discussions.
I have searched for existing issues search for existing issues, including closed ones.
I confirm that I am using English to submit this report (我已阅读并同意 Language Policy).
Pleas do not modify this template :) and fill in all the required fields.

Dify version

0.6.1

Cloud or Self Hosted

Self Hosted (Docker)

Steps to reproduce

When run workflow in batches, sometime the LLM query will reach the frequency usage if I put multi LLM node in workflow.
The idea is if can add a config for batch run and setting up the delay in every run.

✔️ Expected Behavior

The workflow run can support sequence or parallel mode , and support delay if the LLM api is lower query frequency.

❌ Actual Behavior

Currently , the run mode will reach limitation of query frequency on some LLM , like 01Yi, etc.

Selene29 · 2024-04-14T20:05:06Z

Selene29
Apr 14, 2024

To add to this:

Generally i would want to set a rate limit per api call.
(globally and/or locally)

I played around with embedding model from Cohere, where i would give it a longer file.

Cohere allows you 100 api calls per minute with a trial key.

In this case the embedding slammed the api, till it hit the limit and then lost all progress in the process.

And no, this is not a case of pay for more.
I dont need the results instantly, i can wait a while for the task to finish.
Even a delay of 1 second would already solve the problem.

4 replies

FarVision2 Apr 14, 2024

As an aside, embeddings take almost no processing power. I use nomic-embed-text as well as mxbai-embed-large, with Ollama. The embeds pass through practically faster than you can read the logs. It is much wiser to spend your API calls on generation and tooling.

Selene29 Apr 15, 2024

Yeah, no contest from me.
Except Cohere is also free as long as you dont hammer their api.
That is the main point.

And currently i cant control this.

Semonxue Apr 15, 2024
Author

A rate limit is a good idea. but if it works on batch will be better. I mean how many jobs run at same time in batch mode.

StreamlinedStartup Aug 16, 2024

Many api calls have rate limits and the Dify workflow tool, when running in batch mode, is just way too fast. I couldn't find any settings anywhere to limit this.

StreamlinedStartup · 2025-02-21T17:11:58Z

StreamlinedStartup
Feb 21, 2025

Adding a +1 here, this is absolutely necessary to prevent overloading APIs.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add execution frequency or delay when executing workflow in batches #3327

{{title}}

Replies: 2 comments 4 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Add execution frequency or delay when executing workflow in batches #3327

Semonxue Apr 10, 2024

Self Checks

Dify version

Cloud or Self Hosted

Steps to reproduce

✔️ Expected Behavior

❌ Actual Behavior

Replies: 2 comments · 4 replies

Selene29 Apr 14, 2024

FarVision2 Apr 14, 2024

Selene29 Apr 15, 2024

Semonxue Apr 15, 2024 Author

StreamlinedStartup Aug 16, 2024

StreamlinedStartup Feb 21, 2025

Semonxue
Apr 10, 2024

Replies: 2 comments 4 replies

Selene29
Apr 14, 2024

Semonxue Apr 15, 2024
Author

StreamlinedStartup
Feb 21, 2025