feat: add workflow schedule trigger support #24428

ACAne0320 · 2025-08-24T23:20:02Z

Important

Make sure you have read our contribution guidelines
Ensure there is an associated issue and you have been assigned to it
Use the correct syntax to link this PR: Fixes #<issue number>.

Summary

This PR implements workflow schedule trigger support for Dify, enabling users to execute workflows automatically based on cron expressions with timezone support.

Features Implemented

1. Schedule Trigger Node

New trigger node type trigger-schedule for workflow configuration
Support for standard cron expressions (e.g., 0 9 * * 1-5 for weekdays at 9 AM)
Passes current time as input to triggered workflows

2. Custom Celery Beat Scheduler

Database-driven dynamic scheduler replacing default Beat scheduler
High availability support using PostgreSQL's SELECT FOR UPDATE SKIP LOCKED
Configurable tick intervals and batch processing

3. Automatic Schedule Management

Schedules plan automatically created/updated when publishing workflows
Schedules plan delete when workflow has no schedule trigger node
One-to-one mapping between app and schedule plan

4. Performance Optimizations

Batch database updates to reduce round trips

How to Test

1. Migrate database

 uv run flask db upgrade

2. Setup Schedule Trigger in Workflow:

Create a new workflow
Add a "Schedule Trigger" node as the start node
Configure cron expression
Add workflow logic after the trigger
Publish the workflow

3. Start the Scheduler Service:

# Start celery worker
dev/start-worker

# Start celery beat
dev/start-beat

3. Verify Execution:

Check workflow run history in the UI

TODO

Schedule Trigger

Use UUIDv7 as PK for workflow_schedule_plans table
Use beat_schedule instead of Custom Scheduler
Adjust workflow_schedule_plans table structure
Stop dispatching tasks when the daily limit is reached
Add a check when exporting DSL files
Add a failure log when the daily limit is reached

Test

Unit Tests
Integration Tests

Checklist

This change requires a documentation update, included: Dify Document
I understand that this PR may be closed in case there was no previous discussion or issues. (This doesn't apply to typos!)
I've added a test for each change that was introduced, and I tried as much as possible to make a single atomic change.
I've updated the documentation accordingly.
I ran dev/reformat(backend) and cd web && npx lint-staged(frontend) to appease the lint gods

Copilot

Pull Request Overview

This PR implements workflow schedule trigger support for Dify, enabling automatic workflow execution based on cron expressions with timezone support. The implementation includes a new trigger node type, custom Celery Beat scheduler, and automatic schedule management.

Database-driven dynamic scheduler using PostgreSQL for real-time schedule updates and high availability
New schedule trigger node supporting both visual and cron-based scheduling configurations
Automatic schedule lifecycle management tied to workflow publishing

Reviewed Changes

Copilot reviewed 18 out of 19 changed files in this pull request and generated 6 comments.

Show a summary per file

File	Description
docker/docker-compose.yaml	Adds workflow_scheduler service configuration
dev/start-workflow-scheduler	Development script for starting the workflow scheduler
dev/start-worker	Updates worker queues to include schedule queue
api/tasks/workflow_schedule_tasks.py	Celery task for executing scheduled workflow triggers
api/services/workflow_service.py	Integrates schedule sync during workflow publishing
api/services/workflow/schedule_sync.py	Service for syncing schedule configuration from workflow graph
api/services/workflow/schedule_manager.py	Core schedule management service with CRUD operations
api/schedule/schedule_dispatch.py	Custom Celery Beat scheduler with database-driven scheduling
api/models/workflow.py	WorkflowSchedulePlan model definition
api/migrations/versions/2025_08_24_1313-1e06b2654c6c_add_workflow_schedule_plan.py	Database migration for schedule plan table
api/core/workflow/nodes/trigger_schedule/	Trigger schedule node implementation
api/core/workflow/nodes/node_mapping.py	Registers new trigger schedule node type
api/core/workflow/nodes/enums.py	Adds TRIGGER_SCHEDULE node type enum

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

api/schedule/schedule_dispatch.py

api/services/workflow/schedule_sync.py

Copilot

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

dev/start-workflow-scheduler

…le-trigger

…limit checked before dispatching tasks

…le-trigger

Yeuoly · 2025-09-01T10:14:00Z

dev/start-beat

+  echo "Examples:"
+  echo "  $0"
+  echo "  $0 --loglevel DEBUG"
+  echo "  $0 --scheduler django_celery_beat.schedulers:DatabaseScheduler"


what is django_celery

This script doesn't use django_celery.
This example follows the official Celery documentation's instructions for custom scheduler classes:
https://docs.celeryq.dev/en/stable/userguide/periodic-tasks.html#using-custom-scheduler-classes

api/schedule/workflow_schedule_task.py

api/services/schedule_service.py

api/events/event_handlers/sync_workflow_schedule_when_app_published.py

api/services/schedule_service.py

…nce ScheduleService to utils

…rate limit logic

…& use Pydantic.Basemodel handling schedule plan

…le-trigger

…inputs

vincentx11 · 2025-09-01T06:09:21Z

api/schedule/workflow_schedule_task.py

+        )
+
+        if next_run_at:
+            schedule.next_run_at = next_run_at


What if we take this operation after the execution of the scheduled async task?

Consider the following scenario for reference:

The scheduled poller runs per 1min.

The scheduled trigger of a certain workflow is set to every 5min.

Due to some non-declarative reasons, the workflow task takes 15 minutes to complete within a certain period.

There will be two different outcomes depending on where this code is placed:

If we reset next_run_at before the task execution, the task will run at least 3 times within 15 minutes, with 1 run completing successfully and the other 2 remaining in a running state.

If we reset next_run_at after the task execution, the task will only run once within 15 minutes.

In combination with the business scenario, I recommend adopting the second outcome. By the way, the above case isn't the worst-case scenario. The workflow task might also hang for an extended period. If this happens, the more times the task runs, the higher load of the service will be, which could eventually lead to the whole service unhealthy.

Thank you for raising this question.

Under the current architecture, regardless of where next_run_at is updated, the task will execute 3 times within 15 minutes.
The reason for this is that the execution of trigger_workflow_async() is non-blocking.

Our scheduling system is intentionally designed to be fully decoupled:

Scheduler: Responsible for triggering tasks based on cron expressions

Executor: Handles workflow execution asyn via Unified Trigger Entry

Non-blocking: trigger_workflow_async() immediately returns after successfully entering the queue without waiting for completion

While your concern about multiple executions is valid from a scheduling perspective, our architecture has a critical safety mechanism that prevents the system overload you're worried about: the workflow execution queue.

When trigger_workflow_async() is called, it doesn't immediately execute the workflow. Instead:

Creates a trigger log entry with status QUEUED

Dispatches to tier-specific queues (professional/team/sandbox)

Returns immediately without waiting

The actual workflow execution happens in dedicated worker pools with concurrency limits per workspace.

like this:
Schedule Trigger → Unified Trigger Entry → Workflow Queue → [Concurrency Control] → Actual Execution

That sounds interesting. With this solution, whether tasks of a specified workflow can run in parallel (with multiple instances over a period) is controlled by the workflow itself—specifically through the "Parallel Setting" in the trigger, which wasn't mentioned in this feature scenario.

This is acceptable for now. However, looking ahead, I strongly recommend adding a Parallel Setting to the trigger node with three modes:

Parallel – this is the current solution

Serial Waiting – only one task instance exists per workflow; new tasks wait if another is already running

Serial Rejection – new tasks get rejected if one is already running, ensuring only one instance at a time

Following the design approach you described, if we considered this solution, this check could be implemented before trigger_workflow_async within run_schedule_trigger.

I'm glad you were able to provide this solution.
Maybe we need to discuss with the Dify team whether we should add this feature to the schedule trigger.
I'll get back to you if there's any result.

…le-trigger

… additional test cases

api/schedule/workflow_schedule_task.py

api/services/schedule_service.py

api/tasks/workflow_schedule_tasks.py

api/schedule/workflow_schedule_task.py

vincentx11 · 2025-09-08T03:04:48Z

api/tasks/workflow_schedule_tasks.py

+            current_utc = datetime.now(UTC)
+            schedule_tz = ZoneInfo(schedule.timezone) if schedule.timezone else UTC
+            current_in_tz = current_utc.astimezone(schedule_tz)
+            inputs = {"current_time": current_in_tz.isoformat()}


Additionally, I have a question regarding current_time—the only parameter that the trigger node passes to the workflow. This seems more like a discussion at the requirement level rather than a technical communication. If this is not the appropriate place to discuss this, please point me to the right channel.

I’m curious about how we handle the input variables defined in the original Start node. Given that the trigger does not add these user inputs to the variable_pool, should the subsequent nodes that depend on these inputs for execution throw an error, or should they not depend on these input variables in the first place?

Furthermore, if the bussiness scenario must set default values for the variables defined in the "Start" node during scheduled tasks, how should I implement this under the current requirement design? Are there any practices or solutions I can refer to?

Schedule features an entry-type node, rather than a time-based call to Start node, additionally, each entry-type branch can only have a single entry node, these kind of nodes we now called it start kind, which means the original Start node was now renamed to User Input node.

The variables you defined in User Input will never occur in the same entry branch, so NVMD about it.

Thank you for your explaination, and sorry for the late reply.

This is acceptable from the perspective of decoupled design. However, what if I want to reuse existing workflows that depend on the variables from User Input node? Specifically, how can I enable the auto-schedule trigger feature for those existing workflows created prior to this version? What is the recommendation solution about this scenario?

I think you might be able to achieve this by adding some Environment Variables.

…le-trigger

…hedule config extraction

Yeuoly

LGTM

api/schedule/workflow_schedule_task.py

feat: add workflow schedule trigger support

e0dc4be

Copilot AI review requested due to automatic review settings August 24, 2025 23:20

dosubot bot added size:XXL This PR changes 1000+ lines, ignoring generated files. 💪 enhancement New feature or request 📚 documentation Improvements or additions to documentation labels Aug 24, 2025

[autofix.ci] apply automated fixes

081c61c

Copilot AI reviewed Aug 24, 2025

View reviewed changes

api/schedule/schedule_dispatch.py Outdated Show resolved Hide resolved

api/services/workflow/schedule_sync.py Outdated Show resolved Hide resolved

Copilot AI reviewed Aug 25, 2025

View reviewed changes

Yeuoly reviewed Aug 25, 2025

View reviewed changes

dev/start-workflow-scheduler Outdated Show resolved Hide resolved

Merge remote-tracking branch 'upstream/feat/trigger' into feat/schedu…

b7a3393

…le-trigger

ACAne0320 marked this pull request as draft August 27, 2025 16:25

ACAne0320 added 11 commits August 28, 2025 01:42

refactor: use beat_schedule instead of custom schedule

55272f1

Merge remote-tracking branch 'upstream/feat/trigger' into feat/schedu…

3be64ac

…le-trigger

refactor: update workflow_schedule_plans table structrue & add daily …

9ff8318

…limit checked before dispatching tasks

feat: sync workflow schedule params when publishing workflow

57502a4

chore: internal func move to ScheduleService

aaa2c74

feat(test): add ScheduleService unit tests

3b9d3a3

chore: update schedule node default config & remove enable param

29fa80f

chore: remove unused import and clean up comments in ScheduleService

5b49cda

refactor: simplify schedule polling logic and reduce code complexity

7f4c403

chore: reformat

2bab93b

feat: add start-beat

f0ac008

ACAne0320 marked this pull request as ready for review August 29, 2025 14:53

ACAne0320 requested a review from Yeuoly August 29, 2025 14:53

dosubot bot added the 🌊 feat:workflow Workflow related stuff. label Aug 29, 2025

ACAne0320 added 2 commits August 30, 2025 02:25

Merge remote-tracking branch 'upstream/feat/trigger' into feat/schedu…

9bb0646

…le-trigger

Merge remote-tracking branch 'upstream/feat/trigger' into feat/schedu…

5ccca1e

…le-trigger

Yeuoly requested changes Sep 1, 2025

View reviewed changes

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. and removed size:XXL This PR changes 1000+ lines, ignoring generated files. labels Sep 2, 2025

ACAne0320 force-pushed the feat/schedule-trigger branch from b1ce7cc to 5ccca1e Compare September 2, 2025 04:13

dosubot bot added size:XXL This PR changes 1000+ lines, ignoring generated files. and removed size:L This PR changes 100-499 lines, ignoring generated files. labels Sep 2, 2025

ACAne0320 added 7 commits September 3, 2025 00:45

refactor: use Pydantic.Basemodel handling schedule plan

66591bc

feat: add ScheduleNodeError handling & convert def that do not refere…

8882df7

…nce ScheduleService to utils

refactor: simplify workflow schedule processing and remove pre-check …

3a3178a

…rate limit logic

refactor: convert def that do not reference ScheduleService to utils …

3e0414e

…& use Pydantic.Basemodel handling schedule plan

chore: update schedule_service unit_tests

04994d6

Merge remote-tracking branch 'upstream/feat/trigger' into feat/schedu…

d77cb4c

…le-trigger

refactor: update convert_12h_to_24h to raise ValueErrors for invalid …

38ffd95

…inputs

ACAne0320 requested a review from Yeuoly September 2, 2025 17:20

vincentx11 reviewed Sep 3, 2025

View reviewed changes

ACAne0320 added 7 commits September 3, 2025 23:14

Merge remote-tracking branch 'upstream/feat/trigger' into feat/schedu…

8040580

…le-trigger

feat: add VisualConfig & ScheduleExecutionError class

a7fbc9f

chore: replace datetime.now() with naive_utc_now()

80b28e7

chore: enhance schedule update logic and error handling

42fa1c4

chore: improve error handling and logging in run_schedule_trigger func

bce2674

chore: enhance visual_to_cron method

30bb55d

feat: enhance visual_to_cron method with VisualConfig integration and…

22f3f2a

… additional test cases

Yeuoly reviewed Sep 5, 2025

View reviewed changes

api/schedule/workflow_schedule_task.py Show resolved Hide resolved

api/services/schedule_service.py Outdated Show resolved Hide resolved

api/tasks/workflow_schedule_tasks.py Outdated Show resolved Hide resolved

api/schedule/workflow_schedule_task.py Show resolved Hide resolved

vincentx11 reviewed Sep 8, 2025

View reviewed changes

ACAne0320 added 3 commits September 9, 2025 03:11

Merge remote-tracking branch 'upstream/feat/trigger' into feat/schedu…

a00435c

…le-trigger

feat: improve schedule dispatch with streaming and parallel processing

30e562d

chore: remove redundant return statements in account retrieval and sc…

3500d47

…hedule config extraction

ACAne0320 requested a review from Yeuoly September 8, 2025 19:17

Yeuoly approved these changes Sep 10, 2025

View reviewed changes

api/schedule/workflow_schedule_task.py Show resolved Hide resolved

dosubot bot added the lgtm This PR has been approved by a maintainer label Sep 10, 2025

Yeuoly merged commit 4a743e6 into langgenius:feat/trigger Sep 10, 2025

feat: add workflow schedule trigger support #24428

feat: add workflow schedule trigger support #24428

Uh oh!

Conversation

ACAne0320 commented Aug 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Features Implemented

How to Test

TODO

Schedule Trigger

Test

Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Yeuoly left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ACAne0320 commented Aug 24, 2025 •

edited

Loading