Feat/exp backend comparison by catebros · Pull Request #3 · stride-research/flowgentic_benchmarking

catebros · 2026-03-28T00:07:27Z

Depends on Flowgentic Hotfix/benchmarking-redesign PR (For Academy and Autogen implementation and event model)

…etrics

gemini-code-assist

Code Review

This pull request introduces a new BackendComparison experiment to benchmark agent orchestration frameworks, including LangGraph, AutoGen, and Academy, against backend engines like AsyncFlow and Parsl. The update adds new workload implementations, a comprehensive plotting utility, and refined dependency management. Review feedback identifies several issues: a potential crash in the Discord notification function due to missing environment variable validation, unreachable code in the notification utility, a logic error in invocation count reporting that could lead to misleading metrics, and the inclusion of an unused schema file.

gemini-code-assist · 2026-03-28T00:10:19Z

benchmark/data_generation/experiments/backend_comparison/main.py

+	webhook_url = os.getenv("DISCORD_WEBHOOK")
+	data = {"content": msg}
+	requests.post(webhook_url, json=data)


The send_discord_notifaction function does not handle the case where the DISCORD_WEBHOOK environment variable is not set. If os.getenv("DISCORD_WEBHOOK") returns None, requests.post(None, ...) will raise an exception, crashing the program. Please add a check for the webhook URL.

This function also duplicates logic from data_generation/utils/io_utils.py. Consider refactoring to use a single notification utility.

webhook_url = os.getenv("DISCORD_WEBHOOK") if webhook_url: data = {"content": msg} try: requests.post(webhook_url, json=data, timeout=5).raise_for_status() except requests.exceptions.RequestException as e: logger.error(f"Failed to send Discord notification: {e}")

gemini-code-assist · 2026-03-28T00:10:19Z

benchmark/data_generation/utils/io_utils.py

 		self.webhook_url = os.getenv("DISCORD_WEBHOOK")

 	def send_discord_notification(self, msg: str, image_path: str = None):
+		return None


This return None statement makes the rest of the function unreachable code. This appears to be a temporary change for debugging. Please ensure this is removed before merging to allow Discord notifications to work as intended.

gemini-code-assist · 2026-03-28T00:10:19Z

benchmark/data_generation/experiments/backend_comparison/utils/plots.py

+	for engine_id, record in data.items():
+		d = _extract_event_durations(record["events"])
+
+		n_inv = len(d["d_resolve"]) or 1


Using or 1 here can lead to incorrect reporting. If there are zero invocations, n_inv will be set to 1, which is misleading when this metric is displayed in the results table. It's better to correctly report 0 and handle potential division-by-zero errors where the division actually occurs, if any.

Suggested change

n_inv = len(d["d_resolve"]) or 1

n_inv = len(d["d_resolve"])

gemini-code-assist · 2026-03-28T00:10:19Z

benchmark/data_generation/experiments/backend_comparison/utils/schemas.py

@@ -0,0 +1 @@
+from pydantic import BaseModel


This file appears to be unused. The main.py for this experiment imports schemas from data_generation/utils/schemas. If this file is not needed, it should be removed to avoid confusion. Also, it's missing a newline at the end of the file, which is a style violation.

coderabbitai · 2026-03-28T00:10:37Z

Important

Review skipped

Draft detected.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: dbd89144-bb50-4d86-b827-1d3c76e42bfc

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch feat/exp-backend-comparison

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

catebros added 3 commits March 18, 2026 18:09

feat: migrate experiment 3

b8fcf6e

feat: add new boxplots and remove old (uninformative) plots

f4768dc

feat: add orchestrator comparison and drop cold-start from overhead m…

d6812d5

…etrics

gemini-code-assist bot reviewed Mar 28, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/exp backend comparison#3

Feat/exp backend comparison#3
catebros wants to merge 3 commits intomainfrom
feat/exp-backend-comparison

catebros commented Mar 28, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Mar 28, 2026

Uh oh!

gemini-code-assist bot Mar 28, 2026

Uh oh!

gemini-code-assist bot Mar 28, 2026

Uh oh!

gemini-code-assist bot Mar 28, 2026

Uh oh!

coderabbitai bot commented Mar 28, 2026

Review skipped

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		@@ -0,0 +1 @@
		from pydantic import BaseModel No newline at end of file

Conversation

catebros commented Mar 28, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Mar 28, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Mar 28, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Mar 28, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Mar 28, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot commented Mar 28, 2026

Review skipped

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant