Replies: 1 comment
-
|
Today I tested ORCH on a real task — building a full-stack service with a user-facing app and an admin panel. For context:
I fed that architecture into ORCH. Together with Claude Code, we assembled a team of agents based on the required tech stack. A couple of shamanic drum beats and the magic machine started working. The team ended up with 8 agents: CTO, Python Backend, Svelte Frontend, Bot Developer, DevOps, QA, Reviewer, and a CEO on top. They completed 87 runs across 86 tasks — all succeeded, zero failures. Wall-clock time: ~4 hours, while total agent-time was 7.5 hours (~2x parallelism). The whole thing cost about $31 in tokens. The CTO agent consumed the most resources (28% of tokens) — it was constantly monitoring progress and spawning new tasks. QA came in second with 252K tokens across 11 test tasks. The Reviewer did 13 code reviews. Even the DevOps agent showed up, set up Docker Compose, and went quiet — just like in real life. Overall, it felt like actually dispatching a task to a real dev team — things happen in the background, and then you get a nearly working solution. As Dario said, a country of geniuses in a data center — except here it's a dev team in your terminal. Fully autonomous, only needed a bit of polishing at the end with Claude Code. I really enjoyed it. It also made me think that the team composition can be anything — content factory, research pipeline, you name it. Highly recommend giving it a try. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
👋 Welcome!
We'd love to hear from you — whether you're just trying ORCH for the first time or already running multi-agent workflows in production.
Share your experience
Quick feedback template (optional)
Every comment helps us prioritize what to build next. No feedback is too small — even "it just works" or "the TUI crashed after 10 min" is useful.
Thanks for being an early adopter! ⭐
Beta Was this translation helpful? Give feedback.
All reactions