Replies: 3 comments 2 replies
-
Can you provide the server logs for both clusters? Setting the log level to There must be something different between the 2 clusters, whether it is in the software stack or in the infrastructure, or somewhere else. Could you provide more information about the 2 clusters?
There will be some unavailability at some point. Whether you notice or not, this is unavoidable. The duration of the unavailability depends on many factors. I suspect there is some unavailability or at least disruption on the cluster without allegedly any problem, as restarting a stream changes its topology, which impacts the connections that use it. |
Beta Was this translation helpful? Give feedback.
-
@yurahaid you haven't provided any evidence of a bug. A stream restart inevitably will result in a short unavailability window. Your clients should be ready to retry the operations. It's up to you to investigate the difference between the clusters. |
Beta Was this translation helpful? Give feedback.
-
I am adding logs from the clusters. Both clusters are very similar, hosted on VMs in GCP in different projects. The problem cluster has 8 CPUs, and stable has 2 CPUs in each node. healthy cluster with successful reconnect on the first attempt..log |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Describe the bug
We are experiencing a temporary unavailability of a stream after performing a restart operation in one of our RabbitMQ clusters. 
Environment:
• RabbitMQ version: 4.1.0
• Cluster configuration: 3 nodes
• Client: Go RabbitMQ Stream client (latest version (v1.5.6)
Issue Details:
In one of our RabbitMQ clusters, when we execute a restart_stream operation on a stream, ours producers and consumers receive a Stream Not Available error for approximately 5 to 20 seconds. This issue does not occur in another cluster with an identical configuration.
Both clusters are running RabbitMQ version 4.1.0 with three nodes each. The consumers are implemented using the latest version of the Go RabbitMQ Stream client.
Reproduction steps
{"severity":"ALERT","ts":"2025-05-27T12:03:18.897Z","caller":"/src/cmd/test-rabbitmq/main.go:92","message":"exit with error","context":{"error":"create consumer: Stream Not Available"},"stacktrace":"main.main\n\t/src/cmd/test-rabbitmq/main.go:92\nruntime.main\n\t/usr/local/go/src/runtime/proc.go:283"}
Expected behavior
The stream should remain available to consumers and publishers during and after the operations with stream, without any errors.
Additional context
This makes it challenging to work with RabbitMQ because we cannot add or remove a node from the cluster without affecting the system's operation.
Beta Was this translation helpful? Give feedback.
All reactions