-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug]: Kafka operator becomes dysfunctional after setting controller.quorum.fetch.timeout.ms
to -1
#11084
Comments
I guess you can unset it? I'm not sure there is much Strimzi can do with a configuration like this. We can block it from being configurable. But that might not allow you to tune this option. |
@scholzj Thanks for the prompt response. Yes we were able to recover it by reverting, and the operator recovered after 10 mins. In terms of rejecting the invalid valid, is there a way for the kafka-operator to reject the configuration when it is set to negative value, instead of completely blocking it? |
I think there are two separate issues. First one that you see here is that when you set it to The second is whether having In general, I'm not sure Strimzi can easily handle something like this and whether it should handle it. I think there are many ways how you can break your Kafka cluster through these options. The expectation is that when go into configuring options like these, you know what you are doing and you test your setting properly. But let's see what others think when the issue is triaged. |
Thanks for the nice breakdown. the points accurately describe the issues.
Since Strimzi is not only propagating this configuration Kafka, but also relying on this value to manage Kafka, |
As I said, it will be triaged ... but I think the complexity of the implementation is much higher than the benefit of it. |
Bug Description
Kafka operator is stuck with an erroneous configuration value of
controller.quorum.fetch.timeout.ms=-1
. The root cause might be that this configuration value is used by operator to check the health of the nodes, and since it is set to-1
, it immediately times out.Steps to reproduce
spec.kafka.config
withcontroller.quorum.fetch.timeout.ms
set as-1
Expected behavior
The Kafka operator should reject the invalid configuration value of
controller.quorum.fetch.timeout.ms
and remain functionalStrimzi version
quay.io/strimzi/operator:0.45.0
Kubernetes version
v1.28.0
Installation method
YAML
Infrastructure
kind v0.21.0 go1.22.6 linux/amd64
Configuration files and logs
Additional context
The text was updated successfully, but these errors were encountered: