-
Notifications
You must be signed in to change notification settings - Fork 384
MON-4386,MON-4387: telemetry Collection Profile
#2694
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
This profile only caters to metrics that telemetry rules rely upon. Signed-off-by: Pranshu Srivastava <[email protected]>
|
@rexagod: This pull request references MON-4386 which is a valid jira issue. Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the task to target the "4.21.0" version, but no target version was set. This pull request references MON-4387 which is a valid jira issue. Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the task to target the "4.21.0" version, but no target version was set. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
Skipping CI for Draft Pull Request. |
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: rexagod The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
9476550 to
9c9b657
Compare
Adds in-cluster monitoring stack's telemetry metrics under the `telemetry` profile. Signed-off-by: Pranshu Srivastava <[email protected]>
9c9b657 to
d5f9e7e
Compare
| @@ -1,5 +1,9 @@ | |||
| # Note: This CHANGELOG is only for the monitoring team to track all monitoring related changes. Please see OpenShift release notes for official changes. | |||
|
|
|||
| ## 4.XX | |||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Since adding external support may take a while.
|
@rexagod: This pull request references MON-4386 which is a valid jira issue. Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the task to target the "4.21.0" version, but no target version was set. This pull request references MON-4387 which is a valid jira issue. Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the task to target the "4.21.0" version, but no target version was set. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
@rexagod: This pull request references MON-4386 which is a valid jira issue. Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the task to target the "4.21.0" version, but no target version was set. This pull request references MON-4387 which is a valid jira issue. Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the task to target the "4.21.0" version, but no target version was set. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
telemetry profile and add support for in-house componentstelemetry Collection Profile
|
@rexagod: The following test failed, say
Full PR test history. Your PR dashboard. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here. |
|
PR needs rebase. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
|
I wanted to talk a bit regarding the The The If CMO runs with remote-write capability (no telemeter-client), targets may be injected with the same relabellings that are done for the telemetry one. This may have performance implications since every target would have it's metrics filtered through the same relabelling rules' regexes, but it'd be more dynamic than keeping a record of monitor keys for the ones that are needed for telemetry, since that wouldn't be immune to breaking changes (renaming or migration). However, as it is right now we'd need an approach that not only covers this but also the legacy (federated) telemeter-client-based approach, since users should also see the cut-down in metric volumes in not just their LTS storage systems, but also the PVs hooked up for data retention, which brings us back to constraining at the monitor level. A more manageable and fitting way would be to adhere to the same pattern as the @simonpasquier Curious to hear your thoughts on this. I believe the last passage makes the most sense going forward for this collection profile, but PLMK if I missed something. |
|
⬆️ This was resolved over Slack. We'll move forward with the same approach as we did for the |
Both of these changes need to be merged together. Additionally, MON-4788 also needs to be merged with these (once it's pushed).
Marked as draft for now as this needs all non-in-cluster-monitoring-stack recording rules ported to monitors as well.
Not 100% sure right now without investigating more on this, but I believe we can keep the existing machinery, i.e., the whitelisted machinery in CMO and everything from that point on as is, as that would serve as (a) a safeguard to make sure we pipe out the same rules and (b) a check to make sure the metrics (rules) being collected match the rules being sent out. But this will be worked on later once we start adding tests for this in origin (after the aforementioned porting effort is done).