The change from rate() to irate() is a breaking change #670

rmak-cpi · 2021-09-13T21:38:32Z

Submitting an issue so folks encountering the same problem as me can have an easier time finding out what happened. I recently upgraded to kube-prometheus-stack and discovered that CPU data no longer shows on old Grafana Kubernetes / Compute Resources / Workload dashboard that was hosted somewhere else. It turns out that the following change from rate() to irate() was the cause of the issue:

e996e00

In particular, the renaming from node_namespace_pod_container:container_cpu_usage_seconds_total:sum_rate to node_namespace_pod_container:container_cpu_usage_seconds_total:sum_irate can break any dashboard/rules etc referencing the old name. As a (temporary) workaround, I think it's possible for me to just recreate the old node_namespace_pod_container:container_cpu_usage_seconds_total:sum_rate recording rule.

Please feel free to add brief comments confirming the observation and close it. Thanks!

The text was updated successfully, but these errors were encountered:

tahajahangir · 2021-10-01T16:07:54Z

Using irate also does not result in better details. It only makes graphs more random and more noisy (using only sample of points) and final graph may completely discard spikes (while rate always show average rate over the interval).
https://valyala.medium.com/why-irate-from-prometheus-doesnt-capture-spikes-45f9896d7832

Original bad PR: #619

paulfantom · 2021-10-05T08:37:52Z

while rate always show average rate over the interval

Average removes spikes and thus removes important data. In a lot of cases you want those spikes to be present on graphs. This is in contrast to alerts where you probably don't want to have spikes and rate (or other statistic methods) is a better choice.

More in https://www.robustperception.io/irate-graphs-are-better-graphs

tahajahangir · 2021-10-12T04:25:11Z

Average removes spikes and thus removes important data.

rate does not remove spikes, it makes them flat over a time period. In contrast irate may completely remove them (as they are not happened at all).

irate will only use the last two points of data in a time range (ignoring all other points), and will result in more noisy graphs. rate will consider first and last data point.

paulfantom · 2021-10-12T08:01:17Z

rate does not remove spikes, it makes them flat over a time period

When it comes to graphs "removal" and "making spikes flat" are the same. In both cases, you are losing data from visualization.

bboreham · 2021-10-18T18:20:05Z

It's unpredictable whether the important spikes are at the end of the window, in which case irate can see them, or earlier in the window, in which case irate will ignore them.
I prefer consistent behaviour, and no discarding of points, so rate() is better.

github-actions · 2024-10-15T00:24:52Z

This issue has not had any activity in the past 30 days, so the
stale label has been added to it.

The stale label will be removed if there is new activity
The issue will be closed in 7 days if there is no new activity
Add the keepalive label to exempt this issue from the stale check action

Thank you for your contributions!

skl · 2024-10-16T09:51:01Z

Duplicate of #679

bboreham · 2024-10-16T10:57:39Z

This one is about the name changing, while #679 is about the behaviour.

skl · 2024-10-16T15:48:07Z

@bboreham ah ok, I was thinking about contributing a rate version of all the existing irate recording rules - to maintain backwards compatibility and give users the choice. I thought that might address both tickets, hence marking as dupe. wdyt?

bboreham · 2024-10-16T16:52:47Z

Duplicating the recording rules would indeed address the breaking change, relating to other uses of the data.
To fix #679 would require changing the dashboards.

bboreham mentioned this issue Nov 26, 2021

irate is bad #679

Open

austincunningham mentioned this issue May 4, 2022

change sum_rate to sum_irate in dashboards and prometheusrules 3scale/3scale-operator#744

Closed

github-actions bot added the stale label Oct 15, 2024

skl marked this as a duplicate of #679 Oct 16, 2024

skl closed this as not planned Won't fix, can't repro, duplicate, stale Oct 16, 2024

skl reopened this Oct 17, 2024

skl added keepalive Use to prevent automatic closing and removed stale labels Oct 17, 2024

skl self-assigned this Oct 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The change from rate() to irate() is a breaking change #670

The change from rate() to irate() is a breaking change #670

rmak-cpi commented Sep 13, 2021

tahajahangir commented Oct 1, 2021 •

edited

Loading

paulfantom commented Oct 5, 2021

tahajahangir commented Oct 12, 2021 •

edited

Loading

paulfantom commented Oct 12, 2021

bboreham commented Oct 18, 2021

github-actions bot commented Oct 15, 2024

skl commented Oct 16, 2024

bboreham commented Oct 16, 2024

skl commented Oct 16, 2024

bboreham commented Oct 16, 2024

The change from rate() to irate() is a breaking change #670

The change from rate() to irate() is a breaking change #670

Comments

rmak-cpi commented Sep 13, 2021

tahajahangir commented Oct 1, 2021 • edited Loading

paulfantom commented Oct 5, 2021

tahajahangir commented Oct 12, 2021 • edited Loading

paulfantom commented Oct 12, 2021

bboreham commented Oct 18, 2021

github-actions bot commented Oct 15, 2024

skl commented Oct 16, 2024

bboreham commented Oct 16, 2024

skl commented Oct 16, 2024

bboreham commented Oct 16, 2024

tahajahangir commented Oct 1, 2021 •

edited

Loading

tahajahangir commented Oct 12, 2021 •

edited

Loading