Introduce materialization for operations #247

cevian · 2022-05-08T20:44:11Z

This commit introduces a materialization to record
the parent-child relation between operations. Namely for
and span of operation P that is the parent of a span with
operation C, we record the 10-minute bucketed relation:

bucket time
id of operation P
id of operation C
count()

This gives us a way to quickly aggregate the service map.
The materialization is done like a continous aggregate. But,
we couldn't use continuous aggs since they don't support
self-joins.

This materialization support real-time aggregation where the
materialization is combined with a freshly-calculated aggregation
of the not-yet materialized head of the data. This is available
in the _ps_trace.operation_stats view.

A limitation of this approach is that we don't support invalidations
so if a new span comes in for a time-period that has already been
materialized it is never corrected. Currently we materialize
data that is no newer than 10-minutes prior to the most
recent data.

A few assumptions are made in the materialization:

That a child span's start_time is >= parent span's
start time. This seem obvious but I couldn't find
it documented in the spec.
A child span's start_time is < parent span's start_time + 1 hour.
This is more questionable. But I think it's ok if we
document the limitation.

The background job to materialize this data is set up to be separate
from the rest of the jobs for isolation and also because we always
want to run this every 10 minutes and don't want parallelism.

Fixed parameters we may want to make configurable later:

Size of bucket (currently 10 minutes)
Size aggregated in one loop/txn (1 bucket)
Invalidation window (10 minutes)
chunk size of materialized hypertable (7 days)

This commit introduces a materialization to record the parent-child relation between operations. Namely for and span of operation P that is the parent of a span with operation C, we record the 10-minute bucketed relation: - bucket time - id of operation P - id of operation C - count() This gives us a way to quickly aggregate the service map. The materialization is done like a continous aggregate. But, we couldn't use continuous aggs since they don't support self-joins. This materialization support real-time aggregation where the materialization is combined with a freshly-calculated aggregation of the not-yet materialized head of the data. This is available in the _ps_trace.operation_stats view. A limitation of this approach is that we don't support invalidations so if a new span comes in for a time-period that has already been materialized it is never corrected. Currently we materialize data that is no newer than 10-minutes prior to the most recent data. A few assumptions are made in the materialization: - That a child span's start_time is >= parent span's start time. This seem obvious but I couldn't find it documented in the spec. - A child span's start_time is < parent span's start_time + 1 hour. This is more questionable. But I think it's ok if we document the limitation. The background job to materialize this data is set up to be separate from the rest of the jobs for isolation and also because we always want to run this every 10 minutes and don't want parallelism. Fixed parameters we may want to make configurable later: - Size of bucket (currently 10 minutes) - Size aggregated in one loop/txn (1 bucket) - Invalidation window (10 minutes) - chunk size of materialized hypertable (7 days)

pgnickb · 2022-05-09T09:15:26Z

A child span's start_time is < parent span's start_time + 1 hour.
This is more questionable. But I think it's ok if we
document the limitation.

Fixed parameters we may want to make configurable later:

Perhaps worth adding the max parent age is worth it as well. We never know what the end system is going to be like. I reckon on a system where the delta of >1h is the case it wont be as significant of a performance penalty either.

pgnickb · 2022-05-09T09:19:41Z

e2e/testdata/operation_materialization.sql

+
+--correctness check
+SELECT * FROM real_view
+EXCEPT


pgnickb · 2022-05-09T10:19:18Z