Txm traces by GabrielMartinezRodriguez · Pull Request #521 · HappyChainDevs/happychain

GabrielMartinezRodriguez · 2025-03-13T13:53:40Z

Description

This PR includes a proof of concept on how to implement traces in the transaction manager. Traces are useful because they allow us to answer questions such as:

What happened in block X that caused the absence of a drand value in that block?
Why is a transaction in the "interrupted" status?
Why are we not including transactions fast enough to reveal randomness?
Metrics, combined with alerts, provide a generic overview that helps us understand if everything is working correctly, while traces offer more concrete data to understand why something is happening. This allows us to debug and fix production issues much faster.

To simplify the implementation, I created a special decorator that allows us to propagate traces without having to use the startActiveSpan method. This method is somewhat clunky because it requires implementing the span code inside the span itself to propagate the async local storage, which is how OpenTelemetry establishes the hierarchical relationship between multiple spans.

Since we are using OpenTelemetry, we can leverage this feature:
Grafana Exemplars

This feature allows us to correlate traces with metrics. For example, if we notice in Grafana that some transactions are taking longer than expected, we can directly jump from Grafana to the specific trace where the delay occurred and quickly understand the root cause.

To visualize the traces, I deployed Tempo locally. It works well and is natively integrated with Grafana, making it the best option for hosting traces.

Include all relevant context (but no need to repeat the issue's content).
Draw attention to new, noteworthy & unintuitive elements.

Toggle Checklist

Checklist

Basics

B1. I have applied the proper label & proper branch name (e.g. norswap/build-system-caching).
B2. This PR is not so big that it should be split & addresses only one concern.
B3. The PR targets the lowest branch it can (ideally master).

Reminder: PR review guidelines

Correctness

C1. Builds and passes tests.
C2. The code is properly parameterized & compatible with different environments (e.g. local,
testnet, mainnet, standalone wallet, ...).
C3. I have manually tested my changes & connected features.

< INDICATE BROWSER, DEMO APP & OTHER ENV DETAILS USED FOR TESTING HERE >

< INDICATE TESTED SCENARIOS (USER INTERFACE INTERACTION, CODE FLOWS) HERE >

C4. I have performed a thorough self-review of my code after submitting the PR,
and have updated the code & comments accordingly.

Architecture & Documentation

D1. I made it easy to reason locally about the code, by (1) using proper abstraction boundaries,
(2) commenting these boundaries correctly, (3) adding inline comments for context when needed.
D2. All public-facing APIs & meaningful (non-local) internal APIs are properly documented in code
comments.
D3. If appropriate, the general architecture of the code is documented in a code comment or
in a Markdown document.
D4. An appropriate Changeset has been generated (and committed) for changes that touch npm published packages (currently pacakges/core and packages/react), see here for more info.

cloudflare-workers-and-pages · 2025-03-13T13:53:43Z

Deploying happychain with Cloudflare Pages

Latest commit:	`b9091a6`
Status:	✅ Deploy successful!
Preview URL:	https://6b2f72e5.happychain.pages.dev
Branch Preview URL:	https://gabriel-txm-traces.happychain.pages.dev

View logs

GabrielMartinezRodriguez · 2025-03-13T13:53:53Z

Randomness monitor service #584
Txm traces #521 👈 (View in Graphite)
fix(txm): returned nonce queue order #578
fix(txm): not process the same block multiple times #577
Fix: Viem sends random undefined blocks #575
fix(txm): heap out of memory #573
Avoid initiating new attempts if the gas conditions persist unchanged #571
Dynamic priority fee #570
TXM: Rpc liveness #562
fix(txm): nonce gap #561
PoC: Add metrics to TXM #503
master

This stack of pull requests is managed by Graphite. Learn more about stacking.

norswap · 2025-03-14T19:15:42Z

packages/txm/lib/telemetry/traces.ts

I'm not familiar with opentelemetry at all, but what's the purposes of spans vs events. Is it useful to group events emitted in a method in spans (or maybe we have to?) — vs just having the events in a single top-level span?

I think I remember that spans could be used for stuff that happens on different services (different proceses or servers), where it would make sense to have one span per service.

A span inside a trace represents a process within the trace. Each span can have attributes and events. I think the right approach is to have a span for every method, because it's the easiest way, and it allows us to clearly see the stack trace followed by a transaction.

Every horizontal line is a stack, and you can click on it to view its events

norswap

Seems great, left a question!

bun.lockb

GabrielMartinezRodriguez marked this pull request as ready for review March 13, 2025 13:53

GabrielMartinezRodriguez mentioned this pull request Mar 13, 2025

PoC: Add metrics to TXM #503

Merged

11 tasks

GabrielMartinezRodriguez changed the title ~~feat(txm): traces~~ PoC: Txm traces Mar 13, 2025

GabrielMartinezRodriguez self-assigned this Mar 13, 2025

GabrielMartinezRodriguez added the no-merge For showcase, not to be merged label Mar 13, 2025

GabrielMartinezRodriguez force-pushed the gabriel/txm-traces branch from 994302a to 6f1ef4c Compare March 13, 2025 14:59

norswap reviewed Mar 14, 2025

View reviewed changes

norswap added the question Something has to be cleared up after review label Mar 14, 2025

GabrielMartinezRodriguez force-pushed the gabriel/txm-telemetry branch 2 times, most recently from ddfe09e to cf5bf43 Compare March 17, 2025 16:03

GabrielMartinezRodriguez force-pushed the gabriel/txm-traces branch from 6f1ef4c to 0760c4f Compare March 17, 2025 16:03

GabrielMartinezRodriguez force-pushed the gabriel/txm-telemetry branch from cf5bf43 to 96a7504 Compare March 18, 2025 14:17

aodhgan reviewed Mar 24, 2025

View reviewed changes

bun.lockb Outdated Show resolved Hide resolved

This was referenced Mar 26, 2025

fix(txm): nonce gap #561

Merged

TXM: Rpc liveness #562

Merged

GabrielMartinezRodriguez force-pushed the gabriel/txm-telemetry branch from 1727040 to 4303d66 Compare March 27, 2025 10:24

This was referenced Mar 27, 2025

Dynamic priority fee #570

Merged

Avoid initiating new attempts if the gas conditions persist unchanged #571

Merged

fix(txm): heap out of memory #573

Merged

GabrielMartinezRodriguez force-pushed the gabriel/txm-telemetry branch from 4365362 to e783a93 Compare March 31, 2025 09:54

GabrielMartinezRodriguez mentioned this pull request Mar 31, 2025

Fix: Viem sends random undefined blocks #575

Merged

11 tasks

GabrielMartinezRodriguez force-pushed the gabriel/txm-traces branch from 0760c4f to a9689ea Compare March 31, 2025 11:55

GabrielMartinezRodriguez changed the base branch from gabriel/txm-telemetry to gabriel/fix-block-undefined March 31, 2025 11:55

This was referenced Apr 1, 2025

fix(txm): not process the same block multiple times #577

Merged

fix(txm): returned nonce queue order #578

Merged

GabrielMartinezRodriguez force-pushed the gabriel/txm-traces branch from a9689ea to ada0e33 Compare April 1, 2025 10:00

GabrielMartinezRodriguez changed the base branch from gabriel/fix-block-undefined to gabriel/fix-returned-nonce-order April 1, 2025 10:01

GabrielMartinezRodriguez force-pushed the gabriel/fix-returned-nonce-order branch from fc2dc4f to 161217c Compare April 10, 2025 08:38

GabrielMartinezRodriguez force-pushed the gabriel/txm-traces branch from 37754db to 98636d5 Compare April 10, 2025 08:38

GabrielMartinezRodriguez force-pushed the gabriel/fix-returned-nonce-order branch from 161217c to be24f08 Compare April 10, 2025 08:51

GabrielMartinezRodriguez force-pushed the gabriel/txm-traces branch from 98636d5 to 6efc98d Compare April 10, 2025 08:51

GabrielMartinezRodriguez force-pushed the gabriel/fix-returned-nonce-order branch from be24f08 to 81ed24a Compare April 10, 2025 09:13

GabrielMartinezRodriguez force-pushed the gabriel/txm-traces branch from 6efc98d to 335232d Compare April 10, 2025 09:13

GabrielMartinezRodriguez force-pushed the gabriel/fix-returned-nonce-order branch from 81ed24a to 694721d Compare April 10, 2025 09:56

GabrielMartinezRodriguez force-pushed the gabriel/txm-traces branch from 335232d to 2b1bca0 Compare April 10, 2025 09:56

GabrielMartinezRodriguez force-pushed the gabriel/fix-returned-nonce-order branch from 694721d to 85f1522 Compare April 10, 2025 11:51

GabrielMartinezRodriguez force-pushed the gabriel/txm-traces branch from 2b1bca0 to 3a6255a Compare April 10, 2025 11:51

GabrielMartinezRodriguez force-pushed the gabriel/fix-returned-nonce-order branch from 85f1522 to fe19633 Compare April 10, 2025 11:56

GabrielMartinezRodriguez force-pushed the gabriel/txm-traces branch from 3a6255a to 970ff6d Compare April 10, 2025 11:56

GabrielMartinezRodriguez mentioned this pull request Apr 10, 2025

Txm transactions support arbitrary calldata #596

Merged

11 tasks

norswap approved these changes Apr 10, 2025

View reviewed changes

GabrielMartinezRodriguez force-pushed the gabriel/fix-returned-nonce-order branch from fe19633 to 03b78a8 Compare April 13, 2025 21:30

GabrielMartinezRodriguez force-pushed the gabriel/txm-traces branch from 3345d89 to b9091a6 Compare April 13, 2025 21:31

GabrielMartinezRodriguez force-pushed the gabriel/fix-returned-nonce-order branch from 03b78a8 to f9905ee Compare April 13, 2025 21:36

GabrielMartinezRodriguez force-pushed the gabriel/txm-traces branch from b9091a6 to c1e4a23 Compare April 13, 2025 21:36

GabrielMartinezRodriguez force-pushed the gabriel/fix-returned-nonce-order branch from f9905ee to 988fc7f Compare April 13, 2025 21:38

GabrielMartinezRodriguez force-pushed the gabriel/txm-traces branch from c1e4a23 to c293051 Compare April 13, 2025 21:38

GabrielMartinezRodriguez added 6 commits April 13, 2025 23:41

feat(txm): traces

9b62ebd

feat(txm): complete txm instrumentation

fed85a9

chore(txm): format

117dc79

fix(txm): fixed infinite traces

4a3b06e

chore(txm): pr review

c38b1cd

chore(txm): added comments

393edd4

This was referenced Apr 21, 2025

feat(txm): transactions with value #647

Merged

Faucet service & Iframe integration #661

Merged

feat: deploy faucet #668

Merged

feat(randomness): drand prune #693

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Txm traces#521

Txm traces#521
GabrielMartinezRodriguez merged 6 commits intomasterfrom
gabriel/txm-traces

GabrielMartinezRodriguez commented Mar 13, 2025 •

edited

Loading

Uh oh!

cloudflare-workers-and-pages bot commented Mar 13, 2025 •

edited

Loading

Uh oh!

GabrielMartinezRodriguez commented Mar 13, 2025 •

edited

Loading

Uh oh!

norswap Mar 14, 2025 •

edited

Loading

Uh oh!

GabrielMartinezRodriguez Apr 2, 2025

Uh oh!

norswap left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

GabrielMartinezRodriguez commented Mar 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Basics

Correctness

Architecture & Documentation

Uh oh!

cloudflare-workers-and-pages bot commented Mar 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Deploying happychain with Cloudflare Pages

Uh oh!

GabrielMartinezRodriguez commented Mar 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

norswap Mar 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

GabrielMartinezRodriguez Apr 2, 2025

Choose a reason for hiding this comment

Uh oh!

norswap left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

GabrielMartinezRodriguez commented Mar 13, 2025 •

edited

Loading

cloudflare-workers-and-pages bot commented Mar 13, 2025 •

edited

Loading

GabrielMartinezRodriguez commented Mar 13, 2025 •

edited

Loading

norswap Mar 14, 2025 •

edited

Loading