What workload and under which faults should Amaru be able to handle? #120

stevana · 2025-02-20T15:56:45Z

stevana
Feb 20, 2025
Collaborator

For more context see: https://github.com/pragma-org/amaru/wiki/log#2025-02-20

KtorZ · 2025-02-21T15:04:46Z

KtorZ
Feb 21, 2025
Maintainer

@stevana Let me break down an answer into small points.

1. The workload

Ideally, we would use the Haskell node as a baseline for an answer here. "At least as good" is what comes to mind, though I am not sure it's been properly formulated for the Haskell node either. So that's perhaps something worth spending some time on, possibly using the simulation from the DeltaQ framework that was developed as part of the Haskell development?

One particular constraint that seems to be recurring is that when a new block is produced, this block is able to propagate to 95% of the network in less than 5s. I recall that the Haskell team is even using a stricter bound (less than 2s I reckon, but don't quote me on that).

Another way to look at it is to think that we want to be able to adopt a block before a next block comes in. While Praos distribution shows that the average block production time is around 20s; in practice, the only strong guarantee we have is that there cannot be two blocks less than 1s from one another (if we discard the case of slot-battle-induced forks here, where we could potentially receive two blocks in a single second span, but from different peers/chains). So, worse-case scenario, we must be able to adopt blocks at a rate of 1 block per second.

Now, not all blocks are born equal, but blocks are bounded on all dimensions. More specifically, a block is sized around three axes:

Its serialized size in bytes
Its smart contract execution steps
Its smart contract execution memory

Each dimension comes with a maximum value, which is defined by (updatable) protocol parameters. So, since we have to optimize for the worse case scenario (to ensure we can survive from adversarial behaviors), then, the workload has to be considered given those bounds. Although we have to be careful here because the execution steps and memories are abstract units, measured and costed according to a model. They do not necessarily reflect the actual performance cost on the system. More so, they are modeled after the Haskell node, which has a vastly different runtime execution model than Rust.

One practical way to approach this workload question could be to sample the mainnet chain, and run a simulation where blocks from mainnet would be applied every second. This would not necessarily reflect a worse-case situation, but would at least model a first "realistic" workload. From there, we can search for scenarios that try to maximise if not all, at least one of the block's dimensions.

2. Liveness & Safety properties

Consensus

I can think of a few safety property in the system. The one you stated already is, I believe, one:

All honest nodes in the network ultimately converge to the same chain in no more than k blocks given there’s less than 50% adversarial stake.

The reason I would argue it is a safety property is because $k$ is finite. And so "ultimately" cannot be infinite. Another way to look at it perhaps is to say that any block that is more than $k+1$ blocks away from the tip must be in ALL (honest) chains. Although, since there's usually no clear local definition of the tip (because one can only know about blocks it has seen); hence we typically reason from the chain growth and chain quality properties:

(chain growth) Any block older than $\frac{3 \cdot k}{f}$ slots ($k = 2160$, $f = 0.05$ on mainnet, so $129600$ slots total) must be in all (honest) chains; and
(chain quality) Any span of $\frac{k}{f}$ ($43200$ on mainnet) contains at least 1 block produced by an honest node.

Any system that would break these properties at any time would be considered invalid. In fact, the only "liveness" property I can truly think of in terms of the protocol is that a valid transaction that is continuously re-submitted to at least one honest node with non-zero stake will eventually land in a block (unless it becomes invalid due to external factor, e.g. double-spend).

Ledger

Since the simulation is focusing on the whole system, I think it also make sense to look at properties of the ledger as well. There is, in fact, one particularly interesting one: the sum of all money pots (UTxO balances, reward accounts balances, deposits, reserves, treasury and fees) always, at any time, equals the max supply (45B Ada on Mainnet). So said differently: the ledger isn't loosing (or creating!) money; everything is always accounted for.

Interestingly, there are liveness properties that the ledger does not have such as, the guarantee that it's always possible to spend Ada. It is very much possible to end up in a situation where every single Ada is either locked in smart contracts or stuck in the treasury. I don't think there's anything we can do about it, though.

0 replies

abailly · 2025-02-21T15:36:58Z

abailly
Feb 21, 2025
Maintainer

I would frame the problem in terms of observable behaviour. In classical DB systems, whether key-value, relational, or whatever, the observable behaviour of the system is expressed in terms of some operations and queries one can enact on the system, eg. write some key/value, read value at some key. Then what we care about from a distributed systems perspective is how distribution, concurrency, failures, etc. impact (or not) this observable behaviour. How the system implements distribution, consensus, and replication is basically irrelevant, but typically is carried out through a Replicated State Machine based on some consensus algorithm.

In Cardano, and even more so in Amaru at this stage, the main observable behaviour of the system is actually the command log that each node constructs out of all the logs it observes from other nodes, which is precisely the role of the Nakamoto consensus algorithm we are implementing. So in my opinion it makes sense to define workloads in term of this observable behaviour given our main purpose is to build a node that correctly implements Ouroboros Praos algorithm.
Practically speaking, I have 2 scenarios in mind at the moment:

Single node correctness:
a. simulate multiple upstream peers propagating headers and blocks
b. feed those to the SUT
c. verify the best chain selected by the node (eg. the one it would propagate to downstream peers) is the expected one (the longest chain made of valid blocks)
Cluster correctness:
a. interconnect multiple nodes playing the role of relays
b. simulate block forgers connected to each relay node, inject new blocks, faults, split-brains, delays, whatever... respecting the Praos hypothesis (or not if we want to test what happens outside those boundaries)
c. verify blocks are correctly propagated and all nodes ultimately agree on the best chain

For those scenarios, we largely don't care about the content of the blocks and the details of the ledger behaviour, beyond its ability to provide a stake distribution. There are interesting things we might want to test that are somehow dependent on the ledger, like what happens if a node receives a block that's not valid, but this is easy to make happen. In a way, we could perfectly imagine to run those tests with a "mock ledger" which is something worth considering as it could make it easier to simulate things like lengthy block validation or issues with the stake distribution.

The workloads that @KtorZ describe are interesting to assess the performance of the system but are not that much relevant to the correctness of the consensus per se. But of course, this is something we'll need to do.

0 replies

stevana · 2025-02-24T08:24:30Z

stevana
Feb 24, 2025
Collaborator Author

@abailly Yes, I agree with framing the problem in terms of observable behaviour. In particular, if we consider the following picture:

       +--------+    request    +--------+
       |        | ------------> |        |
       | Client |               | System |
       |        | <------------ |        |
       +--------+    response   +--------+

Observable behaviour should be from the point of view of the client. First question to ask is: who's the "Client"? What's inside the "System" box and what are the requests and responses (where are these API calls documented)? How do you express the properties you described in terms of a concurrent trace of request-response-pairs?

For example, if the system is a linearisable distributed key-value store, then the client is the user of the store, requests are "write key value" and "read key". The property that we can take the concurrent trace and find a sequential interleaving (i.e. an execution that can be performed by one CPU) that respects a sequential model, where the sequential model is defined by the following state machine:

type Model = Map Key Value

model : Model -> Input -> (Output, Model)
model m (Read key) = (ReadOk m[key], m)
model m (Write key value) (WriteOk, m[key] = value)

@KtorZ Yes, I think we should focus on whole system properties. Because observable behaviour from clients point of view typically does care about whole system behaviour.

This "[..] the ledger isn't loosing (or creating!) money; everything is always accounted for." sounds like a good safety property to me (as a client/user of the system, this is something I'd like to be reassured of). It would be good if we can make this more precise over time (doesn't have to be now). Where by precise I mean something at least as detailed as my key-value store example above.

Regarding "All honest nodes in the network ultimately converge to the same chain in no more than k blocks given there’s less than 50% adversarial stake." (or variants of it): it's still not clear to me if this is a safety or liveness property, and you two don't seem to agree either. It would be good to come to a consensus on this ;-). But let me throw in question, pardon my ignorance: why should a client/user care about this property? If the property wouldn't hold: what would the consequences be? Would it be possible to break the "ledger isn't losing or creating money" property?

Both of you mention things like "sample the mainnet chain" or "simulate block forgers", sure we can fake things until they are actually implemented and I think those things can be valuable tests to have, but we I think we should keep our sight at observable behaviour from a clients point of view (rather than a developer of a subsystem's point of view), because ultimately it's a client/user of the system that needs to be convinced of the system's usefulness (and the whole system from client's point of view tests should cover all properties that a developer of a subsystem cares about).

4 replies

abailly Feb 24, 2025
Maintainer

@abailly Yes, I agree with framing the problem in terms of observable behaviour. In particular, if we consider the following picture:
       +--------+    request    +--------+
       |        | ------------> |        |
       | Client |               | System |
       |        | <------------ |        |
       +--------+    response   +--------+
Observable behaviour should be from the point of view of the client. First question to ask is: who's the "Client"?

The "client" for a relay/BP node are other nodes, that's the reason why I (we?) think testing should be done from that perspective, at least for now.

What's inside the "System" box and what are the requests and responses (where are these API calls documented)?

The System box is a cluster of nodes, the requests and responses are the ChainSync and BlockFetch protocols. They are documented in the network specification document

How do you express the properties you described in terms of a concurrent trace of request-response-pairs?

Typically, the clients propose new headers/blocks for adoption and each node in the cluster independently decide which chain they want to adopt, then clients can observe the result of this chain selection process by subscribing to each node's CS/BF

This "[..] the ledger isn't loosing (or creating!) money; everything is always accounted for." sounds like a good safety property to me (as a client/user of the system, this is something I'd like to be reassured of). It would be good if we can make this more precise over time (doesn't have to be now). Where by precise I mean something at least as detailed as my key-value store example above.

What I am trying to suggest is that if you know the consensus always selects the "correct chain", that is the chain which is considered the best one according to the Ouroboros Praos (and extensions) protocol, you don't need to test extensively at the system level this property.

However, you want to test it extensively at the ledger level because there's a clear distinction between the expected (observable) behaviour (eg. apply blocks and txs sequentially) and the implementation (parallelising tx application seems very desirable).

Regarding "All honest nodes in the network ultimately converge to the same chain in no more than k blocks given there’s less than 50% adversarial stake." (or variants of it): it's still not clear to me if this is a safety or liveness property, and you two don't seem to agree either. It would be good to come to a consensus on this ;-).

While it's formulated as a liveness property (something good eventually happens) the fact it's bounded by $k$ implies it's actually a safety property so I think @KtorZ is right.

But let me throw in question, pardon my ignorance: why should a client/user care about this property? If the property wouldn't hold: what would the consequences be? Would it be possible to break the "ledger isn't losing or creating money" property?

Yes, because then you might end up in situations like "double spending" and inconsistent ledger states between nodes.

Both of you mention things like "sample the mainnet chain" or "simulate block forgers", sure we can fake things until they are actually implemented and I think those things can be valuable tests to have, but we I think we should keep our sight at observable behaviour from a clients point of view (rather than a developer of a subsystem's point of view), because ultimately it's a client/user of the system that needs to be convinced of the system's usefulness (and the whole system from client's point of view tests should cover all properties that a developer of a subsystem cares about).

That's true, but you should bear in mind the current goal for Amaru is to build a block producing node which is different from a client node. The fact they are one and a same thing in the current cardano-node is actually a problem.

stevana Feb 24, 2025
Collaborator Author

When you say:

The "client" for a relay/BP node are other nodes

and:

The System box is a cluster of nodes

It sounds circular to me. What is the use case (intent) of the client? The way you put it it seems that the intent of the client is to sustain the system (for it's own sake). While a client of a system can be another (sub)system, currently it's not clear to me how a human user/client would use the system (through what API) and with what goal/intent (use case)? A minimal example, the "hello world" equivalent of the system would be useful to me. (Again this doesn't have to be working/implemented yet, and also: I'm not saying that testing it from the perspective of another subsystem being the "client" isn't valuable.)

Yes, because then you might end up in situations like "double spending" and inconsistent ledger states between nodes.

OK, this sounds like something concrete that would be good to spell out in more detail? Could we introduce a bug in the log replication such that the tests find the breaking of the "no money is created or destroyed" property?

you should bear in mind the current goal for Amaru is to build a block producing node which is different from a client node. The fact they are one and a same thing in the current cardano-node is actually a problem.

I'm not sure I understand what this means, or the implications. First question: what's the relation between what you call "client node" and the "client" box in my diagram? None, I presume?

abailly Feb 24, 2025
Maintainer

When you say:

The "client" for a relay/BP node are other nodes

and:

The System box is a cluster of nodes

It sounds circular to me. What is the use case (intent) of the client? The way you put it it seems that the intent of the client is to sustain the system (for it's own sake).

Yes, it's circular because this is what the purpose of the system is: sustaining the system. In and of itself, the network has no other purpose than persisting in his being.

While a client of a system can be another (sub)system, currently it's not clear to me how a human user/client would use the system (through what API) and with what goal/intent (use case)?

They would not because that's not the goal of Amaru (yet?). Or the human user would be interacting with the system through monitoring and administration.

A minimal example, the "hello world" equivalent of the system would be useful to me. (Again this doesn't have to be working/implemented yet, and also: I'm not saying that testing it from the perspective of another subsystem being the "client" isn't valuable.)

Yes, because then you might end up in situations like "double spending" and inconsistent ledger states between nodes.

OK, this sounds like something concrete that would be good to spell out in more detail? Could we introduce a bug in the log replication such that the tests find the breaking of the "no money is created or destroyed" property?

I don't think the two are related because the property "no money is created or destroyed" is a property of the ledger interpretation of blocks and transactions. You could create ill-formed transactions, but it's useless to test those at the level of the consensus (eg. log replication) IMO.

I'm not sure I understand what this means, or the implications. First question: what's the relation between what you call "client node" and the "client" box in my diagram? None, I presume?

A client node is a node that does not participate in propagating blocks (replicating log) in the network, so a client node could be a client box if you change the request type to transactions submission (the response stays the same).

abailly Feb 24, 2025
Maintainer

I really feel we are talking past each other: can we agree that we start with we have, testing properties which are apparently not "client facing", against a system that's mostly circular, until things become clearer for everyone?

abailly · 2025-02-25T12:43:34Z

abailly
Feb 25, 2025
Maintainer

Work on amaru simulator has slowed down to a crawl on my side because of other duties, but here is a proposal of a format we could use as a starting point to test Amaru's consensus using a predefined block tree. This is a JSON format which contains:

The context which should probably be renamed protocolParameters defining the main parameters for block production and validation
stakePools which itself contains:
- chains : a list of interlinked headers, down to genesis, along with their slot , depth and CBOR raw content. All headers are for 0 sized blocks
- spos : data about each SPO, in the form of all private keys and structures that are needed to forge a block.

I think the tester/simulator should be able to read such a file then serve the headers "following" the block tree, simulating how different upstream peers can serve different headers, possibly introducing delays and hopefully forks.

The generated chain is very small and does not have forks longer than 1 block, but I think this is a good starting point. My plan is to reuse what Consensus folks have done for Genesis which is much more involved and thorough, but of course requires a bit more work to be used in Amaru context.

chain.json

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What workload and under which faults should Amaru be able to handle? #120

{{title}}

Replies: 4 comments 4 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

What workload and under which faults should Amaru be able to handle? #120

stevana Feb 20, 2025 Collaborator

Replies: 4 comments · 4 replies

KtorZ Feb 21, 2025 Maintainer

1. The workload

2. Liveness & Safety properties

Consensus

Ledger

abailly Feb 21, 2025 Maintainer

stevana Feb 24, 2025 Collaborator Author

abailly Feb 24, 2025 Maintainer

stevana Feb 24, 2025 Collaborator Author

abailly Feb 24, 2025 Maintainer

abailly Feb 24, 2025 Maintainer

abailly Feb 25, 2025 Maintainer

stevana
Feb 20, 2025
Collaborator

Replies: 4 comments 4 replies

KtorZ
Feb 21, 2025
Maintainer

abailly
Feb 21, 2025
Maintainer

stevana
Feb 24, 2025
Collaborator Author

abailly Feb 24, 2025
Maintainer

stevana Feb 24, 2025
Collaborator Author

abailly Feb 24, 2025
Maintainer

abailly Feb 24, 2025
Maintainer

abailly
Feb 25, 2025
Maintainer