rebis-dev: Make AtomTable fully safe and prepare for garbage collection #2736

adri326 · 2024-12-31T14:15:12Z

This is a followup to #2727 that gives AtomTable the ability to verify that Atoms are safe to dereference and the ability to shuffle their layout for garbage collection purposes, by adding another layer of indirection.

Prior to this change, Atoms would contain the offset of their AtomHeader within a buffer. This lets us resize the buffer to add new atoms, but it is prone to issues caused by atoms with invalid offsets and it doesn't let us defragment the AtomTable once garbage collection is introduced.

                 |   buffer   |
                 +------------+
                 |    ...     |
Atom(0x12580) -> | AtomHeader | @12580
                 | "Hello wo" | @12588
                 | "rld!"____ | @12590
                 |    ...     |

With this change, the Atoms now store an index into an array of indices, meaning that verifying the validity of an atom is as simple as checking that it is within this array:

            | offsets |    |   buffer   |
            +---------+    +------------+
            |   ...   |    |    ...     |
Atom(13) -> | 0x12580 | -> | AtomHeader | @12580
            |   ...   |    | "Hello wo" | @12588
            |   ...   |    | "rld!"____ | @12590
            |   ...   |    |    ...     |

This also lets us safely change the order of atoms within the buffer, by changing the offsets, without needing to modify the existing Atoms, which paves the way towards garbage collection within AtomTable.

One important detail with my implementation is that the offsets array is stored within an Arcu, which means that the read from the buffer in Atom::as_ptr now depends on atom_table.inner.offsets.read(), which is an acquire atomic operation. When a new atom is added to the table with AtomTable::build_with, the write to the buffer is sequenced before atom_table.inner.offsets.replace(new_offsets): Atom::as_ptr is now guaranteed (by my limited experience with atomics) to observe the changes as expected.

Instances where the lack of this property cause actual issues are exceedingly rare. A thread would need to somehow have access to an atom offset created from another thread, without synchronization, and immediately try to read its data.

The Sync-ness of AtomTable is now proved, making it (hopefully) safe :)

Please don't be scared by the number of commits and the number of lines changed, they will reduce once #2727 gets merged.

…n head

To group all the unsafe blocks of the module close to one another

triska · 2025-01-05T10:04:40Z

Thank you a lot, very interesting!

One conceptual question I have about this: What exactly is it that makes for example defragmentation possible in this representation, but not the other? It seems that if one is able to do it in one of them, then also on the other (possibly by temporarily building such an index, though with the benefit of avoiding the indirection every time an atom is referenced).

adri326 · 2025-01-05T13:27:45Z

It's a compromise between performance and safety, mainly. Defragmentation was already possible beforehand, but it required going through the heap to modify any affected Atom, and ensuring that any operation that could lead to garbage collection of the AtomTable doesn't accidentally invalidate a copy of an Atom on the stack.

The current issue with AtomTable is that if someone has the ability to craft their own Atom, then they can trigger UB as follows:

First, allocate a regular atom which contains the bitpattern of an AtomHeader
Then, craft an Atom with as offset the start of the crafted AtomHeader
Reading from that Atom can now read uninitialized memory or cause a data race if another thread allocates a new atom in the meantime

I have thought about multiple ways to ensure that that doesn't happen, which fall into two categories:

Make step 2 impossible, by validating the offset, so that the rest of the code can rely on invariants of AtomHeader. I don't know of any method that is faster than indexing into a contiguous array of offsets, and since we are indexing into an array containing the offset already, then we might as well lean into the added indirection.
Make step 3 impossible, by validating the AtomHeader. This would require adding atomics to prevent data races and zero-initializing allocated memory.

I personally believe that choosing to make step 3 impossible will lead to code that is harder to reason about, which, given the fuzzy nature of the human mind, can increase the number of bugs that could sneak through.

There is a performance penalty to going with step 2, of course. I've measured my method to increase the benchmark times by something between 3% and 5%.

triska · 2025-01-06T11:19:53Z

Thank you a lot, this seems very well thought through!

In my opinion, we need unbreakable software, even if it is 10 times slower than other systems. A 5% overhead is completely acceptable.

mthom and others added 20 commits October 26, 2024 12:44

restore ubuntu testing to ci.yml with rust version 1.77

3141b2d

remove Term

fa1b052

variable revision

a8c0c5d

introduce bespoke Heap type for in-heap partial strings

62bea56

unmark_cell_bits! in push_literal (mthom#2645)

dfe41eb

dereference clause_clause value, reserve more parser space (mthom#2579)

afd3be1

Add safety proofs and assertions around Atom::as_ptr

8aa4fae

Turn RawBlockTraits::align() into a const

b38103d

Fix RawBlock::alloc not aligning values to T::ALIGN

c2c8969

Properly encapsulate raw_block to enforce its invariants

3a458bb

Prove safety of AtomTable::build_with

78c7b99

Switch to Cell in RawBlock to reduce the number of invariants

7abf65f

Fix miri warning about pointer->integer->pointer cast in stack.rs

e3c6a80

Panic if Stack::truncate is called with an unaligned value

7cf2333

Add assertions in stack.rs and TODOs for leftover unsafe operations

b4fbeb7

Tighten assertions in RawBlock::get to require indices to be less tha…

0cda0ec

…n head

Fix Machine::get_clause_p panicking from trying to read a dangling frame

963ca37

Rename InnerAtomTable::table to hash_set and make it private

2cd2f96

Add offsets to AtomTable, making Atom::as_ptr fully safe

b843a93

Move the logic of read_atom within AtomTable

77006e9

To group all the unsafe blocks of the module close to one another

Skgland mentioned this pull request Jan 4, 2025

Compilation fails with thread safety errors in arena.rs #2749

Closed

adri326 changed the title ~~Make AtomTable fully safe and prepare for garbage collection~~ rebis-dev: Make AtomTable fully safe and prepare for garbage collection Feb 4, 2025

mthom force-pushed the rebis-dev branch 2 times, most recently from 8061598 to 28e2559 Compare March 1, 2025 21:59

mthom force-pushed the rebis-dev branch from 456f04e to 1578927 Compare March 9, 2025 20:22

mthom force-pushed the rebis-dev branch 2 times, most recently from a235e33 to c2b1261 Compare April 11, 2025 05:42

mthom force-pushed the rebis-dev branch from c2b1261 to 3549588 Compare April 12, 2025 07:22

mthom force-pushed the rebis-dev branch from a058570 to c0804c1 Compare April 23, 2025 06:34

mthom force-pushed the rebis-dev branch from c09c73e to e45895f Compare May 2, 2025 01:53

mthom force-pushed the rebis-dev branch from 3cb91a8 to 12d8eb1 Compare May 23, 2025 07:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

rebis-dev: Make AtomTable fully safe and prepare for garbage collection #2736

rebis-dev: Make AtomTable fully safe and prepare for garbage collection #2736

Uh oh!

adri326 commented Dec 31, 2024 •

edited

Loading

Uh oh!

triska commented Jan 5, 2025

Uh oh!

adri326 commented Jan 5, 2025

Uh oh!

triska commented Jan 6, 2025

Uh oh!

Uh oh!

rebis-dev: Make AtomTable fully safe and prepare for garbage collection #2736

Are you sure you want to change the base?

rebis-dev: Make AtomTable fully safe and prepare for garbage collection #2736

Uh oh!

Conversation

adri326 commented Dec 31, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

triska commented Jan 5, 2025

Uh oh!

adri326 commented Jan 5, 2025

Uh oh!

triska commented Jan 6, 2025

Uh oh!

Uh oh!

adri326 commented Dec 31, 2024 •

edited

Loading