Buffer obj_free candidates. by eightbitraptor · Pull Request #71 · ruby/mmtk

eightbitraptor · 2026-02-12T15:28:36Z

Previously, every object allocation in rb_gc_impl_new_obj made a per-object FFI call into Rust (mmtk_add_obj_free_candidate), which acquired a mutex on one of the WeakProcessor's candidate vecs, pushed a single element, and released the mutex. That's an FFI crossing + mutex lock/unlock on every single allocation.

Now, each MMTk_ractor_cache has two local buffers (parallel-freeable and non-parallel-freeable, 128 entries each). On allocation, we just store the pointer into the local buffer. When a buffer fills up, we flush the entire batch in one FFI call using mmtk_add_obj_free_candidates, which does a single mutex acquisition and flushes the batch into the work buckets. The objects are still distributed in teh same way, but now we only take a lock once per queue buffer, rather than per-object.

We picked 128 as our buffer size at random. We should probably investigate further what an optimum size for this is

Previously, every object allocation in rb_gc_impl_new_obj made a per-object FFI call into Rust (mmtk_add_obj_free_candidate), which acquired a mutex on one of the WeakProcessor's candidate vecs, pushed a single element, and released the mutex. That's an FFI crossing + mutex lock/unlock on every single allocation. Now, each MMTk_ractor_cache has two local buffers (parallel-freeable and non-parallel-freeable, 128 entries each). On allocation, we just store the pointer into the local buffer. When a buffer fills up, we flush the entire batch in one FFI call using mmtk_add_obj_free_candidates, which does a single mutex acquisition and extend_from_slice for the whole batch. We picked 128 as our buffer size at random. We should probably investigate further what an optimum size for this is

shutdown_call_finalizer reads candidates from the Rust-side WeakProcessor, but the main ractor's C-side buffer may not have been flushed yet (ractor_cache_free runs later). Flush all remaining buffers before reading candidates.

Instead of sending all 128 buffered objects to one bucket, round-robin distribute them across all worker buckets so parallel obj_free work stays balanced.

eightbitraptor force-pushed the mvh-batch-obj-free-candidates branch from e444c58 to 23c4a9a Compare February 12, 2026 15:30

eightbitraptor added 4 commits February 12, 2026 15:40

Fix Cargo format issues

26ec9f7

Flush obj_free buffers before shutdown finalizers

7e01232

shutdown_call_finalizer reads candidates from the Rust-side WeakProcessor, but the main ractor's C-side buffer may not have been flushed yet (ractor_cache_free runs later). Flush all remaining buffers before reading candidates.

Distribute batch candidates across parallel buckets

e1f926c

Instead of sending all 128 buffered objects to one bucket, round-robin distribute them across all worker buckets so parallel obj_free work stays balanced.

Cargo format

7889da7

eightbitraptor requested a review from peterzhu2118 February 13, 2026 00:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Buffer obj_free candidates.#71

Buffer obj_free candidates.#71
eightbitraptor wants to merge 5 commits intomainfrom
mvh-batch-obj-free-candidates

eightbitraptor commented Feb 12, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

eightbitraptor commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

eightbitraptor commented Feb 12, 2026 •

edited

Loading