Revert "Reserve 2 bits for expressing object layout (#17139)"

2026-05-29T23:41:31+00:00

This reverts commit 63d9f090b5d9461cf0b9446e0039d9c56156b826.

Reserve 2 bits for expressing object layout (#17139)

2026-05-29T22:09:34+00:00

* Reserve 2 bits for expressing object layout

We would like to make instance variable reads in the JIT compiler faster
(as well as simplify the JIT implementation).  Currently, in order to
read an instance variable, we have to:

1. Test for heap object
2. Load object to a 64 bit register
3. Mask the object header
4. Bit test against the masked header
5. JNE
6. Load field

We would like to:

1. Test for heap object
2. Load object shape to a 32 bit register
3. Bit test against the shape
4. JNE
5. Load field

The way we fetch instance variables is not consistent across objects.
In order to realize our goal, we need to encode object layout inside the
shape.  If we encode object layout inside the shape, then the shape
itself will guarantee that the access pattern generated by the JIT
compiler is correct.

We should encode the following load patterns into the shape tag bits.
This way we can share shapes on transitions, but be able to
differentiate the access patterns for the JIT compiler.  In other words,
two objects can have an `@a -> @b -> @c` transition and share the same
shape, but the tag bits can differentiate the access pattern so that the
JIT compiler can be confident that the machine code is correct.

Here are the patterns:

1. Embedded/Extended T_OBJECT Instance Variables

Objects with direct references to instance variables or via malloc
buffer

2. Objects with fields_objects fields

These are Data and TypedData objects.  They have an associated axillary
imemo/fields object that stores the instance variables.  The access
pattern is `object[2] + 2`.  The fields object is the 3rd field, and the
instance variables start at +2 inside the fields object.  The fields
object itself is a Ruby object, so it contains the usual header bits +
class headers.

3. Non Boxable Classes / Modules

This is similar to Objects with fields_objects, but the fields object is
stored at a different offset.  We’re differentiating this from boxable
classes and modules because those are harder to support.

4. Other

"Other" pattern is for objects that are rare, or have
difficult-to-implement access patterns.  This includes:

* Boxable classes and modules
* Structs (for now)
* Objects that use the geniv table

Proposed shape bit layout:

```
  Current shape_id_t is 32 bits:
  31        28 27 26 25 24 23 22        19 18                         0
  +-----------+--+--+--+--+--+------------+----------------------------+
  | unused    |L1|L0|OI|FR|CX| heap index | shape tree offset          |
  +-----------+--+--+--+--+--+------------+----------------------------+
               |  |  |  |  |  |            |
               |  |  |  |  |  |            +-- bits 0-18: SHAPE_ID_OFFSET_MASK
               |  |  |  |  |  +--------------- bits 19-22: SHAPE_ID_HEAP_INDEX_MASK
               |  |  |  |  +------------------ bit 23: SHAPE_ID_FL_COMPLEX
               |  |  |  +--------------------- bit 24: SHAPE_ID_FL_FROZEN
               |  |  +------------------------ bit 25: SHAPE_ID_FL_HAS_OBJECT_ID
               +--+--------------------------- bits 26-27: SHAPE_ID_LAYOUT_MASK
```

The important part about these layout patterns is that they do not
reflect the _type_ of object, only how the object is laid out in memory.
For example, we currently treat structs as "other", but we can refactor
them to have the same layout as "Objects with fields_objects", and when
we do that they should get a different bit in the shape header.

This commit only reserves the two bits, it doesn't use them in the JIT
compiler yet.

Co-Authored-By: John Hawthorn 
Co-Authored-By: Max Bernstein 

* Update gc.c

Co-authored-by: Nobuyoshi Nakada 

* Update shape.h

Co-authored-by: Jean Boussier 

* fix function name

* Update shape.c

Co-authored-by: Jean Boussier 

* fix function name

* Revert "Update shape.c"

This reverts commit 900711defc6c541a93f3393a350819ae88cf87f1.

* add comment

---------

Co-authored-by: John Hawthorn 
Co-authored-by: Max Bernstein 
Co-authored-by: Nobuyoshi Nakada 
Co-authored-by: Jean Boussier

Rename RUBY_FL_USERPRIV0 into RUBY_FL_UNUSED6

2026-05-29T05:25:03+00:00

Last usage was removed in a26f528b3bf1eaecff18520f6ba8083c9c0cbf73
https://github.com/ruby/ruby/pull/15447

Deprecate `struct RData` in favor of `struct RTypedData`

2026-05-28T10:29:31+00:00

For the backward compatibility for some wrapper generator gems, keep
only `RData` definition.

Treat all T_DATA the same in zjit

2026-05-27T23:18:43+00:00

ZJIT: Delete binding for unused rb_reg_new_ary()

2026-05-26T17:04:22+00:00

ZJIT: Call only one function for newhash/toregexp (#17092)

2026-05-23T00:08:49+00:00

Use atomics for kwargs reference count

2026-05-20T16:48:55+00:00

Fixes [Bug #22075]

Use IMEMO to store `cdhash`

2026-05-18T05:58:32+00:00

RHash isn't a good fit for storing `cdhash` as this force to allow
arbitrary hash types into RHash, which doesn't work with AR tables.

It also cause the cdhash to be larger than needed.

ZJIT: Share a single JITFrame across all C method frames (#16988)

2026-05-15T19:44:42+00:00

* ZJIT: Share a single JITFrame across all C method frames

Replace the per-call JITFrame::new_cfunc allocation in gen_push_frame
with a sentinel value (ZJIT_JIT_RETURN_C_FRAME) stored in cfp->jit_return.
CFP_ZJIT_FRAME now returns a pointer to a single static rb_zjit_c_frame
(pc/iseq both NULL) when it sees the sentinel.

The CFP accessor is split into CFP_ZJIT_FRAME_P (predicate) and
CFP_ZJIT_FRAME (typed accessor that dereferences safely) so callers don't
have to know about the sentinel encoding. ISEQ frames are unchanged: they
still hand cfp->jit_return a heap-allocated JITFrame pointer.

* Move a pointer location for CFP_ZJIT_FRAME

ruby.git/zjit/src/cruby_bindings.inc.rs, branch master