Compact code object marshaled form and pre-quicken bytecode when unmarshaling #462

markshannon · 2022-09-19T12:11:06Z

Currently we go through a two stage process to get to the adaptive form of instructions.
The explanation below uses CALL, but it applies to all specializable instructions.

When unmarshaling the code object we create the bytecode CALL, oparg, 0, 0, 0 ... where the zeros are the cache.
When executing, check a counter and quicken when that counter reaches zero.
When quickening, replace all CALLs with CALL_ADAPTIVEs.

Instead we should quicken when unmarshaling, so that we create:

(CALL_ADAPTIVE oparg)
7 # uint16_t not byte pair
...

instead of

(CALL oparg)
(0 0)

The marshaled form only needs one byte for instructions without an oparg and two for other instructions. No space is needed for the cache.

The text was updated successfully, but these errors were encountered:

ericsnowcurrently · 2022-09-19T17:37:41Z

Presumably the same idea would apply to deep-frozen code objects?

gvanrossum · 2022-09-19T18:15:24Z

I see loading marshalled code as equivalent to compiling it. So if we want unmarshalling to put the adaptive bytecodes in, we should do the same for the compiler -- IOW we should quicken everything immediately.

Unmarshalling results in exactly the same tree of code objects (module -> classes -> methods etc.) and compiling.

markshannon · 2022-09-19T18:22:34Z

Yes, it is the same, provided that [un]marshaling handles the bytecode as an array of code units which it compresses, not just an array of bytes.

gvanrossum · 2022-09-19T19:12:49Z

Yes, it is the same

Was this in response to Eric's question about deepfreeze, or mine about unmarshal vs. compiler?

markshannon · 2022-09-20T10:45:41Z

Yes, it is the same

Was this in response to Eric's question about deepfreeze, or mine about unmarshal vs. compiler?

Yours.
The compiler would emit the adaptive instructions directly (we can drop the non-adaptive forms), then mashal would store them in the compact form, and unmarshal would expand them.

markshannon · 2022-09-20T10:46:39Z

But we need to change marshaling first, so that it understands that code is made up of 16bit code units, not just bytes.

gvanrossum · 2022-09-20T21:46:33Z

But we need to change marshaling first, so that it understands that code is made up of 16bit code units, not just bytes.

To marshal a code object, we get the bytecode as a bytes object by calling _PyCode_GetCode(co). Any compression that requires knowledge of the code format can be placed in that function, as long as we also update the corresponding unmarshalling code, which calls _PyCode_Validate() and _PyCode_New().

(Likely we would design a new, slightly different API, but my point is that we can implement and test the bytecode compression first, and then using it from marshal.c would be straightforward.)

markshannon · 2023-08-03T12:22:43Z

Obsolete. See #566

This was referenced Sep 19, 2022

Reduce the size of PyCodeObject #463

Open

Improve memory use, sharing and start up with better code objects. #465

Open

brandtbucher self-assigned this Sep 19, 2022

brandtbucher mentioned this issue Oct 25, 2022

Quicken everything python/cpython#98686

Open

brandtbucher moved this to In Progress in Fancy CPython Board Oct 31, 2022

brandtbucher added this to Fancy CPython Board Oct 31, 2022

This was referenced Jul 28, 2023

Compact the co_code attribute of code objects. #608

Closed

Add instructions for creating small objects. #609

Open

markshannon closed this as completed Aug 3, 2023

github-project-automation bot moved this from In Progress to Done in Fancy CPython Board Aug 3, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Compact code object marshaled form and pre-quicken bytecode when unmarshaling #462

Compact code object marshaled form and pre-quicken bytecode when unmarshaling #462

markshannon commented Sep 19, 2022

ericsnowcurrently commented Sep 19, 2022

Uh oh!

gvanrossum commented Sep 19, 2022

Uh oh!

markshannon commented Sep 19, 2022

Uh oh!

gvanrossum commented Sep 19, 2022

Uh oh!

markshannon commented Sep 20, 2022

Uh oh!

markshannon commented Sep 20, 2022

Uh oh!

gvanrossum commented Sep 20, 2022 •

edited

Loading

Uh oh!

markshannon commented Aug 3, 2023

Uh oh!

Compact code object marshaled form and pre-quicken bytecode when unmarshaling #462

Compact code object marshaled form and pre-quicken bytecode when unmarshaling #462

Comments

markshannon commented Sep 19, 2022

ericsnowcurrently commented Sep 19, 2022

Uh oh!

gvanrossum commented Sep 19, 2022

Uh oh!

markshannon commented Sep 19, 2022

Uh oh!

gvanrossum commented Sep 19, 2022

Uh oh!

markshannon commented Sep 20, 2022

Uh oh!

markshannon commented Sep 20, 2022

Uh oh!

gvanrossum commented Sep 20, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

markshannon commented Aug 3, 2023

Uh oh!

gvanrossum commented Sep 20, 2022 •

edited

Loading