Specialized eval loops for categories of functions #17

ericsnowcurrently · 2021-03-16T19:31:13Z

Here's an idea I had during discussions with Guido about his "add-opcodes" (super-instructions) branch (#16). The key observation is that many functions (i.e. code objects) could be grouped by different sets of common characteristics. Then, for each of those groups we could derive an eval loop implementation that is optimized for that group of code objects.

(FYI, this relates to @markshannon's idea about generated code for the eval loop.)

The approach would look something like the following.

During core development:

analyze a large amount of Python code (characteristic of target workloads)
identify classes of functions with common characteristics ("numeric ops", "string ops", "object manipulation", "only set X of opcodes", etc.)
for each category, generate an eval loop implementations with targeted optimizations (the normal eval loop is default)

At runtime:

compiler identifies execution class for the current code object (and flags it)
in _PyEval_EvalFrameDefault() (or maybe in _PyEval_EvalFrame()) we pick the eval loop that corresponds to the code objects flag

There are other factors to consider, (e.g. the cache-level impact of switching between multiple eval loop implementations) but let's start with the high-level idea.

The text was updated successfully, but these errors were encountered:

markshannon · 2021-03-29T11:36:47Z

This seems rather vague.

What exactly are the "targeted optimizations"?
How would the different dispatch loops differ?
Why would having several dispatch loops be faster than having one?

I really don't think we should be doing "blue sky" thinking. There is plenty of existing research to build on.

gvanrossum · 2021-03-29T14:45:02Z

I don't know, and I'm not pushing to pursue this, but one thought would be that some VMs I've read about have a "profiling phase" during which they count things that might be relevant to the optimization. Instead of an "if profiling:" flag check we could have a separate profiling eval loop.

markshannon · 2021-03-29T16:28:08Z

Type profiling is usually continuous at lower tiers.
For example, the specializing interpreter #28 implicitly gathers type feedback, which can be used by higher tiers.

ericsnowcurrently · 2021-03-30T15:39:51Z

FWIW, the specific idea here would be more appropriate to revisit later, if at all, when/if it becomes more practical to develop multiple eval loop implementations (e.g. using generated code). Extra discussion on this isn't worth the time right now.

I really don't think we should be doing "blue sky" thinking. There is plenty of existing research to build on.

For the most part I agree. My intent here was to capture some thoughts that came to mind as Guido and I discussed possible improvements to explore. Part of the challenge, at least for me, is an effective lack of familiarity with "existing research" to use as a guide. I'm definitely in favor of both relying on the efforts of those many smart people and becoming more familiar with that research. At the same time still plan on sharing ideas I have. That isn't so frequent that it's a distraction and at the least ensuring discussion helps me learn more about this space.

markshannon · 2021-07-16T15:51:40Z

Multiple interpreter loops are going to be very unfriendly to the icache, and almost certainly slower.
There is no mechanism to choose the interpreter, or specification of how they would differ.
Can we close this?

gvanrossum · 2021-07-16T15:57:28Z

How certain are you that a typical function execution doesn't completely void the icache whenever an excursion in the runtime (maybe as simple as PyObject_GetAttr) is made? IOW do we care about the icache at the scale of function executions?

markshannon · 2021-07-27T10:51:47Z

I'm not certain about anything regarding what CPUs do with their caches.

markshannon · 2021-07-27T10:52:39Z

I'm closing this as there is nothing actually to be done here.

gvanrossum changed the title ~~Specialized eval loops for categories of functions.~~ Specialized eval loops for categories of functions Mar 16, 2021

ericsnowcurrently added the deferred label Mar 30, 2021

markshannon closed this as completed Jul 27, 2021

gramster added this to Fancy CPython Board Jan 10, 2022

gramster moved this to Todo in Fancy CPython Board Jan 10, 2022

gramster moved this from Todo to Other in Fancy CPython Board Jan 10, 2022

gramster removed this from Fancy CPython Board Jan 24, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Specialized eval loops for categories of functions #17

Specialized eval loops for categories of functions #17

ericsnowcurrently commented Mar 16, 2021 •

edited by gvanrossum

Loading

markshannon commented Mar 29, 2021

Uh oh!

gvanrossum commented Mar 29, 2021

Uh oh!

markshannon commented Mar 29, 2021

Uh oh!

ericsnowcurrently commented Mar 30, 2021

Uh oh!

markshannon commented Jul 16, 2021

Uh oh!

gvanrossum commented Jul 16, 2021 via email

Uh oh!

markshannon commented Jul 27, 2021

Uh oh!

markshannon commented Jul 27, 2021

Uh oh!

Specialized eval loops for categories of functions #17

Specialized eval loops for categories of functions #17

Comments

ericsnowcurrently commented Mar 16, 2021 • edited by gvanrossum Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

markshannon commented Mar 29, 2021

Uh oh!

gvanrossum commented Mar 29, 2021

Uh oh!

markshannon commented Mar 29, 2021

Uh oh!

ericsnowcurrently commented Mar 30, 2021

Uh oh!

markshannon commented Jul 16, 2021

Uh oh!

gvanrossum commented Jul 16, 2021 via email

Uh oh!

markshannon commented Jul 27, 2021

Uh oh!

markshannon commented Jul 27, 2021

Uh oh!

ericsnowcurrently commented Mar 16, 2021 •

edited by gvanrossum

Loading