Speed up the Tier 2 interpreter #112287

gvanrossum · 2023-11-20T18:16:17Z

The Tier 2 interpreter hasn't really been optimized carefully. While the "optimizer" pass is intended to make the Tier 2 micro-code faster through things like guard elimination or constantification, we should also look into just making the Tier 2 interpreter itself faster -- possibly by changing the representation of executable traces held in the executor (the current format is identical to the IR, which is rather verbose, using 16 bytes per uop!), and possibly by just carefully tuning the interpreter. (For example, if the space of micro-opcode ordinals could overlap the space of Tier 1 bytecode ordinals, we could fit the Tier 2 opcode in one byte.)

Linked PRs

gh-112287: Speed up Tier 2 (uop) interpreter a little #112286

This makes the Tier 2 interpreter a little faster. I calculated by about 3%, though I hesitate to claim an exact number. This starts by doubling the trace size limit (to 512), making it more likely that loops fit in a trace. The rest of the approach is to only load `oparg` and `operand` in cases that use them. The code generator know when these are used. For `oparg`, it will conditionally emit ``` oparg = CURRENT_OPARG(); ``` at the top of the case block. (The `oparg` variable may be referenced multiple times by the instructions code block, so it must be in a variable.) For `operand`, it will use `CURRENT_OPERAND()` directly instead of referencing the `operand` variable, which no longer exists. (There is only one place where this will be used.)

…12286) This makes the Tier 2 interpreter a little faster. I calculated by about 3%, though I hesitate to claim an exact number. This starts by doubling the trace size limit (to 512), making it more likely that loops fit in a trace. The rest of the approach is to only load `oparg` and `operand` in cases that use them. The code generator know when these are used. For `oparg`, it will conditionally emit ``` oparg = CURRENT_OPARG(); ``` at the top of the case block. (The `oparg` variable may be referenced multiple times by the instructions code block, so it must be in a variable.) For `operand`, it will use `CURRENT_OPERAND()` directly instead of referencing the `operand` variable, which no longer exists. (There is only one place where this will be used.)

hugovk · 2024-03-15T16:54:21Z

Closing because the PR has been merged. Please re-open if there's more needed here.

gvanrossum · 2024-03-15T19:06:16Z

Thanks for the ping! Arguably the issue was wider, but we've decided to focus on JIT performance, and the Tier 2 interpreter's speed is no longer of great concern (we keep it because it's easier to debug the rest of the Tier 2 machinery this way). So let's keep it closed but mark as "not planned", which is closer to the truth.

…12286) This makes the Tier 2 interpreter a little faster. I calculated by about 3%, though I hesitate to claim an exact number. This starts by doubling the trace size limit (to 512), making it more likely that loops fit in a trace. The rest of the approach is to only load `oparg` and `operand` in cases that use them. The code generator know when these are used. For `oparg`, it will conditionally emit ``` oparg = CURRENT_OPARG(); ``` at the top of the case block. (The `oparg` variable may be referenced multiple times by the instructions code block, so it must be in a variable.) For `operand`, it will use `CURRENT_OPERAND()` directly instead of referencing the `operand` variable, which no longer exists. (There is only one place where this will be used.)

bedevere-app bot mentioned this issue Nov 20, 2023

gh-112287: Speed up Tier 2 (uop) interpreter a little #112286

Merged

iritkatriel added the interpreter-core (Objects, Python, Grammar, and Parser dirs) label Nov 27, 2023

hugovk closed this as completed Mar 15, 2024

gvanrossum closed this as not planned Won't fix, can't repro, duplicate, stale Mar 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up the Tier 2 interpreter #112287

Speed up the Tier 2 interpreter #112287

gvanrossum commented Nov 20, 2023 •

edited by bedevere-app bot

Loading

hugovk commented Mar 15, 2024

gvanrossum commented Mar 15, 2024

Speed up the Tier 2 interpreter #112287

Speed up the Tier 2 interpreter #112287

Comments

gvanrossum commented Nov 20, 2023 • edited by bedevere-app bot Loading

Linked PRs

hugovk commented Mar 15, 2024

gvanrossum commented Mar 15, 2024

gvanrossum commented Nov 20, 2023 •

edited by bedevere-app bot

Loading