Proposal for a different strategy to merging the new back-end #467

zerbina · 2022-11-15T15:05:50Z

zerbina
Nov 15, 2022
Collaborator

Introduction

First, a small summary of the referenced bits from the two relevant PRs:

The injectdestructors PR (#450) adds an IR for program code (i.e. the content of a procedure's body), currently named Mid-end Intermediate Representation (abbreviated as MIR). It's derived from PNode AST as it looks like past transf and is bi-directonal, in the sense that it supports the translation from and to post-transf PNode AST.

The procedure-local control-flow primitives are: case, while true, block, single-branch if, and try/except/finally -- expressions are modeled as operations on values, of which the latter are either named by entities such as locals, globals, etc. or produced by other operations.

Data-structure-wise, the MIR is a flat seq of MirNodes, the latter being a variant object. The MIR might currently have a small bias towards the needs of the move-analyser, as both are being developed together.

The back-end PR (#424) adds:

an IR for code
a dedicated representation for types, procedures, literal data, and symbols
a transformation pipeline for code, types, and literal data, plus transformation passes making use of said pipeline
other parts that are not relevant for this proposal

Generally speaking, the IRs used in the new-back-end are referred to as "back-end IR", although for the rest of this write-up, I'll refer to each of the separate IRs as their own entity, and am also going to abbreviate back-end IR as BIR.

The code BIR is based around instructions, with some of them being declarative annotations. Procedure-local control-flow is encoded via goto, branch, join, and goto-link/resume, the latter two being used to "call" into a span of code inside the same procedure (goto-link) and later "return" to where the earlier call happened (resume). They're used to implement finally and allow for an arbitrary amount of nesting. Each non-control-flow, non-annotation instruction yields a value which is referenced/named via the instruction's position.

The current plan

After finishing and merging the injectdestuctors rewrite, make irgen operate on MirTree instead of PNode AST as it does now. Then make all tests succeed, document everything, put the PR through review, and after refinement, merge it as a whole. Until the back-end PR is merged, the MIR is only used for code that requires the injectdestructors pass to run.

A proposal for a different plan

The summary: instead of developing the new back-end in isolation and merging everything at once, incrementally merge pieces of it over time.

A core part of the new back-end are the transformation passes. Most of them work by iterating over the input instructions searching for a specific magic and then expanding it into a new instruction sequence. This works okay, but I now consider the BIR to be a bit too low-level for many of the transformations -- a slightly higher level IR would work better for them. The MIR fits this description, and it also allows for the same search-and-expand approach.

The first step would be to get a working version of the MIR merged. There are still some parts of it that need adjustments in order for the move-analyser/injectdestructors to be able to efficiently make use of it, but once those things are figured out, everything MIR-related could be moved from the injecdestructors PR to a separate PR.

Instead of only activating the MIR tranlsation for procedures that require the injectdestructors pass, all alive code (for all code-generators, i.e. cgen, jsgen, and vmgen) would be translated to MIR first and then, after all passes are applied, back to PNode AST. The resulting flow would look like:
... -> sem -> transf -> PNode-to-MIR -> transform/lowering passes -> MIR-to-PNode -> cgen/jsgen/vmgen.

Note: The to-and-from MIR tranlsation introduces a small amount of time¹ and memory overhead. Extra memory is required for both the MirNodes and the new PNodes generated by the MIR -> PNode step. It could also be the case that cgen produces less efficient code, as some of the information in the AST that cgen uses to optimize is lost.

Using MIR as an intermediate step is important for the next steps to work. It also acts as a good test for the translation layers and makes sure that changes to PNode that would break them won't go unnoticed.

The next step is to transplant/port each BIR pass, for which it makes sense, to operate on the MIR. This would roughly work as follows:

port the respective procedure from irpasses.nim
if an extra type lowering pass is employed, port it too
remove the logic from cgen that previously implemented the respective transformation
make a PR and merge it

The above steps are repated until all relevant BIR passes are transplanted. Once this is done, the only logic remaining in cgen is that of the actual code-generator, the RTTI generation (the new back-end has its own implementation of it), and all the transformations that the new back-end doesn't yet implement. For the latter, we can then decide on what to do with them.

irgen.nim also contains some transformations/lowerings that should be moved into MIR passes. There also two important passes not yet implemented as BIR passes: the seqsv2 type and code lowering passes. Both are similar to their seqsv1 counterpart and simple to implement -- they would be directly implemented to operate on the MIR.

Note: In the meantime, others could start with porting lambdalifting and closureiters to operate on the MIR. The MIR will require some adjustments to support this, however. I can either try to implement the features I believe are required into the MIR from the get-go, or assist where needed.

One thing to note is that all BIR passes operate on irtypes.Type and not PType. irtypes.Type is designed to be efficient for the type-related tasks, such as the type lowering and querying, previously performed by the new back-end. However, translating from irtypes.Type back to PType is not possible because relevant information is lost, and due to the canonicalization (which the BIR passes depend on), there also doesn't exist a one-to-one mapping between them anymore.

To still make the transplantation work, the BIR passes would be (temporarily) adjusted to use PType. This works, as PType has all information also available with irtypes.Type - it's just not as time and memory efficient.

Note: there is a different approach possible in regards to the type aspect. While not yet implemented, the plan is for the MIR to be environment agnostic. That is, entities such as types are named by transparent IDs that users of the MIR are free to reinterpret as they see fit. This would allow the use of a special tranisition type representation that is essentially a tuple storing both the PType and the ID of the corresponding irtypes.Type, making the required adjustments far less invasive as the passes can still operate on irtypes.Types.

After all passes are transplanted, the back-end PR would be cleaned up and the remaing missing bits implemented. The new back-end is then in a state where it should support the same (remaining) features as the current cgen, and when merging the PR, should thus be able to directly replace it. As part of merging it, the previously-BIR-now-MIR passes which were adjusted to use PType are changed back to work with irtypes.Type.

Note: changing the passes back to use irtypes.Type is possible because jsgen and vmgen wouldn't have the transplanted passes enabled (yet).

Post-merge

The flow through the compiler would look like: ... sem -> transf -> PNode-to-MIR -> lowering/transformation passes -> cgen.

With the new back-end (which will only have a C code-generator making use of it at first) merged, the other code-generators can be rewritten to make use of the BIR and its facilities. This will unblock further progress on the VM, as vmgen is one of the main blockers in that area.

Moving vmgen to use the back-end IR will also not only significantly reduce its code in terms of complexity and size, we will also get .closure iterator and, to some degree, method support essentially "for free".

a test with the current unfinished version showed that the 2nd and 3rd compiler bootstrapping iteration take ~2 seconds longer each (when booting with --gc:refc --d:danger --exceptions:goto) ↩

zerbina · 2022-11-15T15:07:47Z

zerbina
Nov 15, 2022
Collaborator Author

Thanks @saem for providing feedback on the draft of this proposal!

0 replies

saem · 2022-11-17T07:32:22Z

saem
Nov 17, 2022
Maintainer

A+, would review and agree with again.

0 replies

zerbina · 2023-07-04T22:02:05Z

zerbina
Jul 4, 2023
Collaborator Author

So, many things the proposal talks about were implemented, and there were also some changes of plan.

The original proposal/plan had a strong focus on the C backend, neglecting the other backends. There was a high risk of the implementation becoming over-fitted to the needs of the C target and code generator, and thus the focus shifted towards unifying first rather than unifying at the end.

Here, the introduction of code generation orchestrators (#712), lowering modules into procedures (#714), and moving discovery of alive procedures out of the code generators (#777) were big steps towards achieving a unified architecture.

Another important change that happened in the meantime was the removal of the legacy GCs -- a change that made large parts of the PR the proposal is based around (#424) obsolete. In general, the focus is now more on porting over the (good) ideas and concepts from #424, rather than on porting over the implementation.

To summarize: many parts of #424 are outdated or obsolete, and unifying the code generator pre-processing is in full progress. One of the next steps, at the time of writing, is decoupling the evolution of the code generators and the MIR from PNode AST, by introducing a dedicated intermediate representation (=IR) for the code generators (#551).

At first, said IR is going to be very similar to PNode, but the current goal is to incrementally evolve it towards the BIR (IrNode3) from #424.

Moving vmgen to use the back-end IR will also not only significantly reduce its code in terms of complexity and size, we will also get .closure iterator and, to some degree, method support essentially "for free".

Using the back-end IR was a hard-requirement for neither closure iterator nor method support, and in the end not necessary. Applying the lambda-lifting pass for all targets (#586) and a simplification of the closureiters pass (#607) were enough to support closure iterators with all backends, and generating method dispatcher prior to code generation (#734) plus some changes to method support in the runtime (#775) were enough to support methods with the VM backend.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proposal for a different strategy to merging the new back-end #467

{{title}}

Replies: 3 comments

{{title}}

{{title}}

{{title}}

Select a reply

Proposal for a different strategy to merging the new back-end #467

zerbina Nov 15, 2022 Collaborator

Introduction

The current plan

A proposal for a different plan

Post-merge

Footnotes

Replies: 3 comments

zerbina Nov 15, 2022 Collaborator Author

saem Nov 17, 2022 Maintainer

zerbina Jul 4, 2023 Collaborator Author

zerbina
Nov 15, 2022
Collaborator

zerbina
Nov 15, 2022
Collaborator Author

saem
Nov 17, 2022
Maintainer

zerbina
Jul 4, 2023
Collaborator Author