Profile adoption tool #1110

troels-im · 2021-08-04T13:17:04Z

Profile adoption tool

Note PR is WiP.

Goal

The overall goal of this PR is to create a first version of a tool that allows transformation of a general QIR to a base profile.

Quick start

Once the project is built (see next sections), you can generate a new QIR as follows:

./Source/Apps/qat --generate --profile baseProfile -S ../examples/QubitAllocationAnalysis/analysis-example.ll

Likewise, you can validate that a QIR follows a specification by running:

./Source/Apps/qat --validate --profile baseProfile -S ../examples/QubitAllocationAnalysis/analysis-example.ll

Example

In this example, we start with a QIR generated by the Q# frontend. Rather than giving the full 3445 lines of QIR, we instead give the frontend code:

namespace TeleportChain {
    open Microsoft.Quantum.Intrinsic;
    open Microsoft.Quantum.Canon;
    open Microsoft.Quantum.Arrays;
    open Microsoft.Quantum.Measurement;
    open Microsoft.Quantum.Preparation;

    operation PrepareEntangledPair(left : Qubit, right : Qubit) : Unit is Adj + Ctl {
        H(left);
        CNOT(left, right);
    }

    operation ApplyCorrection(src : Qubit, intermediary : Qubit, dest : Qubit) : Unit {
        if (MResetZ(src) == One) { Z(dest); }
        if (MResetZ(intermediary) == One) { X(dest); }
    }

    operation TeleportQubitUsingPresharedEntanglement(src : Qubit, intermediary : Qubit, dest : Qubit) : Unit {
        Adjoint PrepareEntangledPair(src, intermediary);
        ApplyCorrection(src, intermediary, dest);
    }

    operation TeleportQubit(src : Qubit, dest : Qubit) : Unit {
        use intermediary = Qubit();
        PrepareEntangledPair(intermediary, dest);
        TeleportQubitUsingPresharedEntanglement(src, intermediary, dest);
    }

    operation DemonstrateEntanglementSwapping() : (Result, Result) {
        use (reference, src, intermediary, dest) = (Qubit(), Qubit(), Qubit(), Qubit());
        PrepareEntangledPair(reference, src);
        TeleportQubit(src, dest);
        return (MResetZ(reference), MResetZ(dest));
    }

    @EntryPoint()
    operation DemonstrateTeleportationUsingPresharedEntanglement() : Unit {
        let nPairs = 2;
        use (leftMessage, rightMessage, leftPreshared, rightPreshared) = (Qubit(), Qubit(), Qubit[nPairs], Qubit[nPairs]);
        PrepareEntangledPair(leftMessage, rightMessage);
        for i in 0..nPairs-1 {
            PrepareEntangledPair(leftPreshared[i], rightPreshared[i]);
        }

        TeleportQubitUsingPresharedEntanglement(rightMessage, leftPreshared[0], rightPreshared[0]);
        for i in 1..nPairs-1 {
            TeleportQubitUsingPresharedEntanglement(rightPreshared[i-1], leftPreshared[i], rightPreshared[i]);
        }

        let _ = MResetZ(leftMessage);
        let _ =  MResetZ(rightPreshared[nPairs-1]);
    }
}

Once compiled and the initial QIR is generated and save in the file analysis-example.ll, we execute the command

./Source/Apps/qat --generate --profile baseProfile ./analysis-example.ll

The QAT tool will now attempt to map the QIR in analysis-example.ll into a QIR which is compatible with the base format. Removing type and function declarations, the correspoding code reads:

; ModuleID = './analysis-example.ll'
source_filename = "./analysis-example.ll"

define internal fastcc void @TeleportChain__DemonstrateTeleportationUsingPresharedEntanglement__body() unnamed_addr {
entry:
  call void @__quantum__qis__h(%Qubit* null)
  call void @__quantum__qis__cnot(%Qubit* null, %Qubit* nonnull inttoptr (i64 1 to %Qubit*))
  call void @__quantum__qis__h(%Qubit* null)
  call void @__quantum__qis__cnot(%Qubit* null, %Qubit* nonnull inttoptr (i64 2 to %Qubit*))
  call void @__quantum__qis__h(%Qubit* nonnull inttoptr (i64 1 to %Qubit*))
  call void @__quantum__qis__cnot(%Qubit* nonnull inttoptr (i64 1 to %Qubit*), %Qubit* nonnull inttoptr (i64 3 to %Qubit*))
  call void @__quantum__qis__cnot(%Qubit* nonnull inttoptr (i64 1 to %Qubit*), %Qubit* null)
  call void @__quantum__qis__h(%Qubit* nonnull inttoptr (i64 1 to %Qubit*))
  %0 = call i1 @__quantum__qir__read_result(%Result* nonnull inttoptr (i64 4 to %Result*))
  call void @__quantum__qis__mz__body(%Qubit* nonnull inttoptr (i64 1 to %Qubit*), %Result* nonnull inttoptr (i64 4 to %Result*))
  call void @__quantum__qis__reset__body(%Qubit* nonnull inttoptr (i64 1 to %Qubit*))
  br i1 %0, label %then0__1.i.i, label %continue__1.i.i

then0__1.i.i:                                     ; preds = %entry
  call void @__quantum__qis__z(%Qubit* nonnull inttoptr (i64 2 to %Qubit*))
  br label %continue__1.i.i

continue__1.i.i:                                  ; preds = %then0__1.i.i, %entry
  %1 = call i1 @__quantum__qir__read_result(%Result* nonnull inttoptr (i64 5 to %Result*))
  call void @__quantum__qis__mz__body(%Qubit* null, %Result* nonnull inttoptr (i64 5 to %Result*))
  call void @__quantum__qis__reset__body(%Qubit* null)
  br i1 %1, label %then0__2.i.i, label %TeleportChain__TeleportQubitUsingPresharedEntanglement__body.1.exit

then0__2.i.i:                                     ; preds = %continue__1.i.i
  call void @__quantum__qis__x(%Qubit* nonnull inttoptr (i64 2 to %Qubit*))
  br label %TeleportChain__TeleportQubitUsingPresharedEntanglement__body.1.exit

TeleportChain__TeleportQubitUsingPresharedEntanglement__body.1.exit: ; preds = %continue__1.i.i, %then0__2.i.i
  call void @__quantum__qis__cnot(%Qubit* nonnull inttoptr (i64 2 to %Qubit*), %Qubit* nonnull inttoptr (i64 1 to %Qubit*))
  call void @__quantum__qis__h(%Qubit* nonnull inttoptr (i64 2 to %Qubit*))
  %2 = call i1 @__quantum__qir__read_result(%Result* nonnull inttoptr (i64 6 to %Result*))
  call void @__quantum__qis__mz__body(%Qubit* nonnull inttoptr (i64 2 to %Qubit*), %Result* nonnull inttoptr (i64 6 to %Result*))
  call void @__quantum__qis__reset__body(%Qubit* nonnull inttoptr (i64 2 to %Qubit*))
  br i1 %2, label %then0__1.i.i2, label %continue__1.i.i3

then0__1.i.i2:                                    ; preds = %TeleportChain__TeleportQubitUsingPresharedEntanglement__body.1.exit
  call void @__quantum__qis__z(%Qubit* nonnull inttoptr (i64 3 to %Qubit*))
  br label %continue__1.i.i3

continue__1.i.i3:                                 ; preds = %then0__1.i.i2, %TeleportChain__TeleportQubitUsingPresharedEntanglement__body.1.exit
  %3 = call i1 @__quantum__qir__read_result(%Result* nonnull inttoptr (i64 7 to %Result*))
  call void @__quantum__qis__mz__body(%Qubit* nonnull inttoptr (i64 1 to %Qubit*), %Result* nonnull inttoptr (i64 7 to %Result*))
  call void @__quantum__qis__reset__body(%Qubit* nonnull inttoptr (i64 1 to %Qubit*))
  br i1 %3, label %then0__2.i.i4, label %TeleportChain__TeleportQubitUsingPresharedEntanglement__body.2.exit

then0__2.i.i4:                                    ; preds = %continue__1.i.i3
  call void @__quantum__qis__x(%Qubit* nonnull inttoptr (i64 3 to %Qubit*))
  br label %TeleportChain__TeleportQubitUsingPresharedEntanglement__body.2.exit

TeleportChain__TeleportQubitUsingPresharedEntanglement__body.2.exit: ; preds = %continue__1.i.i3, %then0__2.i.i4
  call void @__quantum__qis__mz__body(%Qubit* null, %Result* null)
  call void @__quantum__qis__reset__body(%Qubit* null)
  call void @__quantum__qis__mz__body(%Qubit* nonnull inttoptr (i64 3 to %Qubit*), %Result* nonnull inttoptr (i64 1 to %Result*))
  call void @__quantum__qis__reset__body(%Qubit* nonnull inttoptr (i64 3 to %Qubit*))
  ret void
}

We note the absence of loops, and that quantum registers are "allocated" at compile time meaning that each qubit instance is assigned a unique ID. As some code may be dead and optimised away, the qubit allocation is not garantueed to be sequential at this point in time. Future work will include writing a qubit ID remapper which will allow qubits.

We also note that the function TeleportChain__TeleportQubitUsingPresharedEntanglement__body was cloned twice. This is due to the allocation of qubits and the function being called twice. At present, the analyser does not take qubit release into account and just assumes that it will never be released due to the complicated nature for dealing with nested functions at compile time.

Current TODOs include getting LLVM to remove dead code, do better constant folding and function inlining. Once this is performed correctly, next steps is the remapper and finally a better analysis on what call paths potentially create problems in terms of qubit allocation.

Dependencies

This library is written in C++ and depends on:

LLVM

Additional development dependencies include:

CMake
clang-format
clang-tidy

Building the passes

To build the passes, create a new build directory and switch to that directory:

mkdir Debug
cd Debug/

To build the library, first configure CMake from the build directory

cmake ..

and then make your target

make [target]

The default target is all. Other valid targets are the name of the folders in libs/ found in the passes root.

Profile adoption tool

Building QAT

First

cd Debug
make qat

then

./Source/Apps/qat

Implementing a profile pass

As an example of how one can implement a new profile pass, we here show the implementational details of our example pass which allows mapping the teleportation code to the base profile:

        pb.registerPipelineParsingCallback([](StringRef name, FunctionPassManager &fpm,
                                              ArrayRef<PassBuilder::PipelineElement> /*unused*/) {
          // Base profile
          if (name == "restrict-qir<base-profile>")
          {
            RuleSet rule_set;

            // Defining the mapping
            auto factory = RuleFactory(rule_set);

            factory.useStaticQuantumArrayAllocation();
            factory.useStaticQuantumAllocation();
            factory.useStaticResultAllocation();

            factory.optimiseBranchQuatumOne();
            //  factory.optimiseBranchQuatumZero();

            factory.disableReferenceCounting();
            factory.disableAliasCounting();
            factory.disableStringSupport();

            fpm.addPass(TransformationRulePass(std::move(rule_set)));
            return true;
          }

          return false;
        });
      }};

Transformations of the IR will happen on the basis of what rules are added to the rule set. The purpose of the factory is to make easy to add rules that serve a single purpose as well as making a basis for making rules unit testable.

Implementing new rules

Implementing new rules consists of two steps: Defining a pattern that one wish to replace and implementing the corresponding replacement logic. Inside a factory member function, this look as follows:

  auto get_element =
      Call("__quantum__rt__array_get_element_ptr_1d", "arrayName"_cap = _, "index"_cap = _);
  auto cast_pattern = BitCast("getElement"_cap = get_element);
  auto load_pattern = Load("cast"_cap = cast_pattern);

  addRule({std::move(load_pattern), access_replacer});

where addRule adds the rule to the current rule set.

Capturing patterns

The pattern defined in this snippet matches IR like:

  %0 = call i8* @__quantum__rt__array_get_element_ptr_1d(%Array* %leftPreshared, i64 0)
  %1 = bitcast i8* %0 to %Qubit**
  %2 = load %Qubit*, %Qubit** %1, align 8

In the above rule, the first and a second argument of __quantum__rt__array_get_element_ptr_1d is captured as arrayName and index, respectively. Likewise, the bitcast instruction is captured as cast. Each of these captures will be available inside the replacement function access_replacer.

Implementing replacement logic

After a positive match is found, the lead instruction alongside a IRBuilder, a capture table and a replacement table is passed to the replacement function. Here is an example on how one can access the captured variables to perform a transformation of the IR:

  auto access_replacer = [qubit_alloc_manager](Builder &builder, Value *val, Captures &cap,
                                               Replacements &replacements) {
    // ...
    auto cst = llvm::dyn_cast<llvm::ConstantInt>(cap["index"]);
    // ...
    auto llvm_size = cst->getValue();
    auto offset    = qubit_alloc_manager->getOffset(cap["arrayName"]->getName().str());

    auto idx = llvm::APInt(llvm_size.getBitWidth(), llvm_size.getZExtValue() + offset);
    auto new_index = llvm::ConstantInt::get(builder.getContext(), idx);
    auto instr = new llvm::IntToPtrInst(new_index, ptr_type);
    instr->takeName(val);

    // Replacing the lead instruction with a the new instruction
    replacements.push_back({llvm::dyn_cast<Instruction>(val), instr});

    // Deleting the getelement and cast operations
    replacements.push_back({llvm::dyn_cast<Instruction>(cap["getElement"]), nullptr});
    replacements.push_back({llvm::dyn_cast<Instruction>(cap["cast"]), nullptr});

    return true;
  };

… structure

swernli · 2021-08-15T22:40:11Z

@troelsfr I'm about halfway through the review, and I should be able to finish it before Monday morning your time so you have the chance to respond and we can get this ready to merge by EOD!

swernli

The changes mostly look good, just a few questions before I'm ready to sign off! There are also some suggestions about minor updates (file renames or comments) that you can either address here or in a follow up PR that we work on Monday to get the feature branch ready for sharing.

src/Passes/README.md

src/Passes/Source/Passes/ResourceRemapper/ResourceRemapper.cpp

src/Passes/Source/Rules/Factory.cpp

src/Passes/Source/Rules/Notation/Notation.cpp

src/Passes/Source/Rules/OperandPrototype.cpp

src/Passes/examples/QubitAllocationAnalysis/ConstSizeArray/ConstSizeArray.qs

Co-authored-by: Stefan J. Wernli <[email protected]>

swernli

Thanks for addressing my questions! We should be ready to merge this once the builds pass.

src/Passes/Source/AllocationManager/AllocationManager.cpp

src/Passes/Source/AllocationManager/AllocationManager.hpp

bettinaheim · 2021-08-16T08:43:48Z

src/Passes/Source/Rules/OperandPrototype.hpp

+        /// are matched in order and by size.
+        void addChild(Child const& child);
+
+        /// Flags that this operand should be captured. This function ensures


The comment alone here is not sufficient to understand what the purpose is of this flag; what operands can and should be captured? What happens with captured operands? A see also reference to the piece of code that uses this information might help to clarify as well.

I think what you possible need is some high level description of the OperandPrototype, its relation to creating rules and in that context. what captures are?

src/Passes/Source/Rules/Operands/Instruction.cpp

src/Passes/Source/Rules/ReplacementRule.hpp

src/Passes/examples/ClassicalIrCommandline/README.md

src/Passes/libs/QubitAllocationAnalysis/SPECIFICATION.md

Co-authored-by: bettinaheim <[email protected]>

bettinaheim

I went over the code relatively quick, and I am sure there are places where I didn't look carefully enough. I did a quick read through everything, though.

src/Passes/Source/Passes/QirAllocationAnalysis/QirAllocationAnalysis.hpp

src/Passes/Source/Profiles/BaseProfile.cpp

src/Passes/Source/Rules/Factory.cpp

src/Passes/Source/Passes/ExpandStaticAllocation/ExpandStaticAllocation.cpp

bettinaheim · 2021-08-16T09:20:56Z

src/Passes/README.md

@@ -1,156 +1,157 @@
-# QIR Passes for LLVM


What was the reason for removing the general intro and links to the docs? Also the link in ## Out-of-source Pass might be useful to keep.

As such we can keep it as training material, but it does not belong at the top-level README as this file should contain the "user" documentation such as "What does the tool do", "How to build the tool", "How to run the tool" and "How do I create new profiles". Passes and the usage of passes is an implementation detail that only has relevance for those contributing to the implementation of the library.

I'd keep the info in the same place as any instructions for contributing to the passes.

Co-authored-by: bettinaheim <[email protected]>

troels-im and others added 30 commits July 20, 2021 09:56

Initial proposal for a QsPasses structure

3c76e08

Updating CMake

9eb5c02

Adding CI stuff

6674e24

Making CLI interface for CI tasks

199eed4

Finishing V1 of CI script with updated clang tidy and format

de323ef

Refactoring CI module

cfb4b93

Refactoring

d8949bf

Removing binary IR

cc1a0e6

Updating gitignore

1a98e31

Creating root tool for performing CI tasks

74a4924

Updating documentation

9841c95

Refactoring pass

1a5e95f

Preparing analysis module

f8c7a97

Adding a style proposal

87c08b9

Adding style proposal

f095508

Updating documentation

b0c63d6

Template based pass generator

564f518

Updating template and writing more documentation

c80fd35

Adding introduction on how to create a pass

3078388

Improving code quality

e975b25

Improving code quality

06b09ed

Adding namespaces to passes

fdb465a

Adding comments to the source

dd810af

Merge branch 'main' into feature/llvm-pass-proposal

db6ab19

Small refactor

c40796d

Adding QIR example using opt for optimisation and refactoring library…

a23b6b7

… structure

Adding documentation

327ff7f

Updating linter and formatter

0ee8249

Updating code to meet PR comments

6c3c896

Adding function analysis template

df3e4d2

troels-im added 9 commits August 12, 2021 17:52

Correcting mistake

7c73314

Removing debug output

9f5f5c2

Fixing bug

a7467bf

Documentation and refactoring

d79d0d7

Refactor location of operands

9604ef6

More refactor

370e8a2

Fixing style

bdfc6f5

Fixing CI and style

09e112f

Deprecating LL tests

8a58250

swernli reviewed Aug 15, 2021

View reviewed changes

troels-im and others added 2 commits August 16, 2021 08:12

Update src/Passes/README.md

b83c3d9

Co-authored-by: Stefan J. Wernli <[email protected]>

Update src/Passes/Source/Rules/OperandPrototype.cpp

c3712a9

bettinaheim requested review from alan-geller and swernli August 16, 2021 06:46

troels-im added 5 commits August 16, 2021 09:16

PR revisions

1941b52

Updating README

0b71eb5

Updating with review items

d324ae4

Updating linux build

0f4b921

Fixing yaml file

288abdc

swernli approved these changes Aug 16, 2021

View reviewed changes

Removing LLVM 12

51fb281

bettinaheim reviewed Aug 16, 2021

View reviewed changes

Update src/Passes/Source/Rules/ReplacementRule.hpp

fad944f

Co-authored-by: bettinaheim <[email protected]>

bettinaheim reviewed Aug 16, 2021

View reviewed changes

troels-im and others added 4 commits August 16, 2021 11:26

Update src/Passes/Source/AllocationManager/AllocationManager.hpp

c2cbe9a

Co-authored-by: bettinaheim <[email protected]>

Adding variaous suggestions

b2faa6c

Updating documentation

511baaa

Fixing broken link

ba69f97

troels-im merged commit 700cdec into microsoft:features/llvm-passes Aug 16, 2021

Profile adoption tool #1110

Profile adoption tool #1110

Uh oh!

Conversation

troels-im commented Aug 4, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Profile adoption tool

Goal

Quick start

Example

Dependencies

Building the passes

Profile adoption tool

Building QAT

Implementing a profile pass

Implementing new rules

Capturing patterns

Implementing replacement logic

Uh oh!

swernli commented Aug 15, 2021

Uh oh!

swernli left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

swernli left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bettinaheim Aug 16, 2021

Choose a reason for hiding this comment

Uh oh!

troels-im Aug 16, 2021

Choose a reason for hiding this comment

Uh oh!

bettinaheim Aug 16, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bettinaheim left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bettinaheim Aug 16, 2021

Choose a reason for hiding this comment

Uh oh!

troels-im Aug 16, 2021

Choose a reason for hiding this comment

Uh oh!

bettinaheim Aug 16, 2021

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

troels-im commented Aug 4, 2021 •

edited

Loading