[lldb] Implement RISCV function unwinding using instruction emulation #147434

satyajanga · 2025-07-08T00:23:56Z

We noticed this issue when processing a Meta internal RISCV coredump, currently the RISCV instruction emulation is not handling the Prologue and Epilogue instructions.

Also function unwinding using the instruction emulation is also not implemented.

This PR handles both of these issues.

NOTE: Not sure of the historic reason why this is done this way. This is done in https://reviews.llvm.org/D131759
in contrast MIPS, PPC, ARM all support this.

Test Plan:
Unfortunately there is no easy way to add testing for this. No RISCV hardware at the disposal. I welcome the suggestions.

cc: @clayborg

llvmbot · 2025-07-08T16:45:16Z

@llvm/pr-subscribers-backend-risc-v

@llvm/pr-subscribers-lldb

Author: None (satyajanga)

Changes

We noticed this issue when processing a Meta internal RISCV coredump, currently the RISCV instruction emulation is not handling the Prologue and Epilogue instructions.

Also function unwinding using the instruction emulation is also not implemented.

This PR handles both of these issues.

NOTE: Not sure of the historic reason why this is done this way. This is done in https://reviews.llvm.org/D131759
in contrast MIPS, PPC, ARM all support this.

Test Plan:
Unfortunately there is no easy way to add testing for this. No RISCV hardware at the disposal. I welcome the suggestions.

Full diff: https://github.com/llvm/llvm-project/pull/147434.diff

2 Files Affected:

(modified) lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.cpp (+20)
(modified) lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.h (+7-4)

diff --git a/lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.cpp b/lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.cpp
index 2adde02aca3a1..90537587c0b23 100644
--- a/lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.cpp
+++ b/lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.cpp
@@ -1899,4 +1899,24 @@ RISCVSingleStepBreakpointLocationsPredictor::HandleAtomicSequence(
   return bp_addrs;
 }
 
+bool EmulateInstructionRISCV::CreateFunctionEntryUnwind(
+    UnwindPlan &unwind_plan) {
+  unwind_plan.Clear();
+  unwind_plan.SetRegisterKind(eRegisterKindLLDB);
+
+  UnwindPlan::Row row;
+
+  // Our previous Call Frame Address is the stack pointer
+  row.GetCFAValue().SetIsRegisterPlusOffset(gpr_sp_riscv, 0);
+  row.SetRegisterLocationToSame(gpr_fp_riscv, /*must_replace=*/false);
+
+  unwind_plan.AppendRow(std::move(row));
+  unwind_plan.SetSourceName("EmulateInstructionRISCV");
+  unwind_plan.SetSourcedFromCompiler(eLazyBoolNo);
+  unwind_plan.SetUnwindPlanValidAtAllInstructions(eLazyBoolYes);
+  unwind_plan.SetUnwindPlanForSignalTrap(eLazyBoolNo);
+  unwind_plan.SetReturnAddressRegister(gpr_ra_riscv);
+  return true;
+}
+
 } // namespace lldb_private
diff --git a/lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.h b/lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.h
index 3578a4ab03053..f5692efb03bd9 100644
--- a/lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.h
+++ b/lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.h
@@ -57,11 +57,12 @@ class EmulateInstructionRISCV : public EmulateInstruction {
 
   static bool SupportsThisInstructionType(InstructionType inst_type) {
     switch (inst_type) {
-    case eInstructionTypeAny:
-    case eInstructionTypePCModifying:
+    case lldb_private::eInstructionTypeAny:
+    case lldb_private::eInstructionTypePrologueEpilogue:
       return true;
-    case eInstructionTypePrologueEpilogue:
-    case eInstructionTypeAll:
+
+    case lldb_private::eInstructionTypePCModifying:
+    case lldb_private::eInstructionTypeAll:
       return false;
     }
     llvm_unreachable("Fully covered switch above!");
@@ -94,6 +95,8 @@ class EmulateInstructionRISCV : public EmulateInstruction {
   std::optional<RegisterInfo> GetRegisterInfo(lldb::RegisterKind reg_kind,
                                               uint32_t reg_num) override;
 
+  bool CreateFunctionEntryUnwind(UnwindPlan &unwind_plan) override;
+
   std::optional<DecodeResult> ReadInstructionAt(lldb::addr_t addr);
   std::optional<DecodeResult> Decode(uint32_t inst);
   bool Execute(DecodeResult inst, bool ignore_cond);

DavidSpickett · 2025-07-09T09:23:25Z

Unfortunately there is no easy way to add testing for this. No RISCV hardware at the disposal. I welcome the suggestions.

Make a core file that does not include any internal data and then it can be tested on any system. If you have to satisfy internal policies around that, consider obj2yaml-ing the file, and then either using the yaml in the test, or just using the yaml format as a way to hack out any internal data then convert it back to an object.

I know I personally ok'd a RISC-V core file change before that did not include a test, but that was a mistake and I should have pushed harder for a test case.

(for the avoidance of doubt, this next part is from the perspective of an upstream maintainer of LLDB who cares purely about the health of this project overall, and it does not represent the opinion of my employer)

Side note: If your organisation is using RISC-V tools more and more, consider supporting upstream testing of those tools. For example via Alex Bradbury's work on testing clang and llvm using qemu-system. There's only so long RISC-V LLDB can go on with spot testing, eventually it needs something comprehensive.

jasonmolenda

This will not work. The EmulateInstructionRISCV plugin today is capable of tracking where a given instruction will branch, so lldb can do a software-based instruction step operation (put a breakpoint on the target instruction, resume the cpu) for processors that do not have an instruction-step primitive. However, it does not track the information that UnwindAssemblyInstEmulation needs to create an UnwindPlan that accurately reflects when a function is using the stack pointer, when it is using a frame pointer, where it spills register to stack and where it loads them back into registers again. None of those features are present in EmulateInstructionRISCV today, and the UnwindPlans that this produces will not be correct. A good way to see this is load a binary in a riscv core with DWARF eh_frame or debug_frame. Then do image show-unwind -n FUNCNAME and lldb will show the UnwindPlan instructions at different offsets in the function for both the eh_frame/debug_frame UnwindPlans AND for the assembly emulation UnwindPlan, and if there is any divergence, it is likely a bug in the assembly emulation.

jasonmolenda · 2025-07-09T23:27:09Z

lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.h

    switch (inst_type) {
-    case eInstructionTypeAny:
-    case eInstructionTypePCModifying:
+    case lldb_private::eInstructionTypeAny:


This method is inside a namespace lldb_private, please don't add lldb_private:: specifiers.

jasonmolenda · 2025-07-09T23:28:28Z

lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.h

-    case eInstructionTypePrologueEpilogue:
-    case eInstructionTypeAll:
+
+    case lldb_private::eInstructionTypePCModifying:


EmulateInstructionRISCV is currently used to correctly predict where a given instruction will branch to next, for instruction stepping on hardware that does not support instruction step. By changing eInstructionTypePCModifying to return false from SupportsThisInstructionType, you're going to break that use of this. Did you copy and paste this from another EmulateInstruction plugin? Modifying this plugin without a much deeper understanding of what it is doing & testing it carefully against a corpus of functions is not going to work; this isn't a minor edit.

I told Satya to try this, but it seems they must have been already emulating PC modifying instructions. We don't get any backtrace if we don't say we handle eInstructionTypePrologueEpilogue and when we changed it we did get a backtrace. If we don't return true for eInstructionTypePrologueEpilogue then EmulateInstructionRISCV::CreateFunctionEntryUnwind() never gets called. But it sounds like we need to modify the instruction emulator to handle the prologue opcodes for this patch.

jasonmolenda · 2025-07-09T23:49:25Z

If you look at an existing EmulateInstruction target plugin that we live on today, such as ARM64, it returns context types for UnwindAssemblyInstEmulation like

eContextAdjustBaseRegister
eContextAdjustStackPointer
eContextAdvancePC
eContextImmediate
eContextPopRegisterOffStack
eContextPushRegisterOnStack
eContextReadOpcode
eContextRegisterLoad
eContextRegisterStore
eContextRelativeBranchImmediate
eContextRestoreStackPointer
eContextSetFramePointer

as it is emulating the prologue and epilogue instructions on ARM64. I don't think the existing EmulateInstructionRISCV plugin provides any of these, or at least not the important ones that I spot checked quickly.

clayborg · 2025-07-15T16:55:36Z

Unfortunately there is no easy way to add testing for this. No RISCV hardware at the disposal. I welcome the suggestions.

Make a core file that does not include any internal data and then it can be tested on any system. If you have to satisfy internal policies around that, consider obj2yaml-ing the file, and then either using the yaml in the test, or just using the yaml format as a way to hack out any internal data then convert it back to an object.

obj2yaml doesn't work well at all for core files. Core files have only program headers and obj2yaml doesn't allow program header to specify data. Only sections in the section header can have data. So if you obj2yaml a core file, you end up with a useless file that won't recreate things correctly.

JDevlieghere · 2025-07-15T22:37:27Z

I think we've reached a point where we need to look into first-class support for building and running RISC-V tests on QEMU. I can't sign up to do the work, but I'm happy to help with reviews etc.

@DavidSpickett helpfully created a page about QEMU testing for system emulation. Maybe a first step could be to extend that for RISC-V?

labath · 2025-07-16T07:32:07Z

Just to throw a couple of other ideas:

write a unit test for the instruction emulator (other architectures have one)
extend yaml2obj to work on core files
use minidumps which are supported in yaml2obj (for unwind plan generation/instruction emulation to work, you don't actually need a core file that contains anything. It just needs to be there so that lldb can construct the register context)
teach lldb to emulate instructions without a core file. (by removing the requirement to have a (real) register context for unwind plan construction)

jasonmolenda · 2025-07-16T08:39:54Z

The general discussion of how to test this class of change is interesting, but I don't want to lose sight of the fact that this PR will not work. It does nothing but break existing functionality for RISC-V users (disabling instruction emulation used to do instruction stepping). It's not just a matter of how to automated test; it was not tested while it was being developed at-desk, let alone in CI. There's nothing here, currently, to test.

Address gaps in RISCV function unwinding

d14ca45

satyajanga changed the title ~~Address gaps in RISCV function unwinding~~ Implement RISCV function unwinding using instruction emulation Jul 8, 2025

satyajanga marked this pull request as ready for review July 8, 2025 16:44

satyajanga requested a review from JDevlieghere as a code owner July 8, 2025 16:44

llvmbot added the lldb label Jul 8, 2025

DavidSpickett changed the title ~~Implement RISCV function unwinding using instruction emulation~~ [lldb] Implement RISCV function unwinding using instruction emulation Jul 9, 2025

clayborg approved these changes Jul 9, 2025

View reviewed changes

JDevlieghere requested a review from jasonmolenda July 9, 2025 22:56

jasonmolenda requested changes Jul 9, 2025

View reviewed changes

llvmbot added the backend:RISC-V label Jul 12, 2025

satyajanga force-pushed the main branch from af707a0 to d14ca45 Compare July 12, 2025 19:00

satyajanga marked this pull request as draft July 15, 2025 16:51

satyajanga mentioned this pull request Sep 12, 2025

RISCV unwinding enable #158161

Merged

satyajanga requested a review from jeffreytan81 September 12, 2025 21:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[lldb] Implement RISCV function unwinding using instruction emulation #147434

[lldb] Implement RISCV function unwinding using instruction emulation #147434

satyajanga commented Jul 8, 2025 •

edited

Loading

Uh oh!

llvmbot commented Jul 8, 2025 •

edited

Loading

Uh oh!

DavidSpickett commented Jul 9, 2025

Uh oh!

jasonmolenda left a comment

Uh oh!

jasonmolenda Jul 9, 2025

Uh oh!

jasonmolenda Jul 9, 2025 •

edited

Loading

Uh oh!

clayborg Jul 15, 2025

Uh oh!

jasonmolenda commented Jul 9, 2025

Uh oh!

clayborg commented Jul 15, 2025 •

edited

Loading

Uh oh!

JDevlieghere commented Jul 15, 2025

Uh oh!

labath commented Jul 16, 2025

Uh oh!

jasonmolenda commented Jul 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

[lldb] Implement RISCV function unwinding using instruction emulation #147434

Are you sure you want to change the base?

[lldb] Implement RISCV function unwinding using instruction emulation #147434

Conversation

satyajanga commented Jul 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

llvmbot commented Jul 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DavidSpickett commented Jul 9, 2025

Uh oh!

jasonmolenda left a comment

Choose a reason for hiding this comment

Uh oh!

jasonmolenda Jul 9, 2025

Choose a reason for hiding this comment

Uh oh!

jasonmolenda Jul 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

clayborg Jul 15, 2025

Choose a reason for hiding this comment

Uh oh!

jasonmolenda commented Jul 9, 2025

Uh oh!

clayborg commented Jul 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JDevlieghere commented Jul 15, 2025

Uh oh!

labath commented Jul 16, 2025

Uh oh!

jasonmolenda commented Jul 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

satyajanga commented Jul 8, 2025 •

edited

Loading

llvmbot commented Jul 8, 2025 •

edited

Loading

jasonmolenda Jul 9, 2025 •

edited

Loading

clayborg commented Jul 15, 2025 •

edited

Loading