Skip to content

Support rpath to locate shared libraries? #22126

@ryanking13

Description

@ryanking13

Proposal

Introduce a new rpath.0 custom section into Emscripten WASM module that stores runtime path for the module and can be used to locate shared libraries per-module (or extend the current dylink.0 section to have such value?)

Background

ELF binaries (or shared libraries) has rpath or runtime path section that is used to locate its dependent shared libraries. In addition to LD_LIBRARY_PATH which is a system-wide path for searching shared libraries, rpath can be used per-module path for searching its dependencies.

For now, -rpath flag is ignored when it is passed to emcc.

emcc: warning: ignoring unsupported linker flag: `-rpath` [-Wlinkflags]

Reason for the proposal

Pyodide loads python packages with shared libraries in runtime. In order to ensure that each Python package is self-contained, Python utilizes rpaths to locate dependent dynamic libraries within a single package.

For instance, numpy Python package for linux has following runtime path:

$ objdump -x lapack_lite.cpython-312-x86_64-linux-gnu.so | grep 'PATH'
  RPATH                $ORIGIN/../../numpy.libs

and it is used to locate shared libraries in numpy.libs, which is not in LD_LIBRARY_PATH.

$ ldd lapack_lite.cpython-312-x86_64-linux-gnu.so
        linux-vdso.so.1 (0x00007ffee37cd000)
        libscipy_openblas64_-99b71e71.so => /mnt/c/Users/siha/Downloads/numpy-2.0.0-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64/numpy/linalg/./../../numpy.libs/libscipy_openblas64_-99b71e71.so (0x00007ff9b7c08000)
        libm.so.6 => /lib/x86_64-linux-gnu/libm.so.6 (0x00007ff9b786a000)
        libpthread.so.0 => /lib/x86_64-linux-gnu/libpthread.so.0 (0x00007ff9b764b000)
        libgfortran-040039e1-0352e75f.so.5.0.0 => /mnt/c/Users/siha/Downloads/numpy-2.0.0-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64/numpy/linalg/./../../numpy.libs/libgfortran-040039e1-0352e75f.so.5.0.0 (0x00007ff9b719a000)
        libc.so.6 => /lib/x86_64-linux-gnu/libc.so.6 (0x00007ff9b6da9000)
        /lib64/ld-linux-x86-64.so.2 (0x00007ff9b9da5000)
        libquadmath-96973f99-934c22de.so.0.0.0 => /mnt/c/Users/siha/Downloads/numpy-2.0.0-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64/numpy/linalg/./../../numpy.libs/libquadmath-96973f99-934c22de.so.0.0.0 (0x00007ff9b6b6b000)
        libz.so.1 => /lib/x86_64-linux-gnu/libz.so.1 (0x00007ff9b694e000)
        libgcc_s.so.1 => /lib/x86_64-linux-gnu/libgcc_s.so.1 (0x00007ff9b6736000)

it would be nice if this is also available in Emscripten.

Alternatives / Current workaround

Currently, Pyodide performs rpath-like behavior in the following ways.

  1. We directly use loadDynamicLibrary JS function instead of using dlopen.
  2. When calling loadDynamicLibrary, we pass fs parameter with custom findObject function.
    1.a. This function was removed in latest emscripten, but we are using a patched-version of it (Move dlopen file operations into native code. NFC #19310)
  3. In out custom findObject function, we search shared libraries by looking at some pre-defined search path per module, which behaves like rpath.

References


Above all, thank you for maintaining such an amazing project! We (Pyodide) benefit a lot from Emscripten :)

cc: @hoodmane @agriyakhetarpal for visibility

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions