Skip to content

opal_fifo segfaults on x86_64 #1460

@adrianreber

Description

@adrianreber

In my MTT runs on x86_64 opal_fifo sometimes segfaults. It works most of the time but sometimes it is possible to get a segfault like this:

$ LD_LIBRARY_PATH=../../opal/.libs ./opal_fifo
Single thread test. Time: 0 s 23315 us 23 nsec/poppush
Atomics thread finished. Time: 0 s 70590 us 70 nsec/poppush
Atomics thread finished. Time: 1 s 939921 us 1939 nsec/poppush
Atomics thread finished. Time: 2 s 662403 us 2662 nsec/poppush
Atomics thread finished. Time: 3 s 323705 us 3323 nsec/poppush
Atomics thread finished. Time: 3 s 847741 us 3847 nsec/poppush
Atomics thread finished. Time: 3 s 918840 us 3918 nsec/poppush
Atomics thread finished. Time: 3 s 942101 us 3942 nsec/poppush
Atomics thread finished. Time: 3 s 997348 us 3997 nsec/poppush
Atomics thread finished. Time: 4 s 12625 us 4012 nsec/poppush
All threads finished. Thread count: 8 Time: 4 s 25370 us 503 nsec/poppush
[rhlx01:03461] *** Process received signal ***
Segmentation fault (core dumped)

GDB results:

Core was generated by `/home/staff/adrian/openmpi-dev-3613-g6d7ada9/test/class/.libs/lt-opal_fifo'.
Program terminated with signal SIGSEGV, Segmentation fault.
#0  0x0000000000401222 in opal_fifo_pop_atomic (fifo=0x7fff548f82e0) at ../../opal/class/opal_fifo.h:127
127             next = (opal_list_item_t *) item->opal_list_next;
[Current thread is 1 (Thread 0x7f2665f86700 (LWP 3513))]
(gdb) bt
#0  0x0000000000401222 in opal_fifo_pop_atomic (fifo=0x7fff548f82e0) at ../../opal/class/opal_fifo.h:127
#1  0x0000000000401607 in thread_test_exhaust (arg=0x7fff548f82e0) at opal_fifo.c:79
#2  0x00007f266a2d260a in start_thread () from /lib64/libpthread.so.0
#3  0x00007f266a00ca4d in clone () from /lib64/libc.so.6

See also here: https://mtt.open-mpi.org/index.php?do_redir=2276

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions