-
Notifications
You must be signed in to change notification settings - Fork 925
Closed
open-mpi/ompi-release
#1032Description
In my MTT runs on x86_64 opal_fifo sometimes segfaults. It works most of the time but sometimes it is possible to get a segfault like this:
$ LD_LIBRARY_PATH=../../opal/.libs ./opal_fifo
Single thread test. Time: 0 s 23315 us 23 nsec/poppush
Atomics thread finished. Time: 0 s 70590 us 70 nsec/poppush
Atomics thread finished. Time: 1 s 939921 us 1939 nsec/poppush
Atomics thread finished. Time: 2 s 662403 us 2662 nsec/poppush
Atomics thread finished. Time: 3 s 323705 us 3323 nsec/poppush
Atomics thread finished. Time: 3 s 847741 us 3847 nsec/poppush
Atomics thread finished. Time: 3 s 918840 us 3918 nsec/poppush
Atomics thread finished. Time: 3 s 942101 us 3942 nsec/poppush
Atomics thread finished. Time: 3 s 997348 us 3997 nsec/poppush
Atomics thread finished. Time: 4 s 12625 us 4012 nsec/poppush
All threads finished. Thread count: 8 Time: 4 s 25370 us 503 nsec/poppush
[rhlx01:03461] *** Process received signal ***
Segmentation fault (core dumped)
GDB results:
Core was generated by `/home/staff/adrian/openmpi-dev-3613-g6d7ada9/test/class/.libs/lt-opal_fifo'.
Program terminated with signal SIGSEGV, Segmentation fault.
#0 0x0000000000401222 in opal_fifo_pop_atomic (fifo=0x7fff548f82e0) at ../../opal/class/opal_fifo.h:127
127 next = (opal_list_item_t *) item->opal_list_next;
[Current thread is 1 (Thread 0x7f2665f86700 (LWP 3513))]
(gdb) bt
#0 0x0000000000401222 in opal_fifo_pop_atomic (fifo=0x7fff548f82e0) at ../../opal/class/opal_fifo.h:127
#1 0x0000000000401607 in thread_test_exhaust (arg=0x7fff548f82e0) at opal_fifo.c:79
#2 0x00007f266a2d260a in start_thread () from /lib64/libpthread.so.0
#3 0x00007f266a00ca4d in clone () from /lib64/libc.so.6
See also here: https://mtt.open-mpi.org/index.php?do_redir=2276
Metadata
Metadata
Assignees
Labels
No labels