Skip to content

bug using MPI - Seg Fault #142

@othman-bouizi

Description

@othman-bouizi

Hi,

I want to analyze an application written in C or C++ and running in parallel with MPI.
It is complied with GNU 14. Malt is compiled with GNU 14 as well.
The binary is executed on 112 cores, and generates this error (of course the application does not crash without MALT)

MALT: Start memory instrumentation of pluto_gcc - 338820 by library override.
MALT: Resolving symbols with addr2line...
[cw37053:338799:0:339779] Caught signal 11 (Segmentation fault: address not mapped to object at address (nil))
BFD: BFD version 2.35.2-54.el9 internal error, aborting at ../../bfd/elf.c:9149 in _bfd_elf_get_lineno

BFD: Please report this bug.

However MALT does not crash with 1 MPI and 2 MPI ranks. It crashes with 4 MPI ranks.

Is there a way to intercept the signal 11 and to report it in the gathered data ?

Othman.

Metadata

Metadata

Assignees

Labels

bugSomething isn't working

Type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions