Hi,
I've recently noticed that, at our site, the MPI jobs that we submit do
not communicate between nodes, although they are spread to multiple nodes.
For example, I've run a simple "Hello world!" program and the only
output I get is from the last node that was allocated.
We have both shared /home directory and ssh passwordless
authentification between nodes.
The configuration was done according to
https://twiki.cern.ch/twiki/bin/view/EGEE/MpiTools (Yaim, PBS, MPI-Start).
What could the problem be and where should I look for it?
Cheers,
Claudiu Demian
|