Hello,
I've managed to get a reasonable configuration of torque, maui, mpich,
and mpiexec working at LAL. The configuration fixes the problem you
describe below and also adds support for mpiexec. It uses newer
versions of torque and mpich than are standard in the LCG/EGEE
distribution. The newer versions are running at LAL and work well.
There is a short readme on the configuration which needs to be changed here:
http://grid05.lal.in2p3.fr/mpi/readme.txt
and the rpms can be found here:
http://grid05.lal.in2p3.fr/mpi/
MPICH and mpiexec have been compiled to use p4 by default which is
probably what you'll want.
Cheers.
Cal
Klaere Cassirer wrote:
> Hello,
> I have questions about MPICH jobs. Our site runs LCG2.3 on SL3 with the
> maui
> scheduler and pbs.
> - Does anybody know how to specify a NODENUMBER greater than the number of
> WNs on a site, which has more than 1 CPU per WN? I want to use all
> available
> CPUs (here: 2 CPUs per WN).
> - How can I tell maui to reserve x nodes for me, but only one CPU per node?
>
> In LCG2.2 times I specified nodenumber=x, then ran a script which started 2
> mpich-processes on each of the x nodes. But with maui, I get x CPUs (every
> 2nd on the same WN), but not more.
>
> Regards
>
> --
> Klaere Cassirer
> Fraunhofer-Institute for Algorithms and Scientific Computing (SCAI)
> Schloss Birlinghoven
> D-53754 Sankt Augustin
>
> Tel: +49 - 2241 - 14 - 2758
> Fax: +49 - 2241 - 14 - 42758
> E-mail: [log in to unmask]
> Internet: http://www.scai.fraunhofer.de
>
|