Hello all,
After having a test instance running Torque 4.2.10 + Maui from several
months, with any relevant issue, running hundreds of jobs, we decided to
move our production batch system from Torque 2.5.13 to Torque 4.2.10, as
we expected a better performance. Doing some ldapsearch in our bdiis
showed that several sites are in 4.2.10 version already.
However, we have observed now, only in production, that pbs_server
crashes from time to time with this error:
pbs_server[14661]: segfault at 0 ip 00000031bf281301 sp 00007fc6cdff47a8
error 4 in libc-2.12.so[31bf200000+18a000]
Looking in the pbs logs does not show any helpful information... Thus,
before thinking in rollback to version 2.5.13, I would like to ask you
if any site running Torque 4.2.10 and Maui have experienced issues like
this and how you solve it.
We are running SL6.8.
Thank you in advance.
Cheers,
Carles
--
Carles Acosta i Silva
PIC (Port d'Informació Científica)
Campus UAB, Edifici D
E-08193 Bellaterra, Barcelona
Tel: +34 93 581 33 22
Fax: +34 93 581 41 10
http://www.pic.es
Avís - Aviso - Legal Notice: http://www.ifae.es/legal.html
|