Hi Maarten,
> > just to paste you current server laod status after being reboot an hour ago.
> > we now have more than 537 globus-job-manager processes passing wms at
> > wms017.cnaf.infn.it, is this normal?
>
> No. In the past this could happen for a massive cancellation of jobs,
> but Condor-G on the WMS should now manage that properly.
> It seems there is a bug at least on that WMS.
> Try blocking it in the firewall for now.
Thanks a lot, since we didnt read the same symptom on the other old CE
box, as well as new slc4 lcgCE. and thanks input from Jan Astalos that we
add another snip code into job manager to force process sleep if system
load over 20. seems this able to stablize the system load and we didnt
receive the same event right now.
if problem remains, we will block the wms later on. thanks
Br,
J
|