Hi Carsten,
RB/WMS used for submission of jobs initiates these processes. If the jobs are
not cleanly canceled, obviously RB/WMS would continue to monitor jobs.
However, it should notice that jobs are not running anymore and resubmit them.
This seems not to happen. Restarting the service on RB/WMS should help (only
possible if you are admin of it)...
Best regards, Antun
-----
Antun Balaz
Research Assistant
E-mail: [log in to unmask]
Web: http://scl.phy.bg.ac.yu/
Phone: +381 11 3713152
Fax: +381 11 3162190
Scientific Computing Laboratory
Institute of Physics, Belgrade, Serbia
-----
---------- Original Message -----------
From: Preuss Carsten <[log in to unmask]>
To: [log in to unmask]
Sent: Tue, 21 Aug 2007 16:00:27 +0200
Subject: [LCG-ROLLOUT] Unclean canceled jobs on a LCG-CE
> Hi *,
>
> since a few days we have serveral processes like the ones below running on
our LCG-CE :
>
> alicesgm 1475 1 0 05:34 ? 00:00:52 perl
/tmp/grid_manager_monitor_agent.alicesgm.11971.1000 --delete-self --maxtime=1922s
> alicesgm 21720 1 0 05:45 ? 00:00:50 perl
/tmp/grid_manager_monitor_agent.alicesgm.7385.1000 --delete-self --maxtime=181s
> alicesgm 25484 1 0 05:47 ? 00:00:52 perl
/tmp/grid_manager_monitor_agent.alicesgm.7385.1000 --delete-self --maxtime=61s
> alicesgm 11486 1 0 06:47 ? 00:00:45 perl
/tmp/grid_manager_monitor_agent.alicesgm.29760.1000 --delete-self --maxtime=120s
> alicesgm 20836 1 0 09:07 ? 00:00:33 perl
/tmp/grid_manager_monitor_agent.alicesgm.8708.1000 --delete-self --maxtime=180s
> alicesgm 26820 1 0 09:09 ? 00:00:31 perl
/tmp/grid_manager_monitor_agent.alicesgm.8708.1000 --delete-self --maxtime=60s
>
> These processes are from a bunch of jobs submitted to this CE.
Unfortunatelly the jobs were not cancelled clean and parts of the jobs stayed
> in our CE and the batch farm.
> Rebooting the CE doesn't help to get rid of these processes and they don't
disappear from alone, which is the case under normal circumstances.
>
> Has anyone an idea of how to get rid of these processes?
> Killing them by hand doesn't solve the problem.
>
> Thanks in advantage,
>
> Carsten.
>
> -----------------------------------------
> Carsten Preuss
> Gesellschaft fuer Schwerionenforschung mbH
> IT
> Planckstr. 1, D-64291 Darmstadt, Germany
> phone: +49-6159-71-1339
>
> -----------------------------------------
>
>
------- End of Original Message -------
|