Peter Love wrote:
> We're having problem with jobs submitted by both WMS and condor
> whereby the job runs OK but the WMS/condor state remains in RUNNING
> for a very long time. With the new stuff in lcg-CE 3.1, which
> component should I be looking at? Any technical docs on lcg-CE?
There is some documentation on the updates page:
http://glite.web.cern.ch/glite/packages/R3.1/updates.asp
Some of the options are documented in this ticket:
https://gus.fzk.de/ws/ticket_info.php?ticket=35835
Now, the problem you describe can happen due to various causes.
Some ideas are suggested here (sic):
http://goc.grid.sinica.edu.tw/gocwiki/Jobs_sent_to_some_CE_stay_in_Scheduled_state_forever
Does the problem occur for every user? Multiple WMS nodes?
Look for errors in the globus-gma logs in /opt/globus/var/log;
you can increase the debug level in /opt/globus/etc/globus-gma.conf
and restart the daemon.
|