Hello again,
Thanks to Owen and Steve for their replies. In fact I went for the more
drastic solution of rebooting the CE, which certainly solved THAT problem.
Now I have got back to a similar situation to the one I reached on my last
attempt, back in March. I can submit a "helloworld" job to the new
"external" queue, and I can see it runs and produces stdout and stderr
files, but the output sandbox doesn't get copied back to the RB. The
globus-url-copy process site there for serveral minutes without achieving
anything.
dg-job-status says (this is different from what I saw in March):
Status = Done (Failed)
[...]
Status Reason = disappeared from LRMS
and the gram_job_mgr_*.log file says
[...]
in gram_script_pbs_rm
executing qdel with job id 62.pc55
6/10 11:42:44 JMI: while return_buf = GRAM_SCRIPT_SUCCESS:999
exiting gram_script_pbs_rm\n\n
6/10 11:42:44 JMI: return_buf = GRAM_SCRIPT_SUCCESS:999
6/10 11:42:44 JMI: ret value = 999
6/10 11:42:44 JM: request check returned DONE or FAILED
6/10 11:42:44 JM: we're done. doing cleanup
6/10 11:42:44 JM: No standard output contents found. Waiting for 0
seconds.6/10 11:42:44 JM: No standard output contents found. Waiting for
10 seconds.6/10 11:42:54 JM: No standard output contents found. Waiting
for 20 seconds.6/10 11:43:14 JM: No standard output contents found.
Waiting for 30 seconds.6/10 11:43:44 JM: No standard output contents
found. Waiting for 40 seconds.
Can anyone help me?
Cheers,
Ben
--
Dr Ben Waugh Tel. +44 (0)20 7679 3783
Dept of Physics and Astronomy Internal: 33783
University College London
London WC1E 6BT
|