Gustav Wikström wrote:
> I'm having serious problems with running my VO t2k.org jobs, currently
> 95% of them are being cancelled by the WMSs (lcgwms03.gridpp.rl.ac.uk
> and wms02.grid.hep.ic.ac.uk) or the CEs.
I don't know much about that, but we put a new CREAM ce on yesterday
(hepgrid10.ph.liv.ac.uk),
and it has run ~ 923 t2k jobs, all of which ended DONE-OK. How many jobs
in total have you submitted?
We have seen a big surge over the last day or so. Is that about right?
Steve
> As I understand it, when a
> WMS stops a job, it is labeled Aborted, and then Cancelled is when a
> CE stops a job? The bad thing is that there is no information about a
> job after it has been stopped unless it failed.
>
> So, what could cause a job to be cancelled? Is memory usage one of the reasons?
>
> Cheers,
> Gustav
>
--
Steve Jones [log in to unmask]
System Administrator office: 220
High Energy Physics Division tel (int): 42334
Oliver Lodge Laboratory tel (ext): +44 (0)151 794 2334
University of Liverpool http://www.liv.ac.uk/physics/hep/
|