Hi,
There is currently a bug in Savannah about the estimated response
time.
https://savannah.cern.ch/bugs/?func=detailitem&item_id=6213
<https://savannah.cern.ch/bugs/?func=detailitem&item_id=6213>
Please could anyone who has any comments about this, read the existing
thread and contribute.
This is a non-trivial problem that has been around for some time.
Thanks
Laurence
Rod Walker wrote:
> Hi,
> How does this give anything useful at all? This is the estimated response
> time for pbs:
> $MaxTime=(($TotalJobs * $WallTime) - $UsedTime) / $TCPU;
> where TCPU is the smaller of total number of cpus or the max running
> jobs.
>
> Our cluster has 18 cpus and 2 jobs in the atlas queue, with a combined
> used wall time of around 72 hours. The max walltime is 72 hrs. So
> $MaxTime=((2*72)-72)/18=4hours
> Give or take, this is what lcgce01.triumf.ca is publishing, but it
> clearly
> should be zero as there are 16 free cpus.
> I know it difficult to get a consistent estimation of ERT for both full
> and empty grids, but is this really the best estimate?
>
> I know the logic, if freecpus>0 then ERT=0, does not work due to site
> policies. How about setting ERT=0 if there are no jobs queued for that
> CE(queue)? This will only be wrong when the site is exactly full, which
> almost never happens. Otherwise do your stuff with used times.
>
> Or maybe, if no jobs queued and freecpus>0 ...
> Ideally there is some way to extract the estimate from Maui, as this is
> the only truth. MOAB has a 'showstart' command for example.
>
> Cheers,
> Rod.
>
>
>
> --
> Rod Walker +1 6042913051
>
|