Min Tsai wrote:
> Hi All,
>
> The fix is in for the CPU count. Three other sites had their CPU stats
> change: CCIN2P3-LCG2, INFN-LNL-LCG, INFN-PADOVA. Let me know if these
> number are inaccurate for some reason.
Hi Min,
http://goc.grid.sinica.edu.tw/gstat/filter_help.html#GIISQuery_Usage
Unfortunately for INFN-PADOVA we have queues with different numbers of
WN running on the same cluster. We are committed to provide different
service levels to different VOs, for example we have a contract with a VO to
dedicate always 4 CPUs (2 WN) and in case of need we can change this value.
So the numbers reported for our site are wrong, the right value should be
the one of "biggest" queue, I hope this will not complicated too much the
(already) complex situation...
Another point (but I have to discuss with my ROC, probably there is something
wrong in our GOC DB entries and we can solve the problem ourselves) is that I
found our CE as "gilda-ce-01" instead of "gridit001":
http://lcg-testzone-reports.web.cern.ch/lcg-testzone-reports/cgi-bin/lastreport.cgi
In Padova we have also gilda-ce-01, but is a GILDA CE. Gilda is an indipendent
and dedicated infrastructure for tutorials, integration of new apps from other
sciences, etc... and should not be considered part of the production grid.
Best regards,
>
> Best Regards,
> Min
>
> -----Original Message-----
> From: LHC Computer Grid - Rollout [mailto:[log in to unmask]] On
> Behalf Of Min Tsai
> Sent: Wednesday, January 26, 2005 12:41 PM
> To: [log in to unmask]
> Subject: Re: [LCG-ROLLOUT] TotalCPU count on the GOC Mon
>
> Dear Anar,
>
> Typically CPU stats for queues on a single CE repeat all refer to the same
> set of CPUs. So to prevent recount of CPU Gstat adds up the CPU stats for
> the first queue it encounters for each unique CE. So in your case:
>
> It adds up:
> GlueCEUniqueID=lcg03.gsi.de:2119\/jobmanager-torque-alice 2 CPU
> GlueCEUniqueID=lcg06.gsi.de:2119\/jobmanager-lcglsf-alice 16 CPU
>
> I have not noticed a configuration like yours before, so I will make a
> modification by adding CPU from queues that have different total CPU
> statistics even though they reside on the same CE. The only problem we will
> have if when 2 queues on a single CE has the same total CPU count even
> though they are referring to 2 completely different clusters.
>
> I hope this will correct the CPU problem for you site. Thank you for
> providing this feedback! I will let you know once this I have tested and
> complete this change.
>
> Cheers,
> Min
>
>
>
>
>
> -----Original Message-----
> From: LHC Computer Grid - Rollout [mailto:[log in to unmask]] On
> Behalf Of Anar Manafov
> Sent: Wednesday, January 26, 2005 11:59 AM
> To: [log in to unmask]
> Subject: [LCG-ROLLOUT] TotalCPU count on the GOC Mon
>
> Good day to ALL!
>
> I have mentioned that on the monitoring (http://goc.grid.sinica.edu.tw/
> gstat/lcg03.gsi.de/) we (GSI) publishing only 18 CPU (Total CPU). So, I
> wonder how this number is calculated and why not all of the queues are
> affected.
> We have 2 different CE:
> Torque CE (with 2 CPU).
> LSF CE (more than 300 CPU),
> in LSF we have “dteam” and “alice” queues.
> For “alice” ~ 16 PCU
> For “dteam” ~ 344 CPU or something (Later on, when we finish
> the test of
> our new pool-accounts algorithm we will publish more CPU on the
> “alice”).
>
> So, my question would be which algorithm monitoring uses to calculate Total
> CPU amount?
>
> I would appreciate any comment on this.
>
> Thank you very much in advance.
>
> Best of luck,
>
> Anar
--
Enrico Ferro Ist. Naz. di Fisica Nucleare - Padova
Address: via F. Marzolo, 8 - 35131 Padova - ITALY
Email: [log in to unmask] Phone: +39-0498277154
|