Emanuele
You will see CERN is still orange in the gppmon map today. I think this is
because I misunderstood your latest message (below). I changed the monitor
to submit jobs to adc0015 as you recommended and changed the jobmanager to
jobmanager-pbs-xxx, also as you suggested, but I suspect now that this
latter suggestion only referred to adc0018, which I am now not monitoring.
I'll change back to jobmanager-lcgpbs-xxx for now.
Later I shall experiment with the suggestion from Steve that I try ranking
the job by requesting minimal cpu time and simply directing it at the site.
Could some jdl expert please give me the precise jdl statements which will
do this.
Trevor
.lf n25
Dr Trevor Daniels
c/o CCLRC eSC Department Phone: (+44)|(0) 1235 778093
Rutherford Appleton Laboratory Fax: (+44)|(0) 1235 446626
Chilton, DIDCOT, Oxon, OX11 0QX, UK Email: [log in to unmask]
The contents of this email are sent in confidence for the use of the
intended recipient only. If you are not one of the intended recipients do
not take action on it or show it to anyone else, but return this email to
the sender and delete your copy of it.
> -----Original Message-----
> From: Emanuele LEONARDI [mailto:[log in to unmask]]
> Sent: Monday, September 22, 2003 3:15 PM
> To: [log in to unmask]
> Subject: Re: [LCG-ROLLOUT] GOC monitoring problem
>
>
> Hi Trevor.
>
> I sent a message about it on friday: last week I reinstalled
> the adc0018
> CE so that it now shares the /home directory with its two
> WNs. This also
> mean that its jobmanager is now called jobmanager-pbs-xxx instead of
> jobmanager-lcgpbs-xxx.
>
> I would recommend that you use the CERN main CE, adc0015.cern.ch, for
> the monitoring: its configuration is supposed to stay much more stable
> than that of adc0018.
>
> Cheers
>
> Emanuele
>
> "Daniels, T (Trevor)" wrote:
> >
> > The monitoring failure which affected the gppmon webpage
> this morning was
> > due to the edg UI I was using being updated on Sunday
> afternoon. I don't
> > know what the changes were, but they caused all the
> monitors I was running
> > to fail in a way which suggests some port numbers were
> changed. The person
> > applying the changes is not in today (or tomorrow) so I can't check.
> >
> > In the meantime I've switched to an lcg UI (probably
> sensible to move there
> > anyway now the rate of updating lcg nodes has slowed) and
> the gppmon map
> > should now be updating each hour again.
> >
> > All RBs (including FNAL :-) are now green except for RAL
> and CERN. RAL is
> > still waiting for a change to the firewall. On the last
> attempt CERN failed
> > with a "Cannot plan (a helper failed)" message, which
> usually means the
> > queue name is wrong???
> >
> > Trevor
> > .lf n25
> > Dr Trevor Daniels
> > c/o CCLRC eSC Department Phone: (+44)|(0) 1235 778093
> > Rutherford Appleton Laboratory Fax: (+44)|(0) 1235 446626
> > Chilton, DIDCOT, Oxon, OX11 0QX, UK Email: [log in to unmask]
> > The contents of this email are sent in confidence for the use of the
> > intended recipient only. If you are not one of the
> intended recipients do
> > not take action on it or show it to anyone else, but return
> this email to
> > the sender and delete your copy of it.
>
> --
> /------------------- Emanuele Leonardi -------------------\
> | eMail: [log in to unmask] - Tel.: +41-22-7674066 |
> | IT division - Bat.31 2-012 - CERN - CH-1211 Geneva 23 |
> \---------------------------------------------------------/
>
|