Hi Steve,
you are not the only one:
https://ggus.eu/ws/ticket_info.php?ticket=85029
Cheers,
Daniela
On 9 August 2012 16:35, Stephen Jones <[log in to unmask]> wrote:
> Hi all,
>
> we've got a bit of a problem right now. At the Oxford (and Lancs?) Nagios,
> we're seeing a problem with our CE server at Liverpool, i.e. the
> org.sam.glexec.CE-JobSubmit-/ops/Role=pilot test is showing WARNING: [1/2]
> [Running->Cancelled [timeout/dropped]]
>
> Investigations show that a test script is trying to contact 195.251.55.110
> (broker.afroditi.hellasgrid.gr) on port 6163. It is being Connection
> Refused, then sleeping/retrying until the job dies. Furthermore, another 5
> or six servers within GridPP are having similar problems:
> hepgrid6.ph.liv.ac.uk ,hepgrid10.ph.liv.ac.uk, heplnx206.pp.rl.ac.uk
> ,heplnx207.pp.rl.ac.uk , heplnx208.pp.rl.ac.uk , lcgCE07.gridpp.rl.ac.uk,
> svr009.gla.scotgrid.ac.uk. ...
>
> The problem appears to be flip/flopping - it works on occasion then hangs
> the next. Has anyone seen this problem? Could it be with the monitoring
> servers?
>
> Cheers,
>
> Steve
>
> --
> Steve Jones [log in to unmask]
> System Administrator office: 220
> High Energy Physics Division tel (int): 42334
> Oliver Lodge Laboratory tel (ext): +44 (0)151 794 2334
> University of Liverpool http://www.liv.ac.uk/physics/hep/
--
Sent from the pit of despair
-----------------------------------------------------------
[log in to unmask]
HEP Group/Physics Dep
Imperial College
Tel: +44-(0)20-75947810
http://www.hep.ph.ic.ac.uk/~dbauer/
|