Hi,
Assuming this is your CE
lcgce02.phy.bris.ac.uk
metrics history data can be seen here
http://tinyurl.com/2uq4b7q
Seems like someone tried to execute the passive metric actively. E.g., like this
[root@samnag011 ~]# nagios-run-check -v -H boalice3.bo.infn.it -s
org.sam.WN-CAver-/ops/Role=lcgadmin
Executing command:
su nagios -l -c '/usr/lib64/nagios/plugins/check_dummy 3 "This metric is part of
the org.sam.CE-JobState bundle and cannot be executed independently."'
Output:
UNKNOWN: This metric is part of the org.sam.CE-JobState bundle and cannot be
executed independently.
[root@samnag011 ~]#
Contact admins of gridppnagios.physics.ox.ac.uk
Konstantin
Winnie Lacesso wrote:
> Greetings all,
>
> One SL4 lcg-CE failed *one* CAver test at 05:23:47 on 2010 Aug 2 -
> not a huge surprise, when HPC gpfs gets very sluggish this can happen
> intermittently (although it's been very good for a while)
>
> The next CAver test runs at 09:59:36 2010 Aug 2 and turns grey, with data:
>
> UNKNOWN: This metric is part of the org.sam.CE-JobState bundle and cannot
> be executed independently.
>
> All subsequent SAM CAVer tests have same output. Then the SAM tests
> for this CE, starting today, have no line for the test CE-org.sam.WN-CAver
>
> Where to find any data on what this means?
>
> In gridppnagios.physics.ox.ac.uk, this CE CAver shows status UNKNOWN, the
> last test time is 08-02-2010 12:41:31 (yesterday) and the StatusInfo is
> the same message.
>
> The CE seems perfectly healthy & is passing all other tests (and lhcb &
> cms), just this weird ops CAVer UNKNOWN.
> But for OPS SAM this one ill test makes the whole node FAIL which
> is completely false!
>
> Very grateful for advice/pointers/hints/clues!
|