Hi Frederic,
it's a problem with the cvmfs nagios probe. Those errors show up on our machines
occasionally, but always on the ones that have a high memory usage. What I found
was that the nagios probe uses /usr/bin/attr to get information about the state
of the cvmfs filesystem, e.g. "/usr/bin/attr -q -g version /cvmfs/atlas.cern.ch"
to get the cvmfs version. If most of the memory is used, it's possible that attr
can't allocate enough kernel memory space to operate and fails with the error
message you've posted. I guess you can ignore the error of the cvmfs probe, it
should go away with the next check. If it doesn't you should check the memory
usage of the jobs running on that node.
Cheers,
Robert
On 06/02/14 13:54, SCHAER Frederic wrote:
> Hi
>
> Are there known issues with the atlas cvmfs repositories ?
> Since this morning, we get this kind of monitoring errors, which come and go away on random hosts :
>
> atlas.cern.ch ... attr_get: Cannot allocate memory
> Could not get "version" for .
> SERVICE STATUS: failed to read version attribute - test took 1 s
>
> We see no error for other repos... ?
>
> Our version :
> # rpm -qa|grep cvmfs
> cvmfs-init-scripts-1.0.20-1.noarch
> cvmfs-keys-1.4-1.noarch
> cvmfs-2.1.15-1.el6.x86_64
>
> Thanks
>
|