Hola,
>> Your CE essentially looks OK...
>
> The contents of /var/log/cleanup-grid-accounts.log look a bit unusual,
> there seem to be a lot of unexpected files being left behind:
>
> ----------------------------------------------------------------------
> [...]
> l 0777 1 18595 13991 65 Jul 5 11:55 \
> ./.lcgjm/globus-cache-export.I20194/.emergency-x509
> - 0644 1 18595 13991 0 Jul 5 11:55 \
> ./.lcgjm/globus-cache-export.I20194/file_cleanup.txt
> - 0600 1 18595 13991 0 Jul 5 11:55 \
> ./.lcgjm/globus-cache-export.I20194/batch.out
> l 0777 1 18595 13991 119 Jul 5 11:55 \
> ./.lcgjm/globus-cache-export.I20194/export.2
> - 0644 1 18595 13991 130 Jul 5 11:55 \
> ./.lcgjm/globus-cache-export.I20194/stage_out.txt
> - 0644 1 18595 13991 10240 Jul 5 12:30 \
> ./.lcgjm/globus-cache-export.I20194/gaew1017.27158.import.txt.tar
> [...]
> ----------------------------------------------------------------------
>
> At CERN we occasionally see a few such entries.
> The cleanup should mostly be for the GASS cache:
>
> ----------------------------------------------------------------------
> [...]
> d 0755 2 18595 13991 4096 Jul 22 05:51 \
> ./.globus/.gass_cache/local/md5/0e/5406dd055424ff88e1f57034dde0cf
> - 0644 2 18595 13991 47 Jul 5 11:54 \
> ./.globus/.gass_cache/local/md5/43/02846d87acad1f2e61139c735548cb/tag
> [...]
> ----------------------------------------------------------------------
>
May this have been caused by the problem with the lcg-expiregridmapdir
cron problem I told you about in a different private email?
> Are the grid account home directories on NFS?
No.
> Are there any other suspicious messages in the syslog?
Well, all I see is some different error messages of this type:
-------------------------------------------
Jul 18 04:04:52 lcg02 slapadd: sql_select option missing
Jul 18 04:04:52 lcg02 slapadd: auxpropfunc error no mechanism available
Jul 18 04:04:53 lcg02 slapadd: sql_select option missing
Jul 18 04:04:53 lcg02 slapadd: auxpropfunc error no mechanism available
Jul 18 04:04:54 lcg02 slapd[12652]: sql_select option missing
Jul 18 04:04:54 lcg02 slapd[12652]: auxpropfunc error no mechanism
available
-------------------------------------------
And some (fewer) of this:
-------------------------------------------
Jul 18 04:19:08 lcg02 glite-lb-interlogd[4761]: error reading server
wms209.cern.ch reply: get_reply: er
ror reading server reply
Jul 18 04:19:08 lcg02 glite-lb-interlogd[4761]: queue_thread: get_reply:
error reading server reply
-------------------------------------------
But in both cases, those were already there before the dgas errors
started to appear.
Cheers,
Antonio
|