Print

Print


Hi,

After replacing our LCG-CEs with CreamCEs, we keep having problems where
our CreamCes get blacklisted by the WMSs that the VO SAM tests use. It has
happened to all the CEs at some point or other - they seem to run fine for
a few days to a week then hit this.

It is not transitory the SAM tests start failing and continue to fail
until we intervene. Restarting the gLite services does not appear to fix
the but but rebooting does.

Other WMSs including the ones used by the NGI_UK ops SAM tests continue to
work fine with the CreamCEs.

It does not appear to be load related as the boxes seem to have plenty of
free memory and do not appear to be under heavy load when it happens.

We've increased the innodb_buffer_pool_size, and reduced the purge times
for both the Cream and Blah components which does not appear to have fixed
the issue.

We're using the UMD release with Argus authentication.

Any ideas what else I should be trying?

Thanks,
Chris.