Hi Marteen,
I'm re-installing the CE with the proper umask in cfengine.
I'm not sure if this is the cause of this problem as I had installed the
CE manually previously and it had run fine for a while until last
Thursday before this occured, and which prompted my fresh install.
But, I agree the CE should be re-installed to eliminate problems I've
introduced in last install.
Thank you,
Yves
On Sat, 3 May 2008, Maarten Litmaath wrote:
> Hi Yves,
> a few comments inline.
>
>> The BDII (replacement of globus-mds and not the site BDII) kept dying.
>> It happened every hour just after the gatekeeper had received a kill
>> signal and restarted itself.
>
> The CE is not configured to restarted any service periodically.
> I would not be surprised if this were a result of the problem described
> in the other thread. Please _reinstall_ your CE with a correct umask,
> so that we may avoid a wild goose chase...
>
>> [...]
>>
>> The status of the marshals is misleading has they're actually running
>> fine as can be seen in the process list above.
>>
>> [root@epgce2 ~]# service globus-gass-cache-marshal status
>> globus-gass-cache-marshal dead but pid file exists
>> [root@epgce2 ~]# service globus-job-manager-marshal status
>> globus-job-manager-marshal dead but pid file exists
>
> https://savannah.cern.ch/bugs/index.php?36224
>
|