Hi Yves,
Our version of yaim is glite-yaim-3.0.0-34
Is this the problem? Should we upgrade? Just yaim or all glite ?
Dave
Yves Coppens wrote:
> Hi David,
>
> If your version of yaim is glite-yaim-3.0.1-* ,then the problem may be in:
>
> /opt/edg/etc/edg-mkgridmap.conf. If you do not have no sgm and prd pool
> accounts but single accounts, it should like:
>
> "/VO=ops/GROUP=/ops/ROLE=lcgadmin/Capability=NULL" opssgm
> "/VO=ops/GROUP=/ops/ROLE=lcgadmin" opssgm
> "/VO=ops/GROUP=/ops/ROLE=production/Capability=NULL" opsprd
> "/VO=ops/GROUP=/ops/ROLE=production" opsprd
> "/VO=ops/GROUP=/ops/Role=NULL/Capability=NULL" .ops
> "/VO=ops/GROUP=/ops" .ops
>
> for the ops VO. If you do have pool accounts then opssgm should be
> replaced by .opssgm (which is what yaim does). The same holds for the prd
> account and other sgm and prd VO accounts.
>
> Yves
>
>
> On Thu, 7 Jun 2007, David Robson wrote:
>
>
>> Yesterday, we were frequently failing the SAM CE tests, although non-OPS
>> VO jobs seemed to be OK.
>>
>> After following the discussions in the news groups, I reached the
>> conclusion that the problem was
>> due to the lack of sgm and prd accounts. Therefore I added lines of the
>> form
>>
>>
>> 50601:opssgm001:1932:ops:ops:sgm:
>> 51301:opsprd001:1932:ops:ops:prd:
>>
>> to users.conf for each VO, and then ran configure_node on the CE and all
>> WNs. Now we are failing ALL our SAM CE tests.
>> I reversed the change by deleting the new accounts from user.conf and
>> running configure_node on the CE and WNs again,
>> but we are still failing ALL the tests. I don't see anything wrong in
>> the globus-gatekeeper logs, I can su to ops001 and prove
>> that ssh between the WN nodes is OK, and I can submit jobs internally
>> with qsub.
>>
>> Any ideas anyone on how to debug this?
>>
>> Thanks in advance
>>
>> Dave
>>
>>
>
>
|