Hi Mona,
Are you saying that this is a problem that is causing failures at all
dcache sites? If so then this is quite a problem. Can anybody on the
TB-support list offer any advice?
All the best,
david
Mona Aggarwal wrote:
> Hi Stuart,
>
> I had informed the storage group about the
> problem, and it seems all dCache sites are
> getting it. It could be related to some
> certificate upgrade etc.
>
> Cheers,
> Mona
>
> Stuart Wakefield wrote:
>> This is quite annoying, till we fix it could we open standard
>> non-authenticated dcap (i agree this is non-ideal but its either this
>> or our site is broken as far as cms is concerned and we have
>> production to run)
>>
>> Cheers
>> Stuart
>>
>> On Jan 14, 2008 2:00 PM, Mona Aggarwal <[log in to unmask]> wrote:
>>> Hi Stuart,
>>>
>>> I am aware of the problem, and looking
>>> into it.
>>>
>>> Last time, we had the similar problem and
>>> it was due to certificate format and was
>>> fixed after upgrading to the new dcache
>>> release.
>>>
>>> Cheers,
>>> Mona
>>>
>>>
>>> Stuart Wakefield wrote:
>>>> Hi
>>>>
>>>> My jobs are still dieing with this can someone check (also Matts
>>>> problem..)
>>>>
>>>> dccp
>>>> gsidcap://gfe02.hep.ph.ic.ac.uk:22128/pnfs/hep.ph.ic.ac.uk/data/cms/store/unmerged/test/2007/11/15/CSA07-ProdMgrTestLCG6_EWK_Zmumu_2-3582/GEN-SIM/0000/469D5F7E-5EC0-DC11-AFAB-003048898D90.root
>>>>
>>>> .
>>>> Dcap Version version-1-2-41 Oct 16 2006 16:09:04
>>>> Allocated message queues 0, used 0
>>>>
>>>> Allocated message queues 1, used 1
>>>>
>>>> Creating a new control connection to gfe02.hep.ph.ic.ac.uk:22128.
>>>> Activating IO tunnel. Provider: [libgsiTunnel.so].
>>>> Added IO tunneling plugin libgsiTunnel.so for
>>>> gfe02.hep.ph.ic.ac.uk:22128.
>>>> Sending control message: 0 0 client hello 0 0 2 41 -uid=30078
>>>> -pid=10885 -gid=6747
>>>> Error ( POLLIN) (with data) on control line [3]
>>>> Removing [3] form control lines list
>>>> Failed to connect to gfe02.hep.ph.ic.ac.uk:22128
>>>> Failed to create a control line
>>>> [-1] unpluging node
>>>> Removing unneeded queue [1]
>>>> [-1] destroing node
>>>> Using system native stat64 for ..
>>>> Allocated message queues 2, used 1
>>>>
>>>> Allocated message queues 2, used 2
>>>>
>>>> Creating a new control connection to gfe02.hep.ph.ic.ac.uk:22128.
>>>> Activating IO tunnel. Provider: [libgsiTunnel.so].
>>>> Added IO tunneling plugin libgsiTunnel.so for
>>>> gfe02.hep.ph.ic.ac.uk:22128.
>>>> Sending control message: 0 0 client hello 0 0 2 41 -uid=30078
>>>> -pid=10885 -gid=6747
>>>> Error ( POLLIN POLLERR POLLHUP) (with data) on control line [3]
>>>> Removing [3] form control lines list
>>>> Failed to connect to gfe02.hep.ph.ic.ac.uk:22128
>>>> Failed to create a control line
>>>> [-1] unpluging node
>>>> Removing unneeded queue [2]
>>>> [-1] destroing node
>>>> Failed open file in the dCache.
>>>> Can't open source file : Server rejected "hello"
>>>> System error: Input/output error
>>>> -bash-3.00$ voms-proxy-init -voms cms
>>>> Your identity: /C=UK/O=eScience/OU=Imperial/L=Physics/CN=stuart
>>>> wakefield
>>>> Enter GRID pass phrase:
>>>> -bash-3.00$ voms-proxy-info -all
>>>> subject : /C=UK/O=eScience/OU=Imperial/L=Physics/CN=stuart
>>>> wakefield/CN=proxy
>>>> issuer : /C=UK/O=eScience/OU=Imperial/L=Physics/CN=stuart wakefield
>>>> identity : /C=UK/O=eScience/OU=Imperial/L=Physics/CN=stuart wakefield
>>>> type : proxy
>>>> strength : 512 bits
>>>> path : /tmp/x509up_u30078
>>>> timeleft : 11:59:50
>>>> === VO cms extension information ===
>>>> VO : cms
>>>> subject : /C=UK/O=eScience/OU=Imperial/L=Physics/CN=stuart wakefield
>>>> issuer : /DC=ch/DC=cern/OU=computers/CN=voms.cern.ch
>>>> attribute : /cms/Role=NULL/Capability=NULL
>>>> attribute : /cms/analysis/Role=NULL/Capability=NULL
>>>> attribute : /cms/Higgs/Role=NULL/Capability=NULL
>>>> timeleft : 11:59:50
>>>>
>>>> Cheers
>>>> Stuart
>>>>
>>>> On Jan 12, 2008 12:56 PM, Stuart Wakefield
>>>> <[log in to unmask]> wrote:
>>>>> Hi
>>>>>
>>>>> My jobs are now failing to access files via dccp but srm seems fine..
>>>>>
>>>>> [stuartw@gfe03 stuartw]$ dccp
>>>>> dcap://gfe02.hep.ph.ic.ac.uk:22128/pnfs/hep.ph.ic.ac.uk/data/cms/store/unmerged/test/2007/11/15/CSA07-ProdMgrTestLCG6_EWK_Zmumu_2-3582/GEN-SIM/0000/469D5F7E-5EC0-DC11-AFAB-003048898D90.root
>>>>>
>>>>> .
>>>>> Error ( POLLIN) (with data) on control line [3]
>>>>> Failed to create a control line
>>>>> Error ( POLLIN POLLERR POLLHUP) (with data) on control line [3]
>>>>> Failed to create a control line
>>>>> Failed open file in the dCache.
>>>>> Can't open source file : Server rejected "hello"
>>>>> System error: Input/output error
>>>>>
>>>>> Cheers
>>>>> Stuart
>>>>>
>>>>> On Jan 11, 2008 7:06 PM, Wingham, Matthew P
>>>>>
>>>>> <[log in to unmask]> wrote:
>>>>>>
>>>>>>
>>>>>> Hi Mona,
>>>>>>
>>>>>> Unfortunately I still get the same.
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>> -----Original Message-----
>>>>>> From: Mona Aggarwal [mailto:[log in to unmask]]
>>>>>> Sent: Fri 1/11/2008 6:08 PM
>>>>>> To: Wakefield, Stuart L
>>>>>> Cc: DGUSER; Wingham, Matthew P
>>>>>> Subject: Re: Fwd: [Hep-cms-computing] dcache from gfe03
>>>>>>
>>>>>> Stuart Wakefield wrote:
>>>>>>> ---------- Forwarded message ----------
>>>>>>> From: Wingham, Matthew P <[log in to unmask]>
>>>>>>> Date: Jan 11, 2008 5:19 PM
>>>>>>> Subject: [Hep-cms-computing] dcache from gfe03
>>>>>>> To: hep-cms-computing <[log in to unmask]>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> Hi,
>>>>>>>
>>>>>>> Im trying to remove some old files on dcache from gfe03. As per the
>>>>>>> twiki I try :
>>>>>>>
>>>>>>> bash-2.05b$ uberftp cmsdsk00
>>>>>>> 220 GSI FTP Door ready
>>>>>>> 530 Authorization Service failed:
>>>>>>> diskCacheV111.services.authorization.AuthorizationServiceException:
>>>>>>> authRequestID 1056800682 delegation failed for authentification of
>>>>>>> /C=UK/O=eScience/OU=Imperial/L=Physics/CN=matthew wingham
>>>>>>> java.net.SocketException: Connection reset
>>>>>>>
>>>>>>> Or if i try :
>>>>>>>
>>>>>>> bash-2.05b$ srm-advisory-delete
>>>>>>>
>>>>>> srm://gfe02.hep.ph.ic.ac.uk:8443/pnfs/hep.ph.ic.ac.uk/data/cms/local/users/pwing/tt-ee_1.root
>>>>>>
>>>>>>> WARNING: SRM_PATH is defined, which might cause a wrong version of
>>>>>>> srm client to be executed
>>>>>>> WARNING: SRM_PATH=/opt/d-cache/srm
>>>>>>> srm client error: ; nested exception is:
>>>>>>> java.net.SocketException: Connection reset
>>>>>>>
>>>>>>> This is after voms-proxy-init....
>>>>>>>
>>>>>>> Is there a problem with my certificate? Ive recently re-registered
>>>>>>> with the VO. Though it seems to work properly elsewhere....
>>>>>> I have recently updated voms certificate, could you pls try again?
>>>>>>
>>>>>> Cheers,
>>>>>> Mona
>>>>>>
>>>>>> --
>>>>>> Mona Aggarwal- Imperial College
>>>>>> Tel: +442075947809
>>>>>> Email: [log in to unmask]
>>>
>>> --
>>>
>>> Mona Aggarwal- Imperial College
>>> Tel: +442075947809
>>> Email: [log in to unmask]
>>>
>
>
|