Hi,
I cannot find anything to go on here so any help would be appreciated.
I'm getting occasional groups of SFT RM failures where the copy and
register appears to work but the copy back and replication to a third
party fail. I have only seen this in the SFT's I've run about twenty
tests of rapid lcg-cr, lcg-lr, lcg-cp, lcg-del with no failures. The
problem goes away on it's own.
It looks like it could be some sort of silent failure on write but since
I cannot replictate it I cannot check whether the files that have this
problem really are in PNFS or the pool.
The output from the relavant bit of the SFT is below. Does anyone else
running dCache see these errors, any ideas how I can get rid of them?
I'm running dCache 1.6.6-3, with the PNFS DB in postgres.
Thanks,
Chris.
2006-01-27 09:50:08
Checking lcg-cr command
Create a local file: sft-lcg-rm-cr.txt
Move the file to the default SE (heplnx204.pp.rl.ac.uk) and register it
with the LFN: sft-lcg-rm-cr-heplnx20.pp.rl.ac.uk.0601270949
++ pwd
+ lcg-cr -v --vo dteam -d heplnx204.pp.rl.ac.uk -l
lfn:sft-lcg-rm-cr-heplnx20.pp.rl.ac.uk.0601270949
file:///scratch/WMS_heplnx20_07058_https_3a_2f_2fegee-rb-08.cnaf.infn.it
_3a9000_2fvvejYN44GHWvYleOuzqE3A/sft-lcg-rm-cr.txt
0 bytes 0.00 KB/sec avg 0.00 KB/sec inst
0 bytes 0.00 KB/sec avg 0.00 KB/sec instUsing grid
catalog type: edg
Source URL:
file:///scratch/WMS_heplnx20_07058_https_3a_2f_2fegee-rb-08.cnaf.infn.it
_3a9000_2fvvejYN44GHWvYleOuzqE3A/sft-lcg-rm-cr.txt
File size: 233
VO name: dteam
Destination specified: heplnx204.pp.rl.ac.uk
Destination URL for copy:
gsiftp://heplnx165.pp.rl.ac.uk:2811//pnfs/pp.rl.ac.uk/data/dteam/generat
ed/2006-01-27/filedbceaa97-f04b-40b5-b631-0fb8ea067024
# streams: 1
# set timeout to 0 seconds
Alias registered in Catalog:
lfn:sft-lcg-rm-cr-heplnx20.pp.rl.ac.uk.0601270949
Transfer took 1540 ms
Destination URL registered in Catalog:
srm://heplnx204.pp.rl.ac.uk/pnfs/pp.rl.ac.uk/data/dteam/generated/2006-0
1-27/filedbceaa97-f04b-40b5-b631-0fb8ea067024
guid:5a71529e-3656-4208-9a19-179a6ed0724a
+ result=0
+ set +x
List the replicas:
+ lcg-lr --vo dteam lfn:sft-lcg-rm-cr-heplnx20.pp.rl.ac.uk.0601270949
srm://heplnx204.pp.rl.ac.uk/pnfs/pp.rl.ac.uk/data/dteam/generated/2006-0
1-27/filedbceaa97-f04b-40b5-b631-0fb8ea067024
+ set +x
2006-01-27 09:50:11
Check lcg-cp command - get file back store it in sft-lcg-rm-cp.txt
++ pwd
+ lcg-cp -v --vo dteam lfn:sft-lcg-rm-cr-heplnx20.pp.rl.ac.uk.0601270949
file:///scratch/WMS_heplnx20_07058_https_3a_2f_2fegee-rb-08.cnaf.infn.it
_3a9000_2fvvejYN44GHWvYleOuzqE3A/sft-lcg-rm-cp.txt
the server sent an error response: 553 553 Permission denied, reason:
CacheException(rc=666;msg=can't get pnfsId (not a pnfsfile))
lcg_cp: Permission denied
+ result=1
+ set +x
The contents of file sft-lcg-rm-cp.txt is:
cat: sft-lcg-rm-cp.txt: No such file or directory
|