Hi,
The tests jobs were submited today at 11:40. There is one strange
problem which exists in many sites: Replication from default SE to
CASTOR service at CERN FAILED, with the following message on stderr:
Error: source and destination file sizes are not idential.
I suspect, that the problem is more general but for the time beeing I do
not know what is the reason.
There are also several sites that got timeout when trying to replicate a
file to castorgrid.cern.ch. Network problem? Castor problem? But most of
the sites do not have this kind of error...
Anyway, please try to reproduce the problems, debug them and fix them.
Piotr
Current test is 2004-06-11_11.40.57
SITE grid01.phy.ncu.edu.tw
SITE lcg00125.grid.sinica.edu.tw
SITE lcgce01.triumf.ca
SITE t2-ce-01.lnl.infn.it
SITE wn-04-07-02-a.cr.cnaf.infn.it
Test job still waiting in queue...
SITE atlasce.lnf.infn.it
SITE bigmac-lcg-ce.physics.utoronto.ca
SITE ce.gridpp.shef.ac.uk
SITE gf17.hep.man.ac.uk
SITE lcg2ce.ific.uv.es
SITE lunegw.lancs.ac.uk
SITE tbn18.nikhef.nl
Replication from default SE to CASTOR service at CERN FAILED!
ERROR MESSAGE:
Error: source and destination file sizes are not idential.
SITE atlasgrid04.usatlas.bnl.gov
SITE clrglop195.in2p3.fr
SITE ekp-lcg-ce.physik.uni-karlsruhe.de
3rd party replication from castorgrid.cern.ch to the default SE FAILED!
Copying 3rd party file to the WorkerNode FAILED!
ERROR MESSAGE:
GlobusURLCopy: the server sent an error response: 425 425 Can't open data connection. timed out() failed.
SITE atlasce01.na.infn.it
CopyAndRegisterFile to default SE FAILED!
CopyFile from default SE to WN FAILED
Replication from default SE to CASTOR service at CERN FAILED!
3rd party replication from castorgrid.cern.ch to the default SE FAILED!
Copying 3rd party file to the WorkerNode FAILED!
ERROR MESSAGE:
GridFTP: mkdir operation failed. the server sent an error response: 550 550 /flatfiles/SE00/dteam/generated: Permission denied.
Please check the permissions and ownerships in storage space.
SITE epcf36.ph.bham.ac.uk
JOB SUBMISSION FAILED!!!
ERROR MESSAGE:
Got a job held event, reason: Globus error 155: the job manager could not stage out a file
SITE gw39.hep.ph.ic.ac.uk
JOB SUBMISSION FAILED!!!
ERROR MESSAGE (logging info):
reason = Cannot read JobWrapper output, both from Condor and from Maradona.
SITE lcfgng.cs.tau.ac.il
SITE lcg02.physics.carleton.ca
CopyAndRegisterFile to default SE FAILED!
CopyFile from default SE to WN FAILED
Replication from default SE to CASTOR service at CERN FAILED!
3rd party replication from castorgrid.cern.ch to the default SE FAILED!
Copying 3rd party file to the WorkerNode FAILED!
Removal of replica from the default SE FAILED!
(probably because of 3rd party replication problem)
ERROR MESSAGE:
org.edg.data.reptor.info.InfoServiceException: No Service found edg-replica-metadata-catalog
Something wrong with your BDII?
SITE lcg2-ce.physik.rwth-aachen.de
CopyAndRegisterFile to default SE FAILED!
CopyFile from default SE to WN FAILED
Replication from default SE to CASTOR service at CERN FAILED!
3rd party replication from castorgrid.cern.ch to the default SE FAILED!
Copying 3rd party file to the WorkerNode FAILED!
Removal of replica from the default SE FAILED!
(probably because of 3rd party replication problem)
ERROR MESSAGE:
/opt/edg/var/etc/edg-replica-manager/edg-replica-manager.conf: Permission denied
SITE lcgce02.triumf.ca
JOB SUBMISSION FAILED!!!
ERROR MESSAGE:
reason = 7 authentication with the remote server failed
The following sites are OK:
cclcgceli01.in2p3.fr
ce01.lip.pt
ce01.ph.qmul.ac.uk
dgce0.icepp.jp
farm012.hep.phy.cam.ac.uk
golias25.farm.particle.cz
grid-ce1.desy.de
grid.uibk.ac.at
grid003.ft.uam.es
grid008.to.infn.it
gridkap01.fzk.de
gtbcg12.ifca.unican.es
heplnx131.pp.rl.ac.uk
lcg-ce.lps.umontreal.ca
lcg-ce.usc.cesga.es
lcg02.ciemat.es
lcg06.sinp.msu.ru
lcgce01.nic.ualberta.ca
lcgce02.gridpp.rl.ac.uk
lcgce02.ifae.es
lxn1181.cern.ch
lxn1184.cern.ch
lxt03.jinr.ru
mu6.matrix.sara.nl
pc31.hep.ucl.ac.uk
skurut17.cesnet.cz
t2-ce-01.mi.infn.it
wipp-ce.weizmann.ac.il
zeus02.cyf-kr.edu.pl
|