Print

Print


Dear All,

Sam may already have brought this up in the storage meeting, but if not it is perhaps something to check/discuss in the next one.

Jeremy



Begin forwarded message:

Date: 27 May 2015 08:26:51 BST
From: Oliver Keeble <[log in to unmask]<mailto:[log in to unmask]>>
To: dpm-collaboration <[log in to unmask]<mailto:[log in to unmask]>>
Subject: GridFTP redir


Hi,

Please consider helping out with this testing of Gridftp redirection as part of your collaboration contributions. This is a strategically important part of our plan to simplify DPM and make storage management easier in the long run.

Thanks,

Oliver.


-------- Forwarded Message --------
Subject: Re: Enabling GridFTP redir on pilot prod sites (Was: Re: gsiftp service endpoints)
Date: Thu, 7 May 2015 10:04:31 +0200
From: Fabrizio Furano <[log in to unmask]<mailto:[log in to unmask]>>
To: Alessandra Forti <[log in to unmask]<mailto:[log in to unmask]>>, Fabrizio Furano <[log in to unmask]<mailto:[log in to unmask]>>, Andrea Sartirana <[log in to unmask]<mailto:[log in to unmask]>>, Alessandro Di Girolamo <[log in to unmask]<mailto:[log in to unmask]>>, Alessandro De Salvo <[log in to unmask]<mailto:[log in to unmask]>>
CC: andrea manzi <[log in to unmask]<mailto:[log in to unmask]>>, Tomas Javurek <[log in to unmask]<mailto:[log in to unmask]>>, Oliver Keeble <[log in to unmask]<mailto:[log in to unmask]>>, Andrey Kirianov <[log in to unmask]<mailto:[log in to unmask]>>

Hi all,

I saw this morning that  dpm-dsi-1.9.5-3 is finally available
in epel-testing, so I think that we can technically start the exercise
of enabling the gridftp redirection in a few pilot sites.

Thank you very much again for your availability for this, and
apologies for the delay. This was due to the fact that we found
a few issues with the previous version, which have been fixed and retested.

Please let me know when you are available for starting the setups.
Of course we will stay tuned and do our best to give prompt support.

Thank you
Fabrizio


On 04/20/2015 09:33 PM, Alessandra Forti wrote:
Hi,

for the record I'm on holiday until the end of April and will be unresponsive until then.

cheers
alessandra

On 21/04/2015 00:01, Fabrizio Furano wrote:
Hi Andrea, all,

thank you very much for getting into this.
From our point of view the latest release of dpm-dsi appeared yesterday
in epel-testing, and we are testing it here today... work is in progress.

We'll start testing FTS between today and tomorrow, and if
everything is fine we'll ask you and Alessandra to evaluate it.

Are you using the latest one from epel-testing ? Does it seem to work fine with gfal2 ?

Cheers
Fabrizio


On 20/04/15 14:30, Andrea Sartirana wrote:
Hi,

I'm really sorry for the very late reply at this mail. Here at GRIF_LLR
we were completely overwhelmed by other stuff.

One first feedback:
I've tested srm-less PhEDEx configuration on our Test-bed site.
It is working somehow fine as you can see in [1]  but there are a couple
of things I would like to understand before adopting the same conf in prod.

- FTS transfers from LLR storage are just fine but for incoming
transfers all fail with "DESTINATION file size is 0" . But then the file
IS on disk and with the good size, so PhEDEx manages to validate the
transfer. See [2]. This is not very clean. I'd prefer that the FTS
transfer end up correctly (and that FTS itself checks the checksum and
size). What am I missing?

- I had to deactivate the PhEDEx checksum validation for incoming
transfers as the only method I know for getting the checksum of a DPM
file (w/o re-calculating it each time) was lcg-get-checksum... but this
is SRM based. This is just my ignorance then. What can I use to get the
checksum of a DPM file?


Regards,
a.

[1]
https://cmsweb.cern.ch/phedex/test/Activity::QualityPlots?graph=quality_all&entity=link&src_filter=GRIF&dest_filter=&no_mss=true&period=l72h&upto=



[2]
[phedex@llrphedex ~]$ glite-transfer-submit -s
https://lcgfts3.gridpp.rl.ac.uk:8443
srm://node12.datagrid.cea.fr:8446/srm/managerv2?SFN=/dpm/datagrid.cea.fr/home/cms/trivcat/store/PhEDEx_Debug/LoadTest07_GRIFDAPNIA/LoadTest07_GRIFDAPNIA_8F

gsiftp://llrpp01.in2p3.fr:2811//dpm/in2p3.fr/home/cms/trivcat/store/PhEDEx_LoadTest07/LoadTest07_Debug_T2_FR_GRIF_IRFU/T2_FR_GRIF_LLR/21/LoadTest07_GRIF_DAPNIA_8F_EGJ7yAeOIEb3b0lV_21.tmp.4


fc777684-c018-49dd-8ba7-e13eb17449b7

[phedex@llrphedex ~]$ glite-transfer-status -l -s
https://lcgfts3.gridpp.rl.ac.uk:8443 fc777684-c018-49dd-8ba7-e13eb17449b7
Failed
  Source:
srm://node12.datagrid.cea.fr:8446/srm/managerv2?SFN=/dpm/datagrid.cea.fr/home/cms/trivcat/store/PhEDEx_Debug/LoadTest07_GRIFDAPNIA/LoadTest07_GRIFDAPNIA_8F


  Destination:
gsiftp://llrpp01.in2p3.fr:2811//dpm/in2p3.fr/home/cms/trivcat/store/PhEDEx_LoadTest07/LoadTest07_Debug_T2_FR_GRIF_IRFU/T2_FR_GRIF_LLR/21/LoadTest07_GRIF_DAPNIA_8F_EGJ7yAeOIEb3b0lV_21.tmp.4


  State:       Failed
  Retries:     0
  Reason:      DESTINATION file size is 0
  Duration:    27
[phedex@llrphedex ~]$ rfdir
/dpm/in2p3.fr/home/cms/trivcat/store/PhEDEx_LoadTest07/LoadTest07_Debug_T2_FR_GRIF_IRFU/T2_FR_GRIF_LLR/21/LoadTest07_GRIF_DAPNIA_8F_EGJ7yAeOIEb3b0lV_21.tmp.4


-rw-r--r--   1 102      106                2471769256 Apr 17 17:13
/dpm/in2p3.fr/home/cms/trivcat/store/PhEDEx_LoadTest07/LoadTest07_Debug_T2_FR_GRIF_IRFU/T2_FR_GRIF_LLR/21/LoadTest07_GRIF_DAPNIA_8F_EGJ7yAeOIEb3b0lV_21.tmp.4


[phedex@llrphedex ~]$


On 30/03/2015 16:37, Fabrizio Furano wrote:
Hi Alessandra,

yes, this was discussed, and there was agreement on proceeding. The next
steps are:

- we push to epel-prod the last globus6-proof fix of dpm-dsi
- we refurbish the tutorial to enable the gridftp redir
  that was written a few months ago
- apply it again to our testbed and test
- apply it to two prod sites and test

Andrey, the main responsible for dpm-dsi is here at CERN for
one month, starting today, so this is the timescale for the whole
exercise.

For the prod sites, I understood that you and Andrea Sartirana (added
to CC)
are interested in contributing to these tests, staying in close contact
with us. That will just be great, thank you very much.

The reconfig should be easier if the users of the gridftp service at
the site
can work in SRM-less mode, i.e. pure gridftp. I understood that CMS can,
ATLAS I don't know. Alessandro(Di Salvo+Girolamo), do you know if this is
possible for ATLAS ?

Thank you very much
Fabrizio


On 03/30/2015 11:27 AM, Alessandra Forti wrote:
Apologies Friday I wasn't feeling well was any of this discussed?

thanks

cheers
alessandra

On 24/03/2015 17:47, Fabrizio Furano wrote:
Hi Alessandra,

yes, we have them and we are about to refurbish them these days.
This discussion is a big coincidence because the gridftp redir
is one of the main topics of Friday's meeting, and Andrey is
soon coming for some time to CERN to work on that.

See you on Friday then
Fabrizio

On 24/03/15 18:39, Alessandra Forti wrote:
Hi,

are there instructions to enable the gsiftp redirection? Since I'm
working on DPM I might as well do that too.

cheers
alessandra

On 24/03/2015 16:06, Fabrizio Furano wrote:
Hi,

thank you Andrea for the detailed description.

The bottom line is that:

- DPMs historically have the gsiftp endpoint in the head node
- It will work if the site firewall allows it
- The performance will be poor until the site enables
  the "GridFTP redirection", available since last November

Cheers
Fabrizio


On 24/03/15 16:49, andrea manzi wrote:
Hi Alessandro,
( i put Fabrizio in cc as well)

every DPM has a gsiftp endpoint together with SRM

the endpoint is running both on the Head Node ( which is managing
the
namespace operation ) and on the disk nodes ( where data are stored)

At the moment in production if you contact one of the DPM gsiftp
endpoint running on the headnode the only way to get or write
data is
via RFIO tunnelling to the disk nodes, which means very bad
performances..

but we have a new feature ( named gridftp redirection) that noone is
using in production at the moment which enables the redirection
from the
gsiftp endpoint running from the headnode to the endpoints
running on
the disk nodes, without using RFIO tunnelling.

We are discussing in these days to push for the deployment of this
feature, Fabrizio can also add more details.

Regarding the discovery of the endpoints you can query the BDII,
i’m not
an expert but for instance with this query you get all gsiftp
endpoints
published

ldapsearch -LLL -x -H ldap://lcg-bdii.cern.ch:2170 -b o=grid
'(&(objectclass=GlueSEAccessProtocol)(GlueSEAccessProtocolType=gsiftp))'

| grep GlueSEAccessProtocolEndpoint

then the result should be filtered to get only the DPM ones.

last thing, given that now every operation on the headnode uses
SRM, it
could be that sites have the gsiftp endpoint on the Headnode behind
firewall

cheers
Andrea

On 24 Mar 2015, at 15:58, Alessandro Di Girolamo
<[log in to unmask]
<mailto:[log in to unmask]>> wrote:

Ciao,
one of the steps we need to pursue in moving on on the
"protocols" is
exploring the gsiftp service endpoints instead of the SRM ones.
I think that not all the SRM have behind a gsiftp service endpoint,
but some they do have. I may be wrong, but DPM endpoints have also
gsiftp endpoints which we can use "directly". Is this correct?
In case it is, how can we "discover" these gsiftp endpoints?

Thanks

--- --- --- --- --- --- --- --- --- --- ---
Alessandro Di Girolamo
CERN IT-SDC
+41764870532  (16-0532)
http://www.cern.ch/Diggi
--- --- --- --- --- --- --- --- --- --- ---