-------- Forwarded Message -------- Subject: Data transfer technologies Date: Sat, 20 Jun 2015 15:46:06 +0100 From: David Colling <[log in to unmask]> Reply-To: National E-Infrastructure Project Directors Group <[log in to unmask]> To: [log in to unmask] Dear PDG members, I am not sure if this is the best list to which to send this but it was the best that I can think of at at the moment. If anybody has suggestion for a better list then please suggest. In conversations with members of this group it has become clear that we are in increasing need of a data transfer system capable of handling large amounts of data in a flexible and easy to use manner. The HEP community already has a tool to do this called FTS but it is not a simple standalone tool and generally assumes that quite a lot of the software that our community uses is installed. So I asked Sam Skipsey to look into what we should suggest that other communities with who GridPP works should use. Sam reported back to our fortnightly GridPP technical meeting yesterday. He has only had a chance to do fairly superficial work and he is going to look more deeply but I still think that he report was interesting. His presentation can be found at: https://indico.cern.ch/event/403303/ David F. joined our meeting yesterday as well so if I misrepresent anything he can correct me. The bottom line would appear to be that there are two possibilities for the sort of functionality that I think people are looking for. These are Globus Connect or WebFTS. Both work on the same underlying technology and offer a very similar service however both have pros and cons. Sam has a direct comparison on slide 16 but for those too busy to read Sam's talk I will give a very brief summary below. The Globus service is pretty good and has much (if not quite all) the functionality that people are looking for. It supports a range of third party authentication methods as well as the default X.509 certificates (and even when using certificates it can do so in a hiden way). The downside is that while some of the basic functionality is free Globus have this to make money and to get the full range of functionality it costs $90K/year/grouping. WebFTS is still only a beta release and is currently much less polished than Globus. It's range of supported third authentication is more limited and is not as smooth to set up (yet). However it is open source and is free and will remain so. I am almost ambivalent as to what we choose as HEP will still use FTS for the transfers of large amounts of data (not using any web interface) and from Sam's review both of the leading options seem to be pretty good if at different stages of development. My thoughts were that if the UK communities are going to use Globus en masse then we should probably try to get a UK wide price from Globus rather than each community paying these few $K for the specific range of services and data volume that they want to move. If the UK decides that the free WebFTS is the way forward then I think that it would probably be a good thing to pay somebody to work with the developers (for a year or two) to make sure that we get the functionality that we want in there reasonably soon and that nothing that matters to us is overlooked. I said that I was almost ambivalent and the reason that I am only almost is because I think that if we worked with the WebFTS people we would have a far greater possibility of getting support in there for things like JISC Assent and safe share than we would with Globus. However that is just my feeling and I have no real evidence to back this up. Anyway, I thought it worth disseminating this information as it is a topic that we have often discussed. BTW ... the next GridPP Technical meeting in just under a fortnight will focus on traceability in clouds and anybody interested is welcome to come along. Best, david