Hi Anna - also important to be clear about the different requirements and costs for data storage for active data management on the one hand, and longer term preservation for reuse and discovery on the other. A mixture of internal and external resources likely to be required, as others have identified, potentially varying from use case to use case across an institution. I say use case rather than discipline as there is a huge range of needs just within one discipline - from the big missions to the long tail of individuals and small collaborations including in the sciences. But I think it is becoming more feasible to line up a range of such internal and external services, as needed, to provide for the majority of use cases. Cheers, Jonathan ---------------------------------------------------------------------------------- Dr Jonathan Tedds Tel: +44 (0)116 229 7780<tel:%2B44%20%280%29116%20229%207780> Senior Research Fellow<http://uk.linkedin.com/pub/jonathan-tedds/16/79b/769>, (0)779 504 6277<tel:(0)779%20504%206277> Director: Health And Research Data Informatics (Health Sciences<http://www2.le.ac.uk/departments/health-sciences/research/gen-epi>) @jtedds<http://twitter.com/jtedds> PI, Biomedical Research Infrastructure Software Service (#BRISSKit<http://brisskit.le.ac.uk/>) PI, #PREPARDE<http://www.le.ac.uk/projects/preparde/> & Co-Chair Research Data Alliance Publishing Data Astronomical Surveys & e-Research<http://xmmssc-www.star.le.ac.uk/~jat> (Physics & Astronomy<http://www2.le.ac.uk/departments/physics/research/xroa>) University of Leicester<http://www2.le.ac.uk/> Leicester LE1 7RH, UK Email: [log in to unmask]<mailto:[log in to unmask]> ---------------------------------------------------------------------------------- On 2 Apr 2014, at 19:29, "Anna Clements" <[log in to unmask]<mailto:[log in to unmask]>> wrote: Thanks, both - and I probably agree overall. The biggest issue I face right now is trying to estimate costs - even for such things that should be straightforward such as the data storage infrastructure. I know there has been some work on this and also the 4C project is underway ... But does anyone have a costing model they would be willing to share particularly comparing local I house storage to 3rd party provision. (I am aware of the costing model for Arkivum) Particularly given that we are also dealing with considerable uncertainty on 1. How much data we have and growth rate 2. How much we expect to be able to store in external data centres and 3. How much we think will be open access and therefore easily accessible and shareable cf what we expect to be restricted / by request access - so scalability is a key factor. Anna Anna Clements Head of Research Data and Information Services University of St Andrews St Andrews, Fife,KY16 9AL @annakclements On 2 Apr 2014, at 18:56, "Kevin Ashley" <[log in to unmask]<mailto:[log in to unmask]>> wrote: On 02/04/14 18:21, Anna Clements wrote: Thanks, Rachel and for the summary - will read in detail but initial question is ... Is this report then evidence for the need for (inter)national & subject data centres to be developed where none exist .. and so maybe remove the need for institutions to do this themselves - duplicating costs and effort? My opinion: The answer is yes to the first part, and no to the second. Yes, it is evidence that the existing data centres deliver value and hence can support the case for more of them to exist. But even if funders take that up with enthusiasm, it will take some years for it to happen and there will still be areas of research that aren't covered. There will thus be a role for institutions for the foreseeable future. These questions were examined in the UKRDS report back in 2010 and I don't think the picture has changed substantially since then. As John Milner said, the solutions may involve regional consortia and/or some mix of commercial suppliers for some of the infrastructure, but intellectual custody will still remain the responsibility of institutions themselves.