JiscMail Logo
Email discussion lists for the UK Education and Research communities

Help for DATA-PUBLICATION Archives


DATA-PUBLICATION Archives

DATA-PUBLICATION Archives


DATA-PUBLICATION@JISCMAIL.AC.UK


View:

Message:

[

First

|

Previous

|

Next

|

Last

]

By Topic:

[

First

|

Previous

|

Next

|

Last

]

By Author:

[

First

|

Previous

|

Next

|

Last

]

Font:

Proportional Font

LISTSERV Archives

LISTSERV Archives

DATA-PUBLICATION Home

DATA-PUBLICATION Home

DATA-PUBLICATION  May 2013

DATA-PUBLICATION May 2013

Options

Subscribe or Unsubscribe

Subscribe or Unsubscribe

Log In

Log In

Get Password

Get Password

Subject:

Re: Registrastion for access to data

From:

Joe Hourcle <[log in to unmask]>

Reply-To:

Joe Hourcle <[log in to unmask]>

Date:

Tue, 21 May 2013 08:30:46 -0400

Content-Type:

text/plain

Parts/Attachments:

Parts/Attachments

text/plain (52 lines)

On May 21, 2013, at 7:41 AM, Brian Matthews wrote:

> I think that is important to take into account that there is a process of cultural change going  on here, from a world where data is routinely kept private to one where data is a measurable research output, and in some circumstances the sensitivities of data providers need to be taken into account and reassurance on what they might consider "misuse" of their data. 
> 
> The analogy with publications does not quite work.  With a paper, the content provider is already ready and willing to release the information is a way that reflects their intellectual input, and, most importantly as things are currently constituted, in a way that they are measured for research assessment, affecting status and jobs.    So they encourage anonymous access and citation as much as possible.  With data, researchers may not feel comfortable about releasing it as they may feel that their intellectual input is not complete (with a threat of pre-emption of results from their own data) or get appropriate credit from the reuse of data in the form of citation counts and research evaluation.  
> 
> So registration may be a step along the way in bringing reluctant data providers along.  Do we want empty anonymous access data repositories, or full ones with a low registration barrier?  Ultimately, as researchers acclimatise to this new world and evaluation metrics take into account the value of data reuse, these barriers will hopefully come down.
> 
> I agree with Astrid that this varies on the nature of the data and also considerably between communities - those used to pooling data (e.g. environmental or social science) maybe more comfortable about anonymous open release than say chemistry or materials science.  In my own work, I work with the ISIS neutron facility which has a data policy requesting registration, set up in consultation with data providers - but there is no sense of vetting those registrations beyond checking for spam.    But then ISIS works with a relatively specialised community.

...

So the original question was if the *users* would take issue to having to register.

... and I attempted to couch my initial answer by giving some ways in which you'd be more likely to get people to register by offering them additional services.

In the interest of playing devil's advocate on this, I'm going to try to list some of the reasons why data providers / publishers / archives might want people to register:

1. Tracking how many distinct people are interested in their data.  (IP addresses are a proxy, but inaccurate)

2. Being able to ask people to write letters of support when it comes time for a senior review (or whatever they have to do to justify their funding).

3. Being able to notify people of new data products, particularly those that deprecated those they had downloaded.

4. Being able to notify people of improved or changed documentation.

5. Tracking resource allocation, and limiting a single person from hogging resources.  (this is especially important for brokering, or other situations where you do processing on demand)

6. Being able to notify people that you've had to block them because they were being abusive (see #5).

7. Providing access to restricted data.  This might be stuff that's still embargoed, has privacy concerns, was obtained via some agreement that had restrictive clauses in it, IRB, ITAR, etc.  In some cases, it's the lower-level "raw" data, particularly if there's been fuzzing for the publicly available data.

8. Providing access to 'personal space'.  (a mix of #5 and #7) ... some systems allow you to either process data and share it w/ your colleagues, or to upload data to share, but it's not made fully public.  (I've heard it referred to as 'dropbox for data')



Note that some of these only require the downloader to register once, while others might require them authenticating each time they download.

Some require just giving an e-mail address, others might require a more complex vetting process.

I *know* that I've seen talks on this topic.  I want to say that it was from of the early RDAP meetings, when there were a good number of presentations on iRODS (which *does* do authentication), but it might've also been from the folks at Globus (GridFTP, Globus Online) who mentioned the 'dropbox for data'.

I personally run into the #5 and #6 ... we just had to block two IP addresses last week, because they'd start a command to download the data, but then break the connection and come in again.  (and the software we're using to serve that particular data collection is so brittle that it doesn't handle disconnects well, so they end up DOSing the machine).  Back in the days of FTP, it was easy, as you could just e-mail someone and tell them how to fix the problem.  Now, you have to block them, and wait to see if they e-mail you to complain.

(and in the case of connections from foreign countries, especially China, it's quite unlikely)


-----
Joe Hourcle
Programmer/Analyst
Solar Data Analysis Center
Goddard Space Flight Center

Top of Message | Previous Page | Permalink

JiscMail Tools


RSS Feeds and Sharing


Advanced Options


Archives

April 2024
March 2024
February 2024
January 2024
November 2023
October 2023
September 2023
August 2023
July 2023
June 2023
May 2023
March 2023
February 2023
January 2023
November 2022
October 2022
September 2022
August 2022
July 2022
June 2022
May 2022
April 2022
March 2022
February 2022
January 2022
December 2021
November 2021
October 2021
September 2021
August 2021
July 2021
June 2021
May 2021
April 2021
March 2021
February 2021
January 2021
November 2020
September 2020
August 2020
July 2020
May 2020
April 2020
March 2020
February 2020
November 2019
October 2019
July 2019
June 2019
April 2019
February 2019
January 2019
December 2018
November 2018
October 2018
July 2018
June 2018
May 2018
April 2018
March 2018
February 2018
January 2018
December 2017
November 2017
October 2017
September 2017
August 2017
July 2017
June 2017
May 2017
April 2017
March 2017
February 2017
January 2017
December 2016
November 2016
October 2016
September 2016
August 2016
July 2016
June 2016
May 2016
April 2016
March 2016
February 2016
January 2016
December 2015
November 2015
October 2015
September 2015
August 2015
July 2015
June 2015
May 2015
April 2015
March 2015
February 2015
January 2015
December 2014
November 2014
October 2014
September 2014
July 2014
June 2014
May 2014
April 2014
March 2014
February 2014
January 2014
December 2013
November 2013
October 2013
September 2013
August 2013
July 2013
June 2013
May 2013
April 2013
March 2013
February 2013


JiscMail is a Jisc service.

View our service policies at https://www.jiscmail.ac.uk/policyandsecurity/ and Jisc's privacy policy at https://www.jisc.ac.uk/website/privacy-notice

For help and support help@jisc.ac.uk

Secured by F-Secure Anti-Virus CataList Email List Search Powered by the LISTSERV Email List Manager