JISCMail - RESEARCH-DATAMAN Archives

Email discussion lists for the UK Education and Research communities

Subscriber's Corner

Email Lists

RESEARCH-DATAMAN Archives

RESEARCH-DATAMAN@JISCMAIL.AC.UK

View:

Message:

[

First

Last

]

By Topic:

[

First

Last

]

By Author:

[

First

Last

]

Font:

Proportional Font

		LISTSERV Archives
		RESEARCH-DATAMAN Home
		RESEARCH-DATAMAN April 2016

Options

Subscribe or Unsubscribe

Get Password

Subject:

Re: How do you handle supplementary info?

From:

Robert Darby <[log in to unmask]>

Reply-To:

Research Data Management discussion list <[log in to unmask]>

Date:

Fri, 22 Apr 2016 10:37:26 +0000

Content-Type:

text/plain

Parts/Attachments:

text/plain (1 lines)

Hello all

We have had a similar discussion here at Reading in recent weeks, prompted by the occasional submission of SI files along with papers uploaded to our publications repository, CentAUR. Our experience so far has been that most authors do not deposit SI, that SI are usually single PDFs, sometimes a handful of files, containing abbreviated representations of data, often reproducing figures and tables included in the body of the article, that they are generally not very usable, and would make little sense as standalone datasets. This is our first pass at a policy and definitions:

Policy

Primary research data that underlie research publications should be submitted to the University’s Research Data Archive (or another suitable data service), where they will be preserved and access will be managed appropriately. 

Data submitted to CentAUR as supplementary information with the deposited publication, where the supplementary information is or will be provided in the same form alongside the published article on the publisher’s website, should be retained in CentAUR.

Any researcher who submits supplementary information to CentAUR should be advised to ensure they have preserved and enabled access to their underlying research data using the University Research Data Archive or another suitable data service.

Definitions

Supplementary information

Supplementary information is defined as one or more files representing data collected or generated in the reported research, which are, or are intended to be published alongside the article on the publisher’s website. For the purpose of this policy supplementary information is considered to form part of the associated article. [We understand the article as a complex digital object, which includes the paper itself, supplementary data or other files, and the publisher’s abstract page and associated metadata. All of these elements are usually identified by a single DOI]. 

Supplementary information will typically have one or more of the following characteristics:

- the information is in the form of a PDF or text document containing text and figures or tables, or a few small image or audiovisual files; 

- the information reproduces and collects together figures, tables, images, videos, etc. that are presented in the article, but adds no new information;

- the amount of information provided is negligible.   

Supplementary information has little or no use-value as a standalone dataset, and would not be suitable for inclusion in the Archive.

Underlying data

Underlying data are primary or raw data relating to a publication or research activity, which constitute a comprehensive, coherent, usable dataset, and which are not, or are not intended to be published alongside the article on the publisher’s website.

Underlying data will have one or more of the following characteristics:

- the data do not reproduce the supplementary information published alongside the article on the publisher’s website, but add new data not available on the publisher’s website;

- the data are underlying or ‘raw’ data: ‘the numbers behind the figures’, i.e. quantitative or qualitative information in a systematic presentation such as a table or a structured format; 

- the data are not presented in a PDF file, but in file formats that enable selection, manipulation and analysis, e.g. spreadsheets, editable text files, database, image, audio and video formats; 

- the data consist of one or more files that appear well-presented and ordered, with interpretive documentation embedded in the file and/or recorded in a separate documentation or readme file;

- they include files containing software code used to generate or interpret the data.

We’ll no doubt find exceptions to the general rule, and revise and refine in the light of experience, but this seems a reasonable starting point. We want to promote the University Archive as a service providing access to substantive, usable, well-documented data, and to concentrate on getting authors who provide SIs to publishers to put the primary underlying data, where these exist and are relevant, in a suitable preservation/sharing service. I’m not sure I see any great value in creating metadata records in our Research Data Archive for SI on publisher’s websites, or in adding the SI files to our Archive – especially given the added admin this would involve. But I’m keeping an open mind…

Regards

Robert

Dr Robert Darby

Research Data Manager

Research and Enterprise Development

The University of Reading 

Tel: 0118 378 6161

-----Original Message-----

From: Research Data Management discussion list [mailto:[log in to unmask]] On Behalf Of Gareth Knight

Sent: 18 April 2016 15:24

To: [log in to unmask]

Subject: Re: How do you handle supplementary info?

Hi Mary, all,

We regularly record details of SI files in the LSHTM data repository. This was motivated by a desire to showcase and maintain an institutional record of researchers' data outputs, including that held & published elsewhere. At first I simply catalogued these resources and directed people to the 3rd party website. However, to address researchers' criticism that many of our metadata records were empty I've started to add CC-licensed content where possible.

This is quite labour-intensive at the moment. I review each new publication in our repository for supplementary files and make a decision on whether it should be catalogued. This isn't particularly systematic, but covers factors such as: 

1. Content type: Is it a survey, processing script, dataset, software, or other output?

2. Size/extent: Is there a substantial amount of data? There needs to be some cut-off limit for content. I'm not convinced we need to have a separate record for a summary table with less than 10 rows, for instance.

3. File type: Is it held in a reusable format (XLS, SPSS, CSV)? PDFs are catalogued, but only if they contain substantial data tables or other data

There are a few questions that I've been struggling with, however:

1. How should we catalogue these files? I'd prefer to describe the SI files as a distinct entity, but it takes a long time to review the paper and data & authors are often uncommunicative. Is it sufficient to reproduce the publication abstract or use a blanket "supplementary info for XX" statement?

2. Should we be applying preservation action or enhancing these files?

3. Should we assign a DOI to these files? I've used the publication DOI in most cases, but is this the best approach?

4. Can we assume that the SI licence is the same as the publication?

More generally, it's be nice to automate the process of identifying and importing SI files relevant to publications.

Gareth

--

Gareth Knight

Research Data Manager,

Library & Archives Service

London School of Hygiene & Tropical Medicine Keppel Street, London WC1E 7HT UK

(+44) 020 7927 2564

[log in to unmask]

http://www.lshtm.ac.uk/research/researchdataman/

-----Original Message-----

From: Research Data Management discussion list [mailto:[log in to unmask]] On Behalf Of Rzepa, Henry S

Sent: 18 April 2016 13:19

To: [log in to unmask]

Subject: Re: How do you handle supplementary info?

Yes, it’s a complex area.  As chemists, around 1994 we set out on a project to define about  50 “media types” as part of what we called a  chemical MIME content type.  Quite a few of our choices still are in use but things have got far more complex since then. It might be worth taking a complete look at all the currently ratified  MIME types for some help http://www.sitepoint.com/web-foundations/mime-types-complete-list/

Dave Martinsen has reviewed more recently; D. P. Martinsen, Supplemental Journal Article Materials in ACS Symposium Series, Special Issues in Data Management, 2012, Chapter 3, pp 31-45, DOI: http://doi.org7r9  and that might contain some more recent pointers in the physical sciences area.

On 18/04/2016, 13:01, "Research Data Management discussion list on behalf of Mary Donaldson" <[log in to unmask] on behalf of [log in to unmask]> wrote:

>Hello,

>

>At Glasgow, we're staring to look at how we handle data that is included in supplementary information files. We're becoming increasingly aware of the broad range of file types that are being included in SI, beyond the usual PDFs and extra figures. Many of these file types contain representations of data rather than the data themselves, but some could be data.

>

>We're planning on having a discussion soon to develop some internal guidelines for when the SI files should go in our publications repository and when they merit a record in the data repository. Has anyone else already visited this territory? If so, we'd love to know what conclusions you came to. We will also be happy to share our ideas once we've given them some thought and testing.

>

>Best wishes,

>Mary

>

>RDM Service Coordinator,

>University of Glasgow.

Top of Message | Previous Page | Permalink

JiscMail Tools

Files Area | help

RSS Feeds and Sharing

Search Archives

Advanced Options

Archives

April 2024
March 2024
February 2024
January 2024
December 2023
November 2023
October 2023
September 2023
August 2023
July 2023
June 2023
May 2023
April 2023
March 2023
February 2023
January 2023
December 2022
November 2022
October 2022
September 2022
August 2022
July 2022
June 2022
May 2022
April 2022
March 2022
February 2022
January 2022
December 2021
November 2021
October 2021
September 2021
August 2021
July 2021
June 2021
May 2021
April 2021
March 2021
February 2021
January 2021
December 2020
November 2020
October 2020
September 2020
August 2020
July 2020
June 2020
May 2020
April 2020
March 2020
February 2020
January 2020
December 2019
November 2019
October 2019
September 2019
August 2019
July 2019
June 2019
May 2019
April 2019
March 2019
February 2019
January 2019
December 2018
November 2018
October 2018
September 2018
August 2018
July 2018
June 2018
May 2018
April 2018
March 2018
February 2018
January 2018
December 2017
November 2017
October 2017
September 2017
August 2017
July 2017
June 2017
May 2017
April 2017
March 2017
February 2017
January 2017
December 2016
November 2016
October 2016
September 2016
August 2016
July 2016
June 2016
May 2016
April 2016
March 2016
February 2016
January 2016
December 2015
November 2015
October 2015
September 2015
August 2015
July 2015
June 2015
May 2015
April 2015
March 2015
February 2015
January 2015
December 2014
November 2014
October 2014
September 2014
August 2014
July 2014
June 2014
May 2014
April 2014
March 2014
February 2014
January 2014
December 2013
November 2013
October 2013
September 2013
August 2013
July 2013
June 2013
May 2013
April 2013
March 2013
February 2013
January 2013
December 2012
November 2012
October 2012
September 2012
August 2012
July 2012
June 2012
May 2012
April 2012
March 2012
February 2012
January 2012
December 2011
November 2011
October 2011
September 2011
August 2011
July 2011
June 2011
May 2011
April 2011
March 2011
February 2011
January 2011
December 2010
November 2010
October 2010
September 2010
August 2010
July 2010
June 2010
May 2010
April 2010
March 2010
February 2010
January 2010
December 2009
November 2009
October 2009
September 2009
August 2009
July 2009
June 2009
May 2009
April 2009
March 2009
February 2009
December 2008
November 2008
September 2008

JiscMail is a Jisc service.

View our service policies at https://www.jiscmail.ac.uk/policyandsecurity/ and Jisc's privacy policy at https://www.jisc.ac.uk/website/privacy-notice

For help and support help@jisc.ac.uk