Hi there,
I am interested to know what advice people are giving to researchers around being able to cite and preserving webpages which have been created as part of their research projects. The website's URL can be used to cite the pages themselves in the short term but of course the URL will only work for as long as the webpages are available. If they want a persistent link to the content created in their websites to include in a report or in a data access statement then it seems to me the website content needs to be transferred to our data repository, but how best to do this?
Some options I've been looking at is to suggest copying the website content into a pdf and/or upload any multimedia created and embedded within those pages to our data repository. The downside however, is that this won't retain the look and feel of the website.
Does anyone recommend using https://webrecorder.io/? I understand this generates a .warc file of the website but that this needs to be done for every page of the website separately.
I also know that there will be questions over copyright but for now, I'm interested in how best to capture and preserve website content.
Any advice gratefully received!
Catharine Bailey
Research Data Manager
Brunel University London
|