Not all superheros wear capes and even fewer know how to preserve hundreds of terabytes of internet history. But for the revolving cast of digital librarians in Reddit’s data hoarding community, saving as much of our digital detritus from destruction as possible is just another day on the net.

People come to the data hoarding subreddit to learn about storage set ups, how to scrape data, or to float a new archival project, which can often seem like a never ending game of one upmanship in terms of the scope of the proposals. In July, a Redditor called “traal” posted a short note to r/datahoarders suggesting a hoard of all YouTube metadata, such as the title, description, thumbnail image, and subtitles.


http://bit.ly/2NLersS
http://bit.ly/2NLersS+

--
Peterk
Dallas, Tx
[log in to unmask]
Save our in-boxes! http://emailcharter.org
“If only there were a massive entity that I were forced to fund to tell me how I should live my life, since I’m so obviously incapable of deciding for myself.” M. Hashimoto
To view the list archives go to: https://www.jiscmail.ac.uk/cgi-bin/webadmin?A0=RECORDS-MANAGEMENT-UK To unsubscribe from this list, send an email to [log in to unmask] with the words UNSUBSCRIBE RECORDS-MANAGEMENT-UK For any technical queries re JISC please email [log in to unmask] For any content based queries, please email [log in to unmask]