Tags: preservation

Building the Great Libraries of the Internet with a DNS time machine by Ben Ward

Ben proposes an alternative to archive.org: changing the fundamental nature of DNS.

Regarding the boo-hooing of how hard companies have it maintaining unprofitable URLs, I think Ben hasn’t considered the possibility of a handover to a cooperative of users—something that might yet happen with MySpace (at least there’s a campaign to that effect; it will probably come to naught). As Ben rightly points on, domain names are leased, not bought, so the idea of handing them over to better caretakers isn’t that crazy.

Own your turf - Austin Kleon

Prescient.

Posthaven is the safe place for all your posts forever

This is a breath of fresh air: a blogging platform that promises to keep its URLs online in perpetuity.

The First Website by Mark Boulton

Mark writes about his work with CERN to help restore the first website to its original URL.

I have two young children and I want them to experience the early web and understand how it came to be. To understand that the early web wasn’t that rudimentary but incredibly advanced in many ways.

Internet Archive on Vimeo

A beautiful short film on the amazing work being done at the Internet Archive, produced on the occasion of their 10 petabyte celebration.

Truly awe-inspiring.

Brewster’s trillions: Internet Archive strives to keep web history alive

A profile in The Guardian of the Internet Archive and my hero, Brewster Kahle (who also pops up in the comments).

The Death of Upcoming.org - Waxy.org

Heartbreaking and angry-making.

Fictive Kin, The Joke’s On Us?

The story of one site’s disgraceful handling of acquisition and shutdown (Punchfork, acquired by Pinterest) and how its owner actively tried to block efforts to preserve user’s data.

Our Incredible Journey

A collection of those appalling doublespeek announcements that sites and services give when they get acquired. You know the ones: they begin with “We’re excited to announce…” and end with people’s data being flushed down the toilet.

Google Keep? It’ll probably be with us until March 2017 - on average

Charles Arthur analyses the data from Google’s woeful history of shutting down its services.

So if you want to know when Google Keep, opened for business on 21 March 2013, will probably shut - again, assuming Google decides it’s just not working - then, the mean suggests the answer is: 18 March 2017. That’s about long enough for you to cram lots of information that you might rely on into it; and also long enough for Google to discover that, well, people aren’t using it to the extent that it hoped.

Ideas of March — All in the head

A wonderful rallying cry from Drew.

The problem:

Ever since the halcyon days of Web 2.0, we’ve been netting our butterflies and pinning them to someone else’s board.

The solution:

Hope that what you’ve created never has to die. Make sure that if something has to die, it’s you that makes that decision. Own your own data, friends, and keep it safe.

Dark Archives in Contents Magazine

A really lovely piece on the repositories of information that aren’t catalogued—a fourth quadrant in the Rumsfeldian taxonomy, these dark archives are the unknown knowns.

Control your own content

Honestly, if you value the content you create and put online, then you need to be in control of your own stuff.

Focusing on our future — some changes to our product line-up

What an Orwellian title for a blog post announcing the wholesale destruction of user’s content. Oh, Yahoo, you sound so proud of your cavalier attitude towards the collective culture that you have harvested.

Vile fuckwits.

Owning your own words – is it important?

A fascinating discussion on sharecropping vs. homesteading. Josh Miller from Branch freely admits that he’s only ever known a web where your content is held by somone else. Gina Trapani’s response is spot-on:

For me, publishing on a platform I have some ownership and control over is a matter of future-proofing my work. If I’m going to spend time making something I really care about on the web—even if it’s a tweet, brevity doesn’t mean it’s not meaningful—I don’t want to do it somewhere that will make it inaccessible after a certain amount of time, or somewhere that might go away, get acquired, or change unrecognizably.

When you get old and your memory is long and you lose parents and start having kids, you value your own and others’ personal archive much more.

The Last Pictures: Contemporary pessimism and hope for the future by Paul Glister

From the cave paintings at Lascaux to the Pioneer plaques and Voyager golden records to Trevor Paglen’s “The Last Pictures” project, Paul Glister examines the passage and preservation of art and information through time. Fascinating.

Or perhaps, as Paglen envisions, those who find a Pioneer Plaque, a Voyager Record, or one of our electromagnetic transmissions will be interested enough to search us out, coming upon a future Earth where all that is left of humanity are our terrestrial ruins and that artificial ring of geosynchronous satellites, with one of them having a particular golden artifact bolted to its pitted hull. In that scenario, about all that would be left for the visiting ETI to do in terms of learning about us would be grand-scale dumpster diving.

Song blogging: Files That Last

I hereby declare that this song is my official anthem.

I want some files that last, data that will not stray.

Files just as fresh tomorrow as they were yesterday.

Interstellar Hard Drive - The Morning News

Investigating the options for off-world backups.

Data is only as safe as the planet it sits on. It only takes one rock, not too big, not moving that fast, to hit the Earth at a certain angle and: WHAM! Most living species are done for.

How the hell is your Twitter archive supposed to survive that?

www-talk

Here’s a treasure trove of web history: an archive of the www-talk list dating back to 1991. Watch as HTML gets hammered out by a small group of early implementors: Tim Berners-Lee, Dave Raggett, Marc Andreessen, Dan Connolly…

Just Solve the Problem Month 2012: Nitty Gritty « ASCII by Jason Scott

Jason goes into detail describing the File Format problem that he and others are going to tackle in the effort known as Just Solve The Problem.

The Charge of the Scan Brigade « ASCII by Jason Scott

Live in or near San Francisco? Interested in preserving computer history? Then you should meet up with Jason this Friday:

This Friday, October 5th, the Internet Archive has an open lunch where there’s tours of the place, including the scanning room, and people get up and talk about what they’re up to. The Internet Archive is at 300 Funston Street. I’m here all week and into next.

Eric’s Archived Thoughts: The Web Behind

This ticks all my boxes: a podcast by Eric and Jen about the history of the web. I can’t wait for this to start!

Special Report #1: Data Protection — Contents Magazine

This is an important subject (and one very close to my heart) so I’m very glad to see these data protection guidelines nailed to the wall of the web over at Contents Magazine.

  1. Treat our data like it matters.
  2. No upload without download.
  3. If you close a system, support data rescue.

Scripting News: How future-safe was the first Harvard blogging site?

A cautionary tale from Dave Winer of not considering digital preservation from the outside. We must learn the past. We must.

oldtweets - Laughing Meme

Kellan explains the tech behind Old Tweets …and also the thinking behind it:

I think our history is what makes us human, and the push to ephemerality and disposability “as a feature” is misguided. And a key piece of our personal histories is becoming “the story we want to remember”, aka what we’ve shared.

Rosetta on Vimeo

A beautiful short film about The Long Now Foundation’s Rosetta Project.

Digital archivists: technological custodians of human history | Ars Technica

An introduction to the important work of digital archivists:

Much like the family member that collects, organizes, and identifies old family photos to preserve one’s heritage, digital archivists seek to do the same for all mankind.

Form letter template for acquired startups — Gist

Just copy and paste.

Dear soon-to-be-former user…

1
2
3
Dear soon-to-be-former user,

We've got some fantastic news! Well, it's great news for us anyway. You, on

Coding Horror: Preserving The Internet… and Everything Else

A love letter to the Internet Archive.

» Long Bets Bet – How Durable Are URLs? - Blog of the Long Now

The Long Now blog is featuring the bet between myself and Matt on URL longevity. Just being mentioned on that site gives me a warm glow.

ARCHIVE TEAM: A Distributed Preservation of Service Attack - YouTube

Jason’s rip-roaring presentation from Defcon last year.

Why I’m building Nilai by Colin Devroe

Now this is some prioritisation I can admire:

I’m going to build valuable, reliable, sustainable web services that will last forever.

What Goes Up, Doesn’t Have To Come Down

A thoughtful—and beautifully illustrated—piece by Geri on memory and digital preservation, prompted by the shut-down of Gowalla.

Webstock ‘12: Jeremy Keith - Of Time and the Network on Vimeo

The video of my talk from Webstock, all about wibbly-wobbly, timey-wimey stuff like networks and memory.

Jeremy Keith - All Our Yesterdays on Vimeo

The video of my presentation on digital preservation at last year’s Build conference.

Our communication methods have improved over time, from stone tablets, papyrus, and vellum through to the printing press and the World Wide Web. But while the web has democratised publishing, allowing anyone to share ideas with a global audience, it doesn’t appear to be the best medium for preserving our cultural resources: websites and documents disappear down the digital memory hole every day. This presentation will look at the scale of the problem and propose methods for tackling our collective data loss.

Internet Archive Search: collection:”thesoundofyoungamerica”

Thanks to Jason Scott, every episode of The Sound Of Young America ever recorded is now stored on the Internet Archive. Get huffduffing!

Where do Websites go to Die? « dpr-barcelona

Burying physical copies of dead websites in a Croatian cave.

Simon Collison | Colly | Journal | My digital preservation utopia

Colly’s thoughts on digital preservation are written in a lighthearted tongue-in-cheek way but at least he’s thinking about it. That alone gives me comfort.

We Are Historians | 1sixty

A beautiful reminder that by publishing on the web, we are all historians.

Every color you choose and line of code you write is a reflection of you; not just as a human being in this world, but as a human being in this time and place in human history. Inside each project is a record of the styles and fashions you value, the technological advancements being made in the industry, the tone of your voice, and even the social and economic trends around you.

The Ruins of Dead Social Networks - Alexis Madrigal - Technology - The Atlantic

Reminiscences of the BBSs of yesteryear that could in time be applied to the social networking sites of today.

Events - The Future of the Past of the Web | Digital Preservation Coalition

I’m going to try to make it along to this event in London next month.

Times Higher Education - Memory failure detected

A worrying report on the state of digital preservation and the web, specifically in the UK. Welcome to the memory hole.

History, our future - Preoccupations

A superb post by David that ties together multiple strands of personal digital preservation through homesteading instead of sharecropping.

Escaping the Digital Dark Age

Stewart Brand wrote this twelve years ago: it’s more relevant than ever in today’s cloud-worshipping climate.

I’d like to think that it’s ironic that I’m linking to The Wayback Machine because the original URL for this essay is dead. But it isn’t ironic, it’s horrific.

The Digital Storage of Analog Memories | Caseorganic Blog

Amber documents her attempt to turn physical objects imbued with meaning into digital artefacts.

Understanding 9/11: A Television News Archive

A truly impressive achievement by Archive.org: all the television footage from September 11th, 2001 gathered in one place on the web.

Archives & Museum Informatics: Museums and the Web 2010: Papers: Cope, A.S., Buckets and Vessels

Here’s one to add to Instapaper or Readability to savour at your leisure: Aaron Straup Cope’s talk at Museums and the Web 2010:

This paper examines the act of association, the art of framing and the participatory nature of robots in creating artifacts and story-telling in projects like Flickr Galleries, the API-based Suggestify project (which provides the ability to suggest locations for other people’s photos) and the increasing number of bespoke (and often paper-based) curatorial productions.

Why Preserve Books? The New Physical Archive of the Internet Archive | Internet Archive Blogs

Brewster Kahle explains how and why the Internet Archive is keeping physical copies of the books it digitises.

Race to save digital art from the rapid pace of technological change | Technology | The Observer

Digital preservation in the art world.

LukeW | An Event Apart: All Our Yesterdays

Luke’s notes from my talk about long-term thinking and online preservation at An Event Apart in Boston.

FamilySearch Shares Plans to Digitize Billions of Records Stored at Granite Mountain Records Vault - LDS Newsroom

How the Mormon Church are storing and preserving genealogical data inside a mountain.

Digital legacy: The fate of your online soul - tech - 02 May 2011 - New Scientist

The editor of New Scientist writes about deletionists and preservationists while adding his own personal poignant perspective.

One Terabyte of Kilobyte Age | Digging through the Geocities Torrent

A blog devoted to sifting through the gems in the Geocities torrent. This is digital archeology.

Google can’t be trusted with our books | Simon Barron | Comment is free | guardian.co.uk

The threat to Google Videos shows businesses are not suitable cultural custodians — they can’t be held accountable to the public.

A History of the Future in 100 Objects | Mssv

Adrian Hon’s Kickstarter project has already reached its goal. I can’t wait for the podcasting to start.

Geek Ninja Battle Night | Stuff and Nonsense

Andy hammers home the benefit of a long-term format like HTML compared to the brittle, fleeting shininess of an ephemeral platform-specific app.

A Memory of Webs Past - IEEE Spectrum

A detailed look at how French archivists go about preserving websites.

Digitale data in gevaar! - Datanews.be

If you speak Flemish, you might enjoy this article based on a chat I had with a Belgium journalist.

If you don’t speak Flemish, well, just move along.

Wikipedia:List of articles with doomed BBC links - Wikipedia, the free encyclopedia

Read it and weep. Here are the articles on Wikipedia that reference URLs that are getting axed as part of the BBC’s upcoming cull.

James Bridle on Phare Conference on Vimeo

Everything is worth preserving and protecting.

Jeremy Keith on Phare Conference on Vimeo

I answered a few questions right after giving my talk at the Phare conference in Ghent.

Long Bets - The original URL for this prediction (www.longbets.org/601) will no longer be available in eleven years.

This is my prediction. If you think it’s wrong, challenge it. We shall then partake in a wager.

Safe-keeping - Preoccupations

I wish I had a teacher like David when I was in school.

URLs, permalinks, archives … preservation. It all matters so very much.

Swiss Fort Knox

This is the stuff James Bond stories are made of. Except in this case, the fortress exists to store data rather than criminal masterminds.

TIME CAPSULE ..::HOME::..

On 18 May 2010, the Planets (Preservation and Long-term Access through Networked Services) Project deposited a time capsule in the vaults of datacenter, Swiss Fort Knox, in Saanen, Switzerland. It contained the decoding information for five digital file formats on media ranging from paper, microfilm and floppy discs to CDs, DVDs and USB sticks.

Open Planets Foundation | digital, forever

This consortium of institutions and universities came together “to provide practical solutions and expertise in digital preservation.”

PLANETS stands for Preservation and Long-term Access through Networked Services.

Main Articles: ‘Domesday Redux: The rescue of the BBC Domesday Project videodiscs’, Ariadne Issue 36

The fascinating story of the BBC Domesday Project and its subsequent fate.

The purpose of the CAMiLEON project was to demonstrate the value of emulation in preserving not only the data stored in obsolete systems but the behaviour of the systems themselves - in this case one of the very first interactive multi-media systems. The aim was to reproduce the original user experience as accurately as possible, and the CAMiLEON team argued that the slight faults in images as displayed from the analogue discs were a part of that experience, and should not be cleaned up as Andy proposed to do. Our aim was different - we wanted to preserve the data with the highest quality available consistent with longevity.

Near Arctic, Seed Vault Is a Fort Knox of Food - New York Times

A tour of the Global Seed Bank in Svalbard.

Forever Future | Sascha Pohflepp

The intriguing tale of a fictional archivist, storing past visions of the future in a storage facility that acts as a space ark.

He has put money in the bank which will pay for the space well beyond his lifetime. Each year he collects technological predictions that had been made for that year and conserves the ones that didn’t come true in the form of 35mm slides. The ship itself consists of a refrigeration unit to help preserve the slides, a slide projector and light box in case these technologies have become extinct by the time of its recovery, and a system to get power from the outside. In an annual ritual on April 11th Walker adds another box to the mission.

Link Rot « The Bygone Bureau

Brilliant; just brilliant. Connor O’Brien remains skeptical about the abstract permanence of “the cloud.” The observations are sharp and the tone is spot-on.

If your only photo album is Facebook, ask yourself: since when did a gratis web service ever demonstrate giving a flying fuck about holding onto the past?

Pulling the plug on the BBC’s internet history « 853

The BBC’s decision to actively delete old content (rather than simply allowing it to take up some space on a server) really gets my blood boiling.

The BBC asked the public to contribute their memories of World War Two to a website between June 2003 and January 2006…” and five years later some suit decided to bin them.

Bletchley Park and History Hackday Request | Amplified

Let’s make the Bletchley Park data machine-readable so we can start mining the stories they contain (like Old Weather).

Bletchley Park need help to catalogue and create a proper archive of these decrypts.

I want in!

Outline of a Digital Preservation System (Aaron Swartz’s Raw Thought)

Aaron Swartz gets technical about online digital preservation.

Carlos Bueno: A Paper Internet

Preserving the papernet.

We Didn’t Stop The Fire. – Jeffrey Zeldman Presents The Daily Report

Jeffrey points out another point of failure in our online storage: the willingness of site owners to sell their product (and your data) to a big company for a quick payout.

Forever / from a working library

Mandy writes about digital preservation:

The technological means to produce an archive are not beyond our skills; sadly, right now at least, the will to do so is insufficient.

Home - LOCKSS

Lots Of Copies Keep Stuff Safe — a digital preservation initiative based at Stanford.

Cyberspace When You’re Dead - NYTimes.com

An accurately-downbeat look at digital preservation.

Scripting News: Upcoming: The minimal blogging tool

Dave Winer is putting together technology to battle share-cropping and enable the Pembertonisation of your content: you host the canonical copy and distribute to third-party services.

If You Didn’t Blog It, It Didn’t Happen - Anil Dash

A thoughtful piece on how Twitter can complement blogging, but is far too often used as an impermanent substitute.

…if you didn’t blog it, it didn’t happen. In fact, I first wrote about this idea a bit on Twitter a few years ago. See if you can find it.

Bulkr - Download Flickr photos in batches (Mac, Windows & Linux)

This looks like it could be a handy tool for backing up Flickr photos.

ASCII by Jason Scott / Yahoo!locaust

A viciously accurate assessment of Yahoo’s scorched earth policy towards our online collective culture:

All I can say, looking back, is that when history takes a look at the lives of Jerry Yang and David Filo, this is what it will probably say: Two graduate students, intrigued by a growing wealth of material on the Internet, built a huge fucking lobster trap, absorbed as much of human history and creativity as they could, and destroyed all of it.

..about validating

An oldie but a goodie. This is why we have standards.

Archive Fever: a love letter to the post real-time web | mattogle.com

Matt encapsulates a lot of what I've been thinking about recently: the real-time web is all well and good, but let's not forsake the enormous potential for fulfilment in archives.

Welcome to Open Library! (Open Library)

One web page for every book. I love this project.

On Digital History- - Georgian London

Lucy Inglis, curator of Georgian London, on the role she and other bloggers play.

Does the Web remember too much — or too little? — Scott Rosenberg

Yes! Yes! Yes! An excellent fisking of that ridiculous New York Times article that confused problems in the present with data longevity.

Saving our digital heritage - Dan Gillmor - Salon.com

Dan Gilmore, reporting from a conference on digital preservation. I should pay attention to this

The Web Means the End of Forgetting - NYTimes.com

This article needs a great big "citation needed" slapped on it. Yes, people need to think about what they post on the web, but no, that stuff will not stay around "forever." If anything, the web suffers from the opposite problem: memory loss.

Rams’ Principles Series: 7 of 10 | Inksie Journal of Design & Culture

Mandy's take on Dieter Rams's design principle that "good design is long-lasting."

Raiding Eternity - Myspace - Gizmodo

This is wonderful: sad, beautiful, and wonderful ...it's what I've been trying so hard to clumsily articulate. Read it. And smile. And weep.

Reocities , rising from the ashes - RIP Geocities...

Here lies what we could salvage from the ashes of GeoCities.

yws-search-general : Message: Term Extraction and Contextual Web Search services to be discontinued

Crap. The very powerful term extractor API from Yahoo is being closed down. Sad developer is sad.

Will my site be archived? Yahoo! GeoCities Help

Archive.org is indexing Geocities sites (as it always has). Yahoo are going to fuck all about their users data/dreams/memories and Yahoo are going to do fuck all about the URLs.

Hivelogic - Backing Up Flickr

A python script from Dan Benjamin to help you do your bit in battling the datapocalypse.

Hypertext Style: Cool URIs don't change.

Eleven years old and more relevant than ever.

Ugly and neglected fragments (Phil Gyford’s website)

Phil Gyford on why he will miss Geocities. "It’s only thanks to the efforts of people like the Internet Archive and Archive Team that we’ll have a record of what people, rather than companies, published in the past. As companies like Yahoo! switch off swathes of our online universe little fragments of our collective history disappear. They might be ugly and neglected fragments of our history but they’re still what got us where we are today."

kewlchops: A new leaf.

The perfect person for the job—George will be working on the Internet Archive's Open Library project: a webpage for every book ever published.

husk.org. chaff. Aggregation and the Edge.

Paul Mison shares his thoughts on moving towards a decentralised web of services rather than silos of data. "Now I'm wondering: is there a space for a piece of user-installable software, like Movable Type or Wordpress, that aggregates their data from sites across the web, and then presents it as a site? If there is, is it even possible to write it in a way that anyone who couldn't have written it themselves can even use it?"

Adam Franco.com » Blog Archive » Twitter Export Script

Archive your Twitter updates with this PHP script.