“Aging, quite simply, is a loss of information.”*…
And as it is in the human condition, so it is on the internet. As the Pew Research Center reports: 38% of webpages that existed in 2013 are no longer accessible a decade later…
The internet is an unimaginably vast repository of modern life, with hundreds of billions of indexed webpages. But even as users across the world rely on the web to access books, images, news articles and other resources, this content sometimes disappears from view.
A new Pew Research Center analysis shows just how fleeting online content actually is:
A quarter of all webpages that existed at one point between 2013 and 2023 are no longer accessible, as of October 2023. In most cases, this is because an individual page was deleted or removed on an otherwise functional website.
For older content, this trend is even starker. Some 38% of webpages that existed in 2013 are not available today, compared with 8% of pages that existed in 2023.
This “digital decay” occurs in many different online spaces. We examined the links that appear on government and news websites, as well as in the “References” section of Wikipedia pages as of spring 2023…
The not-so-pretty results of their study, and an account of their methodology at “When Online Content Disappears,” from @pewresearch.
Happily, the Internet Archive‘s wonderful Wayback Machine, where one can find saved copies of (many, many, if not all) web pages that have disappeared, is a(n at least partial) antidote. Indeed, via a background script, the Wayback Machine supplies the most recent archived version of many Wikipedia links that have gone dead.
(Image above: source— where one can find the origin of “404” as the designator of a broken link…)
* David Sinclair, controversial anti-aging researcher
###
As we search for the missing, we might send elegantly-designed birthday greetings to John Cocke; he was born on this date in 1925. A computer engineer, he made numerous important contributions to computer architecture and to optimizing compiler design. Most notably, he is considered by many to be “the father of RISC architecture” (which first appeared in his design of the IBM 801).


You must be logged in to post a comment.