Bonum Certa Men Certa

Archiving Web Sites to Ensure They Last Decades, Not Years, Outliving or Outlasting Various Disruptive Events

Video download link | md5sum b29da11a5ae25c7597c459e8e4c320b2



Summary: Today we upload 15 years' worth of blog posts to the Internet Archive (IA), or close to 32,000 stories along with Daily Links; we suggest that other sites do the same in order to tackle 'Internet rot' and preserve information (otherwise there's room for obscene revisionism)

THE INTERNET won't stay around forever. The Soviets, back in the old days, tried to develop something similar to it. The Internet will probably survive the next decade or two, but fifty years is a stretch; as for the World Wide Web, it has already devolved into a transport layer for JavaScript and DRM, having been rendered bloated and malicious in practice (albeit not in theory; one can still produce elegant Web sites).



Earlier this year we moved to Gemini and more than a year ago we adopted IPFS, which is used to circulate daily bulletins and IRC logs in a decentralised fashion. Our IRC channels all became self-hosted (in our network) earlier this year -- an ambition that we've had for years but didn't get around to until Freenode collapsed.

Archiving a Web site isn't the same as format changes and protocol changes. It's also not about making more copies, especially if those copies are as vulnerable to censorship as one another. Here in this site we have some public domain (PD) works that are of relevance to us and can be accessed in gemini://. Most of the works, however, use a Creative Commons licence. We are not a curation site per se, but it helps to keep copies of historical material, such as antitrust material demonstrating Microsoft's crimes (as tactics barely change over time). Well, by Internet standards we have enjoyed a long span of 15 years (articles and daily links) and we remain active on the daily basis. The same is true for Tux Machines, which turns 18 this coming summer, so a lot of the material we have here is no longer available anywhere else, except the Internet Archive (IA).

A few years ago we started making site archives in IA and we also recommended the site to people, dubbing it the most important site on the Web. It's no eternal site however; as an associate of ours explains, "the IA is very important but it will succumb as the WWW is phased out in favor of obfuscated, proprietary JavaScript."

IA can barely cope with (e.g. spider/index/save/navigate) many of the "modern" Web. When you add DRM to the mix (EME), then it's not a "format-shifting'" task as that too becomes an impossibility. Sites need to evolve or perish, which may mean getting off the Web and one day planning for the demise of the Internet as a whole. Like IA, our associate explains, "archive.is is interesting, but it'll die one day. In the long run they will all pass away. In formal archives, one of the initial decisions the institution has to make about any given artifact is that of how long it shall be preserved for. Nothing lasts forever, but there are ways of stretching things out and the duration determines the methods of preservation."

For a site such as ours it makes sense to keep the material available for 50 years, which is maybe how much longer I can live (if I'm lucky).

"Media shifting will obviously be involved," the associate notes, "but at a loss for some items. The plan pre-dates AWA by a great many years."

Last weekend we turned 15. "Already in 15 short years," our associate remarks, "many whole sites are gone. And of the sites that remain, many have lost all their old articles in clumsy reorgs. Of that which is left, some of those have purged documents with "inconvenient" messages or themes... even Groklaw purged its comments. I suppose few to none of the Groklaw comments made it into the Library of Congress archives."

At the time of writing I'm still uploading 205 MB of archives (as shown in the video above). We hope it can inspire other sites to think ahead and do the same. It's not a big task and it's better done before it's "too late"...

Our associate concludes by saying that "many programmers and even engineers are conscientious in erasing anything "old" even important records. Now with electronic media, there is often only a single copy of anything any more and that introduces, obviously, a single point of failure. So in the old days, one could maintain a relevant personal or professional archive. Now those are all centralized and continue to exist only at the whim of participant consensus. Anyone with administrative privileges, can "tidy" up and easily erases the world's last copy of a standard or other evidence or similar material."

We are going to add more material to IA and it can be found here as that piles up along with some material that isn't ours.

Recent Techrights' Posts

Dr. Andy Farnell on Why Calling Slop or Chaff "Hey Hi" (AI) Harm Us All, Except for "Ten or Twenty Rich Industrialists"
"words to avoid"
Internet Trolls Likely Trying to Distract From the Demise of IBM, Problems With Red Hat
there seems to be trolling online aimed at suppressing discussion
Debian Upgrade Coming Up (Soon)
Yesterday we contacted the datacentre staff about it
Getting Aggressive Suggestive of Loss - Part III - Threats From Burner Accounts Formally Treated as a Crime
Countries that cannot preserve freedom from self-censorship are countries where free press ultimately cannot prevail
24/7 Wall St. Editor-In-Chief and CEO Calls IBM Is "America’s Worst Big Tech Company", Talent is Leaving, Supposedly Strategic Units Culled
21 hours ago by Douglas A. McIntyre
IBM's Debt Increased Over $5 Billion in 3 Months While IBM Laid Off Many in Europe, US, Confluent, HashiCorp, and Red Hat
An increase of $5,000,000,000+ in debt in just 3 months!
 
Corporate Media Did Not Specify What Microsoft Means by "Buyouts" (Layoffs), It May Be Hardly Different From Severance
Time will tell, but investigative journalism hardly exists anymore, so we won't hold our breath
The Corrupt Lecture the Non-Corrupt - Part V - "Diversity" and "Inclusion" at EPO Means Sleeping With Sister of "Cocaine Communication Manager" and Making Them Millionaires
Remember that top applicants or key stakeholders of the EPO are already complaining about a lack of quality
Links 25/04/2026: Fake GAFAM Valuations (Gripping the Market Based on False Accounting), "Evidence Isn't Just for Research", and "Putin Defends Mobile Internet Outages"
Links for the day
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Friday, April 24, 2026
IRC logs for Friday, April 24, 2026
Gemini Links 25/04/2026: 3.4k+ Capsules, Microsoft Layoffs, Call for Nuclear Disarmament, "Internet is Sad and Lonely"
Links for the day
Links 24/04/2026: Zelenskyy Says Ukraine's War Position "Most Stable", Samsung Workers on Strike Due to Pay
Links for the day
Recent Happenings at IBM Reaffirm Rumours About the CEO; He Might be Resigning (or Pushed Out) Soon
If the rumours are true (no, we did not check those tax records for ourselves), it's not unthinkable that IBM is already doing what Apple did months ago
Gemini Links 24/04/2026: Public Reticulum Gateway Node, Smol Computers, and Old E-mail
Links for the day
Links 24/04/2026: Intel Abandoning Computer Freedom (Even Further), Iran Reports That American Software and Hardware Remotely Sabotaged/Hijacked During War
Links for the day
The Great Wonders of Slop "Efficiency"
Thankfully nothing was lost in the transmission and lots of work (datacentre emissions) got "done"
IBMers Expect Another Giant Wave of Layoffs, Talk (and Sing) About the PIPs
The media won't be covering the key facts
Drama at the European Patent Office (EPO) This Week
We'll be covering the EPO quite a lot this weekend and next week
As We Predicted, Francophonie Countries in the EU and Outside the EU Dumping Microsoft for National Security Reasons
We expected Belgium or some other Francophonie place to do so next
Even to Microsoft Insiders It Seems Like XBox Has Already Died or Surrendered to the Japanese Companies
Now the Microsoft layoffs are evident for people to see
EPO Cocainegate Escalates - Part VI - The Strikes Go On and On (Major Strike Today)
We'll be covering this later today in relation to what the Office dubs "ethics"
Absolutely Terrible Journalism About Microsoft Layoffs This Week
7 hours ago by Leila Sheridan
SLAPP Censorship - Part 56 Out of 200: 5RB and Brett Wilson LLP's Copy-Paste Machination for Garrett and Graveley
Here is another straightforward example of their junior barrister overusing copy-paste on his Mac
Getting Aggressive Suggestive of Loss - Part II - Lawyers Are Not "Hired Guns" (and Should Never Act Like Ones)
The matter is being investigated
Nadella is Killing Microsoft. Slop Kills It Even Faster.
A decade from now we'll look back at slop like we look back at skateboards
Huge Microsoft Layoffs Coming Shortly (With Financial Report)
There will be lots of slop layoffs. Be ready. It's a bubble.
Gemini Links 24/04/2026: Data Breaches and Unofficial Gemini Protocol Specification Archive
Links for the day
Microsoft Offers About 10,000 of Its Senior American (Read: Expensive) Workers to be Laid Off
How many slopfarms and media parrots play along?
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Thursday, April 23, 2026
IRC logs for Thursday, April 23, 2026
SLAPP Censorship - Part 55 Out of 200: Strangled Women, Charged for Strangulation, Cannot Find a Job Now (After Microsoft)
merits public awareness and wider scrutiny
Gemini Links 23/04/2026: Spirituality and Detachment, Shoplifting in the UK, and "Introducing Scout, an iOS Native Gemini Client"
Links for the day
Links 23/04/2026: YouTube Age Limits Expanded and 'Secret' Model With Bug-Finding Hype Campaign 'Leaks'
Links for the day
Media Operatives of Microsoft Paint Microsoft Layoffs as Buyouts (Intentionally False Narrative)
Those are mass layoffs disguised as something else
IBM's Stock Has Collapsed Over 10% in One Day, Insiders Explain What's Happening
Today, due to a lack of time, we mostly present an outline of what people say (not IBM-sponsored media hacks with LLM slop)
Getting Aggressive Suggestive of Loss - Part I - Threats Sent From Burner Accounts Since February, Belatedly Reported to British Police
Threats connected to Graveley or Garrett or 5RB or Brett Wilson LLP [...] We're not dealing with a law firm here; we're dealing with the underworld
EPO Cocainegate Escalates - Part V - Where Does the António Campinos 'Family Affair' Go From Here?
Do cocaine in public, get caught, take paid "sick leave", come back to lead Europe's second-largest organisation
Links 23/04/2026: Legal Trouble for Microsoft, Chronic Fatigue Syndrome, and DMCA Whac-a-Mole
Links for the day
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Wednesday, April 22, 2026
IRC logs for Wednesday, April 22, 2026
Gemini Links 23/04/2026: Sunrise Chasing Season, Going Back to Older Software, New Gemini Client for Mobile Devices
Links for the day