Bonum Certa Men Certa

Archiving Web Sites to Ensure They Last Decades, Not Years, Outliving or Outlasting Various Disruptive Events

Video download link | md5sum b29da11a5ae25c7597c459e8e4c320b2



Summary: Today we upload 15 years' worth of blog posts to the Internet Archive (IA), or close to 32,000 stories along with Daily Links; we suggest that other sites do the same in order to tackle 'Internet rot' and preserve information (otherwise there's room for obscene revisionism)

THE INTERNET won't stay around forever. The Soviets, back in the old days, tried to develop something similar to it. The Internet will probably survive the next decade or two, but fifty years is a stretch; as for the World Wide Web, it has already devolved into a transport layer for JavaScript and DRM, having been rendered bloated and malicious in practice (albeit not in theory; one can still produce elegant Web sites).



Earlier this year we moved to Gemini and more than a year ago we adopted IPFS, which is used to circulate daily bulletins and IRC logs in a decentralised fashion. Our IRC channels all became self-hosted (in our network) earlier this year -- an ambition that we've had for years but didn't get around to until Freenode collapsed.

Archiving a Web site isn't the same as format changes and protocol changes. It's also not about making more copies, especially if those copies are as vulnerable to censorship as one another. Here in this site we have some public domain (PD) works that are of relevance to us and can be accessed in gemini://. Most of the works, however, use a Creative Commons licence. We are not a curation site per se, but it helps to keep copies of historical material, such as antitrust material demonstrating Microsoft's crimes (as tactics barely change over time). Well, by Internet standards we have enjoyed a long span of 15 years (articles and daily links) and we remain active on the daily basis. The same is true for Tux Machines, which turns 18 this coming summer, so a lot of the material we have here is no longer available anywhere else, except the Internet Archive (IA).

A few years ago we started making site archives in IA and we also recommended the site to people, dubbing it the most important site on the Web. It's no eternal site however; as an associate of ours explains, "the IA is very important but it will succumb as the WWW is phased out in favor of obfuscated, proprietary JavaScript."

IA can barely cope with (e.g. spider/index/save/navigate) many of the "modern" Web. When you add DRM to the mix (EME), then it's not a "format-shifting'" task as that too becomes an impossibility. Sites need to evolve or perish, which may mean getting off the Web and one day planning for the demise of the Internet as a whole. Like IA, our associate explains, "archive.is is interesting, but it'll die one day. In the long run they will all pass away. In formal archives, one of the initial decisions the institution has to make about any given artifact is that of how long it shall be preserved for. Nothing lasts forever, but there are ways of stretching things out and the duration determines the methods of preservation."

For a site such as ours it makes sense to keep the material available for 50 years, which is maybe how much longer I can live (if I'm lucky).

"Media shifting will obviously be involved," the associate notes, "but at a loss for some items. The plan pre-dates AWA by a great many years."

Last weekend we turned 15. "Already in 15 short years," our associate remarks, "many whole sites are gone. And of the sites that remain, many have lost all their old articles in clumsy reorgs. Of that which is left, some of those have purged documents with "inconvenient" messages or themes... even Groklaw purged its comments. I suppose few to none of the Groklaw comments made it into the Library of Congress archives."

At the time of writing I'm still uploading 205 MB of archives (as shown in the video above). We hope it can inspire other sites to think ahead and do the same. It's not a big task and it's better done before it's "too late"...

Our associate concludes by saying that "many programmers and even engineers are conscientious in erasing anything "old" even important records. Now with electronic media, there is often only a single copy of anything any more and that introduces, obviously, a single point of failure. So in the old days, one could maintain a relevant personal or professional archive. Now those are all centralized and continue to exist only at the whim of participant consensus. Anyone with administrative privileges, can "tidy" up and easily erases the world's last copy of a standard or other evidence or similar material."

We are going to add more material to IA and it can be found here as that piles up along with some material that isn't ours.

Recent Techrights' Posts

24/7 Wall St. Editor-In-Chief and CEO Calls IBM Is "America’s Worst Big Tech Company", Talent is Leaving, Supposedly Strategic Units Culled
21 hours ago by Douglas A. McIntyre
IBM's Debt Increased Over $5 Billion in 3 Months While IBM Laid Off Many in Europe, US, Confluent, HashiCorp, and Red Hat
An increase of $5,000,000,000+ in debt in just 3 months!
Drama at the European Patent Office (EPO) This Week
We'll be covering the EPO quite a lot this weekend and next week
EPO Cocainegate Escalates - Part VI - The Strikes Go On and On (Major Strike Today)
We'll be covering this later today in relation to what the Office dubs "ethics"
Huge Microsoft Layoffs Coming Shortly (With Financial Report)
There will be lots of slop layoffs. Be ready. It's a bubble.
 
Links 24/04/2026: Zelenskyy Says Ukraine's War Position "Most Stable", Samsung Workers on Strike Due to Pay
Links for the day
Dr. Andy Farnell on Why Calling Slop or Chaff "Hey Hi" (AI) Harm Us All, Except for "Ten or Twenty Rich Industrialists"
"words to avoid"
Recent Happenings at IBM Reaffirm Rumours About the CEO; He Might be Resigning (or Pushed Out) Soon
If the rumours are true (no, we did not check those tax records for ourselves), it's not unthinkable that IBM is already doing what Apple did months ago
Gemini Links 24/04/2026: Public Reticulum Gateway Node, Smol Computers, and Old E-mail
Links for the day
Links 24/04/2026: Intel Abandoning Computer Freedom (Even Further), Iran Reports That American Software and Hardware Remotely Sabotaged/Hijacked During War
Links for the day
The Great Wonders of Slop "Efficiency"
Thankfully nothing was lost in the transmission and lots of work (datacentre emissions) got "done"
IBMers Expect Another Giant Wave of Layoffs, Talk (and Sing) About the PIPs
The media won't be covering the key facts
As We Predicted, Francophonie Countries in the EU and Outside the EU Dumping Microsoft for National Security Reasons
We expected Belgium or some other Francophonie place to do so next
Even to Microsoft Insiders It Seems Like XBox Has Already Died or Surrendered to the Japanese Companies
Now the Microsoft layoffs are evident for people to see
Absolutely Terrible Journalism About Microsoft Layoffs This Week
7 hours ago by Leila Sheridan
SLAPP Censorship - Part 56 Out of 200: 5RB and Brett Wilson LLP's Copy-Paste Machination for Garrett and Graveley
Here is another straightforward example of their junior barrister overusing copy-paste on his Mac
Getting Aggressive Suggestive of Loss - Part II - Lawyers Are Not "Hired Guns" (and Should Never Act Like Ones)
The matter is being investigated
Nadella is Killing Microsoft. Slop Kills It Even Faster.
A decade from now we'll look back at slop like we look back at skateboards
Gemini Links 24/04/2026: Data Breaches and Unofficial Gemini Protocol Specification Archive
Links for the day
Microsoft Offers About 10,000 of Its Senior American (Read: Expensive) Workers to be Laid Off
How many slopfarms and media parrots play along?
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Thursday, April 23, 2026
IRC logs for Thursday, April 23, 2026
SLAPP Censorship - Part 55 Out of 200: Strangled Women, Charged for Strangulation, Cannot Find a Job Now (After Microsoft)
merits public awareness and wider scrutiny
Gemini Links 23/04/2026: Spirituality and Detachment, Shoplifting in the UK, and "Introducing Scout, an iOS Native Gemini Client"
Links for the day
Links 23/04/2026: YouTube Age Limits Expanded and 'Secret' Model With Bug-Finding Hype Campaign 'Leaks'
Links for the day
Media Operatives of Microsoft Paint Microsoft Layoffs as Buyouts (Intentionally False Narrative)
Those are mass layoffs disguised as something else
IBM's Stock Has Collapsed Over 10% in One Day, Insiders Explain What's Happening
Today, due to a lack of time, we mostly present an outline of what people say (not IBM-sponsored media hacks with LLM slop)
Getting Aggressive Suggestive of Loss - Part I - Threats Sent From Burner Accounts Since February, Belatedly Reported to British Police
Threats connected to Graveley or Garrett or 5RB or Brett Wilson LLP [...] We're not dealing with a law firm here; we're dealing with the underworld
EPO Cocainegate Escalates - Part V - Where Does the António Campinos 'Family Affair' Go From Here?
Do cocaine in public, get caught, take paid "sick leave", come back to lead Europe's second-largest organisation
Links 23/04/2026: Legal Trouble for Microsoft, Chronic Fatigue Syndrome, and DMCA Whac-a-Mole
Links for the day
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Wednesday, April 22, 2026
IRC logs for Wednesday, April 22, 2026
Gemini Links 23/04/2026: Sunrise Chasing Season, Going Back to Older Software, New Gemini Client for Mobile Devices
Links for the day
Upcoming Mass Layoffs at Microsoft Not Limited to Gaming/XBox
from Microsoft staff
What Could Run the World Instead of "Linux"
Had it not been for GNU (the software, the licence, the compiler GCC), we'd probably not have Linux and perhaps BSD would be more widespread (no copyleft, so expect proprietary derivatives)
IBM's Shares Have Just Collapsed Again as a Result of the Phony 'Results'
Of course all the so-called news is shallow parroting of IBM or "churnalism" void of real analysis
EPO President to Meet the Union, But He Needs to Resign
Colleagues or workers of the EPO have only just been told that the boyfriend of the sister of "Cocaine Communication Manager" will be talking to the union (SUEPO) tomorrow mornin
Gemini Links 22/04/2026: Movies, Vim, and Bash
Links for the day
International Business Machines Corporation: Paying Peanuts, Getting Monkeys
they don't pay enough to retain key people
No, Finding Security Bugs Takes Time and Care (Human Touch, Real Grasp of Real Code)
This too shall pass
Move to GNU/Linux, Save This Planet
If you are an environmentalist, it's hard to justify still using stuff from Apple or Microsoft
SLAPP Censorship - Part 54 Out of 200: Alex-Matt/Automate Twin Cases, Separated at Birth, Drafted by Brett Wilson LLP and 5RB
Perhaps their solicitor K.C. (not the legal title) sought actual redemption and followed the Cross, not the dagger
When Peak Oil Isn't Just "Alarmist Propaganda"
the current conditions favour less consumption
Combatting Racist Abuse
Take racism seriously
They've Failed to Ruin Our Community, But They Still Try
The cost of liberty is not zero. The cost of it can be supremely high.
IBM "Results" as a Smokescreen to Distract From Mass Layoffs at IBM Every Month in 2026
How can we as a society function if we do not get properly informed and educated about what goes on around us?
'Nuclear Winter' at Microsoft This Summer?
At Microsoft so far this year there have been many layoffs, but the company tries to keep them secret
Links 22/04/2026: LLM Slop "Damaging Users’ Cognitive Abilities", UK-based Publishers Urge CMA to Curb Slop-Wielding Plagiarists Like GAFAM
Links for the day
EPO Cocainegate Escalates - Part IV - António Campinos Allegedly Sleeping With Sister of "Cocaine Communication Manager" Luis Berenguer to Secure Third Mandate
Based on our understanding, "the f---ing president" Campinos - to quote rather than merely paraphrase his description of himself - is dating Ana Berenguer, sister of "Cocaine Communication Manager" (Luis Berenguer) and daughter of another Luis Berenguer, a friend of the late Jorge Campinos (António's father)
Clownflare (Cloudflare) and the 'Ecosystem' It Wants to Replace
Vercel & Next.JS Hacked - Nothing New to Report
Today, or Tonight, Look for What IBM is Hiding, Not What It's Telling Shareholders
It shapes the narrative while cooking the books
Brett Wilson LLP Working for Racists and Losing (at the Same Time It Works for Men Who Assault Women in America)
Brett Wilson LLP is basically attacking whistleblowers
The Corrupt Lecture the Non-Corrupt - Part IV - Demanding Respect From Those You Are Attacking and Robbing
"literature" aimed at staff looks increasingly comical, hypocritical, one might say inappropriate
What It Will Take for More Nations in Europe to Move Fully to GNU/Linux
It would be false to say that France is hostile towards the US
Gemini Links 22/04/2026: Voyage into Cheapness, Heat and Pressure in a Contained Ideal Gas, Tidepools
Links for the day
Links 22/04/2026: YouTube Deletes Channels to Promote US Hegemony, "Kash Patel’s Defamation Suit Against The Atlantic Is Designed To Generate Headlines, Not Win In Court"
Links for the day
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Tuesday, April 21, 2026
IRC logs for Tuesday, April 21, 2026