Bonum Certa Men Certa

Archiving Web Sites to Ensure They Last Decades, Not Years, Outliving or Outlasting Various Disruptive Events

Video download link | md5sum b29da11a5ae25c7597c459e8e4c320b2



Summary: Today we upload 15 years' worth of blog posts to the Internet Archive (IA), or close to 32,000 stories along with Daily Links; we suggest that other sites do the same in order to tackle 'Internet rot' and preserve information (otherwise there's room for obscene revisionism)

THE INTERNET won't stay around forever. The Soviets, back in the old days, tried to develop something similar to it. The Internet will probably survive the next decade or two, but fifty years is a stretch; as for the World Wide Web, it has already devolved into a transport layer for JavaScript and DRM, having been rendered bloated and malicious in practice (albeit not in theory; one can still produce elegant Web sites).



Earlier this year we moved to Gemini and more than a year ago we adopted IPFS, which is used to circulate daily bulletins and IRC logs in a decentralised fashion. Our IRC channels all became self-hosted (in our network) earlier this year -- an ambition that we've had for years but didn't get around to until Freenode collapsed.

Archiving a Web site isn't the same as format changes and protocol changes. It's also not about making more copies, especially if those copies are as vulnerable to censorship as one another. Here in this site we have some public domain (PD) works that are of relevance to us and can be accessed in gemini://. Most of the works, however, use a Creative Commons licence. We are not a curation site per se, but it helps to keep copies of historical material, such as antitrust material demonstrating Microsoft's crimes (as tactics barely change over time). Well, by Internet standards we have enjoyed a long span of 15 years (articles and daily links) and we remain active on the daily basis. The same is true for Tux Machines, which turns 18 this coming summer, so a lot of the material we have here is no longer available anywhere else, except the Internet Archive (IA).

A few years ago we started making site archives in IA and we also recommended the site to people, dubbing it the most important site on the Web. It's no eternal site however; as an associate of ours explains, "the IA is very important but it will succumb as the WWW is phased out in favor of obfuscated, proprietary JavaScript."

IA can barely cope with (e.g. spider/index/save/navigate) many of the "modern" Web. When you add DRM to the mix (EME), then it's not a "format-shifting'" task as that too becomes an impossibility. Sites need to evolve or perish, which may mean getting off the Web and one day planning for the demise of the Internet as a whole. Like IA, our associate explains, "archive.is is interesting, but it'll die one day. In the long run they will all pass away. In formal archives, one of the initial decisions the institution has to make about any given artifact is that of how long it shall be preserved for. Nothing lasts forever, but there are ways of stretching things out and the duration determines the methods of preservation."

For a site such as ours it makes sense to keep the material available for 50 years, which is maybe how much longer I can live (if I'm lucky).

"Media shifting will obviously be involved," the associate notes, "but at a loss for some items. The plan pre-dates AWA by a great many years."

Last weekend we turned 15. "Already in 15 short years," our associate remarks, "many whole sites are gone. And of the sites that remain, many have lost all their old articles in clumsy reorgs. Of that which is left, some of those have purged documents with "inconvenient" messages or themes... even Groklaw purged its comments. I suppose few to none of the Groklaw comments made it into the Library of Congress archives."

At the time of writing I'm still uploading 205 MB of archives (as shown in the video above). We hope it can inspire other sites to think ahead and do the same. It's not a big task and it's better done before it's "too late"...

Our associate concludes by saying that "many programmers and even engineers are conscientious in erasing anything "old" even important records. Now with electronic media, there is often only a single copy of anything any more and that introduces, obviously, a single point of failure. So in the old days, one could maintain a relevant personal or professional archive. Now those are all centralized and continue to exist only at the whim of participant consensus. Anyone with administrative privileges, can "tidy" up and easily erases the world's last copy of a standard or other evidence or similar material."

We are going to add more material to IA and it can be found here as that piles up along with some material that isn't ours.

Recent Techrights' Posts

Former Debian Project Leader Branden Robinson Cautions Against Cover-up and Censorship in Debian
Debian drama. Again.
It's Friday Again and Many People Leave IBM for Good (IBM Should be Reported for Illegal NDAs That Hide Layoffs)
we very seldom see anyone deviating a lot from the "template-like" narrative, let alone mentioning "layoffs" or "RA" or some other term that implies non-consensual departure
What Do People Ever Buy From Microsoft Anyway (Not PCs)?
Microsoft sells two things these days: 1) vapourware/promises. 2) its stock.
Gemini Links 20/02/2026: "Mainstream Unix, Underground Unix", Slop Staging DDoS Attacks Against Small Sites
Links for the day
 
Links 21/02/2026: "Moving Away From Cloudflare", Many Layoffs or Shutdowns in Games (Including XBox/Microsoft)
Links for the day
GNU Linux-libre is a Grown-Up Today
"before that, every distro that wanted to respect its users' freedom had to remove itself all of the binary blobs that were distributed as part of the kernel Linux's so-called sources"
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Friday, February 20, 2026
IRC logs for Friday, February 20, 2026
Gemini Links 21/02/2026: "The Evil of Action" and Slop Bots Causing Great Harm Online (Not Just the Web)
Links for the day
Like a Shell
Overreactions can backfire
Not Only Leaders of XBox Got Sacked (Layoffs)
Phil Spencer and Sarah Bond got laid off
9PM on a Friday Night: Microsoft Says the Layoffs Are Not Layoffs
We've said for a long time that XBox is doomed this year
Gemini Links 20/02/2026: Misfin Server and Magic in Programming
Links for the day
analytics.usa.gov Reckons Windows "Market Share" Fell to Just 38%, Vista 11 Not Even a Third of Windows Users
This coming summer Vista 11 turns 5
The New Digg.com is Slop
Slop "summaries" and Serial Sloppers are drowning out the site with fake 'articles' (plagiarism)
Linus Torvalds: Bill Epsteingate Good Enough for Me to Wine and Dine With
Torvalds is more connected to Jeffrey Epstein than Richard Stallman ever was
Our Uptimes Are Always Better Than Any Site That Uses Clownflare
Clownflare as a company operates like a cult
GNU/Linux Apparently Rose to 6% in Uzbekistan
If accurate, this represents a new problem for Microsoft and a big win for Software Freedom
Sponsored Videos and 'Articles' in The Register MS, Stenography as a Service/Product
They should more accurately label these actors
The Little Clique of Sloppers/Spammers About "Linux" Got Even Smaller
Thankfully there are still genuine and legit GNU/Linux sites out there
Links 20/02/2026: Microsoft Intentionally Kills Older Hardware, "The Story of XBox" Shows How Defective Microsoft Hardware Really Was
Links for the day
Turkmenistan One of Many Countries Where Microsoft Fell to Distant Third in Search
We expect many layoffs in Bing some time soon
Don't Wait for "Red Hat Layoffs" Because After Bluewashing They're IBM RAs and Don't Wait for "IBM Layoffs" Because They're Perpetual
IBM layoffs are silent and "forever" (small trickle that never ends and is widespread - after all IBM is a very global and ubiquitous firm)
Links 20/02/2026: Standards, Science, and Politics
Links for the day
IBM Inclusivity: Red Hat Summit is for Rich Sponsors Like Microsoft and Rich Guests Who Pay $500 a Day
Nothing signals societal tolerance more than paying a large military contractor
GNU/Linux Adoption is Higher in Richer Countries
Is it because freedom is actually expensive - something that only privileged people can pursue?
Links 20/02/2026: Windows TCO Versus Deutsche Bahn, Europe Seeks More Independent Digital Future
Links for the day
IBM, Red Hat and Fedora: Don't Say "Master", It Offends People. Also IBM, Red Hat and Fedora: "Master Podman".
The hypocrisy at Red Hat and Fedora shows no boundaries
IBM Layoffs Aren't Just in IBM 'Proper'
Who is still using Lotus after the HCL move?
The Register MS Gets Paid by Gartner to Promote a Ponzi Scheme for Gartner, Microsoft, and Others
The credibility of that site will suffer because it tries to sell a major scam to its audience
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Thursday, February 19, 2026
IRC logs for Thursday, February 19, 2026
Gemini Links 19/02/2026: "Towards a Gemini Famicom Resource" and Dumping Microsoft
Links for the day
IBM Behaves Like a Company Looking for Loose Change Between Sofa Cushions
Chasing laid-off workers for dollars and even pennies, making excuses and devising loopholes (such as PIPs) to flout severance obligations
Microsoft Found Another Bailout Opportunity: Killing People
Good thing that Nadella is not racist!
No "Smart Mobs" (Social Control Media) in BRIC?
It looks like the "Social" "Media" sites tracked by statCounter see little from (or of) BRIC, and moreover it is declining fast
The Few Slopfarms We Saw Today
The sentiment has changed a lot
Links 19/02/2026: Protecting Framework Laptop 13, Hardware Drive Shortages
Links for the day
In Africa's Second-Largest Nation, Democratic Republic of the Congo (DRC), Opera 10 Times Bigger Than Firefox (and GNU/Linux Now at 5%)
This will become an accessibility problem
Links 19/02/2026: "A.I.pocalypse" Inevitable and "Butlers to LLMs"
Links for the day
An Inherently Royal (Monarchs') Legal System Where Size Matters (Big Capital Eats the Small)
This reinforces the notion that justice is only for those who can afford it
These Statistics Should Keep Microsoft Shareholders Awake at Night
Windows is, in general (all versions collectively), declining over time
Economic Failure and Other Harsh Realities Have Nothing to Do With Slop 'Innovation'
Advanced propaganda, not advanced 'AI' [...] They attack workers while insulting their intelligence
Spaniards Shutting Down MElon's Digital Weapon of "Smart Mobs"
Are the Spanish people already acting based on gut feeling and shunning/shutting out the provocation vector?
Bitcoin: government engagement contradictions
Reprinted with permission from Daniel Pocock
Richard Stallman in the United States - Part II - "Haters Gonna Hate"
we shall carry on with this series at the right pace
Typical! Solicitors Regulation Authority (SRA) Tells Victims of Fraud to Wait 10 Weeks
justice delayed is justice denied
EPO Union Leaders in Rijswijk Explain Where EPO Strikes Stand and How to Prepare for Next Week's
We have some revelations to share in a few days
statCounter: Only One in 350 Iranians Would Use Microsoft for Web Search
Microsoft is trying to fake "demand"
Slides Shown a Week Ago by the EPO's Staff Committee Ahead of the Second Very Large Strike
This coming weekend we'll drop a 'bombshell' of sorts
EPO "Cocaine Communication Manager" - Part II - Illegal Drug Addicts Mobbing the Wrong People, This Will Definitely Backfire
This year may well be the last year of Team Campinos. Nobody will hire them after that.
Mass Layoffs (But Silent Layoffs) Still Happening in IBM, You Need Only Look Closely (There Are NDAs, PIPs, 'Early Retirement' Sweeteners and IBM - Like Microsoft - Skirts the WARN Act)
the layoffs are definitely happening
Microsoft's "AI CEO" (Slop Propagandist) is Projecting, Many Microsoft "Jobs to be Replaced With All-Indian Low-Paid Staff in 12 Months"
Windows is perishing
Very Little Slop
We are not finding much slop anymore
Links 19/02/2026: Illegal Kangaroo Court for Patents Attracts Aggressive Firms, Public Domain Review Grows
Links for the day
Gemini Links 19/02/2026: Taxing the Rich, Raspberry Pi 4 Tinkering
Links for the day
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Wednesday, February 18, 2026
IRC logs for Wednesday, February 18, 2026