Bonum Certa Men Certa

Archiving Web Sites to Ensure They Last Decades, Not Years, Outliving or Outlasting Various Disruptive Events

Video download link | md5sum b29da11a5ae25c7597c459e8e4c320b2



Summary: Today we upload 15 years' worth of blog posts to the Internet Archive (IA), or close to 32,000 stories along with Daily Links; we suggest that other sites do the same in order to tackle 'Internet rot' and preserve information (otherwise there's room for obscene revisionism)

THE INTERNET won't stay around forever. The Soviets, back in the old days, tried to develop something similar to it. The Internet will probably survive the next decade or two, but fifty years is a stretch; as for the World Wide Web, it has already devolved into a transport layer for JavaScript and DRM, having been rendered bloated and malicious in practice (albeit not in theory; one can still produce elegant Web sites).



Earlier this year we moved to Gemini and more than a year ago we adopted IPFS, which is used to circulate daily bulletins and IRC logs in a decentralised fashion. Our IRC channels all became self-hosted (in our network) earlier this year -- an ambition that we've had for years but didn't get around to until Freenode collapsed.

Archiving a Web site isn't the same as format changes and protocol changes. It's also not about making more copies, especially if those copies are as vulnerable to censorship as one another. Here in this site we have some public domain (PD) works that are of relevance to us and can be accessed in gemini://. Most of the works, however, use a Creative Commons licence. We are not a curation site per se, but it helps to keep copies of historical material, such as antitrust material demonstrating Microsoft's crimes (as tactics barely change over time). Well, by Internet standards we have enjoyed a long span of 15 years (articles and daily links) and we remain active on the daily basis. The same is true for Tux Machines, which turns 18 this coming summer, so a lot of the material we have here is no longer available anywhere else, except the Internet Archive (IA).

A few years ago we started making site archives in IA and we also recommended the site to people, dubbing it the most important site on the Web. It's no eternal site however; as an associate of ours explains, "the IA is very important but it will succumb as the WWW is phased out in favor of obfuscated, proprietary JavaScript."

IA can barely cope with (e.g. spider/index/save/navigate) many of the "modern" Web. When you add DRM to the mix (EME), then it's not a "format-shifting'" task as that too becomes an impossibility. Sites need to evolve or perish, which may mean getting off the Web and one day planning for the demise of the Internet as a whole. Like IA, our associate explains, "archive.is is interesting, but it'll die one day. In the long run they will all pass away. In formal archives, one of the initial decisions the institution has to make about any given artifact is that of how long it shall be preserved for. Nothing lasts forever, but there are ways of stretching things out and the duration determines the methods of preservation."

For a site such as ours it makes sense to keep the material available for 50 years, which is maybe how much longer I can live (if I'm lucky).

"Media shifting will obviously be involved," the associate notes, "but at a loss for some items. The plan pre-dates AWA by a great many years."

Last weekend we turned 15. "Already in 15 short years," our associate remarks, "many whole sites are gone. And of the sites that remain, many have lost all their old articles in clumsy reorgs. Of that which is left, some of those have purged documents with "inconvenient" messages or themes... even Groklaw purged its comments. I suppose few to none of the Groklaw comments made it into the Library of Congress archives."

At the time of writing I'm still uploading 205 MB of archives (as shown in the video above). We hope it can inspire other sites to think ahead and do the same. It's not a big task and it's better done before it's "too late"...

Our associate concludes by saying that "many programmers and even engineers are conscientious in erasing anything "old" even important records. Now with electronic media, there is often only a single copy of anything any more and that introduces, obviously, a single point of failure. So in the old days, one could maintain a relevant personal or professional archive. Now those are all centralized and continue to exist only at the whim of participant consensus. Anyone with administrative privileges, can "tidy" up and easily erases the world's last copy of a standard or other evidence or similar material."

We are going to add more material to IA and it can be found here as that piles up along with some material that isn't ours.

Recent Techrights' Posts

EPO Strikes Are Already Working
Campinos is already going "into hiding"
The Brand 'Watsonx' is a Terrible Name for IBM 'Hey Hi' (Chatbots) Because Watson Agreed With Adolf Hitler
Almost a century has passed and IBM still believes that selling "intelligence", chatbots in particular, should be done under the name "Watson"
 
The Broken Window Industry and Its Ongoing Desires to Make Technology Less Dependable
Reliable computing is becoming harder to find
Microsoft Windows Falls to Another New All-Time Low in Guatemala, It is a Bottomless Pit
Maybe users come to realise that Windows means back doors and those doors are open to a regime that ought not be trusted
"XBox" Will Become Slop After Mass Layoffs
When all else fails, "AI it"
Links 22/02/2026: Hardware Price Hikes Across the Board, "Microsoft Issues Statement on Potential Layoffs"
Links for the day
Microsoft "Layoffs Incoming"
This transition isn't about promoting games; it's about canning the console
Links 22/02/2026: "Bloat of Modern Fitness Apps" and Wikipedia Deprecates Archive.today
Links for the day
Our IRC 5-Year Anniversary (for Self-Hosted) is Fast Approaching
A week from now it's March already
Gemini Links 22/02/2026: Dream Job Gone and Slop in Taskwarrior
Links for the day
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Saturday, February 21, 2026
IRC logs for Saturday, February 21, 2026
GNU/Linux Grew a Lot in Nicaragua
We've not noticed until today
Techrights Has Over 1,000 Good Articles 'in the Tank'
Drafts, notes, and lengthy documents
New Article Challenges Solicitors Regulation Authority (SRA) for Choosing the Wrong SLAPP Cases to Investigate
The one point we can agree on is that SRA does not know how to correctly select the worst culprits/offenders
Why IBM is Still Scary and Dangerous
Keep a distance from "Big Blue" Bully
Measuring the Growth of Our Mission and Community
Something between experiment and prototype
Richard Stallman in the United States - Part III - Georgia Tech Did a Fine Job Upholding Free Speech Principles
The real problem was social control media (toxic)
Debian's Master is Deleting Criticism of SystemD and Other Things (On-Topic and Published by Debian Developers), Resorts to the Excuse Messages Are "Too Long"
Censorship serves nobody except the masters that control this censorship
Digg's Latest Incarnation Already Failed, It's Infested With LLM Slop
Many submissions go to slopfarms and some get summarised by slop
Gemini Links 21/02/2026: Veganism and DeskPi RackMate T0
Links for the day
On The Web, XBox Already a Dying Breed
Down to about 0.05% on large machines, based on statCounter [...] Microsoft will never publicly admit or say how many billions it lost on the XBox
2026 a Year of 'Top-Down' Microsoft Layoffs (Management First)
Stay tuned for what comes next
Your "Likes" Aren't Yours and They're Mostly "Worthless Clicks"
Social hermits are not popular, irrespective of how many "Facebook friends" or "likes" they get
Waggener Edstrom/Frank Shaw Lied, There Are Definitely Microsoft Layoffs
Microsoft never issued a formal statement, it made allusions by proxy
Microsoft-Controlled Media With Embargo and Press Operatives
This won't be the last example of media manipulation for narrative control or face-saving "damage control"
Slop Hype Makes Our Core Technology Less Reliable and Far Less Resilient (We Pay for the Catastrophe That Follows)
Only slop-free projects can be trusted
Going for 1,000 (Days of Uptime)
universal records are vastly better
Firefox is No-Go in China, Not Even 1% "Market Share" Anymore
Given Mozilla's utterly rubbish marketing these days (politics over technical aspects), set aside the cheerleading for slop, there's hardly a chance of Mozilla Firefox reaching or exceeding 10% again
EPO "Cocaine Communication Manager" - Part III - It's in His Eyes
Workers are free to draw their own conclusions
Links 21/02/2026: Tensions Over Iran and Illegal Cheeto Tariffs, Presidential Approval Sags
Links for the day
Links 21/02/2026: "Moving Away From Cloudflare", Many Layoffs or Shutdowns in Games (Including XBox/Microsoft)
Links for the day
GNU Linux-libre is a Grown-Up Today
"before that, every distro that wanted to respect its users' freedom had to remove itself all of the binary blobs that were distributed as part of the kernel Linux's so-called sources"
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Friday, February 20, 2026
IRC logs for Friday, February 20, 2026
Gemini Links 21/02/2026: "The Evil of Action" and Slop Bots Causing Great Harm Online (Not Just the Web)
Links for the day
Like a Shell
Overreactions can backfire
Not Only Leaders of XBox Got Sacked (Layoffs)
Phil Spencer and Sarah Bond got laid off
9PM on a Friday Night: Microsoft Says the Layoffs Are Not Layoffs
We've said for a long time that XBox is doomed this year
Gemini Links 20/02/2026: Misfin Server and Magic in Programming
Links for the day
Former Debian Project Leader Branden Robinson Cautions Against Cover-up and Censorship in Debian
Debian drama. Again.
analytics.usa.gov Reckons Windows "Market Share" Fell to Just 38%, Vista 11 Not Even a Third of Windows Users
This coming summer Vista 11 turns 5
The New Digg.com is Slop
Slop "summaries" and Serial Sloppers are drowning out the site with fake 'articles' (plagiarism)
Linus Torvalds: Bill Epsteingate Good Enough for Me to Wine and Dine With
Torvalds is more connected to Jeffrey Epstein than Richard Stallman ever was
Our Uptimes Are Always Better Than Any Site That Uses Clownflare
Clownflare as a company operates like a cult
GNU/Linux Apparently Rose to 6% in Uzbekistan
If accurate, this represents a new problem for Microsoft and a big win for Software Freedom
Sponsored Videos and 'Articles' in The Register MS, Stenography as a Service/Product
They should more accurately label these actors
It's Friday Again and Many People Leave IBM for Good (IBM Should be Reported for Illegal NDAs That Hide Layoffs)
we very seldom see anyone deviating a lot from the "template-like" narrative, let alone mentioning "layoffs" or "RA" or some other term that implies non-consensual departure
The Little Clique of Sloppers/Spammers About "Linux" Got Even Smaller
Thankfully there are still genuine and legit GNU/Linux sites out there
Links 20/02/2026: Microsoft Intentionally Kills Older Hardware, "The Story of XBox" Shows How Defective Microsoft Hardware Really Was
Links for the day
Turkmenistan One of Many Countries Where Microsoft Fell to Distant Third in Search
We expect many layoffs in Bing some time soon
Don't Wait for "Red Hat Layoffs" Because After Bluewashing They're IBM RAs and Don't Wait for "IBM Layoffs" Because They're Perpetual
IBM layoffs are silent and "forever" (small trickle that never ends and is widespread - after all IBM is a very global and ubiquitous firm)
Links 20/02/2026: Standards, Science, and Politics
Links for the day
What Do People Ever Buy From Microsoft Anyway (Not PCs)?
Microsoft sells two things these days: 1) vapourware/promises. 2) its stock.
Gemini Links 20/02/2026: "Mainstream Unix, Underground Unix", Slop Staging DDoS Attacks Against Small Sites
Links for the day
IBM Inclusivity: Red Hat Summit is for Rich Sponsors Like Microsoft and Rich Guests Who Pay $500 a Day
Nothing signals societal tolerance more than paying a large military contractor
GNU/Linux Adoption is Higher in Richer Countries
Is it because freedom is actually expensive - something that only privileged people can pursue?
Links 20/02/2026: Windows TCO Versus Deutsche Bahn, Europe Seeks More Independent Digital Future
Links for the day
IBM, Red Hat and Fedora: Don't Say "Master", It Offends People. Also IBM, Red Hat and Fedora: "Master Podman".
The hypocrisy at Red Hat and Fedora shows no boundaries
IBM Layoffs Aren't Just in IBM 'Proper'
Who is still using Lotus after the HCL move?
The Register MS Gets Paid by Gartner to Promote a Ponzi Scheme for Gartner, Microsoft, and Others
The credibility of that site will suffer because it tries to sell a major scam to its audience
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Thursday, February 19, 2026
IRC logs for Thursday, February 19, 2026