Bonum Certa Men Certa

Archiving Web Sites to Ensure They Last Decades, Not Years, Outliving or Outlasting Various Disruptive Events

Video download link | md5sum b29da11a5ae25c7597c459e8e4c320b2



Summary: Today we upload 15 years' worth of blog posts to the Internet Archive (IA), or close to 32,000 stories along with Daily Links; we suggest that other sites do the same in order to tackle 'Internet rot' and preserve information (otherwise there's room for obscene revisionism)

THE INTERNET won't stay around forever. The Soviets, back in the old days, tried to develop something similar to it. The Internet will probably survive the next decade or two, but fifty years is a stretch; as for the World Wide Web, it has already devolved into a transport layer for JavaScript and DRM, having been rendered bloated and malicious in practice (albeit not in theory; one can still produce elegant Web sites).



Earlier this year we moved to Gemini and more than a year ago we adopted IPFS, which is used to circulate daily bulletins and IRC logs in a decentralised fashion. Our IRC channels all became self-hosted (in our network) earlier this year -- an ambition that we've had for years but didn't get around to until Freenode collapsed.

Archiving a Web site isn't the same as format changes and protocol changes. It's also not about making more copies, especially if those copies are as vulnerable to censorship as one another. Here in this site we have some public domain (PD) works that are of relevance to us and can be accessed in gemini://. Most of the works, however, use a Creative Commons licence. We are not a curation site per se, but it helps to keep copies of historical material, such as antitrust material demonstrating Microsoft's crimes (as tactics barely change over time). Well, by Internet standards we have enjoyed a long span of 15 years (articles and daily links) and we remain active on the daily basis. The same is true for Tux Machines, which turns 18 this coming summer, so a lot of the material we have here is no longer available anywhere else, except the Internet Archive (IA).

A few years ago we started making site archives in IA and we also recommended the site to people, dubbing it the most important site on the Web. It's no eternal site however; as an associate of ours explains, "the IA is very important but it will succumb as the WWW is phased out in favor of obfuscated, proprietary JavaScript."

IA can barely cope with (e.g. spider/index/save/navigate) many of the "modern" Web. When you add DRM to the mix (EME), then it's not a "format-shifting'" task as that too becomes an impossibility. Sites need to evolve or perish, which may mean getting off the Web and one day planning for the demise of the Internet as a whole. Like IA, our associate explains, "archive.is is interesting, but it'll die one day. In the long run they will all pass away. In formal archives, one of the initial decisions the institution has to make about any given artifact is that of how long it shall be preserved for. Nothing lasts forever, but there are ways of stretching things out and the duration determines the methods of preservation."

For a site such as ours it makes sense to keep the material available for 50 years, which is maybe how much longer I can live (if I'm lucky).

"Media shifting will obviously be involved," the associate notes, "but at a loss for some items. The plan pre-dates AWA by a great many years."

Last weekend we turned 15. "Already in 15 short years," our associate remarks, "many whole sites are gone. And of the sites that remain, many have lost all their old articles in clumsy reorgs. Of that which is left, some of those have purged documents with "inconvenient" messages or themes... even Groklaw purged its comments. I suppose few to none of the Groklaw comments made it into the Library of Congress archives."

At the time of writing I'm still uploading 205 MB of archives (as shown in the video above). We hope it can inspire other sites to think ahead and do the same. It's not a big task and it's better done before it's "too late"...

Our associate concludes by saying that "many programmers and even engineers are conscientious in erasing anything "old" even important records. Now with electronic media, there is often only a single copy of anything any more and that introduces, obviously, a single point of failure. So in the old days, one could maintain a relevant personal or professional archive. Now those are all centralized and continue to exist only at the whim of participant consensus. Anyone with administrative privileges, can "tidy" up and easily erases the world's last copy of a standard or other evidence or similar material."

We are going to add more material to IA and it can be found here as that piles up along with some material that isn't ours.

Recent Techrights' Posts

You Know Microsoft's "Value" is 100% Fictional When in One Single "Trading" Day in Wall Street It Loses THREE TIMES More in "Value" Than It Was 'Worth' in 2009
Microsoft does not behave like a company riding trillions but like a company that struggles with payroll
Better Outcomes When Facing the Discomfort of Conflict
Don't take the easy way out when the "hard way" is the right way and it can result in positive revelations
Leaving the United States 3 Years Ago Was the Best Decision We Made
A lot of stuff is being consolidated
BillBC (BBC) Covered Up Pedophilia, Now It's Covering Up for Its Sponsor Bill Gates by Reprinting His Lies, Which His Own Wife Disputes
Is Bill Gates having orgies (group sex)?
 
Links 04/02/2026: Extreme Malice in Microsoft's Visual Studio Code on GNU/Linux, More Hey Hi (AI) Chaos
Links for the day
Sexism & GNOME: shaming men, hiding women, Sonny Piers update
Reprinted with permission from Daniel Pocock
Gemini Links 04/02/2026: Humanity and Animality, systemd (Controlled by Amutable, a Proxy of Microsoft) Moves on to "Extinguish" Phase
Links for the day
Certificate Authority Let's Encrypt Used to be Widely Used in Geminispace, Now It's Down to Just 0.2% of the Whole
Let's Encrypt is not your friend
What IBM Does Is Clearly Illegal in the US: Tying Severance Packages to NDAs (Non-Disparagement Agreement/Clause)
The NDAs make things worse; they keep people isolated and silent
Microsoft's Giant Snowball of Layoffs and PIPs (in 2026)
They would delay until March or April if they wanted to, but then we can expect numbers exceeding 10,000 layoffs (Microsoft always low-balls the real figure/s)
Mozilla Turned Firefox Into Shovelware, Adding 'Kill Switch' for Slop Still Means Mozilla is Participating in a Pyramid Scheme, Plagiarism, Grifting
Mozilla is still a slop pusher
Links 04/02/2026: "Laws of Succession" and Microsoft's VS Code as Code-Stealing Malware
Links for the day
Phoronix Swims With the Real Trolls, People Who Fancy Proprietary Software and Back Doors
If Larabel begins to actively participate in provocation with the "Microsoft GitHub fans club", what does this tell us about Phoronix?
They Know Microsoft Layoffs Are About to Hit Them Hard
The gaming division at Microsoft is a complete catastrophe, lots of money (debt) down the drain [...] Buying Activision was all about misleading shareholders or hiding the deep trouble/problems XBox was having
Red Hat is Not a Linux Company, It's IBM's Ponzi Scheme Enabler
Had we still been stuck in 2021, perhaps IBM would plaster "NFT" or "metaverse" all over RedHat.com
Keep Grinding
"Don't let the bastards grind you down"
Mobbing at the European Patent Office (EPO) - Part III - Who's Going to Pay for the EPO's Corruption? (Aside From European Citizens)
Some people inside the EPO reached out to us
"Investors Are Concerned About an AI Bubble" (That GAFAM and IBM Ride)
A few decades from now IBM will only be remembered in the same sense many so-called 'AI' companies will be remembered
EPO Staff Union: "Very High Strike Participation on Friday 30 January", Another Strike Starts 19 Days From Now
EPO management in a bit of a panic
Censorship/Free Speech and Social Control Media
It's important to have a grasp of how contemporary censorship works and how to tackle it
Google News as Slop Booster
this is what Google links to
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Tuesday, February 03, 2026
IRC logs for Tuesday, February 03, 2026
Gemini Links 04/02/2026: "Raspberry Pi Relaxes the Rules for Its RP2040 Hacking Challenge" and "Long Web Society"
Links for the day
IBM Falls by Over 10%
a recipe for disasters like accounting fraud
Links 03/02/2026: Windows Copies GNU/Linux, Windows TCO Shown Again
Links for the day
Gemini Links 03/02/2026: Alhena Turns One, Slop Rejected, and Max Roy Carrouges Recalled
Links for the day
How to Identify Demonisation or Dehumanisation Tactics Against Interesting Figures or Luminaries in Free Software
Rather than in general or generally in technology
We Should Learn From Bulgaria
Why can't European companies and government recognise and react to a threat (when they see one)?
Dr. Andy Farnell on Why and How European Authorities Can Adopt Free Software, Parenting in the Age of Digital Abundance
Will Europe use technology that Europe controls (not the hegemon), for a change?
Canonical: Ubuntu is GAFAM (US), We're Resellers of American Proprietary Software
They want people to pay for a licence
Seems Like IBM Trolls Use Chatbots to Vandalise Platform That Discusses IBM's Secret Layoffs, Forever Layoffs
Not for the first time either
You Know Your Company is Dead or Basically a Pyramid Scheme When Jim Cramer Keeps Promoting Its Stock
How much does IBM pay for "puff pieces" or "fluff" about QC?
Red Hat (Under IBM) Works for Microsoft (Proprietary Software) and Slop
Yesterday Red Hat's official site, redhat.com, published exactly 5 new blog posts
IBM is Dying (More Layoffs), Red Hat Will Continue to Suffer From the Acquisition
Financial engineering
Colombia Adopting GNU/Linux Even Faster (at Microsoft's and Apple's Expense)
Do politics play any role in this?
An Effort to Tackle Slavery in 'Open Source' Clothing
"a civil rights lawsuit to examine the concerns of censored developers in the free, open source software ecosystem"
$15 billion lawsuit: Ubuntu, Google & Debian crowdfunding campaign launch
Reprinted with permission from Daniel Pocock
The Solicitors Regulation Authority (SRA) Delusion - Part II - Why We Need to Expose the SRA to More Daylight, Public Scrutiny
SRA is neither effective nor regulated
Links 03/02/2026: "Distraction is a Sin" and Fake "Encryption" (Surveillance With Good Marketing)
Links for the day
400-Page US Federal Court Against Abuses by Google, Microsoft and Front Groups That Abuse Volunteers for American Corporations
There are 386 pages in total (in the US claim)
Corporate Influence Never Impacted Us
There's no reason to assume we'll ever "sell out"
Growth of GNU/Linux in Cuba
Right now a lot of the world drafts or already implements a GAFAM exit plan
A Day After EPO Strikes an Escalation to Heads of Delegations to the Administrative Council
They rely on the European media playing along, helping them to hide major blunders, even crimes
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Monday, February 02, 2026
IRC logs for Monday, February 02, 2026
Gemini Links 03/02/2026: Stargazing, Development Boards, and Tcl/Tk Slop
Links for the day
Microsoft Lost 20% of Its Money in the Past 6 Months
Microsoft is hiding what's really happening while mocking critics
Great News, IBM 'Gained' Almost 10% in "Goodwill" Value After Firing Tens of Thousands in 2025
"goodwill" will be inflated despite IBM staff getting sick of IBM
Americans Move to GNU/Linux
some of the biggest American populations
I Still Like Drawing and Various Other Arts (They Help My Activism and Journalism), Slop is an Enemy of Creative People
Recognise that slop isn't intelligence; it's a generational excuse for plagiarism and privatisation of not only the Commons but also proprietary knowledge (without authorisation)
Carmen-Lisandrette Maris (Mission:Libre) Explains to Adolescents and Young Adults How Free Software Improves Privacy
Based on what we've seen and read, Mission:Libre has a solid grasp of Software Freedom
Chatbots Didn't Do Any Good for Microsoft
Google "AI" = search + copypasta
Links 02/02/2026: Cultural Cleansing by China and 'Living Behind Firewalls" in Iran
Links for the day
GNU/Linux Measured at More Than 4% in Russia
growing adoption of GNU/Linux in Russia
Gemini Links 02/02/2026: Stages of Age, Workflows, and Counting Capsules
Links for the day
Oracle's Debt Rose Over 20 Billion Dollars in Just 3 Months
Is "hey hi" becoming a synonym for debt?
Oligarchs' 'Speech Zones' Are Not the "Public Square"
The apologists of social control media, including press that got "addicted" to such fake "media", are helping dictators and oligarchs grab the public attention away from the real press
IBM Misleads and Gaslights Investors With Slop Sold as "AI" (the Business is Waning, Mass Layoffs Continue)
People who do this are dishonest. They should not be put in charge.
Links 02/02/2026: 'Melania' a Horror Movie "Will They Inherit Our Blogs?"
Links for the day
Doing More Detailed Series (Long-Form Works)
Long readings or book-like reading binges are only possible when parts are suitably labeled (name and numbers) if not interlinked
Mobbing at the European Patent Office (EPO) - Part II - Racism, Cocaine Use and White-Collar Corruption
When you hire people illegally, to work for cocaine users and keep quite about the cocaine use, what will be the impact on the reputation of an institution?
A Can of WORMS - Part II - Darkening the Name of RMS, Associating It With Crime
Beware projection tactics
Submit Your Suggestions for EU's Embrace of Software Freedom by Tomorrow
Time to leave GAFAM (US) hegemony behind
Slopless Weekend
This is not sustainable
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Sunday, February 01, 2026
IRC logs for Sunday, February 01, 2026