Bonum Certa Men Certa

Wikipedia Can Lower Its Hosting Bill by Going More Static, Not Just by Caching, But It Would Not Solve Its Biggest Problems (Bribes and AstroTurfing)

posted by Roy Schestowitz on Apr 08, 2025,
updated Apr 08, 2025

back at the pawnshop

LLMs are not the biggest problem at Wikipedia

2008: Microsoft Agents from Waggener Edstrom Airbrush Wikipedia, Glorify Paymaster

For about 15 years we had a Wiki in this site (it's still there, but not as a wiki; it's a wikidump, i.e. static pages). We adopted the same software developed and used by Wikipedia. Spam and vandalism meant we just had to limit who can edit; script kiddies did so much damage (defacing or adding bunk pages) that rolling back changes became a chore, even while on holiday.

By 2023 it also became a nuisance due to read-only bots; online spiders/scrapers would constantly hit unique ('on-demand') pages that made no sense; no person would repeatedly request those. Caching would not help much if many different "pages" were repeatedly requested (sometimes hundreds of times per second), invoking the back end (MariaDB/MySQL and PHP) so many times for no good reason. At times we got thousands of requests per second. That's just too much, even for a decent router.

Wikipedia recently bemoaned LLM scrapers; it really ought to moan about LLMs distorting Wikipedia articles. Moreover, Wikipedia ought to complain about: 1) Microsoft and Bill Gates bribing Wikipedia [1, 2, 3, 4] to be passive while they distort Wikipedia articles (for PR purposes, revisionism/lies/selective omissions as "articles"); 2) Microsoft providing servers and money for LLM scrapers, such as those which harm Wikipedia (overwhelming the back end).

Wikipedia isn't a site of integrity; not anymore. I wrote a great deal about Wikipedia over the years. More than 16 years ago the cofounder of Wikipedia openly blasted Microsoft for bribing people to edit Wikipedia articles (interjecting Microsoft lies/spin or 'guarding' pages of interest against facts). Nowadays this cofounder (the greedy one, Wales, not Sagner) would simply look the other way while his bank balance grows.

If Wikipedia is serious about lowering its hosting bills (its financial disclosures show that this expense is very minor compared to other things) or making the site faster/more resilient, then it should consider becoming more like Britannica, which is a lot trickier for corporations to manipulate.

As a kid I used Britannica and other encyclopedias quite a lot. Nowadays I feel disillusioned and dissatisfied about the "page anyone can manipulate" approach; it's becoming a lot more like Social Control Media, not literature. It's not about what's true but about "brigading" and persistence/perseverance (or budget).

Wikipedia needs to get its act together or lose what's left of its former reputation. Britannica isn't a good yardstick, but in my experience it's nowadays a lot more accurate and reliable than Wikipedia, where many articles are "unfinished works" or ads disguised as legitimate pages (sometimes people or companies writing about themselves).

Demoting or altogether abandoning Wikipedia isn't easy; people have nostalgic memories (sentimental facets) about what it used to be, however a lot has changed. Many ordinary people "contributed" to Wikipedia (edits, funds etc.), so rejecting Wikipedia feels like self-loathing or self-betrayal.

Wikipedia has new masters. They work against you. Wikipedia is just another "Advertising Channel" to them ("Reputation Management"). For only a little money ("slush funds") they can get a lot out of it. It's another "investment".

Wikipedia: Bill and Melinda Gates Foundation

Wikipedia: Microsoft Matching Gifts Program

Other Recent Techrights' Posts

Gemini Links 20/05/2025: LLM Scraper Bots in Gopher and "Starmer and the Somewheres"
Links for the day
Skype Fell Off a Cliff (Microsoft Killed It), All Microsoft Has Left Now is Slop and Spaghetti Code
"This isn’t about AI. This is a puppet show to drive stock prices up and down."
Slopfarms (Machine-Generated Fake News Sites Authored by Bots With Slop Images) Spread GNU FUD
This isn't about Linux (GNU doesn't run just on Linux)
United States Federal Government's Digital Analytics Program (DAP): GNU/Linux Users Represent Close to 6% of Visitors This Year
How far has GNU/Linux gotten? Very far!
The "LLM Ouroboros of Shit" is Complemented by Even Worse Phenomena Caused by Microsoft's Contribution of SPAM and Pollution
Microsoft became a world leader in promotion of LLM slop
The LLM Ouroboros Phenomenon
Fact #1: over time slop gets worse (training set is like some blurry JPEG). Fact #2: People's "smell" for slop improves over time, as they 'train' on slop and can detect it based on prior encounters. Put 1 and 2 together.
How We Defeated DDoS Attacks
One of the best things one can do is migrate to an SSG
Microsofters Issuing Threats to Microsoft Critics Who Blog About Microsoft
So far we see that their "legal strategy" revolves around trying to discredit people like Theodore Ts'o
 
Openwashing of Windows, Back Doors, Persistent Surveillance, Keyloggers, Screen Loggers, DRM and So On
WSL is not "Linux", it's Windows
New 'Interview' With - or Talk Coverage of - Richard Stallman in the European Union
automated English translation
IBM Mass Redundancies Likely This Coming Thursday
We're not in a position to judge if that's true or false
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Monday, May 19, 2025
IRC logs for Monday, May 19, 2025
Microsoft a Top Sponsor at Red Hat Summit (IBM Selling Proprietary Spyware and Back Doors in a "Red" Trench Coat)
They both work for Microsoft
The Official SUSE Blog Uses LLM Slop to Compose Fake Articles Promoting Microsoft and Azure
even a little slop spoils the broth
Links 19/05/2025: Charges of Blackmailing Over Son Heung-min, Chad Opposition Leader Detained
Links for the day
Gemini Links 19/05/2025: Ableism, Silicon Monkeys, and More
Links for the day
Links 19/05/2025: Political Catchup and CISA Advisories
Links for the day
TheLayoff.com Has Begun Deleting Trolls/AstroTurfers Infesting the IBM Section to Discourage On-Topic Discussion About Culls and Maladministration (Bad Strategy)
Moderators have realised there's a problem
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Sunday, May 18, 2025
IRC logs for Sunday, May 18, 2025
Gemini Links 18/05/2025: Five Years on Gemini and Atom Feeds over Gopher
Links for the day
Links 18/05/2025: F.D.A. More Sceptical of COVID-19 Vaccines, UK Charges 3 Iranian Nationals In Alleged Attack Plot Against Journalists
Links for the day
Gemini Links 18/05/2025: "Finally Upgraded" and "Rebooting"
Links for the day
There Are Days or Occasions Where gemini:// Requests Almost Exceed http(s):// and Gemini Protocol Isn't Even 6 Yet
Gemini Protocol turns 6 one month from now
Abundance of Good Code, "Just Like Air."
Richard Stallman's seminal manifesto and foundational (practical) work on GNU gave us a very solid system that facilitates productive work without concerns over spyware
Messages in TheLayoff.com Drowned Out by LLM Slop (Comments Focused on Replying to Bot-Generated Provocation)
apparently shaking hands with nazis isn't as bad as calling your git repository's main branch "master"
The Importance of Full Disclosure and Transparency Online
there will be full transparency, as always
Slopwatch: Slopfarms and Serial Sloppers Still at It
Apparently Google is too understaffed to figure that out
Links 18/05/2025: Decreased Prospects of Science Careers, Disappearance of Journalists
Links for the day
Microsofters Have a Long History Trying to Take Down Techrights by Sending Threats to Webhosts
picking on women
Links 18/05/2025: Science, Censorship and European Commission Taking on Monopoly Abuse by Microsoft
Links for the day
Gemini Links 18/05/2025: Šibenik and SFJAZZ Historical Archive
Links for the day
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Saturday, May 17, 2025
IRC logs for Saturday, May 17, 2025