Bonum Certa Men Certa

The LLM Ouroboros Phenomenon

posted by Roy Schestowitz on May 19, 2025,
updated May 19, 2025

An ouroboros in a 1478 drawing in an alchemical tract

Ancient Greek mythology came up with this concept of an ouroboros, wherein some animal - typically a snake for a feasible "IRL" (in real life) metaphor - eats itself by eating its own tail. We would not be the first to point out the analogy here for LLMs because an ouroboros is a good parable. This morning we catalogued two BSD and Linux sites complaining about desperate LLM scrapers staging a DDoS attack in pursuit of original, as in human-written, code or words. This isn't a new problem for us and in the past few days we served about half a million pages in Gemini Protocol, likely due to to LLM scrapers. It's obnoxious to say the least, but distinguishing benign from malicious (or worthless junk) requests is hard and a "moving target" (it's never enough as parasites learn to adapt).

This morning in IRC we made an assertion about LLMs and fake (slop) images. We also made several observations. Fact #1: over time slop gets worse (training set is like some blurry JPEG). Fact #2: People's "smell" for slop improves over time, as they 'train' on slop and can detect it based on prior encounters. Put 1 and 2 together.

Are LLMs bound to not only get worse but also more easily detectable by an increasingly sceptical general public? TheLayoff.com has just responded to this.

An associate opines that fact #1 (that slop gets worse over time) is exacerbated by the flood of slop on the Net being snarfed up by newer bots and mistaken for training data. "Thus the feedback loop I mentioned a long time back and which Andy wrote about in depth." (He was referring to Dr. Farnell's good writings about this dilemma - as he did several times in The CyberShow's blog)

To a certain extent my Ph.D. thesis (dissertation) covered this about two decades ago. The associate says that it's a "well-known problem from days of old".

There are several unique aspects to this, including validation bias. To me it seemed a bit related to but not the same as over-training because, as an associate explains, "overtraining is something else: too much data and the patterns become locked too tightly to the training set and less useful for new data".

For an LLM to scan online its own output serves to affirm the mistakes, or the errors, often euphemised as mere "hallucinations", which are innocent, not libellous, and by no means "intentional" and "harmful". Dr. Farnell and Dr. Kate Brown responded to this last October in "Radical disbelief and its causes".

In the context of my thesis (dissertation), a concern was raised about what we back then called "synthetic data" finding its way "back" into the training set. So when you check brain MRI scans (which is what we did back then) you must ensure you only ever deal with real data, not mock or manipulated data that can confirm your own biases and "fit into" the model that generated it in the first place (in generative mode). To use the analogy of text-based LLMs, your BS is "truth" if your input is your own BS (output/s) and it would be deemed accurate, based on you (opposite of the notion of peer review in science). The associate correctly points out, based on a scan of my thesis (dissertation), that the strings "overtraining" and "over-training" are not in the dissertation, but we used different terms back then.

A squat toilet (also known as an Eastern, Turkish, Iranian or Natural-Position toilet). This one is in Turkey

"An LLM Ouroboros of shit", as the associate dubs it, would be statistical models (such as PDMs or AAMs*) treating computer-generated images as something from "the real world".

The so-called "generative hey hi" (genAI) "bros" won't allow the media to talk about such issues, at least if they can downplay the issues and deny/misportray them (in the media). But it's a real and growing problem. Its magnitude likely grows quadratically, not linearly. Just like other bubbles (overabundance based around hype), don't expect linear implosions. When it's gone (poof!), it's gone.

____

* PDM and AAM need expansion in the explanatory sense, not just words (in the acronyms). PDMs go back several decades ago they were invented or pioneered by the people who tutored me. They use mathematical, statistical models to perform multidimensional analysis of data variations, based upon principal component analysis (PCA). AAMs are an extension but with textures, not only points. This is really old stuff; even AAMs are over 23 years old; now the mainstream media pretends those are some kind of "revolution".

Other Recent Techrights' Posts

How to Identify Demonisation or Dehumanisation Tactics Against Interesting Figures or Luminaries in Free Software
Rather than in general or generally in technology
We Should Learn From Bulgaria
Why can't European companies and government recognise and react to a threat (when they see one)?
Canonical: Ubuntu is GAFAM (US), We're Resellers of American Proprietary Software
They want people to pay for a licence
Links 03/02/2026: "Distraction is a Sin" and Fake "Encryption" (Surveillance With Good Marketing)
Links for the day
 
BillBC (BBC) Covered Up Pedophilia, Now It's Covering Up for Its Sponsor Bill Gates by Reprinting His Lies, Which His Own Wife Disputes
Is Bill Gates having orgies (group sex)?
Phoronix Swims With the Real Trolls, People Who Fancy Proprietary Software and Back Doors
If Larabel begins to actively participate in provocation with the "Microsoft GitHub fans club", what does this tell us about Phoronix?
They Know Microsoft Layoffs Are About to Hit Them Hard
The gaming division at Microsoft is a complete catastrophe, lots of money (debt) down the drain [...] Buying Activision was all about misleading shareholders or hiding the deep trouble/problems XBox was having
Red Hat is Not a Linux Company, It's IBM's Ponzi Scheme Enabler
Had we still been stuck in 2021, perhaps IBM would plaster "NFT" or "metaverse" all over RedHat.com
Keep Grinding
"Don't let the bastards grind you down"
Mobbing at the European Patent Office (EPO) - Part III - Who's Going to Pay for the EPO's Corruption? (Aside From European Citizens)
Some people inside the EPO reached out to us
"Investors Are Concerned About an AI Bubble" (That GAFAM and IBM Ride)
A few decades from now IBM will only be remembered in the same sense many so-called 'AI' companies will be remembered
EPO Staff Union: "Very High Strike Participation on Friday 30 January", Another Strike Starts 19 Days From Now
EPO management in a bit of a panic
Censorship/Free Speech and Social Control Media
It's important to have a grasp of how contemporary censorship works and how to tackle it
Google News as Slop Booster
this is what Google links to
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Tuesday, February 03, 2026
IRC logs for Tuesday, February 03, 2026
Gemini Links 04/02/2026: "Raspberry Pi Relaxes the Rules for Its RP2040 Hacking Challenge" and "Long Web Society"
Links for the day
IBM Falls by Over 10%
a recipe for disasters like accounting fraud
Links 03/02/2026: Windows Copies GNU/Linux, Windows TCO Shown Again
Links for the day
Gemini Links 03/02/2026: Alhena Turns One, Slop Rejected, and Max Roy Carrouges Recalled
Links for the day
Dr. Andy Farnell on Why and How European Authorities Can Adopt Free Software, Parenting in the Age of Digital Abundance
Will Europe use technology that Europe controls (not the hegemon), for a change?
Seems Like IBM Trolls Use Chatbots to Vandalise Platform That Discusses IBM's Secret Layoffs, Forever Layoffs
Not for the first time either
You Know Your Company is Dead or Basically a Pyramid Scheme When Jim Cramer Keeps Promoting Its Stock
How much does IBM pay for "puff pieces" or "fluff" about QC?
Red Hat (Under IBM) Works for Microsoft (Proprietary Software) and Slop
Yesterday Red Hat's official site, redhat.com, published exactly 5 new blog posts
IBM is Dying (More Layoffs), Red Hat Will Continue to Suffer From the Acquisition
Financial engineering
Colombia Adopting GNU/Linux Even Faster (at Microsoft's and Apple's Expense)
Do politics play any role in this?
An Effort to Tackle Slavery in 'Open Source' Clothing
"a civil rights lawsuit to examine the concerns of censored developers in the free, open source software ecosystem"
$15 billion lawsuit: Ubuntu, Google & Debian crowdfunding campaign launch
Reprinted with permission from Daniel Pocock
The Solicitors Regulation Authority (SRA) Delusion - Part II - Why We Need to Expose the SRA to More Daylight, Public Scrutiny
SRA is neither effective nor regulated
400-Page US Federal Court Against Abuses by Google, Microsoft and Front Groups That Abuse Volunteers for American Corporations
There are 386 pages in total (in the US claim)
Corporate Influence Never Impacted Us
There's no reason to assume we'll ever "sell out"
Growth of GNU/Linux in Cuba
Right now a lot of the world drafts or already implements a GAFAM exit plan
A Day After EPO Strikes an Escalation to Heads of Delegations to the Administrative Council
They rely on the European media playing along, helping them to hide major blunders, even crimes
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Monday, February 02, 2026
IRC logs for Monday, February 02, 2026
Gemini Links 03/02/2026: Stargazing, Development Boards, and Tcl/Tk Slop
Links for the day
Microsoft Lost 20% of Its Money in the Past 6 Months
Microsoft is hiding what's really happening while mocking critics
Great News, IBM 'Gained' Almost 10% in "Goodwill" Value After Firing Tens of Thousands in 2025
"goodwill" will be inflated despite IBM staff getting sick of IBM
Americans Move to GNU/Linux
some of the biggest American populations
I Still Like Drawing and Various Other Arts (They Help My Activism and Journalism), Slop is an Enemy of Creative People
Recognise that slop isn't intelligence; it's a generational excuse for plagiarism and privatisation of not only the Commons but also proprietary knowledge (without authorisation)
Carmen-Lisandrette Maris (Mission:Libre) Explains to Adolescents and Young Adults How Free Software Improves Privacy
Based on what we've seen and read, Mission:Libre has a solid grasp of Software Freedom
Chatbots Didn't Do Any Good for Microsoft
Google "AI" = search + copypasta
Links 02/02/2026: Cultural Cleansing by China and 'Living Behind Firewalls" in Iran
Links for the day
GNU/Linux Measured at More Than 4% in Russia
growing adoption of GNU/Linux in Russia
Gemini Links 02/02/2026: Stages of Age, Workflows, and Counting Capsules
Links for the day
Oracle's Debt Rose Over 20 Billion Dollars in Just 3 Months
Is "hey hi" becoming a synonym for debt?
Oligarchs' 'Speech Zones' Are Not the "Public Square"
The apologists of social control media, including press that got "addicted" to such fake "media", are helping dictators and oligarchs grab the public attention away from the real press
IBM Misleads and Gaslights Investors With Slop Sold as "AI" (the Business is Waning, Mass Layoffs Continue)
People who do this are dishonest. They should not be put in charge.
Links 02/02/2026: 'Melania' a Horror Movie "Will They Inherit Our Blogs?"
Links for the day
Doing More Detailed Series (Long-Form Works)
Long readings or book-like reading binges are only possible when parts are suitably labeled (name and numbers) if not interlinked
Mobbing at the European Patent Office (EPO) - Part II - Racism, Cocaine Use and White-Collar Corruption
When you hire people illegally, to work for cocaine users and keep quite about the cocaine use, what will be the impact on the reputation of an institution?
A Can of WORMS - Part II - Darkening the Name of RMS, Associating It With Crime
Beware projection tactics
Submit Your Suggestions for EU's Embrace of Software Freedom by Tomorrow
Time to leave GAFAM (US) hegemony behind
Slopless Weekend
This is not sustainable
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Sunday, February 01, 2026
IRC logs for Sunday, February 01, 2026