Bonum Certa Men Certa

The LLM Ouroboros Phenomenon

posted by Roy Schestowitz on May 19, 2025,
updated May 19, 2025

An ouroboros in a 1478 drawing in an alchemical tract

Ancient Greek mythology came up with this concept of an ouroboros, wherein some animal - typically a snake for a feasible "IRL" (in real life) metaphor - eats itself by eating its own tail. We would not be the first to point out the analogy here for LLMs because an ouroboros is a good parable. This morning we catalogued two BSD and Linux sites complaining about desperate LLM scrapers staging a DDoS attack in pursuit of original, as in human-written, code or words. This isn't a new problem for us and in the past few days we served about half a million pages in Gemini Protocol, likely due to to LLM scrapers. It's obnoxious to say the least, but distinguishing benign from malicious (or worthless junk) requests is hard and a "moving target" (it's never enough as parasites learn to adapt).

This morning in IRC we made an assertion about LLMs and fake (slop) images. We also made several observations. Fact #1: over time slop gets worse (training set is like some blurry JPEG). Fact #2: People's "smell" for slop improves over time, as they 'train' on slop and can detect it based on prior encounters. Put 1 and 2 together.

Are LLMs bound to not only get worse but also more easily detectable by an increasingly sceptical general public? TheLayoff.com has just responded to this.

An associate opines that fact #1 (that slop gets worse over time) is exacerbated by the flood of slop on the Net being snarfed up by newer bots and mistaken for training data. "Thus the feedback loop I mentioned a long time back and which Andy wrote about in depth." (He was referring to Dr. Farnell's good writings about this dilemma - as he did several times in The CyberShow's blog)

To a certain extent my Ph.D. thesis (dissertation) covered this about two decades ago. The associate says that it's a "well-known problem from days of old".

There are several unique aspects to this, including validation bias. To me it seemed a bit related to but not the same as over-training because, as an associate explains, "overtraining is something else: too much data and the patterns become locked too tightly to the training set and less useful for new data".

For an LLM to scan online its own output serves to affirm the mistakes, or the errors, often euphemised as mere "hallucinations", which are innocent, not libellous, and by no means "intentional" and "harmful". Dr. Farnell and Dr. Kate Brown responded to this last October in "Radical disbelief and its causes".

In the context of my thesis (dissertation), a concern was raised about what we back then called "synthetic data" finding its way "back" into the training set. So when you check brain MRI scans (which is what we did back then) you must ensure you only ever deal with real data, not mock or manipulated data that can confirm your own biases and "fit into" the model that generated it in the first place (in generative mode). To use the analogy of text-based LLMs, your BS is "truth" if your input is your own BS (output/s) and it would be deemed accurate, based on you (opposite of the notion of peer review in science). The associate correctly points out, based on a scan of my thesis (dissertation), that the strings "overtraining" and "over-training" are not in the dissertation, but we used different terms back then.

A squat toilet (also known as an Eastern, Turkish, Iranian or Natural-Position toilet). This one is in Turkey

"An LLM Ouroboros of shit", as the associate dubs it, would be statistical models (such as PDMs or AAMs*) treating computer-generated images as something from "the real world".

The so-called "generative hey hi" (genAI) "bros" won't allow the media to talk about such issues, at least if they can downplay the issues and deny/misportray them (in the media). But it's a real and growing problem. Its magnitude likely grows quadratically, not linearly. Just like other bubbles (overabundance based around hype), don't expect linear implosions. When it's gone (poof!), it's gone.

____

* PDM and AAM need expansion in the explanatory sense, not just words (in the acronyms). PDMs go back several decades ago they were invented or pioneered by the people who tutored me. They use mathematical, statistical models to perform multidimensional analysis of data variations, based upon principal component analysis (PCA). AAMs are an extension but with textures, not only points. This is really old stuff; even AAMs are over 23 years old; now the mainstream media pretends those are some kind of "revolution".

Other Recent Techrights' Posts

Microsoft May Already Be Shutting Down More Gaming Studios
the writings are on the wall: XBox is in disarray.
 
European Patent Office (EPO) Series: An Advisor to the President
he had recently advanced to membership of the "inner circle" of Team Campinos.
Two Weeks Ahead of July Three Studios Microsoft Plans to Shut Down Already Named
This is what happens when companies try to establish themselves on a mountain of promises and false assumptions, kicking the can down the road until payroll becomes hard to complete
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Monday, June 15, 2026
IRC logs for Monday, June 15, 2026
IBM Works for Microsoft
Hours ago in IBM.com
European Patent Office (EPO) Series: The EPO's Brussels Liaison Officer
It would appear that in January 2020, Pellegrino was induced by Campinos to jump ship from the EUIPO and take up his current position as Brussels Liaison Officer for the EPO
European Patent Office (EPO) Receiving Section (RS) and Elimination of Many Roles
Open letter to Mr Rowan (VP1) and Mr Aledo Lopez (COO) [...] Does the EU leadership intend to tolerate this?
Microsoft's XBox is Disintegrating, Executives Are Quitting
We're basically witnessing the slow-motion "end of XBox"
Gemini Links 15/06/2026: Slop Code Benchmarked, Wireguard on NixOS and Guix
Links for the day
Links 15/06/2026: More Own Goals for the Slop Industry, Palantir Trouble in UK
Links for the day
Apple Wants Everybody to Forget About "Vision Pro" Because It Was a Giant Flop
worthless gadgets with no obvious use case/s
The Cyber Show is Adopting 'Book Form' (or Long Form Publications)
Andy and Helen nowadays invest more time in making their site faster
Richard Stallman's Software Freedom/Digital Sovereignty Tour in Europe
As things stand at present, the vast majority of people have their interactions controlled/policed by GAFAM
Estimates of Scale of Microsoft Layoffs, Will Likely Happen "in Batches"
"Heard 10 to 15 percent eventually but idk date."
IBM Has Put Red Hat on a Poor Diet of Slop, Now Fedora and Red Hat Suffocate or Choke on It
Over the weekend we saw more people leaving the company
Estimates of Microsoft Layoffs: 3,000 Staff to be Culled Just in Gaming, How Many in Other Divisions?
Now the XBox division has its own "fall guy", but it is a woman
Straw Man Arguments Against Rust
If anything, it teaches the importance of auditing packages
Tesla Debt Rose Sharply, Sales Declined, Wall Street's Claim of Tesla "Value" is Merely a Fairytale (and Not Just Tesla)
We would gladly sell land on Mars to anyone who honestly believes a company that loses money is somehow "worth" trillions in Wall Street
Stop Calling Losses "Investment"
XBox is losing money, it is a sinkhole
For Justice We Need More Speech, Not Less Speech
When you attack something you are just giving that something a bigger platform
SLAPP Censorship - Part 107 Out of 200: Keeping Law Accessible to Everybody
We'll have stories related to this in the future
Links 15/06/2026: Slop "Beg Bounties", Wall Street Fakes 'Worth', and Arkansans Saved PBS
Links for the day
Gemini Links 15/06/2026: Dating Oaks, Simulation, and Theremin
Links for the day
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Sunday, June 14, 2026
IRC logs for Sunday, June 14, 2026
Links 14/06/2026: Energy Cost and Reality Strikes at Heart of Slop Bubble, 75 Data Center Build-outs "Successfully Blocked"
Links for the day
Microsoft CEO Says XBox is Not a Sustainable Business
"Now, we have to turn this into a sustainable business," he said about XBox
MElon (MUSK, Elon) is a Trillionaire Like Penguins Are Mammals
Have media outlets told the truth?
Unlikely Heroes
One personal hero who is not alive (anymore) is Navalny
Bruce Schneier Was Probably Wrong About Slop
Right now politicians who openly speak in favour of slop are committing "political suicide"
SLAPP Censorship - Part 106 Out of 200: 100 Kilograms of Legal Papers
When one party's communications and filings weigh at about 3 KG of paper and another's... at about 100 KG of paper
Links 14/06/2026: More Google Layoffs, Wall Street Deems Companies That Lose Money "Worth" Trillions
Links for the day
Gemini Links 14/06/2026: "The Universe is a Hologram", "Matrix Brain Download", and "Happy 0th Year"
Links for the day
European Patent Office (EPO) Series: Battistelli's "Baltic Crusader"
Gilles Requena, Battistelli's erstwhile "Baltic Crusader" and the loyal servant of his successor Campinos
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Saturday, June 13, 2026
IRC logs for Saturday, June 13, 2026