Bonum Certa Men Certa

The LLM Ouroboros Phenomenon

posted by Roy Schestowitz on May 19, 2025,
updated May 19, 2025

An ouroboros in a 1478 drawing in an alchemical tract

Ancient Greek mythology came up with this concept of an ouroboros, wherein some animal - typically a snake for a feasible "IRL" (in real life) metaphor - eats itself by eating its own tail. We would not be the first to point out the analogy here for LLMs because an ouroboros is a good parable. This morning we catalogued two BSD and Linux sites complaining about desperate LLM scrapers staging a DDoS attack in pursuit of original, as in human-written, code or words. This isn't a new problem for us and in the past few days we served about half a million pages in Gemini Protocol, likely due to to LLM scrapers. It's obnoxious to say the least, but distinguishing benign from malicious (or worthless junk) requests is hard and a "moving target" (it's never enough as parasites learn to adapt).

This morning in IRC we made an assertion about LLMs and fake (slop) images. We also made several observations. Fact #1: over time slop gets worse (training set is like some blurry JPEG). Fact #2: People's "smell" for slop improves over time, as they 'train' on slop and can detect it based on prior encounters. Put 1 and 2 together.

Are LLMs bound to not only get worse but also more easily detectable by an increasingly sceptical general public? TheLayoff.com has just responded to this.

An associate opines that fact #1 (that slop gets worse over time) is exacerbated by the flood of slop on the Net being snarfed up by newer bots and mistaken for training data. "Thus the feedback loop I mentioned a long time back and which Andy wrote about in depth." (He was referring to Dr. Farnell's good writings about this dilemma - as he did several times in The CyberShow's blog)

To a certain extent my Ph.D. thesis (dissertation) covered this about two decades ago. The associate says that it's a "well-known problem from days of old".

There are several unique aspects to this, including validation bias. To me it seemed a bit related to but not the same as over-training because, as an associate explains, "overtraining is something else: too much data and the patterns become locked too tightly to the training set and less useful for new data".

For an LLM to scan online its own output serves to affirm the mistakes, or the errors, often euphemised as mere "hallucinations", which are innocent, not libellous, and by no means "intentional" and "harmful". Dr. Farnell and Dr. Kate Brown responded to this last October in "Radical disbelief and its causes".

In the context of my thesis (dissertation), a concern was raised about what we back then called "synthetic data" finding its way "back" into the training set. So when you check brain MRI scans (which is what we did back then) you must ensure you only ever deal with real data, not mock or manipulated data that can confirm your own biases and "fit into" the model that generated it in the first place (in generative mode). To use the analogy of text-based LLMs, your BS is "truth" if your input is your own BS (output/s) and it would be deemed accurate, based on you (opposite of the notion of peer review in science). The associate correctly points out, based on a scan of my thesis (dissertation), that the strings "overtraining" and "over-training" are not in the dissertation, but we used different terms back then.

A squat toilet (also known as an Eastern, Turkish, Iranian or Natural-Position toilet). This one is in Turkey

"An LLM Ouroboros of shit", as the associate dubs it, would be statistical models (such as PDMs or AAMs*) treating computer-generated images as something from "the real world".

The so-called "generative hey hi" (genAI) "bros" won't allow the media to talk about such issues, at least if they can downplay the issues and deny/misportray them (in the media). But it's a real and growing problem. Its magnitude likely grows quadratically, not linearly. Just like other bubbles (overabundance based around hype), don't expect linear implosions. When it's gone (poof!), it's gone.

____

* PDM and AAM need expansion in the explanatory sense, not just words (in the acronyms). PDMs go back several decades ago they were invented or pioneered by the people who tutored me. They use mathematical, statistical models to perform multidimensional analysis of data variations, based upon principal component analysis (PCA). AAMs are an extension but with textures, not only points. This is really old stuff; even AAMs are over 23 years old; now the mainstream media pretends those are some kind of "revolution".

Other Recent Techrights' Posts

Don't Use the Future Tense to Discuss the Slop Bubble
Wall Street does not react to reality; it reacts to panic, which is related to expectations
The Broken Window Industry and Its Ongoing Desires to Make Technology Less Dependable
Reliable computing is becoming harder to find
New XBox CEO Typecast in Social Control Media
Microsoft apologists will fall back on (or shuffle between) the "racist" and "sexist" angle
Sites Without JavaScript Deserve Your Visits
We're not arguing that the Web should be as simple or barebones like Gemini Protocol/GemText
EPO Strikes Are Already Working
Campinos is already going "into hiding"
 
IBM Layoffs Definitely Still Happening
Contrary to what some apologists try to say
More and More Projects Quit Microsoft GitHub This Year, XBox Will See the Same
Microsoft GitHub's embrace of slop as "strategic" gives us a clue of what'll happen to XBox very soon
Google "Intelligence": Despite Slam-Dunk or "Smoking Gun" Proof, Drug Abuse in EPO Leadership is "Unverified Allegations"
Google's slop (so-called 'AI') lacks intelligence
8,000 Pages/Articles Per Year
We're eager to maintain a good production/publication pace and illuminate the sinister attempts to interfere with Freedom of the Press in the UK
Gemini Links 22/02/2026: Okonomiyaki and Midcrunch Crisis
Links for the day
Freedom Means Accepting He or She Who is Different
In the Debian community we're sadly seeing some authoritarian overreach this month
Microsoft Windows Falls to Another New All-Time Low in Guatemala, It is a Bottomless Pit
Maybe users come to realise that Windows means back doors and those doors are open to a regime that ought not be trusted
"XBox" Will Become Slop After Mass Layoffs
When all else fails, "AI it"
Links 22/02/2026: Hardware Price Hikes Across the Board, "Microsoft Issues Statement on Potential Layoffs"
Links for the day
Microsoft "Layoffs Incoming"
This transition isn't about promoting games; it's about canning the console
Links 22/02/2026: "Bloat of Modern Fitness Apps" and Wikipedia Deprecates Archive.today
Links for the day
Our IRC 5-Year Anniversary (for Self-Hosted) is Fast Approaching
A week from now it's March already
Gemini Links 22/02/2026: Dream Job Gone and Slop in Taskwarrior
Links for the day
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Saturday, February 21, 2026
IRC logs for Saturday, February 21, 2026
GNU/Linux Grew a Lot in Nicaragua
We've not noticed until today
Techrights Has Over 1,000 Good Articles 'in the Tank'
Drafts, notes, and lengthy documents
New Article Challenges Solicitors Regulation Authority (SRA) for Choosing the Wrong SLAPP Cases to Investigate
The one point we can agree on is that SRA does not know how to correctly select the worst culprits/offenders
The Brand 'Watsonx' is a Terrible Name for IBM 'Hey Hi' (Chatbots) Because Watson Agreed With Adolf Hitler
Almost a century has passed and IBM still believes that selling "intelligence", chatbots in particular, should be done under the name "Watson"
Why IBM is Still Scary and Dangerous
Keep a distance from "Big Blue" Bully
Measuring the Growth of Our Mission and Community
Something between experiment and prototype
Richard Stallman in the United States - Part III - Georgia Tech Did a Fine Job Upholding Free Speech Principles
The real problem was social control media (toxic)
Debian's Master is Deleting Criticism of SystemD and Other Things (On-Topic and Published by Debian Developers), Resorts to the Excuse Messages Are "Too Long"
Censorship serves nobody except the masters that control this censorship
Digg's Latest Incarnation Already Failed, It's Infested With LLM Slop
Many submissions go to slopfarms and some get summarised by slop
Gemini Links 21/02/2026: Veganism and DeskPi RackMate T0
Links for the day
On The Web, XBox Already a Dying Breed
Down to about 0.05% on large machines, based on statCounter [...] Microsoft will never publicly admit or say how many billions it lost on the XBox
2026 a Year of 'Top-Down' Microsoft Layoffs (Management First)
Stay tuned for what comes next
Your "Likes" Aren't Yours and They're Mostly "Worthless Clicks"
Social hermits are not popular, irrespective of how many "Facebook friends" or "likes" they get
Waggener Edstrom/Frank Shaw Lied, There Are Definitely Microsoft Layoffs
Microsoft never issued a formal statement, it made allusions by proxy
Microsoft-Controlled Media With Embargo and Press Operatives
This won't be the last example of media manipulation for narrative control or face-saving "damage control"
Slop Hype Makes Our Core Technology Less Reliable and Far Less Resilient (We Pay for the Catastrophe That Follows)
Only slop-free projects can be trusted
Going for 1,000 (Days of Uptime)
universal records are vastly better
Firefox is No-Go in China, Not Even 1% "Market Share" Anymore
Given Mozilla's utterly rubbish marketing these days (politics over technical aspects), set aside the cheerleading for slop, there's hardly a chance of Mozilla Firefox reaching or exceeding 10% again
EPO "Cocaine Communication Manager" - Part III - It's in His Eyes
Workers are free to draw their own conclusions
Links 21/02/2026: Tensions Over Iran and Illegal Cheeto Tariffs, Presidential Approval Sags
Links for the day
Links 21/02/2026: "Moving Away From Cloudflare", Many Layoffs or Shutdowns in Games (Including XBox/Microsoft)
Links for the day
GNU Linux-libre is a Grown-Up Today
"before that, every distro that wanted to respect its users' freedom had to remove itself all of the binary blobs that were distributed as part of the kernel Linux's so-called sources"
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Friday, February 20, 2026
IRC logs for Friday, February 20, 2026
Gemini Links 21/02/2026: "The Evil of Action" and Slop Bots Causing Great Harm Online (Not Just the Web)
Links for the day
Like a Shell
Overreactions can backfire
Not Only Leaders of XBox Got Sacked (Layoffs)
Phil Spencer and Sarah Bond got laid off
9PM on a Friday Night: Microsoft Says the Layoffs Are Not Layoffs
We've said for a long time that XBox is doomed this year
Gemini Links 20/02/2026: Misfin Server and Magic in Programming
Links for the day
Former Debian Project Leader Branden Robinson Cautions Against Cover-up and Censorship in Debian
Debian drama. Again.
analytics.usa.gov Reckons Windows "Market Share" Fell to Just 38%, Vista 11 Not Even a Third of Windows Users
This coming summer Vista 11 turns 5
The New Digg.com is Slop
Slop "summaries" and Serial Sloppers are drowning out the site with fake 'articles' (plagiarism)
Linus Torvalds: Bill Epsteingate Good Enough for Me to Wine and Dine With
Torvalds is more connected to Jeffrey Epstein than Richard Stallman ever was
Our Uptimes Are Always Better Than Any Site That Uses Clownflare
Clownflare as a company operates like a cult
GNU/Linux Apparently Rose to 6% in Uzbekistan
If accurate, this represents a new problem for Microsoft and a big win for Software Freedom
Sponsored Videos and 'Articles' in The Register MS, Stenography as a Service/Product
They should more accurately label these actors
It's Friday Again and Many People Leave IBM for Good (IBM Should be Reported for Illegal NDAs That Hide Layoffs)
we very seldom see anyone deviating a lot from the "template-like" narrative, let alone mentioning "layoffs" or "RA" or some other term that implies non-consensual departure
The Little Clique of Sloppers/Spammers About "Linux" Got Even Smaller
Thankfully there are still genuine and legit GNU/Linux sites out there
Links 20/02/2026: Microsoft Intentionally Kills Older Hardware, "The Story of XBox" Shows How Defective Microsoft Hardware Really Was
Links for the day
Turkmenistan One of Many Countries Where Microsoft Fell to Distant Third in Search
We expect many layoffs in Bing some time soon
Don't Wait for "Red Hat Layoffs" Because After Bluewashing They're IBM RAs and Don't Wait for "IBM Layoffs" Because They're Perpetual
IBM layoffs are silent and "forever" (small trickle that never ends and is widespread - after all IBM is a very global and ubiquitous firm)
Links 20/02/2026: Standards, Science, and Politics
Links for the day
What Do People Ever Buy From Microsoft Anyway (Not PCs)?
Microsoft sells two things these days: 1) vapourware/promises. 2) its stock.
Gemini Links 20/02/2026: "Mainstream Unix, Underground Unix", Slop Staging DDoS Attacks Against Small Sites
Links for the day
IBM Inclusivity: Red Hat Summit is for Rich Sponsors Like Microsoft and Rich Guests Who Pay $500 a Day
Nothing signals societal tolerance more than paying a large military contractor
GNU/Linux Adoption is Higher in Richer Countries
Is it because freedom is actually expensive - something that only privileged people can pursue?
Links 20/02/2026: Windows TCO Versus Deutsche Bahn, Europe Seeks More Independent Digital Future
Links for the day
IBM, Red Hat and Fedora: Don't Say "Master", It Offends People. Also IBM, Red Hat and Fedora: "Master Podman".
The hypocrisy at Red Hat and Fedora shows no boundaries
IBM Layoffs Aren't Just in IBM 'Proper'
Who is still using Lotus after the HCL move?
The Register MS Gets Paid by Gartner to Promote a Ponzi Scheme for Gartner, Microsoft, and Others
The credibility of that site will suffer because it tries to sell a major scam to its audience
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Thursday, February 19, 2026
IRC logs for Thursday, February 19, 2026