Bonum Certa Men Certa

The Great LLM Delusion - Part IV: Academic Papers as Microsoft Marketing for LLMs

posted by Roy Schestowitz on Jan 31, 2024

Photo capturing two very different architectural styles for student buildings at the UCLA Campus, California

"Strange and misleading article about LLMs," as explained by an anonymous contributor

THIS series coincides with Microsoft hype and vapourware (much-needed distractions). This part is a guest post of sorts, an unedited version of something a reader sent us. The supporting material is in there.

To be clear, you can talk to a chatbot. The chatbot won't talk back to you. It'll just spew out words, some of them merely plagiarised based on words similar to what you said (which the chatbot does not grasp, it's more like a Web search, except there's no attribution/link to source). Chatbots might lower the bar for journalism - to the point where Web as a whole will lose legitimacy, trust, value etc. Then what? Going back to physical libraries? Saying you can compose a physical book using a chatbot is like saying you can make a very large meal by assembling trash and cooking parts of it (LLMs are "digital pagpag"). Given reports like "Scammy AI-Generated Book Rewrites Are Flooding Amazon", this is already a real problem. "These 'AI' stock increases based on fake increases in revenue," an associate has remarked, "appear funded by mass firings to appease the LARPers in the financial community, That can only go on so long before they run out of people to take care of the core income-generating activities, a line which I suspect they have already crossed."

Will Microsoft also start spewing out "papers" or "publication" made by its chatbots, in order to generate hype about chatbots? That probably would not work, as the quality would not meet basic criteria.

Without further ado, here is the contributor's message:


I stumbled upon a recent article you may find curious.

While reading comments on a post at Bruce Schneier's blog, I saw a user who posted the following link as a kind of "proof" that conversations with LLM-based chatbots can be "useful" and "interesting."

Of course, it sparked my interest at first, but as I started reading it, red flags started to pop up here and there.

I do not know much about Quanta Magazine's credibility. At first, I thought that it was some semi-crackpot pop science news site, but after a shallow search, I saw a good rank from a fact-checking site.

The article was published on January 22, 2024, and the research it discussed was released on October 26, 2023. May be it does not mean much—just a few months—but it is a bit suspicious that the research paper is apparently not peer reviewed (just published on arXiv and cited in ~2–3 sources), and the article about it came out in parallel with "AI" swindle failure unraveling.

It seems like the article is desperately trying to spark new interest in readers regarding LLMs and chatbots, saying that there is some evidence that there is "much more than just autocomplete."

Following are some dubious parts.

1. The article talks about the "understanding" of something by LLMs but presents no clear definition of it.

The thing that can pass as a semi-definition (from the research paper)—"combinations that were unlikely to exist in the training data"—is, in my opinion, misleading for ordinary people. Much like other misnomers in the field (e.g., "hallucinations").

I guess it may be suitable to talk about "competence" instead, as in the "competence without comprehension" phrase from Dennet's writings.

2. The paper described in the article seems to support (or go in the direction of) the vague idea that if you shovel a lot of data and complexity into "AI" (LLM in this case), then "something" will emerge ("skills" and "ability to generalize" in this case, as stated in the paper and researcher's comments in the article). I find it concerning.

3. "Research scientist at Google DeepMind" among the authors of the paper, so it is probably not clearly independent (from corporate influence) research.

4. “[They] cannot be just mimicking what has been seen in the training data,” said Sébastien Bubeck, a mathematician and computer scientist at Microsoft Research who was not part of the work. “That’s the basic insight.”

Wait, what? Why is this part inserted in the article at all? Some guy from Microsoft is eager to tell us that LLMs are "something more." No bullshit. What a surprise!

5. The paper starts with this passage: "With LLMs shifting their role from statistical modeling of language to serving as general-purpose AI agents..."

I mean, what the fuck?! LLMs are not "shifting" anywhere; they are poorly shoehorned into use cases where a "general-purpose AI agent" is required (whatever it is, it does not exist in our reality anyway) by people who want to reap profits from selling half-assed "products" based almost entirely on lies! LLMs are definitely not suitable for general-purpose tasks other than text manipulation or some kinds of entertainment where facts, preciseness, and responsibilities do not matter at all.

One of the researchers acknowledges that it is not about accuracy.

"Arora adds that the work doesn’t say anything about the accuracy of what LLMs write. “In fact, it’s arguing for originality,” he said. “These things have never existed in the world’s training corpus. Nobody has ever written this. It has to hallucinate.”

I need to make it clear: I have no competence to review the actual paper; this task requires actual experts in the field.

As far as I understand the paper, the researchers devised some abstractions to describe observations they already made and try to construct a method that would be useful to work with their definitions and hypotheses that have a little in common with laymen's definitions (e.g., for terms like "understanding" and "creativity") and perceptions of the matter.

I tried to read the paper with an open mind to avoid at least some obvious biases. I have no problems with the paper; maybe it is actual useful research that will serve to advance the field (and not the companies of con artists)—I cannot say for sure.

What bothers me are the misnomers, misleading, and vague terms and descriptions in the paper (less) and the article (a great deal) based on it. In my opinion, the article commits the crime of severely misinforming the reader.

Other Recent Techrights' Posts

IBM Cannot Even Do Payroll, Now a "Legitimate Target" of Iran
Missiles or not, it seems like IBM systems will be targeted more by cybercriminals
Microsofters' SLAPP Censorship - Part 10 Out of 200: Showing Public Tweets is Not a Privacy Violation, But This Isn't About Justice, It's About Censorship
It's time to put a stop to this abuse of process (which is what the Judge deemed it to be last year)
 
European Qualifying Examination (EQE) Being Reduced to Pieces of Papers One Can Buy, Patent System Rapidly Losing Its Legitimacy
Welcome to the "new Europe"
Priorities in 2026
2026 is an interesting year
Willis Towers Watson (WTW) Producing More Propaganda for EPO "Cocaine Communication Managers"
The Local Staff Committee The Hague (LSCTH) has this new paper about Willis Towers Watson (WTW) and its annual EPO-sponsored propaganda, pretending all is well when things are clearly dire
Head of Microsoft Office and Microsoft 360 is Leaving Microsoft Amid Problems and Mass Layoffs
Microsoft is like a "legacy" company
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Thursday, March 12, 2026
IRC logs for Thursday, March 12, 2026
Gemini Links 13/03/2026: "Someone to Take Over Antenna" and Random Seed/RNG
Links for the day
By Expanding to Advocacy of Ponzi Schemes and Bill Epsteingate (Sex Trafficking), Linux Foundation Revenue Grew to $220,730,594, But Salary of Linus Torvalds Not Even in Top 10 Anymore!
true!
In the Name of Transparency, Today We Show Our Defence and Counterclaim
already uploaded by the other side
Links 12/03/2026: Heating Bills to Soar, "Banks in Gulf Evacuate Their Offices"
Links for the day
Gemini Links 12/03/2026: On Phone Anxiety and Bjorn "Looking for Someone to Take Over Antenna"
Links for the day
Cultification: best candidates avoiding Debian leader elections
Reprinted with permission from Daniel Pocock
Richard Stallman (RMS) et al Cited in 'Nature' (Journal/Site) Today, "CODE beyond FAIR"
Under Open Access
The Register MS, on Verge of Collapse, Keeps Promoting a Ponzi Scheme for China
Publishers that participate in this simply don't care about their readers
Overview of False Narratives and Lies Used to Lower Salaries at the European Patent Office (EPO), Abandoning Patent Quality and the EPC
Many of the latter slides are the same as Munich's
Links 12/03/2026: Atlassian Layoffs, GAFAN Covering up Slop-Induced Outages, "Age-verification in Operating Systems and the Internet"
Links for the day
The EPO's President, Who Covers Up Cocaine Use, is Trying to Suppress Communication Between EPO Staff Under the Guise of 'Privacy' (and in Defiance of a Court Ruling)
Why does Europe's second-largest institution: 1) curtail communication among staff (including union) and 2) go out of its way to avoid obeying a court order from ILOAT in Geneva?
Exactly One Week Before Next EPO Strike, Media Intentionally Not Mentioning EPO Strikes
One form of propaganda technique/s involves the systematic suppression of certain topics, or of particular "narratives"
Suicide of disgruntled employee? Bus fire at Kerzers / Chiètres, Switzerland, at least six dead
Reprinted with permission from Daniel Pocock
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Wednesday, March 11, 2026
IRC logs for Wednesday, March 11, 2026
Gemini Links 12/03/2026: "on Urbit" and the True Cost (or Criticism) of "Social Control Media"
Links for the day
Slop About "linux" in Google News
Once people recognise that those sites are fake it's hard to 'unsee' what they are
An American War on GNU/Linux, Software Freedom, and British Investigative, Science-Based Reporting - Part V - Attempts to Take Down and Suppress Criticism of Back Doors Controlled by Microsoft and the American Government
The cost of maintaining illusions
IBM's Payroll: Cannot Even Pay the People What They're Legally Entitled to
How financially-stressed is IBM at this point?
Slides From the European Patent Office (EPO) Explain Why They're Striking, How They're Striking, and What Comes Next
A week from now the strike will go ahead
GAFAM Datacentres Are Facilities of War, So Risk of Downtime by Missiles or State-Sponsored Cracking Has Vastly Increased
How safe is your business in "clown computing" or DCs marked as some "legitimate targets" at wartime?
Companies That Take Away Blood and Sweat From the Community to Sell a Ponzi Scheme to Everybody
We need Free software that is run by communities
1,234 People Gather Online to Plan Next EPO Strikes and Other Industrial Actions
yesterday an online gathering orchestrated the next moves by EPO staff
Links 11/03/2026: Fake Videos Swarm YouTube, "Ukraine Can Now Manufacture ‘China-Free’ Drones"
Links for the day
Gemini Links 11/03/2026: Lagrange for iOS and Android and "Turning a Folder of Git Repos Into Project Launcher"
Links for the day
Kafkaesque: Unlawful Activities in the UK to Cover Up Unlawful Activities in the United States of America
Why is bribery and even extortion seen is OK? Because rich people do those things?
Former IBM Executive, Ron Hovsepian, Doomed S.u.S.E. (SUSE)
SUSE is like a child nobody wants to raise
Quiet Layoffs or Silent Layoffs Alleged at Microsoft
Will some investigative journalists do their job now and ask Microsoft tough questions?
After a Long Lull LinuxTeck (linuxteck.com) Came Back Only as a Slopfarm
Unlike Linuxiac, LinuxTeck wasn't very active in recent years
Links 11/03/2026: EPO and USPTO Software Patents Thrown Out Again, Copyright Concerns Over Slop (Plagiarism Using Buzzwords)
Links for the day
Microsofters' SLAPP Censorship - Part 9 Out of 200: 5RB Barrister Does Not Even Know the Name of His Own Client (That He Was Paid Well Over $200,000 to 'Speak' or 'Cover' for)
If you assault women in the United States, there's a barrister available for you in the UK
IBM's Fedora is Now Led by GAFAM Slop
The official word of Fedora is partly slop
IBM 'Dinobabies' Speak Out
"They want newbies out of school at a much cheaper rate"
Links 11/03/2026: "Drill, Baby, Drill" and Social Control Media Recognised as Threat to Democracy
Links for the day
5 Years Since Freenode Conflict
IRC isn't going away
A Week Ahead of Next EPO Strike the Staff Representatives Show the Administrative Council That the Office Lost the Best Staff, It's No Longer Attractive
the message circulated regarding the open letter to the Administrative Council
Jeff Bezos as an Individual Said to Have Enough Capital to Buy IBM
Assuming a market capitalisation of 234.70 billion
Starting Soon: Another New Series About Richard Stallman
There are some inside stories we can tell
Gemini Links 11/03/2026: School, Code Slop, and "Fancy Weapons"
Links for the day
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Tuesday, March 10, 2026
IRC logs for Tuesday, March 10, 2026