Bonum Certa Men Certa

Richard Stallman Explains Stochastic Parrots (LLMs)

posted by Roy Schestowitz on Sep 11, 2024

Enhanced vintage colour illustration with a transparent background

From his latest talk:

There are various kinds of artificial intelligence programs that within specific domains can understand a problem and can understand how to get the correct answer for that kind of problem at least usually as often as humans can do it or more but one kind of program which is not intelligence is the Large Language Models because intelligence implies understanding and those programs generate output but they have no idea what the output means. They're thinking of word usage only. And so we shouldn't be surprised that they generate statements that are false very often or even statements that are almost nonsensical. They're grammatical but they don't mean anything. They present imaginary fictitious events as if they were real, and yet they are being called artificial intelligence and most people on seeing that assume that the output of these programs can be believed. But it can't be. They have no idea of what's true. They don't understand the statements they generate an so we shouldn't call them A I and I never do. Sometimes I call them bullshit generators. Bullshit is defined as generating statements, producing statements with indifference to their truth or falsehood.

Of course if you can't understand truth and falsehood, you can't be anything but indifferent to it. And that's what those programs are like. There are also humans that output bullshit who are presumably capable of understanding whether they're true or not but don't care. For instance, Trump. [applause]

But I suppose as a human being, he would be capable of caring about the truth of his statements if it ever occurred to him to care. You know Trump has no heart but he still needs a defibrillator. But a program that can't have any idea of what is true certainly can't care. So we know that those bullshit generators are not intelligence they can't understand. So, moving on from that, one thing we can see is that a web site should never use a bullshit generator to do any job that depends on accuracy or validity or truth, because it's going to go wrong and more often than you might think. So it's a very bad thing to change a web site to be so-called smarter by having it pass what you tell it through a bullshit generator or having it pass what other people have published through a bullshit generator and giving you a summary that might be total nonsense.

It'll be written in good English though.

Other Recent Techrights' Posts

Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Saturday, November 23, 2024
IRC logs for Saturday, November 23, 2024
[Meme] GAFAMfox
Mozilla Firefox in a state of extreme distress
Google Can Kill Mozilla Any Time It Wants
That gives Google far too much power over its rival... There are already many sites that refuse to work with Firefox or explicitly say Firefox isn't supported
Free (as in Freedom) Software Helps Tackle the Software Liability Issue, It Lets Users Exercise Greater Control Over Programs
Microsofters have been trying to ban or exclude Free software
In the US, Patent Laws Are Up for Sale
This problem is a lot bigger than just patents
ESET Finds Rootkits, Does Not Explain How They Get Installed, Media Says It Means "Previously Unknown Linux Backdoors" (Useful Distraction From CALEA and CALEA2)
FUD watch
Techdirt Loses Its Objectivity in Pursuit of Money
The more concerning aspects are coverage of GAFAM and Microsoft in particular
Techrights' Statement on Code of Censorship (CoC) and Kent Overstreet: This Was the Real Purpose of Censorship Agreements All Along
Bombing people is OK (if you sponsor the key organisations), opposing bombings is not (a CoC in a nutshell)
Links 23/11/2024: Press Sold to Vultures, New LLM Blunders
Links for the day
Links 23/11/2024: "Relationship with Oneself" and Yretek.com is Back
Links for the day
Links 23/11/2024: "Real World" Cracked and UK Online Safety Act is Law
Links for the day
Links 23/11/2024: Celebrating Proprietary Bluesky (False Choice, Same Issues) and Software Patents Squashed
Links for the day
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Friday, November 22, 2024
IRC logs for Friday, November 22, 2024
Gemini Links 23/11/2024: 150 Day Streak in Duolingo and ICBMs
Links for the day
Links 22/11/2024: Dynamic Pricing Practice and Monopoly Abuses
Links for the day
Topics We Lacked Time to Cover
Due to a Microsoft event (an annual malware fest for lobbying and marketing purposes) there was also a lot of Microsoft propaganda
Microsofters Try to Defund the Free Software Foundation (by Attacking Its Founder This Week) and They Tell People to Instead Give Money to Microsoft Front Groups
Microsoft people try to outspend their critics and harass them
[Meme] EPO for the Kids' Future (or Lack of It)
Patents can last two decades and grow with (or catch up with) the kids
EPO Education: Workers Resort to Legal Actions (Many Cases) Against the Administration
At the moment the casualties of EPO corruption include the EPO's own staff
Gemini Links 22/11/2024: ChromeOS, Search Engines, Regular Expressions
Links for the day
This Month is the 11th Month of This Year With Mass Layoffs at Microsoft (So Far It's Happening Every Month This Year, More Announced Hours Ago)
Now they even admit it
Links 22/11/2024: Software Patents Squashed, Russia Starts Using ICBMs
Links for the day
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Thursday, November 21, 2024
IRC logs for Thursday, November 21, 2024