
(ℹ) Join us now at the IRC channel | ䷉ Find the plain text version at this address (HTTP) or in Gemini (how to use Gemini) with a full GemText version.
| *agentcas2y (~dracos@spnyw2fxpi54g.irc) has joined #techbytes | May 29 00:17 | |
| *agentcas2y is now known as agentcasey | May 29 00:18 | |
| *x-amarsh04 has quit (Quit: Konversation terminated!) | May 29 02:06 | |
| *x-amarsh04 (~amarsh04@atu2uhbvxaeyw.irc) has joined #techbytes | May 29 02:14 | |
| *psydroid2 has quit (Ping timeout: 2m30s) | May 29 02:15 | |
| *psydroid2 (~psydroid@36imbvshpgubk.irc) has joined #techbytes | May 29 02:16 | |
| *psydruid (~psydruid@jevhxkzmtrbww.irc) has left #techbytes | May 29 02:22 | |
| *psydruid (~psydruid@jevhxkzmtrbww.irc) has joined #techbytes | May 29 02:32 | |
| *agentcasey has quit (Ping timeout: 2m30s) | May 29 05:04 | |
| *psydroid3 (~psydroid@36imbvshpgubk.irc) has joined #techbytes | May 29 08:14 | |
| *psydroid3 has quit (Quit: KVIrc 5.2.10 Quasar http://www.kvirc.net/) | May 29 08:21 | |
| *psydroid3 (~psydroid@36imbvshpgubk.irc) has joined #techbytes | May 29 08:24 | |
| *psydroid3 has quit (Ping timeout: 2m30s) | May 29 09:08 | |
| schestowitz[TR2] | https://www.thelayoff.com/t/1ksnecn6r | May 29 09:26 |
|---|---|---|
| -TechBytesBot/#techbytes- ( status 403 @ https://www.thelayoff.com/t/1ksnecn6r ) | May 29 09:26 | |
| schestowitz[TR2] | "ntil 1993 | May 29 09:26 |
| schestowitz[TR2] | 19 hours ago by Anonymous | May 29 09:26 |
| schestowitz[TR2] | | 1 reaction (+1/-0) | Reply | May 29 09:26 |
| schestowitz[TR2] | Post ID: @dc+1ksnecn6r | May 29 09:26 |
| schestowitz[TR2] | @d0 you must mean Indian monkeys, since those are the only ones that Alvind, Krabanaugh and the CIO look to hire. If you pay peanuts, you hire monkeys, of course. | May 29 09:26 |
| schestowitz[TR2] | IBM is the laughingstock of the tech world these days. It can only acquire clients through deceit and deception. | May 29 09:26 |
| schestowitz[TR2] | " | May 29 09:26 |
| *psydroid2 has quit (Quit: KVIrc 5.0.0 Aria http://www.kvirc.net/) | May 29 09:29 | |
| schestowitz[TR2] | https://www.thelayoff.com/t/1ksj4adk3 | May 29 09:38 |
| -TechBytesBot/#techbytes- ( status 403 @ https://www.thelayoff.com/t/1ksj4adk3 ) | May 29 09:38 | |
| schestowitz[TR2] | " only 3 badges ?? You must be joking. | May 29 09:38 |
| schestowitz[TR2] | As a wannbe executive, you must get at least 7 or more badges (worthless or not) in a week or get a PIP immediately. Make sure your devout, church going first line knows about this, or he will be on a PIP too." | May 29 09:38 |
| *psydroid2 (~psydroid@36imbvshpgubk.irc) has joined #techbytes | May 29 09:42 | |
| *psydroid2 has quit (Ping timeout: 2m30s) | May 29 09:59 | |
| schestowitz[TR2] | "Have a goal of inserting undetected backdoors (and other exploitable vulnerabilities) into popular software? | May 29 10:11 |
| schestowitz[TR2] | There's a few things that will dramatically help that goal: | May 29 10:11 |
| schestowitz[TR2] | 1) Increase the quantity of code, in a project, written by an unknown party. [Example: AI, & Rust Crates] | May 29 10:11 |
| schestowitz[TR2] | 2) Increase the speed at which new, unreviewed code is introduced into a project... thus rendering future code reviews increasingly difficult and unlikely. [Example: AI] | May 29 10:11 |
| schestowitz[TR2] | 3) Make sure new versions of a compiler can only be built with earlier versions of the same compiler. This makes it possible to inject replicating backdoors not present in the current code base (see "Trusting Trust" by Ken Thompson). [Example: Rust] | May 29 10:11 |
| schestowitz[TR2] | 4) Make sure a programming language has a single prominent compiler. This makes the vulnerability above especially powerful. [Example: Rust] | May 29 10:11 |
| schestowitz[TR2] | If I wanted to insert undetected backdoors into a software project, I would encourage them to use Rust (the Rustc compiler) along with AI coding "assistants"." | May 29 10:11 |
| *psydroid2 (~psydroid@36imbvshpgubk.irc) has joined #techbytes | May 29 10:34 | |
| *psydroid2 has quit (Ping timeout: 2m30s) | May 29 11:09 | |
| *psydroid2 (~psydroid@36imbvshpgubk.irc) has joined #techbytes | May 29 11:11 | |
| *psydroid2 has quit (Ping timeout: 2m30s) | May 29 12:03 | |
| *psydroid2 (~psydroid@36imbvshpgubk.irc) has joined #techbytes | May 29 12:26 | |
| *psydroid3 (~psydroid@36imbvshpgubk.irc) has joined #techbytes | May 29 12:33 | |
| *psydroid2 has quit (Ping timeout: 2m30s) | May 29 13:14 | |
| *psydroid2 (~psydroid@36imbvshpgubk.irc) has joined #techbytes | May 29 13:41 | |
| *psydroid2 has quit (Ping timeout: 2m30s) | May 29 13:55 | |
| *psydroid2 (~psydroid@36imbvshpgubk.irc) has joined #techbytes | May 29 15:10 | |
| schestowitz[TR2] | https://www.thelayoff.com/t/1ksr3548x#replies | May 29 15:31 |
| -TechBytesBot/#techbytes- ( status 403 @ https://www.thelayoff.com/t/1ksr3548x#replies ) | May 29 15:31 | |
| schestowitz[TR2] | "" | May 29 15:31 |
| schestowitz[TR2] | "Why the vitriol for a man who is leading IBM to become more profitable and competitive?" | May 29 15:31 |
| schestowitz[TR2] | Profitable for the Indian tribe ? | May 29 15:31 |
| schestowitz[TR2] | At the expense of throwing US based employees under the bus. | May 29 15:31 |
| schestowitz[TR2] | What are his achievements ? Spewing sh-t from his disgusting mouth ? | May 29 15:31 |
| schestowitz[TR2] | https://lists.gnu.org/archive/html/libreplanet-discuss/2026-05/msg00019.html | May 29 17:13 |
| -TechBytesBot/#techbytes-lists.gnu.org | Re: LLM code as public domain and copyleft | May 29 17:13 | |
| schestowitz[TR2] | " | May 29 17:13 |
| schestowitz[TR2] | I'm curious if such fear is justified. | May 29 17:13 |
| schestowitz[TR2] | LLMs are not much more than plagiarism engines. | May 29 17:13 |
| schestowitz[TR2] | That LLMs are "not much more than plagiarism engines" is gross oversimplification that reveals narrow understanding of how large language models actually work. LLMs are generative synthesis engines that operate on probabilistic reasoning, contextual understanding, and semantic compression. Plagiarism requires intent. LLMs do not have any intent. | May 29 17:13 |
| schestowitz[TR2] | If you ask an LLM to explain quantum entanglement in simple terms, it doesn’t ‘pull’ a definition from Wikipedia. It constructs a novel explanation using its understanding of the underlying concepts. That’s synthesis, not plagiarism. | May 29 17:13 |
| schestowitz[TR2] | LLMs are trained on vast corpora of human text. But so are humans. When you read a book and then write an essay inspired by it, are you a ‘plagiarist’? No—you’re learning from context. LLMs do the same, but at scale. The difference is mechanism, not essence. | May 29 17:13 |
| schestowitz[TR2] | You can use any LLM to generate what you wish to call "plagiarism", and you can use it to be very creative. | May 29 17:13 |
| schestowitz[TR2] | How do you generate plagiarism by your definition? Ask the LLM to give you literal output of some works. Set the temperature to zero. that specific mode, the LLM acts as a high-speed search-and-retrieve engine, reproducing existing text with minimal variation. It is citation on steroids. | May 29 17:13 |
| schestowitz[TR2] | Conversely, "authorship" emerges when you use the LLM as an augmentative layer for your own original ideas. You define the core concept, the intent, and the direction. The LLM then refines, expands, and polishes your voice, not its own. It is not generating content from scratch; it is elevating your expression. The difference is not in the tool, but in the source of the initial spark. | May 29 17:13 |
| schestowitz[TR2] | You’re still mistaking the map for the territory, rather than discovering the territory itself. Try it out. Method is there, in previous paragraph. | May 29 17:13 |
| schestowitz[TR2] | Thus, in the matter of software, one of the main uses of LLMs is to strip both attribution and licensing information from whole code bases. So while it is correct that machine generated *output* cannot be copyrighted, if it were actually generated from scratch by the LLM. But, one has to raise questions about the input which it is regurgitating minus attribution and licensing information. LLM output is not generated from scratc | May 29 17:13 |
| schestowitz[TR2] | h, it is instead generated from models trained on licensed code under copyright protection. | May 29 17:13 |
| schestowitz[TR2] | There are many LLMs today that are not trained on copyrighted code. We talk of millions of published models, like those on huggingface.co | May 29 17:13 |
| schestowitz[TR2] | I have downloaded 902 models. And I run those which are not trained on copyrighted sets and which are not proprietary. Now you come with such mature looking statement how the main use of the LLM is to strip both attribution and licensing information. Surely you can do that, but it is not main use. For these models, the 'stripping of attribution' narrative falls apart entirely. | May 29 17:13 |
| schestowitz[TR2] | Yet, you present your statement as if it’s a universal truth. This kind of broad generalization is dangerous. It’s not just an academic error; it misleads people who don’t have the time or expertise to dig into the nuances of these diverse models. They’ll just jump on the bandwagon, protesting a problem that isn’t actually universal. | May 29 17:13 |
| schestowitz[TR2] | Put more effort in research. It is enough to research the website HF I gave you. | May 29 17:13 |
| schestowitz[TR2] | So if you wish to use it in the mode of plagiarism, of course, you are free to do so. Same thing with actual books, you could take them and kind of re-write them, without referencing. | May 29 17:13 |
| schestowitz[TR2] | If you wish to use it in full authorship mode, you can, there will be no plagiarism, and LLM is your augmentation tool. | May 29 17:13 |
| schestowitz[TR2] | As for software freedom, there is an additional pair of problems caused by LLMs: using the plagiarized output separates potential project collaborators from the very projects which are getting exploited, while at the same time the projects which are getting exploited are getting isolated from any potential collaborators trapped behind LLMs. | May 29 17:13 |
| schestowitz[TR2] | That is under condition that some user use it as plagiarized output. It simply is not so in majority of cases. I am sure you need to do more experience with it. Basically, run the LLM. | May 29 17:13 |
| schestowitz[TR2] | If something is plagiarized it means user knew it, and had intent to plagiarize, and would not provide references. | May 29 17:13 |
| schestowitz[TR2] | So try doing that, practically, then come back with the report. Then we have the fact to talk about. | May 29 17:13 |
| schestowitz[TR2] | But, yeah, using LLMs to strip the GPL and other software licenses from code, as well as stripping copyright attribution, is a real problem now. | May 29 17:13 |
| schestowitz[TR2] | For who? Do you have specific case? You can complain. But let us not generalize. | May 29 17:13 |
| schestowitz[TR2] | The abundance of software created with the LLM also generates more GPL/Apache/Mit/free software. The abundance will soon make the GPL obsolete, because everybody is then able to generate their software, which is in the end free software. It is evolution. | May 29 17:13 |
| schestowitz[TR2] | There was free software before the GPL. | May 29 17:13 |
| schestowitz[TR2] | GPL has the purpose to protect free software to remain free software. | May 29 17:13 |
| schestowitz[TR2] | But once you have free software generated on every corner, every monitor, every computer, the GPL may become obsolete, but users may still have the freedom. | May 29 17:13 |
| schestowitz[TR2] | So keep working on the freedom, don't judge the temporary momentum of LLm-affairs which you still did not research well. | May 29 17:13 |
| schestowitz[TR2] | -- | May 29 17:13 |
| schestowitz[TR2] | Jean Louis | May 29 17:13 |
| schestowitz[TR2] | " | May 29 17:13 |
| schestowitz[TR2] | " | May 29 17:15 |
| schestowitz[TR2] | LLMs are generative synthesis engines that operate on probabilistic reasoning, contextual understanding, and semantic compression. | May 29 17:15 |
| schestowitz[TR2] | No, LLMs have nothing to do with generation. They merely ingest 'tokens' and recombine sets of them into statistically plausible combinations. Lots of people get confused about that, perhaps (to be generous) because of wishful thinking on their part. | May 29 17:15 |
| schestowitz[TR2] | > Plagiarism requires intent. | May 29 17:15 |
| schestowitz[TR2] | Intent is not a prerequisite for plagiarism as defined in academia over the ages. | May 29 17:15 |
| schestowitz[TR2] | > LLMs do not have any intent. | May 29 17:15 |
| schestowitz[TR2] | Agreed, LLMs have no intent. Thus LLMs just plagiarize despite being automatons. But those who interface with LLMs can have intent. One can question the intent of those interfacing with LLMs because most people know, at least on some level, that the 'tokens' are slurped up verbatim from the WWW at large without attribution. | May 29 17:15 |
| schestowitz[TR2] | >> But, yeah, using LLMs to strip the GPL and other software licenses | May 29 17:15 |
| schestowitz[TR2] | >> from code, as well as stripping copyright attribution, is a real | May 29 17:15 |
| schestowitz[TR2] | >> problem now. | May 29 17:15 |
| schestowitz[TR2] | > | May 29 17:15 |
| schestowitz[TR2] | > For who? Do you have specific case? You can complain. But let us not | May 29 17:15 |
| schestowitz[TR2] | > generalize. | May 29 17:15 |
| schestowitz[TR2] | In the context of code, one recent example is the use of LLMs to try to strip the Chardet project of its license, among quite a few other similar attacks some more public some less public. Noticeably, the original code is in the training set so the stripped version can hardly be described as a cleanroom implementation. " | May 29 17:15 |
| schestowitz[TR2] | https://lists.gnu.org/archive/html/libreplanet-discuss/2026-05/msg00014.html | May 29 17:16 |
| -TechBytesBot/#techbytes-lists.gnu.org | Re: LLM code as public domain and copyleft | May 29 17:16 | |
| schestowitz[TR2] | " | May 29 17:16 |
| schestowitz[TR2] | In Part II of the report on Copyright and Artificial Intelligence, | May 29 17:16 |
| schestowitz[TR2] | https://www.copyright.gov/ai/ | May 29 17:16 |
| schestowitz[TR2] | it concludes with the observation that LLM output, and AI output in general, is ineligible for copyrights unless that output is modified by a human for the purposes of creative expression: | May 29 17:16 |
| schestowitz[TR2] | "As described above, in many circumstances these outputs | May 29 17:16 |
| schestowitz[TR2] | will be copyrightable in whole or in part—where AI is | May 29 17:16 |
| schestowitz[TR2] | used as a tool, and where a human has been able to | May 29 17:16 |
| schestowitz[TR2] | determine the expressive elements they contain. Prompts | May 29 17:16 |
| schestowitz[TR2] | alone, however, at this stage are unlikely to satisfy | May 29 17:16 |
| schestowitz[TR2] | those requirements." | May 29 17:16 |
| schestowitz[TR2] | May 29 17:16 | |
| schestowitz[TR2] | https://www.copyright.gov/ai/Copyright-and-Artificial-Intelligence-Part-2-Copyrightability-Report.pdf | May 29 17:16 |
| schestowitz[TR2] | For further clarification, there are written guidelines on the scope of copyright in the context of slop which has been handled meaningfully by humans: | May 29 17:16 |
| schestowitz[TR2] | "Consistent with the Office’s policies described above, | May 29 17:16 |
| schestowitz[TR2] | applicants have a duty to disclose the inclusion of | May 29 17:16 |
| schestowitz[TR2] | AI-generated content in a work submitted for registration | May 29 17:16 |
| schestowitz[TR2] | and to provide a brief explanation of the human author’s | May 29 17:16 |
| schestowitz[TR2] | contributions to the work" | May 29 17:16 |
| schestowitz[TR2] | However, slop which has not been handled meaningfully by humans remains ineligible for copyright: | May 29 17:16 |
| schestowitz[TR2] | "... a work “autonomously created by artificial intelligence | May 29 17:16 |
| schestowitz[TR2] | without any creative contribution from a human actor” was | May 29 17:16 |
| schestowitz[TR2] | “ineligible for registration”. | May 29 17:16 |
| schestowitz[TR2] | https://www.copyright.gov/ai/ai_policy_guidance.pdf | May 29 17:16 |
| schestowitz[TR2] | Thus the problem raised in Andy's original message, that LLMs can be and are being misused to strip licensing (and attribution) from projects' code bases. The code goes into the large language model with a license and a copyright holder. Calling that infringement a training set doesn't change its nature. Then the code comes back out of the model as slop, and while slop in general is ineligible for copyright status the original code | May 29 17:16 |
| schestowitz[TR2] | remains strongly protected by copyright, whether plagiarized by an LLM or not. | May 29 17:16 |
| schestowitz[TR2] | And, again, as mentioned in an earlier reply, there is the further problem of the LLMs separating users from projects and vice versa. That is a whole other discussion about duplicated effort, missing communications, and general lost opportunities. | May 29 17:16 |
| schestowitz[TR2] | PS. COPE (Committee on Publication Ethics) writes the following: | May 29 17:16 |
| schestowitz[TR2] | "COPE joins organisations, such as WAME and the JAMA | May 29 17:16 |
| schestowitz[TR2] | Network among others, to state that AI tools cannot | May 29 17:16 |
| schestowitz[TR2] | be listed as an author of a paper." | May 29 17:16 |
| schestowitz[TR2] | May 29 17:16 | |
| schestowitz[TR2] | https://publicationethics.org/guidance/cope-position/authorship-and-ai-tools | May 29 17:16 |
| schestowitz[TR2] | It would not be a stretch to apply a similar conclusion to code which has been mangled by an LLM. " | May 29 17:16 |
| -TechBytesBot/#techbytes-www.copyright.gov | Copyright and Artificial Intelligence | U.S. Copyright Office | May 29 17:16 | |
| -TechBytesBot/#techbytes- ( status 403 @ https://publicationethics.org/guidance/cope-position/authorship-and-ai-tools ) | May 29 17:17 | |
| *psydroid2 has quit (Ping timeout: 2m30s) | May 29 17:32 | |
| *psydroid2 (~psydroid@36imbvshpgubk.irc) has joined #techbytes | May 29 17:47 | |
| *psydroid2 has quit (Ping timeout: 2m30s) | May 29 18:01 | |
| *agentcas2y (~dracos@spnyw2fxpi54g.irc) has joined #techbytes | May 29 18:21 | |
| *agentcas2y is now known as agentcasey | May 29 18:22 | |
| *psydroid2 (~psydroid@36imbvshpgubk.irc) has joined #techbytes | May 29 19:08 | |
| *psydroid2 has quit (Ping timeout: 2m30s) | May 29 19:15 | |
| *psydroid2 (~psydroid@36imbvshpgubk.irc) has joined #techbytes | May 29 19:15 | |
| *agentcasey has quit (Ping timeout: 2m30s) | May 29 20:41 | |
| *psydroid2 has quit (Quit: KVIrc 5.2.10 Quasar http://www.kvirc.net/) | May 29 21:04 | |
| *psydroid2 (~psydroid@36imbvshpgubk.irc) has joined #techbytes | May 29 21:04 | |
| *agentcas2y (~dracos@spnyw2fxpi54g.irc) has joined #techbytes | May 29 21:13 | |
| *agentcas2y is now known as agentcasey | May 29 21:14 | |
| *psydroid2 has quit (Quit: KVIrc 5.2.10 Quasar http://www.kvirc.net/) | May 29 23:12 | |
Generated by irclog2html.py 2.6 | ䷉ find the plain text version at this address (HTTP) or in Gemini (how to use Gemini) with a full GemText version.