Bonum Certa Men Certa

Turning Away Unwanted and/or Predatory Bots

posted by Roy Schestowitz on Sep 15, 2024

Sleep Tight

If no human will ever read it, what's the point serving?

ROGUE bots (programs without operators) are ruining the Web. One of us recently contacted Semrush Holdings, Inc. (founded by Oleg Shchegolev and Dmitri Melnikov) to complain about the misbehaving bots, which offer no benefit to anyone and basically just waste bandwidth and burn the planet. Semrush responded, but it's difficult to actually anticipate better behaviour. It's like another bubble; they probably have no concrete plan as a company (Semrush Inc. became Semrush Holdings, Inc. - one can guess why).

Companies like Semrush ruin the Web for real people. They also unnecessarily increase people's hosting bills. To them, that's just an "externality" - like LLM pests, they simply couldn't care less! Companies like these motivated us to go static; they misuse programs with a database back end (e.g. wikis) because they don't behave like people who are sane. They scrape away mercilessly and selfishly. They disregard and bypass caching or HTTP headers.

That's not to say that Gemini Protocol is free of annoying bots; we wrote about some of these before and many still traverse Geminispace for little purpose other than maintaining lists like these:

There are 4056 capsules. We successfully connected recently to 2872 of them.

Those 10,000 pages sent from Techrights were retrieved for no purpose other than Lupa gathering statistics or surveying what's out there. Since midnight today we've served 13344 requests over Gemini, yesterday it was 12917, and the day before that 14257. A high proportion of these are requests from bots.

An associate has adjusted the domain's configurations to send "429 Too Many Requests" to unwanted Web requests that might cause denial of service (at sufficiently high volume). "I think this will be an appropriate tool against bots hitting the server too hard," he said. "Changes were required in NFTables and in the Apache2 configuration," he said, and there's probably no information of use for an attacker here, as merely knowing NFTables is used barely gives an advantage.

But the very fact one needs to deploy and use NFTables means extra complexity. The misbehaving, out-of-control bots have certainly caused many sites to just throw in the towel and shut down.

Making the site and capsule serve pages fast to real visitors is of utmost importance, not gaming the numbers upwards. If you want fakes, go use Facebook; or do what Clickfraud Spamnil (Swapnil Bhartiya) does at YouTube.

In Geminispace, the capsules known to be using the Certificate Authority Let's Encrypt are a dying breed; the total has fallen again. Lupa sees only 31 such capsules today:

2576 (89.7 %) capsules are self-signed, 31 (1.1 %) use the Certificate Authority Let's Encrypt, 265 (9.2 %) are signed by another CA (may be not a trusted one).

So this coming week we might see the Certificate Authority Let's Encrypt at under 1%. It used to be in around 200 capsules or around 12% of Gemini capsules.

Other Recent Techrights' Posts

Over at Tux Machines...
GNU/Linux news for the past day
Governments That Financially Benefit (Profit) From the EPO Have a Long History of Covering Up Fraud and Corruption at the EPO
Many people are aware of it, even some of the biggest EPO stakeholders
 
Links 10/11/2025: BBC Turmoil and Iranian Drought Crisis
Links for the day
The Register MS Still Occasionally Uses Slop
some articles don't use real images
Links 10/11/2025: "Scam Altman Gets Served Subpoena" and "China will Rule Renewable Energy"
Links for the day
ubuntupit.com Has Paused the LLM Slop (for Now)
No slopfarm ever offered any real value
More Media Coverage From Austria Regarding Cocaine Use by EPO Management
The ultimate goal is full accountability
Ponzi Economics and the Media's Role in Defending Ponzi Economics
We occasionally notice weak or almost-non-existent coverage regarding the economy
Links 10/11/2025: Very High Windows TCO and XBox Continues to Languish
Links for the day
IRC Proceedings: Sunday, November 09, 2025
IRC logs for Sunday, November 09, 2025
Our Time in London
10 Days Ago We Were Down in London
Giving Red Hat a Second Life and Second Chance: Drop the LLM Slop, Stop Publishing Promotion of LLMs or Text Made by LLMs
For Red Hat to earn more trust it needs to quit participating in the biggest "pump and dump" pyramid scheme since the 1990s
Gemini Links 09/11/2025: Garden Room Complete, FreeBSD 15.0 on the ThinkPad T480, and Known Gemini Caspules Sorted by Number of URLs
Links for the day
Links 09/11/2025: Fung-wong Strikes Maharlika, "Open" "AI" Wants Taxpayers to Give It Bailout Money
Links for the day
Links 09/11/2025: "Avoid MSI Graphics Like the Plague", Harms of Social Control Media More Widely Recognised
Links for the day
Rocky Linux's Embrace of Mindless Cargo Cults Will Harm Rocky Linux in the Long Run
focus on technology, not marketing that defrauds many people and plagiarises many producers
Many of Red Hat's Official Blog Posts Seem to be Fake, Written at Least Partly by Bots (LLM Slop)
Can one trust Red Hat on technical things if it cannot even write words?
Suggestions Regarding Techrights Search
In some cases, Daily Links also serve to obscure our original articles
"Open" "AI" is Going Bankrupt, Appealing for Government Bailout
The writings have been on the wall for years
Reaffirming Rumours of More Microsoft Layoffs, Halo Impacted, XBox Business Winding Down
XBox has a huge target painted on its bum
"Secure Boot": Stop Trying to Boot Into GNU/Linux, Use Vista 11 Instead
It's all about reducing the user's cybersecurity under the false guise of improving it
This is What We Always Wanted to Spend Our Time on
2026 will probably be our most productive ever
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Saturday, November 08, 2025
IRC logs for Saturday, November 08, 2025
LowEndBox Resorts to Ableism to Smear Software Freedom
Not some "low-level" pundit but an administrator
IBM is Destroying Red Hat (by Extension, It Also Harms GNU/Linux)
IBM is where things come to die, more so in the past decade or so
Austrian Media Coverage of Luis Berenguer's (Top EPO Official) Getting Busted for Cocaine
This wasn't some rich tourist caught by cops, it was a local official whom they busted
This Coming Thursday EPO Staff Meets Online to Discuss the Salaries Going Down While Stoned Managers Increase Their Own
compensation going down relative to inflation and other factors
Misinformation of IBM Spread via LLM Slop
Since a lot of sites now rely on LLMs we can expect the corporations' lies to be perpetuated by bots. That includes the myths of IBM Red Hat.
Gemini Links 09/11/2025: File Managers and DPC Commissioner
Links for the day
Links 08/11/2025: Climate Talk Unfruitful, OldVersion.com Archive Facing Shutdown
Links for the day
IBM is Eliminating Red Hat Like It Eliminated Tivoli and Eliminated Cognos
Be wary of IBM
Quitting One's Job Isn't Forbidden, Right?
it's important to remind people that leaving one's job is perfectly OK
Being Absent/Missing From Social Control Media is Not a Sign of Weakness
Broadly speaking, social control media is for losers
Empathy Online
I recently learned from someone that running his Web site might hurt some feelings, even if the writings are truthful
Our Site Search Increases Our Editorial and Informational Independence
Implementing our search facility is a long-term investment
Advocates of GNU/Linux and the Uphill Battles Behind Us
GNU/Linux felt like "activism" 20 years ago. Now it's mainstream.
Cybersecurity Means Real Security, Not Back Doors
Standing our ground on technology and cybersecurity is an uncompromisable stance
Links 08/11/2025: Disinformation Crisis, Denmark Recognises Threats Associated With Social Control Media
Links for the day
The Free Software Foundation (FSF) is Besieged for the Times It Does the Right Things
As that upsets rich people's interests (and they were, at times, sponsors)
Links 08/11/2025: Technical and Financial GAFAM Woes and Arrests of Journalists by Despots
Links for the day
Like SUSE, IBM Red Hat Seems to be Using LLM Slop to Write Fake (Bot-Generated) Blog Posts
IBM Red Hat keeps promoting slop
Corruption is a Reality, It's Not a Dirty or a Strong Word
Corruption is a topic some newspapers shy away from
How German Media Covered Cocainegate at The European Patent Office (EPO)
At some point we'll ask that same press to revisit the issue and this time comment on the EPO connection
Our Launch of Techrights Search Has Been Successful (So Far)
There are about 50,000 articles indexed there, going 19+ years back
Daniel Pocock Explains Social Engineering in Debian and Other Communities Increasingly Controlled by "Barons"
Communities are not corporations
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Friday, November 07, 2025
IRC logs for Friday, November 07, 2025
Rosanna Yuen & GNOME community triple tricked
Reprinted with permission from Daniel Pocock
Adrian & Diana von Bidder-Senn, Debian: detailed history of a death
Reprinted with permission from Daniel Pocock
Crypto AG tricked ETH Zurich student internship
Reprinted with permission from Daniel Pocock
An Old Story of Fraud at the EPO in the Netherlands (and How the Dutch Government Facilitated It)
We've already mentioned several other scandals where the the Dutch government engaged in fraud and passive corruption
Voicing Concerns About European Patent Office (EPO) in Rijswijk
The report is dated yesterday
Gemini Links 08/11/2025: KeePassRX and Pluribus
Links for the day
IBM Layoffs Not Done, Terminations of Staff in India, Brazil, and Mexico Reported
This hopefully answers questions such as, "do the layoffs only impact US and Canada?"