Bonum Certa Men Certa

There Are Probably Over a Million Pages in Geminispace

posted by Roy Schestowitz on Aug 09, 2025

Flowering Heather Plant

Occasionally people online ask, how large is "Gemini" (by which they mean Gemini Protocol)? How widely used is it? The naysayers don't want answers, they spread FUD.

Well, there are two elements here: readers and authors. There are many users of Gemini Protocol, many of whom read or lurk, not write (or only habitually post something in somebody else's capsule). Some capsules sort of emulate social control media, akin to Reddit.

It seems safe to assume all the authors are also readers, but not all readers are also authors.

Regarding size, there are two many limitations which merit a mention when it comes to assessing magnitude; one is the lack of a complete list of all capsules and another is crawling limits. Tux Machines and Techrights, for instance, contain about 10 times more pages that Lupa allows itself to index and thus count. So counts aren't complete, set aside the fact that crawling is never complete for a variety of other reasons (some pages are islands in the traversal sense).

Since we've already mentioned Lupa, consider the statistics published by Lupa 2 hours ago:

Currently, our database includes 716,787 URIs, 591,936 of them having been checked successfully (status code 20) and recently. Among the recently accessed, 435,150 URIs serve a Gemini content.

Consider crawling constraints and artificial limits. It seems fair to guess that it never sees or crawls half of all pages.

The search engine TLGS knows of less than 3,000 capsules that it crawls and it extracted from them close to half a million pages/objects:

The number of pages and domains known to TLGS at 2025-08-08 21:04:19. These figures are updated every 6Hrs or so. Suppose TLGS does not fully crawl very large capsules and cannot find many pages in many capsules. Also bear in mind networking and computational limits. TLGS isn't some company, it is a personal project.

It seems safe to assume there are over a million working (functional) gemini:// objects out there and over 100,000 new ones every year. That's enough to keep anybody busy.

Other Recent Techrights' Posts

Legal Letters Are Not Postcards
It seems like intimidation, nothing more
 
IAM Magazine is in Effect Dead, It's Now Fused Into Microsoft's Patent Troll (Which It Has Promoted All Along)
Microsoft-connected patent trolls in Europe [...] Now, in his new job, Wild can use his 'expertise' to help guide blackmail/extortion to better harm Europe's industry
A Huge Proportion of 'Articles' in The Register MS Are Actually Paid Spam of the Communist Party of China, Selling Compromised (for Wiretapping) Technology
The Register MS is having a go at becoming a marketing company or "B2B"
Top Officials Have Just Left Microsoft, Layoffs in Anything But Name
Microsoft's debt is very fast-growing
Local Staff Committee The Hague (LSCTH) Meets "Alicante Mafia" at the European Patent Office (EPO)
Report on meeting with VP1 and his team on 21 April 2026
UbuntuPit (ubuntupit.com) Has Deleted Slop Pages, Its Slopfarm Experiment Has Failed (Like Always!)
Turning one's site into a slopfarm is a death knell
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Saturday, May 23, 2026
IRC logs for Saturday, May 23, 2026
The "Next Big" Bonus for IBM's CEO Apparently Comes From American Taxpayers While Veteran IBMers Are PIP'd and RA'd (Laid Off)
the next big thing will be the CEO's bonus
Links 23/05/2026: Starbucks Scraps Disastrous Slopfest, Colbert’s Final ‘Late Show’
Links for the day
Gemini Links 23/05/2026: Poetry, Hobbies, ROOPHLOCH, and More
Links for the day
Government Bailouts Won't be Enough to Save IBM
Bailouts from taxpayers in the US
Links 23/05/2026: Social Media Bans and Demise of Userbase of LLM Chatbots
Links for the day
SLAPP Censorship - Part 85 Out of 200: The United Kingdom's Rating for Press Freedom Has Improved, But We Can Do Even Better
we see the US at #64
Sites Realise That Becoming More Active by Using Bots (LLM Slop) is Self-Destructive
We'll soon (maybe next year) also show that some of the 85+ KG of legal papers sent our way are computer-generated garbage, which might run afoul of some rules
European Patent Office (EPO) Strikes Persist, EPO Management Tries to Give False Impression of "Happy Staff"
EPO is trying to broadcast to the world a totally phony image of itself
Gemini Links 23/05/2026: Patience, LLM Chatbts Being Bad, and Unexpected Computer Surgery
Links for the day
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Friday, May 22, 2026
IRC logs for Friday, May 22, 2026
Links 22/05/2026: Ebola Crisis and Samsung Averts a Walkout With Big Bonuses
Links for the day
The End of FOSSPost (fosspost.org), It Has become an LLM Slopfarm Like FOSSLinux
These sites will never get lucky with slop. These experiments always end badly.
Links 22/05/2026: Inflation Fears and Thailand Tightens Visa Rules for Tourists From Dozens of Nations
Links for the day
EPO Staff Representation Speaks of This Week's Discussion With the EPO's Budget and Finance Committee (BFC) Amid Mass Strikes
The Central Staff Committee's outline (prepared in a rush) or the "flash report"
SLAPP Censorship - Part 84 Out of 200: New Legislation Against SLAPPs on the Way (After We Reached Out to Ministers)
They dealt with the matter individually too, but we won't share this in public, at least not at this time
The Corrupt Lecture the Non-Corrupt - Part XXX - Where Was "The Ethics and Compliance Team" When the Family of EPO President Campinos Was Caught Doing Cocaine?
It remains to be seen if national delegates will tolerate this in future meetings
Gemini Links 22/05/2026: Esperanto Music History, Suspicious Adoption of Signal, and Unauthorised LLM Slop in Code
Links for the day
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Thursday, May 21, 2026
IRC logs for Thursday, May 21, 2026