Bonum Certa Men Certa

Commentary: StatCounter 'Global' Statistics

StatCounter bias



Summary: How StatCounter turns 4-5% of the world's population into 25% and reduces the world's largest Internet population (China) to just 2.46%, then claims to be measuring global market share (other surveys do the same thing)

AL submits: "Thank you for all your hard work in bringing us news through Techrights. I am reading it daily and find lots of interesting information.



"I read one of the comments from Mad Hatter in which he was talking about Wikipedia article on OS market share. I went to check it out and found that they use 1% for Linux (globally) based on the research by StatCounter Global. I was interested to see how this group is gathering their statistical data. If you go to their FAQ section they talk about sample size per country/region and there is a link to the full list of all countries. As they stated themselves their pool is 16,3 bln hits. Quite large I would say. But there is something interesting - the biggest group (region) is United States with 3,965,972,279 hits. That is almost 25% of the total pool. Now, my days of statistical studies are long gone but I still remember that in order to have accurate result you cannot over-represent one group. The result will be obviously skewed. We have one country that contributes almost 25% to the result compared to the rest of the world. As StatCounter states that they choose randomly that makes it very likely that lots of data on hits would be taken from USA. You know, for example, how much is the share of hits from China? 2,46%! In fact, looking at the whole list you can see that starting from Korea and further down the share is less than 1%! That includes countries like Poland, Greece, Japan, Russia, Switzerland etc.

“The result will be obviously skewed. We have one country that contributes almost 25% to the result compared to the rest of the world.”
      --Al
"I know some can say that there are many more computers sold in USA than in other countries (can't be true). But market share is more complex. If we have 95% (example) Linux presence on desktops in China, they would hardly make any influence with representation of only 2,46% on the StatCounter data. Do you see what I mean? There are of course many more problems with that. What kind of websites StatCounter is using to get hits? If we put hit counter on the website with Silverlight I don't think we will get many hits from Linux OS desktops, right? And even if the websites are getting hits from same amount of Linux OS and other OS desktops what will happen? StatCounter will randomly select hits from global pool and as data from USA will be more likely to get selected it will greatly skew the result and linux will always get under-represented. Lets say you have two crates: one with 10 pears and one with 250 tomatoes + 150 pears and you draw five times. However 3 times from first crate and 2 times from the second. You will have selected more pears than tomatoes. Even though there are 250 tomatoes and 150+10=160 pears. Is this reliable representation?"

Comments

Recent Techrights' Posts

Legal Letters Are Not Postcards
It seems like intimidation, nothing more
European Patent Office (EPO) Strikes Persist, EPO Management Tries to Give False Impression of "Happy Staff"
EPO is trying to broadcast to the world a totally phony image of itself
 
The "Next Big" Bonus for IBM's CEO Apparently Comes From American Taxpayers While Veteran IBMers Are PIP'd and RA'd (Laid Off)
the next big thing will be the CEO's bonus
Links 23/05/2026: Starbucks Scraps Disastrous Slopfest, Colbert’s Final ‘Late Show’
Links for the day
Gemini Links 23/05/2026: Poetry, Hobbies, ROOPHLOCH, and More
Links for the day
Government Bailouts Won't be Enough to Save IBM
Bailouts from taxpayers in the US
Links 23/05/2026: Social Media Bans and Demise of Userbase of LLM Chatbots
Links for the day
SLAPP Censorship - Part 85 Out of 200: The United Kingdom's Rating for Press Freedom Has Improved, But We Can Do Even Better
we see the US at #64
Sites Realise That Becoming More Active by Using Bots (LLM Slop) is Self-Destructive
We'll soon (maybe next year) also show that some of the 85+ KG of legal papers sent our way are computer-generated garbage, which might run afoul of some rules
Gemini Links 23/05/2026: Patience, LLM Chatbts Being Bad, and Unexpected Computer Surgery
Links for the day
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Friday, May 22, 2026
IRC logs for Friday, May 22, 2026
Links 22/05/2026: Ebola Crisis and Samsung Averts a Walkout With Big Bonuses
Links for the day
The End of FOSSPost (fosspost.org), It Has become an LLM Slopfarm Like FOSSLinux
These sites will never get lucky with slop. These experiments always end badly.
Links 22/05/2026: Inflation Fears and Thailand Tightens Visa Rules for Tourists From Dozens of Nations
Links for the day
EPO Staff Representation Speaks of This Week's Discussion With the EPO's Budget and Finance Committee (BFC) Amid Mass Strikes
The Central Staff Committee's outline (prepared in a rush) or the "flash report"
SLAPP Censorship - Part 84 Out of 200: New Legislation Against SLAPPs on the Way (After We Reached Out to Ministers)
They dealt with the matter individually too, but we won't share this in public, at least not at this time
The Corrupt Lecture the Non-Corrupt - Part XXX - Where Was "The Ethics and Compliance Team" When the Family of EPO President Campinos Was Caught Doing Cocaine?
It remains to be seen if national delegates will tolerate this in future meetings
Gemini Links 22/05/2026: Esperanto Music History, Suspicious Adoption of Signal, and Unauthorised LLM Slop in Code
Links for the day
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Thursday, May 21, 2026
IRC logs for Thursday, May 21, 2026
Links 21/05/2026: "Declining America" and Why Slop 'Code' is Made to Fail
Links for the day
Techrights and Tux Machines Subjected to Cyberattacks for Several Weeks
In the past I spoke to the cybercrime unit of British Police. Maybe it's time to do so again.
The Register MS Has Become a 'Content' Farm Promoting Slop for Hostile Corporations
Now they call it "PARTNER CONTENT" - not "SPONSORED" - as if semantics make the difference
Latest Example of Widespread Fake Assertions (False News) About "Hey Hi"
The false narrative of "Hey Hi layoffs"
Links 21/05/2026: Facebook Rewarded With Tax Breaks to Destroy the Environment and Cause Global Warming, Shortages, Pollution; SpaceX (SPCX) Continues Losing Billions of Dollars
Links for the day
Codecs and Software Patents - Part VIII - GNU Audio/Video Team Has Chosen the AV1 Video Codec and It Explains Why (They've Researched Their Options)
AV1 video codec will be used to encode and share GNU videos online
Dr. Stallman Helps Establish Free Software Advocacy Outside the Free Software Foundation (FSF) as Well
The ideals or principles of Free Software needn't be centralised or monopolised; they can be federated
22 Years of Tux Machines and a Community Stronger Than Ever Before
We've already received some feedback from the community and improved it accordingly
Microsoft Under Investigation for Breaches of Law in the UK
Just like the Microsofters
More Microsoft Layoffs on the Way (June and July 2026)
with or without PIPs
LWN Sponsored by the Linux Foundation (Monopolies)
We must be able to casually point this out
The Corrupt Lecture the Non-Corrupt - Part XXIX - European Patent Office (EPO) Tells Staff "Speaking up" is Good, But Not When the "Brother-in-law" of EPO's President Does Cocaine
Do we still have a functioning democracy and potent press?
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Wednesday, May 20, 2026
IRC logs for Wednesday, May 20, 2026
Gemini Links 21/05/2026: Immigration, Slop, and Slop 'Code' Suggestions Infesting Code Repositories
Links for the dayGemini Links 21/05/2026: Immigration, Slop, and Slop 'Code' Suggestions Infesting Code Repositories