Bonum Certa Men Certa

Can the of Scale of Linux Adoption Ever be Gauged?

Determining usage and growth of Free software has always been a challenge. For over a decade, arguments have been held -- sometimes flamewars -- whose central point was the usage scale of software which is freely distributed. While market share can be estimated based on the number of sales. Free software usually replaces existing software that is proprietary, i.e. its ownership lies with a vendor and it is usually treated as an integral part of another product.



When discussing Free software, the term "installed base" seems rather popular. It is installation, not embedment or preinstallation, that tailors a product to the owner's personal needs. Unfortunately, installed base, as opposed to market share, proves to be a tricky thing to gauge.

At the center of this debate, one typically finds the GNU/Linux operating system. Many perceive Linux the greatest contender which can bring Free software to the mainstream. Linux is commonly obtained through exchange of CDs, which can be modified, change hands, and be used to deploy the same software on multiple computers. The content of these CDs is usually (albeit not always) downloaded from the Internet. Lesser-known Linux distributions are sometimes obtained through peers or via BitTorrent, which cannot be properly tracked. These channels of communication are decentralized by nature.

Endless attempts have been made to count Linux users. Userbase vanity harbours confidence and leads to better support from the industry. Attempts to quantify growth included Web sites whose sole purpose is to have Linux users register and provide details about their computer/s. Even the most prominent among these Web sites met very limited success. They were not able to keep up with change, let alone attract and grab the attention of all Linux users. Most Linux users were simply apathetic toward this cause.

In more recent years, the ubiquity of interconnected devices and computers played an important role in statistics. Computing units that offer Web access have generated large piles of data. Statistical analysis of this data was thought to be another opportunity to study presence and geography of Linux users around the planet. It has, however, been a very deficient analysis. For a variety of reasons, too many assumptions were made, which led to flawed conclusions. To this date, no proper and valid analysis has been carried out.

Looking more closely at some difficulties in interpreting Web statistics, there are numerous factors to consider. There are obvious problems. The sample of selectively-chosen Web sites often contains particular audiences which, on average, do not represent the entire population. Additionally, due to diversity in the identity of Linux, as it comes in just one among a large number of distributions, identification strings are hard to understand. As such, many Linux users are simply be treated as though they use an “unknown” operating system. This “unknown” component is statistically significant, yet it tends to be ignored and discarded.

There are more problems that need to be taken into consideration. For example, data gathered by Web sites neglects to identify computers that are operated behind proxies, or even Squid. This data also assumes that everyone identifies himself or herself in a truly honest fashion. The matter of fact is that certain Web sites were designed to reject access from every Web browser other than Internet Explorer. As a result, many Linux users are forced to pretend (by altering HTTP headers) that they use a typical Windows setup. This is known as spoofing or forging and it is a matter of convenience.

“To use an example, Hollywood is considered a place where production studios adopt Linux, even on the desktop.”The last factor to consider here are botnets (zombie PCs), which are essentially travelling the World Wide Web. It's a relentless Web journey and this happens without the awareness of the rightful owner of the computer. This troublesome phenomenon means that large amounts of Web traffic is devoured in a very wasteful fashion. It does not reflect on human consumption of information. Botnets 'pollute' log data and therefore tweak statistics. It rarely (if ever) works in favor of secure operating systems and Web browsers.

Web statistics and research that revolves around them suffers from yet another false assumption. One must not simply accept the contention that all computers are connected to the Internet nowadays. If they are, their users do not necessarily visit an identical number of Web sites or consume an equal number of pages. Different operating systems are used in different settings. They serve a particular purpose and facilitate working tasks that might not require the Internet at all.

To use an example, Hollywood is considered a place where production studios adopt Linux, even on the desktop. In a recent interview with the press, CinePaint's Project Manager said that "Linux is the default operating [system] on desktops and servers at major animation and visual effects studios, with maybe 98 percent [or more] penetration". These computers, which include user-facing workstations, get used heavily for design and rendering work, but probably not for Web surfing.

There have been other projects that are intended to keep track of the number of Linux users by setting up a communication channel that connects a computer to the Linux distributor's servers. These projects are neither mature nor widely adopted.

On the other hand, the increasing adoption of online software repositories has made this process more feasible without it being considered "spying". And yet, inexistence of a registration process leaves room for dynamic addressing, so a single unique user is still hard to identify. The user will remain a moving target on the network as long as system registration is an absent component. Free software is adverse to such privacy-compromising steps, so they are unlikely to ever become mandatory.

Last year, in an interview with Red Herring, Canonical's CEO Mark Shuttleworth commented about the activity on his company's repositories. At the time, at least 8 million distinct users or addresses with a particular version of his Linux distribution could be identified. That was only months after the release of this distribution, which many of us had already known as "Ubuntu".

Regardless of the adoption rate of Linux on the desktop, Linux enjoys double-figure inter-quarter growth on the server side. This trend has sustained itself for several years. There are, however, great difficulties to overcome when it comes to tracking how widespread -- not just profitable -- Linux has become on in the datacenter. Market figures regularly come from analysts, but these figures are based purely on sales. They only gauge revenue. They fail to account for the fact that Linux is free and it is becoming easier to set up each year. Many companies take the do-it-yourself route and build their own server farms. They do not require much assistance, so deployment can be completed without a Linux purchase per se ever being made. The true growth of Linux will therefore stay an enigma for quite some time to come.

At the end of the day, let us all remember that Free software was not created to thrive in profits. There is no marketing department to boast growth either. Whether we use a search engine, or connect to a mail server, or acquire some snazzy gadget, Linux is likely to be there. The desktop, however, is perceived as an ultimate destination. It has the most visibility. Laptops and desktops can demonstrate that Linux has come and that it is here to stay and thrive. The back room usually escapes people's attention, despite a gradual shift in paradigm, which encourages adoption of remote services and thinner clients.

Counting the number of Linux users might always remain an impossibility. Should you mind?

Originally published in Datamation in 2007 and reached the front page of Slashdot

Recent Techrights' Posts

The FSF Board and FSF Beard
So the FSF's Board has grown
Law Firms Facing the Consequences for Patently Abusive Litigation on Behalf of Microsoft Employees Who Got Arrested for Strangulation and Had Done Even Worse Things
Having spent 1.5 years bullying me with patronising letters on behalf of Microsofters, last week they got served a massive bill and, in effect, lost the Hearing
LLMs Breaking Everything
Computing and the Net became a playground for scammers and "bros", like people who "invented" fake currencies and also try to tell us that LLMs spewing out things will have some real value
 
Links 22/06/2025: Windows TCO Tales and YouTube Getting More Hostile to Users
Links for the day
New Report From the EPO's Staff Representatives in The Hague (LSCTH) Reveals Many Unsolved Issues
Local Staff Committee The Hague (LSCTH) wrote to staff just before the weekend
Links 22/06/2025: More Slop Lawsuits (Copyrights) and "America’s Oligarch Problem"
Links for the day
Gemini Links 22/06/2025: Gigantic Toolchest and Annoying Bots
Links for the day
The Calling
Persist and persevere, justice will come your way
So Far Every BetaNews 'Article' is LLM Slop, So BetaNews is Officially Just a Slopfarm
They just don't seem to value what they have
IBM Rumour: Mass Layoffs (RAs) Lists Being Made for Consulting, With Effect in July 2025
Bogus companies with no viable products and no world-leading (in their field) staff are doomed to perish
Links 21/06/2025: Data Breach With 16 Billion Passwords, Dutch Government Recommends Children Under 15 Stay off TikTok and Instagram
Links for the day
Gemini Links 21/06/2025: Notes about Typst (and LaTeX) and Opos
Links for the day
Microsoft's Competition Tactics: Sabotage GNU/Linux Installs, Block Chrome
Edge is dying
1989: Free Software as "Open" Software (OSI Didn't Coin "Open Source", It Also Predates Linux)
"One man's fight for Free software"
The Microsoft OOXML Modus Operandi: Throw 1,000 Pages of Other People's Work for a Judge to Read Ahead of a One-Hour Meeting
No time to discuss this - that's the point
Formalities Officers (FOs) at the EPO Are in Trouble, Reveals Internal Report
We already know, based on an HR pattern we saw at IBM and elsewhere, that reallocating roles can be prerequisite for dismissal and those who do so expect many to resign anyway
The Web is Slop and FUD, Let's Go to Gemini Protocol
Lupa sees self-signed capsules at 92.4%
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Friday, June 20, 2025
IRC logs for Friday, June 20, 2025
Links 21/06/2025: Phone Bans for Concerts, Tensions in Taiwan Strait
Links for the day
Gemini Links 21/06/2025: Spoilers, Public Yggdrasil Node, Changes to AuraGem Search
Links for the day
"Six years of Gemini!"
From gemini://geminiprotocol.net
Gemini Links 20/06/2025: Summer Updates and Hardware Failures
Links for the day
Links 20/06/2025: Google Shareholder Sues Google and Google Sued for Defamatory Slop ('Hey Hi') Word Salads ('Summaries')
Links for the day
Linux Journal Might Have Become the Latest Slopfarm Targeting "Linux", the Trends Are Concerning for Dying News Sites
They tarnish the Web with junk and then die
On "Learning to Code"
quality may suffer, plus things get bloated
Quick Points Regarding This Week's Court Hearing
it paves the way for us to squash all the SLAPPs from Microsofters
Common Mistake: Believing Social Control Media Will Document Your Writings/Thoughts and Search Engines Like Google Will Help You Find These
Many news sites wrongly assumed that posting directly to Twitter would be acceptable
The Manchester Bees and This Hot Summer
We have had a fantastic week so far this week
Gemini Protocol Enters Its Seventh Year, Growth Has Accelerated!
Maybe in June 20 2026 there will be over 3,500 active capsules?
Mastodon and the Fediverse Have an Issue: Liability for Content (Even in Other Instances) and Costs
self-hosting is the only logical path forward
Why Microsoft and Its 'Hey Hi' (Slop) Frenzy Fail While Sinking in Deep, Growing Debt
Right now, like Twitter around the time it was sold to MElon, "open" "hey hi" is a big pile of debt with a lot to pay for that debt (interest payments)
Europe is Leaving Microsoft, the Press Coverage Isn't Sufficiently Helpful
The news is generally positive, but the press coverage leaves so much to be desired
Slopwatch: Linuxsecurity, BetaNews, and Linux Journal
slippery slope
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Thursday, June 19, 2025
IRC logs for Thursday, June 19, 2025
Gemini Links 20/06/2025: Gemini Protocol Turns 6!
Links for the day