Bonum Certa Men Certa

[Meme] Google: Nothing is Beyond Our Reach

Thunderbird



US Spy Agency Boasts 'Nothing Is Beyond Our Reach' With New LogoSummary: When it comes to Google (Gulag), E-mail messages are merely "content" (to be scanned and indexed)

Google Has Most of My Email Because It Has All of Yours



Republished by Slate. Translations available in French (Français), Spanish (Español), Chinese (中文)

For almost 15 years, I have run my own email server which I use for all of my non-work correspondence. I do so to keep autonomy, control, and privacy over my email and so that no big company has copies of all of my personal email.

A few years ago, I was surprised to find out that my friend Peter Eckersley — a very privacy conscious person who is Technology Projects Director at the EFF — used Gmail. I asked him why he would willingly give Google copies of all his email. Peter pointed out that if all of your friends use Gmail, Google has your email anyway. Any time I email somebody who uses Gmail — and anytime they email me — Google has that email.

Since our conversation, I have often wondered just how much of my email Google really has. This weekend, I wrote a small program to go through all the email I have kept in my personal inbox since April 2004 (when Gmail was started) to find out.

One challenge with answering the question is that many people, like Peter, use Gmail to read, compose, and send email but they configure Gmail to send email from a non-gmail.com “From” address. To catch these, my program looks through each message’s headers that record which computers handled the message on its way to my server and to pick out messages that have traveled through google.com, gmail.com, or googlemail.com. Although I usually filter them, my personal mailbox contains emails sent through a number of mailing lists. Since these mailing lists often “hide” the true provenance of a message, I exclude all messages that are marked as coming from lists using the (usually invisible) “Precedence” header.

The following graph shows the numbers of emails in my personal inbox each week in red and the subset from Google in blue. Because the number of emails I receive week-to-week tends to vary quite a bit, I’ve included a LOESS “smoother” which shows a moving average over several weeks.

Emails, total and from GMail, over timeFrom eyeballing the graph, the answer to seems to be that, although it varies, about a third of the email in my inbox comes from Google!

Keep in mind that this is all of my personal email and includes automatic and computer generated mail from banks and retailers, etc. Although it is true that Google doesn’t have these messages, it suggests that the proportion of my truly “personal” email that comes via Google is probably much higher.

I would also like to know how much of the email I send goes to Google. I can do this by looking at emails in my inbox that I have replied to. This works if I am willing to assume that if I reply to an email sent from Google, it ends up back at Google. In some ways, doing this addresses the problem with the emails from retailers and banks since I am very unlikely to reply to those emails. In this sense, it also reflects a measure of more truly personal email.

I’ve broken down the proportions of emails I received that come from Google in the graph below for all email (top) and for emails I have replied to (bottom). In the graphs, the size of the dots represents the total number of emails counted to make that proportion. Once again, I’ve included the LOESS moving average.

Proportion of emails from GMail over timeThe answer is surprisingly large. Despite the fact that I spend hundreds of dollars a year and hours of work to host my own email server, Google has about half of my personal email! Last year, Google delivered 57% of the emails in my inbox that I replied to. They have delivered more than a third of all the email I’ve replied to every year since 2006 and more than half since 2010. On the upside, there is some indication that the proportion is going down. So far this year, only 51% of the emails I’ve replied to arrived from Google.

The numbers are higher than I imagined and reflect somewhat depressing news. They show how it’s complicated to think about privacy and autonomy for communication between parties. I’m not sure what to do except encourage others to consider, in the wake of the Snowden revelations and everything else, whether you really want Google to have all your email. And half of mine.

If you want to run the analysis on your own, you’re welcome to the Python and R code I used to produce the numbers and graphs.



Licence: Copyright Benjamin Mako Hill, distributed as free cultural work under the Creative Commons Attribution-ShareAlike license.

Recent Techrights' Posts

State of Tech Journalism in 2026: Follow the Money
in order to understand what motivates an opinion piece one must follow the money
 
Europe in 2026: Over 5% GNU/Linux, Not Counting Chromebooks
2026 has started strongly
Slopfarm Says Microsoft's "Biggest Business" is the 'Business' Where It Loses Tens of Billions of Dollars
TOI still pretends to have a lot of output
At the Start of January 2025 Microsoft President Said Microsoft Would Spend 80 Billion Dollars on "AI" Data Centres. That Didn't Happen. Microsoft Laid Off 30,000 Workers, Debt Surged.
Maybe this coming Monday Microsoft will come up with more false promises and vapourware
Links 02/01/2026: Insurrectionist Attacks Musicians Critical of Him With Lawfare, Project Gutenberg Now Has Over 75,000 Books
Links for the day
Decline in LLM Slop About "Linux" is a Good Start for 2026
When the only remaining proponents of slop are slop, which is pretty much what's happening right now, the bubble is popping
EPO People Power - Part XXII - Contact Officials and Inform Your National Representatives (Delegates) of the EPO's Cocainegate
Europe's largest media intentionally covers up serious scandals in Europe's second-largest institution
Slopwatch Still Dead, Not Enough LLM Slop About "Linux"
this is the desirable thing
LibXML2 Will Carry on (Without or With the Name "LibXML2")
The proprietary software boosters are projecting
Gemini Links 02/01/2026: ThinkPad, SHARP Zaurus, Lagrange Handheld Support
Links for the day
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Thursday, January 01, 2026
IRC logs for Thursday, January 01, 2026
Links 01/01/2026: "Biophobia" and Renewed Effort to Locate MH370
Links for the day
Gemini Links 01/01/2026: Bot Accounts Online and Reading in 2025
Links for the day
IBM’s and Red Hat’s "Operation Evolution initiative" Just Long, Fancy Term for Bluewashing, Redundancies, Layoffs
Gerstner is still alive, but he's shorter and more arrogant
Designing a Better Mousetrap or Tools for the SSG
Static Site Generators (SSGs) - unlike all modern Content Management Systems (CMSs) - are so simple that extending them is easy
Links 01/01/2026: 1930 Works in the Public Domain, Electricity Pricing 'a Mystery'
Links for the day
Firefox is Toast Because It Got Toasted by Mozilla
Firefox cannot keep above 2% and hasn't been able to for quite some time
Ignore the LLM Slop and the Noise, Microsoft is in a Death Spiral
So what does Microsoft have left to sell?
Red Hat is Vanishing Before Our Eyes
With some Red Hat staff "transitioning" we wonder if it's an HR hack, wherein they "reset the clock" on employment duration so as to lessen severance obligations
In 2025 Microsoft Lost Palau
Palau now has GNU/Linux at steadily high levels
Microsoft Mocked UNIX/Linux for Not Handling Dates After 2038, Microsoft Breaks Down on 2026!
Only a truly moronic company would design it that way
Another New Year's Resolution: Public Domain Sources, Credits
In addition to our first one
Combatting Slop Images (and ClownFlare)
we won't use or reuse slop images
The End of Red Hat
expect many more layoffs soon
A New Year's Resolution: Maximal Transparency
We'll do our very best to be transparent about everything that's going on, even legal matters
Gemini Links 01/01/2026: 2025 Comes to a Close and Capsular Gemlog Manager
Links for the day
Free Software Foundation (FSF) Raised About 1.3 Million Dollars in the Past Couple of Months!
the FSF's Board now has 10 people in it
2026 IBM Phaseout of Red Hat
Red Hat won't fare any better than most IBM acquisitions
Microsoft Budget Issues, XBox Thrown Under the Bus
They're cutting budget. Soon they'll cut the staff.
Only Hours Into the New Year People Already Discuss the Next Round of Layoffs at Red Hat/IBM
2026 will be another tough year for Red Hat and IBM
EPO People Power - Part XXI - Europe's Second-Largest Institution Became a Corrupt For-Profit Company Run by Drug Addicts
it'll be the demise of the Rule of Law in Europe and maybe a death blow to the EU (eventually), not just the EPO
Another Very Productive Year Commences
"a total of over 17,000 pages in a year"
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Wednesday, December 31, 2025
IRC logs for Wednesday, December 31, 2025
Fiji: GNU/Linux Has Risen From Almost Nothing to Almost 5% in Recent Years
It's not as small as people are led to believe
Gemini Links 31/12/2025: Blogosphere is Growing and New Year Begins
Links for the day
Recruiters Don't Use Microsoft LinkedIn, Spammers Use LinkedIn
One of my best friends, a university professor, lost all of his life's savings due to Microsoft LinkedIn
You've Only Wasted Your Life in Social Control Networks
In a sense, social control media is a giant delusion
2025 Was a Very Bad Year for Social Control Media
statCounter sees a gradual demise in Social Control Media access
Don't "Go Paperless", Go Paperful [sic] (for What Really Matters)
Why should we favour paper use sometimes? Well, many reasons.
Complexity Considered Harmful: We Used to Run an Operating System on 64KB of RAM, Not 64GB of RAM (a Million Times More)
"Initially confined to single-tasking on 8-bit processors and no more than 64 kilobytes of memory"
The Slop Industry is Failing So Badly (Mountains of Debt, Losses) That It's Merging With the SPAM Industry
we reckon that Google will eventually delist all slopfarms, recognising they're just a form of SPAM
Links 31/12/2025: Cheeto Pushing for More Wars, ‘Security is a Shared Responsibility’
Links for the day
Enshittification of Postal Services Isn't Technological Advancement
Societies that say the aim is to "go digital" and eliminate paper trail aren't advanced; they're moving backwards
IBM Starts 2026 a Much Smaller Company (Not Homage to Gerstner)
People who get bluewashed out of their job (or bluewashed into unemployment) are gagged by NDAs
XBox is Likely Dead Already, But the Threat It Posed to Us All for Two Decades Isn't Over
"the Xbox was never about gaming and merely served as a test bed for DRM in commodity systems."
Ahead of 2026 Mass Layoffs at Microsoft the Tree Gets Shaken to See Who 'Falls' (Resigns/Retires)
"We had a quiet meeting last week about budget realignment. No one said layoffs, but it’s clear where the focus is shifting."
Almost 6,5000 Pages in 2025, Aiming Higher in 2026
if we can keep focused, then quantity will increase
Microsoft XBox Having a "Dog Ate My Homework" Moment: No New Console Until 3 Years From Now... Because "RAM Prices"
Who will ever remember this in 2028? Nobody.
Gemini End of Year Capsules Tally (Based on Lupa) Shows About 10% Growth
What a difference a year makes
Gemini Links 31/12/2025: New Resolution, Reverse Hexdump, and Programming Languages
Links for the day
Dr. Andy Farnell Explains Why Chatbots Became Dishonesty on Top of Dishonesty (Hiding Usage of Dishonest Salads of Words)
new article from CyberShow
Links 31/12/2025: Nvidia Faces Bubble-Bursting Moment, Saudi Oil Money Pumped Into Chatbots to Keep the Energy Waste Going (Circular Financing Again)
Links for the day
Richard Stallman's First Talk in a U.S. College Since 2018
Greetings from Georgia Tech!
EPO People Power - Part XX - Why António Campinos Chose to Put His Cokehead Friend on 'Sick Leave'
EPO Cocainegate will be covered for months to come
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Tuesday, December 30, 2025
IRC logs for Tuesday, December 30, 2025