Bonum Certa Men Certa

[Meme] Google: Nothing is Beyond Our Reach

Thunderbird



US Spy Agency Boasts 'Nothing Is Beyond Our Reach' With New LogoSummary: When it comes to Google (Gulag), E-mail messages are merely "content" (to be scanned and indexed)

Google Has Most of My Email Because It Has All of Yours



Republished by Slate. Translations available in French (Français), Spanish (Español), Chinese (中文)

For almost 15 years, I have run my own email server which I use for all of my non-work correspondence. I do so to keep autonomy, control, and privacy over my email and so that no big company has copies of all of my personal email.

A few years ago, I was surprised to find out that my friend Peter Eckersley — a very privacy conscious person who is Technology Projects Director at the EFF — used Gmail. I asked him why he would willingly give Google copies of all his email. Peter pointed out that if all of your friends use Gmail, Google has your email anyway. Any time I email somebody who uses Gmail — and anytime they email me — Google has that email.

Since our conversation, I have often wondered just how much of my email Google really has. This weekend, I wrote a small program to go through all the email I have kept in my personal inbox since April 2004 (when Gmail was started) to find out.

One challenge with answering the question is that many people, like Peter, use Gmail to read, compose, and send email but they configure Gmail to send email from a non-gmail.com “From” address. To catch these, my program looks through each message’s headers that record which computers handled the message on its way to my server and to pick out messages that have traveled through google.com, gmail.com, or googlemail.com. Although I usually filter them, my personal mailbox contains emails sent through a number of mailing lists. Since these mailing lists often “hide” the true provenance of a message, I exclude all messages that are marked as coming from lists using the (usually invisible) “Precedence” header.

The following graph shows the numbers of emails in my personal inbox each week in red and the subset from Google in blue. Because the number of emails I receive week-to-week tends to vary quite a bit, I’ve included a LOESS “smoother” which shows a moving average over several weeks.

Emails, total and from GMail, over timeFrom eyeballing the graph, the answer to seems to be that, although it varies, about a third of the email in my inbox comes from Google!

Keep in mind that this is all of my personal email and includes automatic and computer generated mail from banks and retailers, etc. Although it is true that Google doesn’t have these messages, it suggests that the proportion of my truly “personal” email that comes via Google is probably much higher.

I would also like to know how much of the email I send goes to Google. I can do this by looking at emails in my inbox that I have replied to. This works if I am willing to assume that if I reply to an email sent from Google, it ends up back at Google. In some ways, doing this addresses the problem with the emails from retailers and banks since I am very unlikely to reply to those emails. In this sense, it also reflects a measure of more truly personal email.

I’ve broken down the proportions of emails I received that come from Google in the graph below for all email (top) and for emails I have replied to (bottom). In the graphs, the size of the dots represents the total number of emails counted to make that proportion. Once again, I’ve included the LOESS moving average.

Proportion of emails from GMail over timeThe answer is surprisingly large. Despite the fact that I spend hundreds of dollars a year and hours of work to host my own email server, Google has about half of my personal email! Last year, Google delivered 57% of the emails in my inbox that I replied to. They have delivered more than a third of all the email I’ve replied to every year since 2006 and more than half since 2010. On the upside, there is some indication that the proportion is going down. So far this year, only 51% of the emails I’ve replied to arrived from Google.

The numbers are higher than I imagined and reflect somewhat depressing news. They show how it’s complicated to think about privacy and autonomy for communication between parties. I’m not sure what to do except encourage others to consider, in the wake of the Snowden revelations and everything else, whether you really want Google to have all your email. And half of mine.

If you want to run the analysis on your own, you’re welcome to the Python and R code I used to produce the numbers and graphs.



Licence: Copyright Benjamin Mako Hill, distributed as free cultural work under the Creative Commons Attribution-ShareAlike license.

Recent Techrights' Posts

Windows Defender Flags Tor Browser as “Trojan Horse” Malware
Reprinted with permission from Ryan Farmer
Sensationalist Clickbait Called 'Looney Tunables' to Badmouth Linux (It's Not Even Linux)
FUD attack
Links 04/10/2023: Murena 2 with /e/OS and More
Links for the day
Links 04/10/2023: KDE Promo Sprint and More
Links for the day
Clown Computing Means Security/Data Breaches, Not Just the Outsourcing Itself (They Get Cracked, Too!)
Reprinted with permission from Ryan Farmer
Over at Tux Machines...
yesterday's posts
mp3HD: Another Patent Trolls' Patent Trap That Failed
Reprinted with permission from Ryan Farmer
IRC Proceedings: Tuesday, October 03, 2023
IRC logs for Tuesday, October 03, 2023
"Modern" Computing Sucks and Harms Computer Users
Reprinted with permission from Ryan Farmer
Red Windows
Red Hat is not into Free software
Richard Stallman Giving Talks in the Czech Republic and Germany This Week (Tomorrow's Talk is "Artificial Intelligence vs Language Models")
This past weekend he gave two talks in the Czech Republic
Companies Faking the True Number of Layoffs With Return-to-Office Mandates and Forced Relocation
we estimate that Microsoft cut about 30,000 so far this year, having cut many more jobs last year
Links 03/10/2023: Cellphones (Mobile Phones) Banned in Classrooms in England
Links for the day
Greener Pastures for Free Software Users
This coming week we'll publish many articles about GNU/Linux and technical means of/for user empowerment
IRC Proceedings: Monday, October 02, 2023
IRC logs for Monday, October 02, 2023
Google News, Which We Call Gulag Noise, is Following the New York Times Into the Digital Graveyard
It merely gives an illusion of volume and instead of giving readers more stuff to read it wastes people's time
Daily Bulletins Coming Soon (Hopefully as Early as Next Week)
Today we finish testing IRC logs and their upload to Gemini, not just to IPFS
Over at Tux Machines...
yesterday's posts
Software Freedom is the Future and Microsoft is the Biggest Obstacle
GNU/Linux, at its roots, was all about Software Freedom
Links 02/10/2023: NUC, GTK Themes, and More
Links for the day
New Union Syndicale Articles About the European Patent Office
We'll probably get back to regularly writing about the EPO in the near future
If WordPress Knows Well Enough to Self-Host Its Podcast, Why Can't GNU/Linux Shows Do the Same?
For those who want videos and podcasts, here are today's latest additions from other sites
Richard Stallman Can Outlive Many of His Prominent Haters
M.J.G. tried hard to take our Web site offline, based on lies and repeated threats
The GNU/Linux Revolution Ain't Here. Look at Brazil, Russia, India, China, and South Africa (BRICS) Instead.
The revolution won't be televised
Chaffbot Effect: Microsoft Bing Falls to Lowest Share in Two Years (Amid Loads of Bing Layoffs This Year)
Press outlets mostly failed to report that Bing is collapsing
Forget VSCode (Microsoft's Proprietary Spyware), Use KATE Instead
KATE is great
Sometimes It's Time to Reboot
No, not Android. KDE.
GNU/Linux Distributions as "Appliances" and DRM Platforms (the Case of ChromeOS and SteamOS)
Is this what we envisioned in the 1980s and 90s?
Fulfilling the Site's Full Potential
We remain devoted to the aforementioned goal of posting more original material
Over at Tux Machines...
2 days' worth
Upcoming Talk by Dr. Richard Stallman: Large Language Models Are Not Artificial Intelligence
LLMs aren't truly intelligent and cannot quite grasp what they spew out
GulagTube is a Burning Platform (Exit YouTube, Invidious Won't Save Us From Google/Alphabet in the Long Run)
Alphabet Agency (Google) sees the future of video as a "skinnerbox" (running Android) that indoctrinates you like TikTok does
Microsoft's Demise in the Global News Cycle is Rather Telling
It should be noted that Microsoft is, in general, no longer prominent or dominant in news headlines
Gemini Migration and Backup Capsule (Archive)
At the end we'll end up with something a lot better than before and latency should be massively reduced
Links 01/10/2023: Science, Education, and pro-Russia Slovakia Leadership
Links for the day
IRC Proceedings: Sunday, October 01, 2023
IRC logs for Sunday, October 01, 2023