Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Gmail can read your emails and attachments to train its AI, unless you opt out (malwarebytes.com)
113 points by taubek 29 days ago | hide | past | favorite | 58 comments


From the dialogues in the pictures it doesn’t sound like they are using anyones emails for training. The messaging indicates it’s more like using as context and/or generating embeddings for RAG. Unless there’s something else I’m not aware of.

I know that Google does a lot of bad stuff but we don’t need to make up stuff they just aren’t doing

This doomsday messaging an alarmism is only serves to degrade the whole cause

edit: and before someone say that they also don’t want that then let’s criticize it for what it is (opting users to feature without consent). We don’t need to make stuff up, it really doesn’t help.


>When smart features are on, your data may be used to improve these features. Across Google and Workspace, we’ve long shared robust privacy commitments that outline how we protect user data and prioritize privacy. Generative AI doesn’t change these commitments — it actually reaffirms their importance. Learn how Gemini in Gmail, Chat, Docs, Drive, Sheets, Slides, Meet & Vids protects your data.

When I click "Learn more" in toggling the smart features on/off

It may not do it now, but I really don't like the implications. Especially a tone of "it's not actually bad, it's good!"


> It may not do it now

I don't believe "may" is being used to indicate possibility, but rather permission.

That is to say, there's no reason to think it's not being used, given that wording.


I agree. What I really don’t like is that we have to choose between having smart search and giving up our data.

Is it too much to ask to be able to not give up data for “improvements” but keep the functionality?


This seems to indeed be confirmed over at https://support.google.com/mail/answer/14615114

"Your data stays in Workspace. We do not use your Workspace data to train or improve the underlying generative AI and large language models that power Gemini, Search, and other systems outside of Workspace without permission."


But then if the terms include a vague permission and/or license to use the data for improving the results, the text is factually correct while obscuring the fact that they do in fact solicit your permission and thus use the data, with your permission.


Discovering new settings that I was opted in to without being asked does not scream good faith.

Separately, their help docs are gibberish. They must use this phrase 20 times: "content is not used for training generative AI models outside of your domain without your permission." Without telling you if that checkbox is that permission; where that permission is set; or indeed, even if that permission is set. From reading their documentation, I cannot tell if that checkbox in gmail allows using my data outside my organization or not.


i don't understand one thing - isn't the Venn Diagram of people who use Gmail in 2025 and who are on HN just 2 circles that don't touch?


Sorry, but that "doomsday" "alarmism" is exactly what is needed and warranted. But this practice of sneakily opting users in into things they don't want instead of a very clear full on pop up saying "We now use your data and private emails for AI training" is exactly the problem.

> I know that Google does a lot of bad stuff but we don’t need to make up stuff they just aren’t doing

No no. a) they ARE doing a lot of bad stuff and b) that shit ain't made up and they ARE exactly doing that. Or do you also think that Github is NOT using priI know that Google does a lot of bad stuff but we don’t need to make up stuff they just aren’t doingvate repos to train Copilot? Do you honestly and truly believe that?

If you do truly believe that I got a bunch of bridges to sell to you.


Link for the lazy: https://mail.google.com/mail/u/0/#settings/general

Find "Smart Features", uncheck, and press save, which will reload Gmail.

Then find "Google Workspace smart features" and click the "Manage Workspace smart features settings" button and unselect everything.


They disable so many features when you remove "Smart features" i.e. Grammar, Spelling, Autocorrect, Smart Compose/Reply (those templated suggested replies), Nudges, Package tracking, Desktop notifications. Google really wants to punish you for doing that.


Turning off the inbox categories feature was particularly annoying. A feature they had for a good decade before deciding they weren't jsut happy with collecting my data.


It is unbelievably manipulative that they tied this organizational feature with this new LLM-training scam they have running now.

Learning to not rely on inbox categories does make it easier for me to finally leave Gmail for a real email provider though, so maybe this will all work out in the end.


Yep I went through this sad journey with my gmail this week. Got tired of seeing "coming soon" packages cluttering up my inbox, so I looked into how to turn them off. It turned off the categories. Reminds me of the dark pattern used by many apps, where if you turn off notifications to avoid ads/spam, you also lose useful notifications.


There's an entire setting for "Turn on package tracking" alone. You do not need to disable Smart Features altogether.


I never enabled that and I verified it's not enabled. However I still have the "Happening Soon" section in my inbox that has status of some packages.

My guess is that Smart Features will (along with everything else it does) scan your emails to populate "Happening Soon" with package status, and if you then enable "Turn on package tracking", it will also periodically poll the shippers for those packages to keep the status up to date.

My complaint still stands. I want to entirely remove "Happening Soon" without disabling categories. It's not even the "Google reading your emails" creepiness. I just don't want my UI to be cluttered.


I see. The "Happening Soon" section of the inbox is populated with stuff other than order tracking, such as your airline tickets. So I can see why you'd have to disable the whole shooting match to get rid of it. But I agree that it would be nice for some people if there was an inbox layout setting to just get rid of it altogether.


ugh, yeah, I was going to turn this off but not having inbox categories is a deal breaker


What do you believe you have gained by throwing those switches?


I think I’m improving my security posture. For example, I’d hope that prompt injection attacks against Gemini AI will be less likely to scoop my data.


These switches don't control whether your emails are used to train models. They control whether you get to use machine inference features on your own emails.


It makes Gmail a normal email client again?


Reclaiming my English.

I'm Australian, I use Australian idioms, spelling, contractions, et al in emails to regular contacts for 20+ years via gmail (Yet Another Early Gmail Invite User).

Despite having selected UK English (there's no 'Strayla option) gmail via the web still insists on suggesting I morph into a cookie cutter middle north American Engrish typer.

Yes, Engrish .. AmerEngrish is an abomination.

Feck that shite. Hard.


For me this feature has been disabled for as long as I remember it being there. I think it predates the AI craze by quite some time.


Google will use your family's pictures and memories "to improve Google’s AI assistants, like Smart Compose or AI-generated replies."

Someone will ask a Google AI service to generate an image some day and your daughter will be used. And that's one of the least worrying outcomes.


That's what training means, yeah, but it isn't clear Google is actually doing what the article claims it is.


They're claiming that these options allow Google to use your data to train its AI, but that's not what it says at all. Where are they getting that idea from?


https://support.google.com/mail/answer/15604322?sjid=1029580...

> When smart features are on, your data may be used to improve these features.


"training on our data" has turned into a catchphrase like "taking our guns" or "banning our books" - dumb propaganda for anti-AI crowd to enrage people. Whether personalized AI-based experience is useful can be debated but everything has to be twisted into culture wars, thats just how media is nowadays


Privacy is not a culture war issue. Not wanting massive amounts of personal data hoovered up to train an AI for Google is reasonable. Arguing against this kind of invasion of privacy is not "dumb propaganda".


Thank god they also got consent from the non google people who sent the email.


It's like with Gemini and Google smart devices. You need to opt-in for data training to use Gemini apps. This means you won't be able to access basic features like asking Gemini to turn off your smart light bulbs. Essentially, Google is preventing you from using any smart features unless you allow data training on your own. Even to access basic features like chat history, you need to enable Gemini activity. This essentially allows Google to train on all of your conversations. This is even for paid tiers


Ha ! I disabled the smart features in gmail and then used gemini to ask something unrelated. The massage ended with:

"By the way, to unlock the full functionality of all Apps, enable Gemini Apps Activity." with a link to myactivitydotgoogledotcom


Google can read the emails of everyone who corresponds with a Gmail user - which is almost everyone - and they can't opt out.


One could use encryption but most receiving users have not the faintest clue on how to use that. If most people would care, then I wouldn't give a flying fuck about what Google does because they can't read my mail even if they wanted to.


You can't really use encryption for a couple of reasons:

1. Encrypted email only protects the email body. All the metadata - who is emailing who, when, from what servers - as well as the subject line, are in the clear regardless. The metadata is more valuable than the data in many cases.

2. Unless users download all their Gmail messages, leaving none on the server (essentially, like an old POP account), then the decrypted message bodies will be in their Gmail account for Google to read.


Hey, if anyone wants to use AI to draft replies but wants to make sure their data isn't trained by it, I've built a Chrome extension that does exactly that! You can plug in your API key, and it supports all SOTA models.

https://jetwriter.ai


The article is useful overall, but the following line is such bad journalism that I want to call it out, since we can't afford to have people dumbed-down.

> The reason behind this is Google’s push to power new Gmail features with its Gemini AI, helping you write emails faster and manage your inbox more efficiently.


Alright, I see that line, but in what sense is that bad journalism? I could in the same vein call this out as a bad comment.


A journalist is stating a corporate PR explanation for a sketchy action as fact. And it's not even a very believable reason, at least not completely, and the journalist certainly isn't verifying it. And it's important.


But journalists have to include the statements from both sides. And then it's up to the audience to make an opinion of it.


No they don't. And that's not what the article did here.



They're already reading your email to train their ads, so a small step to also train the AI right?


Ok correction: they don't scan your mail for ads since 2017 since the EU GPDR regulation


Oh Well, I left gmail years ago, now it is used for junk mail, good luck loading ads going to gmail into your AI :)


What did you move too? I was thinking of moving my personal email off of Gmail for a while. It's just 15+ years of accounts to untangle.

I long made a "spam/unimportant site verification " account, so that's set.


Honestly, get a small VM at openbsd.amsterdam or another good hoster and just do it yourself. OpenBSD has outstanding documentation, standing up SMTPD and Dovecot there is super simple, especially for a single user and not a whole group or family (where LDAP or Kerberos would be warranted). Just take initiative and own your data. Does it cost a little bit upfront time? Sure, but maybe just weekend. Does it require a little maintenance here and there? Sure. But still cheaper than paying for fastmail or waiting for fastmail to also exploit your data because the shareholders said so....just own your data.


I like Fastmail with my own domain for personal email, but the reality is nothing is a complete replacement for a Google account, given how tied in it is with auth and the whole Google ecosystem. I still have to use Google for work.

Proton is another one people often suggest. Hey.com sometimes too. No experience with those myself.

There are other options (such as the big guys, iCloud mail or Outlook.com), but aside from self-hosting (which I don't want to spend time maintaining just for my personal mail), I personally haven't seen much outside of those ones that are recommended often.


My own domain.


Assumedly you need an email hosting service to place behind your domain, correct? Or do you self host an email server? If you are completely self-hosting, how do you deal with being marked as spam by large spam filter organizations for being a low-trust sender?


In daily practice you don't. Unless you host in a DC with lots of spam. Use small but trusted hosters.

But as long as you get SSL/DKIM/SPF and the other stuff right, and it's not THAT difficult, then most hosters will let you through. Unless it's German Telecom, because for some reason t-online.de decided to only allow emails from hosters they whitelisted and there's a whole approval process which even requires registering with them with an email that is NOT from your domain and even a fax, but honestly fuck anyone using that domain.



Funny how the law and the corporations see the meaning of "optional" differently.


Reading comprehension and media literacy at all-time lows, below what cognitive scientists formerly believed was a hard floor of "absolute zero comprehension".


how do we turn back?


It's just for "improving", not training! Why would anyone assume the worst? They're the good guys.. more like a cuddly teddy bear than a monopolistic megacorp! Surely they'd explicitly tell us exactly what they were doing before they do it. Stop fear-mongering, people! anxiously checks the price of my GOOG stock


actually, I do want them to read my auto subscribed newsletter slop for random services that I register to. I do want them to use that messy, unorganized, faulty or maybe even useless data to train their AI models.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: