alexi.sh
All articlesBrowser securityNetwork privacyPrivacy toolingThreat modelingAI codingDev tooling

alexi.shResearch

privacy-tooling

AI and Data Privacy: What Happens to What You Type (2026)

PrivSec Lab3 min read
A digital illustration of a brain made of circuit-board lines, representing artificial intelligence

AI tools are useful because they ingest your data β€” which is exactly the privacy problem. What AI services do with your inputs, how training and retention work, the real risks, and the practical ways to keep your data private while still using AI.

A digital illustration of a brain made of circuit-board lines, representing artificial intelligence

AI tools are useful for one reason: they ingest your data and act on it. That is also the entire privacy problem. Every prompt, document and screenshot you feed an AI is data leaving your device for someone else's servers β€” so "AI data privacy" is really the question of what those servers do with it. Here's the honest picture and what you can actually control.

What AI services do with your data

When you use a cloud AI tool, a few things happen to what you type. It's processed on the provider's servers (not your device). On consumer plans, it may be used to help improve the models unless you opt out. And account and usage data is stored like any online service. The exact behaviour varies by provider and plan, but the safe default assumption is simple: anything you put into a cloud AI tool has left your control.

The part people underestimate is how much they put in. A chatbot feels like a private scratchpad, so people paste contracts, code, medical questions and personal details β€” all of which become text on a company's servers rather than a local note.

Training and retention β€” the two levers

Two things determine your exposure:

  • Training β€” whether your inputs are used to improve the model. Consumer tiers often default to yes (with an opt-out); business/enterprise plans and APIs are usually excluded from training by default. Opting out is forward-looking: it stops future use, it doesn't pull back data already used.
  • Retention β€” how long your conversations are stored. Most providers keep history unless you use a temporary/incognito mode or delete it, and "delete" on a cloud service typically means removed from your view, then purged over a retention window. Legal obligations can also require providers to preserve certain logs.

A person's hands using a smartphone, tapping the screen

The real risks

The risks aren't science-fiction; they're mundane and concrete:

  • Sensitive inputs stored on third-party servers β€” the most common exposure is simply what you paste.
  • Breaches β€” aggregated AI data is a valuable target, so a provider breach can expose inputs you never shared elsewhere.
  • Profiling and reuse β€” data tied to your account can build a profile, or be used for features and analytics you didn't expect.
  • Compelled disclosure β€” providers can be legally required to produce stored data.

How to use AI without giving away your data

You don't have to stop using AI β€” you have to use it deliberately:

  • Don't paste secrets. Passwords, API keys, ID numbers, health data, confidential client or company material β€” keep them out of consumer AI tools.
  • Turn off training in the tool's data settings if you're on a consumer plan, and use temporary/incognito modes for one-off sensitive questions.
  • Pick the right tier. For sensitive work, a business/enterprise plan or an API with a no-training, reduced-retention policy is far safer than a free consumer account.
  • Use a local model for the sensitive stuff. A model running on your own hardware never sends your prompts anywhere β€” the strongest privacy option by far.
  • Cover the network layer with a VPN. It hides your IP and connection from your ISP and the network (useful on public Wi-Fi), though it doesn't change what the provider does with your text.

The honest takeaway

AI data privacy comes down to a single habit: assume everything you type into a cloud AI tool is stored and possibly used, and decide accordingly. Share less of what matters, turn off training, choose trustworthy providers or local models for sensitive work, and keep the rest of your setup private around it. Used that way, you keep most of AI's usefulness while giving away far less.

Image: Pixabay (source)

Also available in

FAQ

Do AI tools use my data?
Generally yes. Cloud AI tools process whatever you type or upload on their servers, and on consumer plans many may use your conversations to help improve their models unless you opt out. They also store account and usage data like any online service. The exact behaviour varies by provider and plan: consumer tiers tend to be the most data-hungry, while business/enterprise plans and APIs are usually contractually excluded from training by default. The safe assumption is that anything you put into a cloud AI tool leaves your device.
Is it safe to put personal or confidential data into AI?
Treat it like posting to a third-party server, because that's what it is. Avoid pasting secrets β€” passwords, API keys, ID numbers, health details, or confidential client/company data β€” into a consumer AI tool. If you must use AI on sensitive material, use a business/enterprise plan with stronger data terms, a provider that contractually excludes training, or a local model that runs entirely on your own machine so nothing leaves it.
How do I stop AI from training on my data?
On most consumer chatbots there's a setting (often under Data Controls or Privacy) to turn off using your content to improve the model β€” turning it off is forward-looking, it doesn't remove past data already used. Many tools also offer a temporary or incognito mode that isn't saved to history. Business and API tiers are typically no-train by default. And you can delete conversations, though 'delete' on a cloud service usually means removed from view then purged over a retention window.
What's the most private way to use AI?
A local model running on your own hardware is the most private option β€” your prompts never leave the device, so there's nothing to log, train on, or leak. It's less convenient and needs a capable machine, but for sensitive work it's the gold standard. Short of that, choose providers that are transparent about data use, turn off training, avoid pasting secrets, and treat AI like any other cloud service you don't fully control.
Does a VPN make AI use more private?
A VPN hides the network layer β€” your IP and the fact that you're connecting, from your ISP and the network β€” but it does not change what the AI provider does with the text you send them. So a VPN is a useful base layer (especially on public Wi-Fi) but it is not a substitute for the in-tool privacy steps: not pasting secrets, opting out of training, and choosing trustworthy providers or local models.