Skip to content
ahead x
Data protection in AI: what you can put in the prompt

Law, society & ethics · Explained

Data protection in AI: what you can put in the prompt

GDPR + ChatGPT, Claude, Copilot, the most common misconceptions. What technical and contractual steps actually protect, and rules of thumb for daily use.

Lukas Wagner, Founder & Curator von ahead 7 min read

The most common question in every ahead-future-labs workshop: “Am I even allowed to do this? I dump customer data into ChatGPT, that can’t possibly be legal.”

The short answer: it depends. On which tier, which data, which legal basis. The detailed answer follows now, in a format that lets you decide after 10 minutes what works for you.

The GDPR basics (in 5 sentences)

  1. Personal data may only be processed if a legal basis exists (Art. 6 GDPR).
  2. For external processing, a Data Processing Agreement (DPA) is required.
  3. Without a DPA, transferring personal data to external providers is unlawful, including into “the cloud”, including to the USA.
  4. What counts as personal data is broader than most people think: name, email, IP address, device IDs, often also phone number plus industry.
  5. For sensitive data (health, religion, trade union membership, sexual orientation), Art. 9 applies, which is stricter.

Which tiers give you GDPR compliance?

ProviderTierGDPR-ready?DPA available?Data location
ChatGPT (OpenAI)FreeNoNoUSA
ChatGPT PlusPersonalNoNoUSA
ChatGPT TeamB2BYes*YesUSA + EU options
ChatGPT EnterpriseEnterpriseYesYesEU region selectable
OpenAI APIAPIYes*Yes (DPA)depending on region
Claude (Anthropic)Free / ProPersonal: noNoUSA
Claude for WorkB2BYesYesUSA + EU
Microsoft Copilot (Office 365)BusinessYesYes (Microsoft DPA)EU region selectable
Google Gemini WorkspaceWorkspaceYesYesEU region selectable
Mistral / Aleph AlphaAPIYesYesEU

“Yes” with an asterisk: additionally switch on privacy settings, disable “Train on my data”.

The simple decision tree

1. Is a DPA in place with the provider?
   ├── No → Don't paste in any personal data. No clauses from contracts, no customer emails.
   └── Yes → continue to 2

2. Is sensitive data involved (health, religion, etc.)?
   ├── Yes → Additional review of the legal basis (Art. 9 GDPR). When in doubt: don't.
   └── No → continue to 3

3. Is the processing compatible with the original purpose of collection?
   ├── No → Obtain the consent of the data subjects or review another legal basis.
   └── Yes → OK, into the prompt with it. But: mind logging and retention periods.

What actually happens in daily practice

In most SMEs, daily practice looks like this:

  • Employees use private ChatGPT accounts (“it’s just faster”)
  • Customer data ends up in there
  • No one has a DPA
  • GDPR violation: yes
  • Probability that anyone notices: low
  • Probability that there are problems if someone does notice: high

The solution is not “ban it and threaten penalties”, that drives shadow IT. The solution is: procure an approved tier, train people, document it.

Three rules of thumb for daily use

1. Anonymize before pasting. Instead of “Maria Huber, mariahuber@firma.at, complained about…” → “A customer complained about…”. Works in 80% of cases.

2. For sensitive material: local model or Enterprise. Personnel files, health data, contracts with clauses → either local (Llama, Mistral self-hosted) or Enterprise with an EU region.

3. Check the output. If the model answers you with “Anna K. lives at Hauptstr. 12”, even when you didn’t enter it, AI can also hallucinate personal data. Never pass it on one-to-one.

Contractual steps for your company

StepWhat happens?
Conclude a DPAWith every AI provider you use
Record of Processing Activities (RoPA)Mandatory document; enter AI tools there
TIA / Data Protection Impact AssessmentFor high-risk use cases (HR, applicants)
Document trainingWho was trained on which tools and when
Retention periodsWhich prompts/responses are stored and for how long

Common misconceptions

“We use Microsoft 365 Copilot, so everything is GDPR-compliant.” Wrong. Microsoft has the DPA, but you have to have concluded it. Also: set the region in the tenant config to EU.

“The AI doesn’t learn from my inputs.” It depends. With most Enterprise tiers: correct. With Free / Plus / Pro: often it does after all, until you manually disable it.

“Pseudonymizing is enough.” Often it isn’t. Pseudonymized data remains personal data if it can be re-attributed. Strict anonymization is the only safe option.

“We host Llama ourselves, so data protection isn’t an issue.” It is. GDPR applies regardless of hosting. What changes: no DPA with an external party is needed. But: RoPA, legal basis, impact assessment, all remain.


Disclaimer: This article is editorial and does not replace legal advice. For concrete GDPR compliance in your own company: involve a specialized lawyer or data protection officer.

Frequent questions

Your turn

What question is still on your mind?

Ask us. Selected questions turn into new explainers, glossary entries or topics for our events.

See all questions →

characters left

Thank you — your question has arrived. We will check whether we can answer it in the magazine, the glossary or at an ahead x event.

See all questions →

Read next

Explained

EU AI Act: What businesses actually need to know

The EU AI Act is in full effect from 2026. What it means for your business, risk classes, obligations, deadlines, penalties. Plain English, no legalese.

More on this topic

New articles in your inbox. At most once a month as a roundup of new articles, analysis and explainers. No spam, no sales pitches, just content. Newsletter →
← All Explained Back to Knowledge