Is my data used to train AI models?

It depends on the AI service. Free AI services often use conversations to improve models (though some do not). Paid services vary: some use data, others do not. Enterprise versions typically have contracts promising your data will not be used for training. Check the privacy policy of the specific service you use. If you want to opt out, look for privacy settings or user agreements that allow opt-out for training.

How long is my data kept?

Data retention varies by service. Common retention periods are 30 days, 90 days, or longer. Some services keep conversations indefinitely. Check the privacy policy for the specific retention period. You can usually request deletion, and some services allow you to delete conversations directly. If you need guarantees about data deletion, look for enterprise agreements with specific data retention commitments.

Can I prevent my data from being used for training?

Depends on the service. Many free services do not offer an opt-out. Paid and enterprise services typically do not use data for training, especially if you have paid for the service. Some services offer privacy settings that disable training data usage. The safest option: use paid or enterprise versions of AI services if you want to ensure your data is not used for training models.

Chapter 7.2: Your Data & AI Systems

Your Data in AI Systems

When you use ChatGPT, Claude, Gemini, or any AI service, you send information to that provider's servers. Your question, your context, any data you share becomes part of a database on their systems. This chapter is about understanding what happens to your data and what rights you have.

The Data Flow

Step 1: You input data. You ask a question, paste text, upload documents, or provide images to an AI system. That data leaves your device and travels to the provider's servers.

Step 2: The provider stores your data. The data arrives on the provider's computers. It is stored in a database, potentially with metadata about you (your account, what device you used, when you accessed it, your location).

Step 3: The AI processes your data. The model runs and generates an output. This processing involves analyzing your input and producing a response.

Step 4: Data is retained. Even after you see the response, the provider retains your data for some period (determined by their policy). This retention serves purposes like abuse detection, system improvement, and record-keeping.

Step 5: Data might be used for training. Some providers use conversation data (with sensitive information removed or anonymized) to further train their models. Others do not. This depends on their privacy policy and your settings.

Key Point

Your data is not immediately deleted. It is stored on the provider's servers for some period. During that time, it could potentially be accessed by employees, affected by breaches, or used in ways you did not anticipate (if you did not read the full privacy policy).

Reading Privacy Policies

Most AI providers have privacy policies that explain what they do with your data. These policies are often dense and legalistic, but they contain crucial information. Here is what to look for:

Data Collection

What data is collected? Just your prompt/query? Or also metadata like your IP address, device information, location? Most services collect some metadata. Understanding what is collected helps you decide what to share.

Data Use for Training

This is crucial: does the provider use your conversations to train or improve their models? This is stated in different ways in different policies. Look for language like: "We may use conversations to improve our models" or "Conversations are used for training and development." If you see this language and you are concerned, look for settings to opt out.

Data Retention

How long is your data kept? Typical retention periods: 30 days, 90 days, 1 year, or indefinitely. Shorter retention periods are better for privacy. If the policy does not specify a period, the data might be kept indefinitely.

Data Deletion

Can you request deletion of your data? Most privacy-conscious providers allow you to request deletion. Some allow you to delete conversations directly in the app. Check if your provider allows deletions and how to request them.

Third-Party Sharing

Does the provider share your data with third parties? This is a red flag. Ideally, your data stays with the provider and is not sold or shared. Some policies might allow sharing with "service providers" (contractors who help operate the service), but data should not be sold or shared for marketing.

Enterprise Agreements

Most privacy-conscious organizations get enterprise agreements with AI providers. These agreements typically include: your data will not be used for training, data retention periods are specified, and your organization controls data deletion. If your organization uses AI, ask if you have an enterprise agreement with privacy guarantees.

How Different Services Handle Your Data

Free Services

Free AI services often use conversations to improve models (unless you opt out). Your privacy is potentially lower. The trade-off: free access in exchange for contributing your data to model improvement. If you use free services, assume your data might be used for training unless the policy explicitly says otherwise.

Paid Individual Services

Many paid individual subscriptions (ChatGPT Plus, Claude Pro, Gemini Advanced) have privacy policies that typically say your data is not used for training. Paying for the service provides better privacy protection. This is often the best option for individuals with sensitive data.

Enterprise Services

Enterprise agreements usually have the strongest privacy protections: data is not used for training, retention periods are specified, and deletion is guaranteed. Large organizations can negotiate additional protections. If your organization is sensitive about data, ensure you have an enterprise agreement.

Practical Steps to Protect Your Data

1. Know the Service's Privacy Policy

Spend 15 minutes reading the key sections. Focus on: data collection, training use, retention, and deletion rights. If the policy is unclear, contact the provider for clarification.

2. Adjust Privacy Settings

Most AI services have privacy settings. Turn on any options to disable training data use. Delete conversations you no longer need. These are simple steps that increase your privacy.

3. Minimize Sensitive Data Sharing

If you have sensitive data, do not paste it into public AI services. Use enterprise versions, paid versions, or do not use AI for sensitive information. The safest approach: if the data is sensitive, find an alternative solution that does not involve third-party AI.

4. Understand What You Are Trading

Free AI services trade data for access. This might be acceptable to you. But understand the trade. You are potentially contributing to model improvement and your data is potentially used for training. If that concerns you, use paid services instead.

5. Request Your Data / Exercise Your Rights

Most privacy laws (GDPR, CCPA) give you rights to: access your data, delete your data, and understand how your data is used. You can request this from AI providers. The process varies, but usually involves filing a formal request through their privacy portal or contacting their privacy team.

Your Organization's Relationship With AI Data

If you are using AI tools through your organization, your organization might have agreements that control data. Ask your IT or compliance team:

Does our organization have an enterprise agreement with these AI providers?
What data can and cannot be shared with external AI services?
Are there specific AI tools we should use vs. avoid?
Do we have data classification guidelines that apply to AI use?

Organizations often have stricter data policies than individuals. Respect your organization's guidelines. If guidelines are unclear, ask. If you are unsure whether something is allowed, err on the side of caution.

Key Takeaway

Understanding Your Data Rights

Your data shared with AI providers is stored, potentially used for training, and retained for varying periods depending on the provider's policy. You have rights: to know what is collected, to understand how it is used, to request deletion, and in many jurisdictions, to access a copy of your data.

The key is being informed. Read privacy policies. Understand that free services often trade privacy for access. Use paid or enterprise services if you have sensitive data. Exercise your privacy settings. And when in doubt, do not share sensitive information with third-party AI services. This approach protects your data while still allowing you to benefit from AI.

Ch 7.1: What AI Knows

Next Chapter

Ch 7.3: Data Decisions

Your Data & AI
Systems

Your Data in AI Systems

The Data Flow

Reading Privacy Policies

Data Collection

Data Use for Training

Data Retention

Data Deletion

Third-Party Sharing

Enterprise Agreements

How Different Services Handle Your Data

Free Services

Paid Individual Services

Enterprise Services

Practical Steps to Protect Your Data

1. Know the Service's Privacy Policy

2. Adjust Privacy Settings

3. Minimize Sensitive Data Sharing

4. Understand What You Are Trading

5. Request Your Data / Exercise Your Rights

Your Organization's Relationship With AI Data

Key Takeaway

Understanding Your Data Rights

On This Page

Chapter Details

Lesson 7 Chapters

Your Data & AISystems

Your Data in AI Systems

The Data Flow

Reading Privacy Policies

Data Collection

Data Use for Training

Data Retention

Data Deletion

Third-Party Sharing

Enterprise Agreements

How Different Services Handle Your Data

Free Services

Paid Individual Services

Enterprise Services

Practical Steps to Protect Your Data

1. Know the Service's Privacy Policy

2. Adjust Privacy Settings

3. Minimize Sensitive Data Sharing

4. Understand What You Are Trading

5. Request Your Data / Exercise Your Rights

Your Organization's Relationship With AI Data

Key Takeaway

Understanding Your Data Rights

On This Page

Chapter Details

Lesson 7 Chapters

Your Data & AI
Systems