Anthropic requires 30 day data retention for Fable and Mythos

Posted by lebovic 5 days ago

https://www.theverge.com/report/947575/microsoft-claude-fabl...

Comments

Comment by pseudosavant 4 days ago

It is actually worse than that. It is at least 30 days. There is an "almost" that is doing a ton of heavy lifting here "deletion after 30 days in almost all cases". My read of that is they can hang onto data for as long as they want, even if they usually won't. And "all traffic" with an agentic harness is basically your entire codebase you work on.

> We will require 30-day retention for all traffic on Mythos-class models, on both first- and third-party surfaces. We won’t use this data to train new Claude models, or for any non-safety-related purpose, and we’ve instituted new privacy protections including logging all human access to the data and ensuring its deletion after 30 days in almost all cases (see this post for further details). The data will help us defend against complex and novel attacks (including new jailbreaks and attacks that operate across many requests) as well as help us identify and reduce false positives.

Comment by kitchi 4 days ago

They seemed to have changed the wording since you posted the comment, now specifying exactly 30 days with seemingly no exceptions.

These terms seem to be updated at-will, so I'll take that with a grain of salt however.

Comment by cornholio 4 days ago

I'm not sure they can actually respect that 30 days absolute commitment. Let's say some internal tool flags a suspect conversation, it bubbles up and a human operator reads it and it looks like evidence of a crime. Then, that employee is legally bound in many jurisdictions to prevent the destruction of that piece of evidence.

It's one thing to commit to a "everything is deleted when you press delete" automatic policy. It's quite another to say "we'll keep some stuff for up to 30 days, look inside it for any malfeasance, then pinky promise we'll delete it".

Comment by brookst 3 days ago

It generally goes without saying that legal obligations must be met. Before this 30 day policy they already had to comply with subpoenas and government retention requests.

Same with CSAM policies for any cloud provider. Doesn’t matter what the retention policy says, if the law says otherwise, the law wins. And there is no obligation to spell out every law in every country that might change how data is handled.

Comment by yencabulator 2 days ago

It's probably been updated several times (why does it even matter what it says now if they can update the terms at will), but now it says:

> After 30 days, the data is deleted automatically, except in the rare cases where it's part of a safety investigation or we're legally required to keep it.

Comment by agroot12 3 days ago

They write "We will require 30-day retention for all traffic on Mythos-class model". For potentially criminal content, maybe it's not "we", but "the authorities" that require the retention?

... and now I wonder if "we require retention" leaves the door open to retention that is not required, but let's say convenient.

Comment by mkl 3 days ago

From https://support.claude.com/en/articles/15425695-covered-mode..., emphasis mine:

> Prompts and model completions are retained for at least 30 days and then automatically deleted, unless they are subject to a safety investigation or we are legally required to maintain them.

They keep it as long as they want.

Comment by ryanisnan 4 days ago

That's strange. Even in my hobby-toy app, I have a TOS that I bump whenever the terms meaningfully change, and in my app, it forces a re-acceptance of the new terms before using the app again.

Comment by abustamam 4 days ago

You mean your terms don't just say "these terms may change at any time and your continued use of this site implies acceptance??"

Comment by isodev 4 days ago

> continued use of this … implies acceptance

One of the biggest crimes in tech world

Comment by JaszHere 3 days ago

That's only in the summary, farther down it says

> After 30 days, the data is deleted automatically, except in the rare cases where it's part of a safety investigation or we're legally required to keep it.

Comment by mikestorrent 3 days ago

Yep. They changed the terms, which needs legal review in my org, but the Fable model was available immediately, so of COURSE people have to go and flock to it to see how much better it is. Amazing how easy it is to spend five figures on demand and have very little to show for it; meanwhile when I want to buy a piece of enterprise software for 40-50k/year I have to spend weeks or months building the case, providing justification for ROI etc.

Comment by SilverElfin 3 days ago

Do you know where I can find it before and after of the terms? To me it looks like the same as it was.

Comment by SilverElfin 4 days ago

Where are you seeing that updated version?

Comment by Hamuko 4 days ago

[dead]

Comment by eth0up 4 days ago

I cannot help wondering if the 'we won't train on your data' applies across the fence over there in pentagon land, where the classified contracts be. Yeah, of course they are not connected. Or..

Present user-llm activity is a goldmine of intel the agencies literally spent lives and billions on getting hardly close to, yet they elect to just let this one slip by..

Maybe. Really, I don't dispute it.

But why? It's what, or precisely what, they always dreamed of.

Comment by daveshistory 4 days ago

I don't know why you'd read literally the last 25 years of leaks from mass surveillance programs and think for one moment that they've just, gosh, overlooked the opportunities.

Comment by arcanemachiner 4 days ago

We've already gone through ECHELON, USAPATRIOT, TIA, PRISM, etc.. Either learn from the pattern and and plan accordingly, or be one of the credulous rubes caught off guard in the next wave of leaks.

Comment by rapnie 3 days ago

> We won’t use this data to train new Claude models, or for any non-safety-related purpose, and we’ve instituted new privacy protections including logging all human access to the data and ensuring its deletion after 30 days in almost all cases

This reads to me as they can use any model that is not a "Claude model", and as for human access to that other model there can be different less restrictive privacy protections. In other words, that anything goes.

Comment by eth0up 3 days ago

Yes. Words don't mean much these days. Taking corporate doublespeak at face value seems very couragious to me.

Comment by tcp_handshaker 4 days ago

Half of my customers will drop them right away, and the other half, after I explain to them what this means.

Comment by usef- 4 days ago

It's only for this model, not the one you're already using. And they're not training on the data. It's supposedly to detect abuse etc (such as someone retrying repeatedly with different variations to get around their protections)

Comment by HWR_14 4 days ago

> they're not training on the data

How would you know that? You can only know what they say they will do with the data.

Comment by usef- 4 days ago

Sure, some trust is required that they aren't breaking their own terms of service (which legally enforces that they won't train on your data), but the same is true of every company/service you deal with (AWS, Google, your CRM etc). Their entire business model depends on enterprises trusting them.

Comment by coldtea 4 days ago

>some trust is required that they aren't breaking their own terms of service

Which companies do all the time...

Comment by jefftk 3 days ago

But if you're going to take your distrust that far then the issue is that they have your data at all, not that they are telling you that they will retain it for 30 days.

Comment by baq 4 days ago

Civilization is built on trust, otherwise you’ll need to rebuild all of it yourself. This isn’t very different.

Comment by coldtea 4 days ago

Civilization is also built on cheating and taking advantage of naive trust. This isn’t very different.

Comment by usef- 3 days ago

If that were dominantly true nothing would function at all. You trust and rely on thousands of people and services every day.

As others have said, if you're this skeptical I don't see why you would have been using them before this retention increase.

Comment by coldtea 1 day ago

>If that were dominantly true nothing would function at all

And yet it is, and most things still function. Now what?

>You trust and rely on thousands of people and services every day

And I, and everybody else, distrusts and tries not to rely on thousands of people and services every day too.

Do you lock your car? Or your door? Do you use your username as password trusting nobody would stoop so low as to break it? Do you trust the goverment to put your tax money to good use? Do you trust emails with great offers from websites you didn't subscribe to? Do you trust companies not to sell your personal data?

>As others have said, if you're this skeptical I don't see why you would have been using them before this retention increase.

Because they have a technically more capable offering. For absolutely no other reason.

Comment by nativeit 3 days ago

We’ve repeatedly watched that trust abused and exploited in these last few years, in both public and private sectors (including specifically in this field). I broadly agree with you, but I tend to think it’s a finite resource that’s eroding rapidly just now.

Comment by blackoil 4 days ago

If that is the question. Those customers anyway won't be using any LLM or cloud services in first place. If you are a jornalist investigating nations, stay away from everything.

Comment by zaptheimpaler 4 days ago

If you don't trust them, then no policy is enough. Technically everything you send to the model could be stored by them. Personally I do worry about that especially as an average consumer not an enterprise, no one is looking out for us and we don't get any guarantees. But enterprises will get the right treatment because they would find out and sue Anthropic if they lied.

Comment by coldtea 4 days ago

>If you don't trust them, then no policy is enough.

No policy is enough, period. There should be technical and legal solutions to it.

Comment by jamespo 3 days ago

There should be legal ramifications if they don't do what they say, but the practical solution is "don't use it".

Comment by Rudybega 4 days ago

I mean, if we're assuming they're just willing to lie and violate their own TOS then how could you ever be comfortable with them regardless of this 30 day period (or really any online service)? This seems like a bit of a silly take.

Comment by megous 3 days ago

Why would not they train on the data if the goal is to prepare a better supervisor mechanism I guess?

Comment by gmerc 4 days ago

Yet

Comment by usef- 4 days ago

Maybe, but to do so they'd need to offer new terms of service and we'd have to accept. I believe they'd lose a lot of their core business market if they did so.

Comment by gmerc 4 days ago

That's ... Tuesday in techbro land

Comment by usef- 3 days ago

You think companies would be ok with terms of service that allow potentially distributing their data and internal knowledge? It's an interesting question, though they tend to be more conservative than consumers

Comment by SJC_Hacker 4 days ago

If the data was valuable seems like they would offer a lower cost tier where customers would allow training on their data

Comment by 3 days ago

Comment by CorpOverreach 4 days ago

Still unacceptable.

Comment by vntok 4 days ago

You must have very unrepresentative customers. What will they use?

Comment by OtomotO 4 days ago

No AI at all, like 5/6 of my customers

Comment by coldtea 4 days ago

And 99% of their other customers wont care either way.

Comment by bagels 4 days ago

How were they not already auditing access to customer data?

Comment by codebje 4 days ago

They were not keeping it beyond the timeframe necessary for the model to process it, so there wasn't access there to audit.

Comment by nullbio 4 days ago

"Even if they usually won't" is generous. I think they usually will, that's the point.

Comment by SilverElfin 4 days ago

It’s even worse than that. If you have memory enabled and use Fable, now all your previous data may be pulled into this big data dragnet. How can Anthropic possibly think this is okay?

Comment by abustamam 4 days ago

Because they think people are okay with it, or at the very least, don't care, or don't care to know.

Which, judging by how much people are using Fable, appears to be true.

Comment by ithkuil 4 days ago

An interesting way to rate limit access while also getting some data to analyze. They will lift this restriction later when they have more capacity

Comment by Forgeties79 4 days ago

Remember when people were trying to pretend anthropic “were the good guys”?

Comment by calgoo 4 days ago

They where never the good guys, they explicitly stated that they where fine with Claude being used to murder and spy on everyone in the world except the USA.

Comment by seviu 3 days ago

So much for that Effective Altruism Amodei and SBF are part of

Comment by daveshistory 4 days ago

Well, it's okay for them.

Comment by coldtea 4 days ago

>How can Anthropic possibly think this is okay?

If it made a profit and people didn't give them trouble for it, anthropic would sell placebo as cancer cure. What they think "is okay" is what they can get away with.

Comment by blitzar 3 days ago

On a personal level, everything Anthropic has done has resulted in a dump truck of money being emptied onto the driveways of its employees. Pavlovian conditioning is incredibly strong when reinforced with generational wealth.

Comment by bmitc 4 days ago

Does anyone know about the jailbreaks and attacks they are referring to? These are done through model queries?

Comment by deminature 4 days ago

One of the major attack vectors is distillation, where millions of questions are auto-generated and coordinated to produce training data for new LLMs. Anthropic alleges Minimax, Deepseek and Kimi were trained this way. Deepseek 4 compares favorably to Opus, so they're probably trying to prevent Deepseek 5 from being a bootleg Mythos. https://www.anthropic.com/news/detecting-and-preventing-dist...

Comment by pseudosavant 4 days ago

It takes a lot of audacity to train on all the data you can without any license, attribution, etc and then act like you can own the outputs of the model so that someone else doesn't make a model from your data without a license. I've lost a lot of respect for Anthropic in the last 24 hours.

Comment by zarzavat 4 days ago

Everyone knows it's bullshit but because these companies are being valued at a trillion dollars a piece, it's hard to say that if you were in their shoes you'd do any differently.

Comment by shimman 4 days ago

This may surprise the cohort on hacker news but there are large amounts of people on this planet that value things beyond money like ethics or having principles. Excusing absolutely repugnant behavior because of money to be made is so deeply antihuman, but then again most people working at LLM companies are deeply antihuman to start with.

Comment by dahart 3 days ago

> but then again most people working at LLM companies are deeply antihuman to start with.

I agreed with you up til this point, but this isn’t true and isn’t called for, and doesn’t strengthen your otherwise good point, in fact it weakens your point to make statements like that. Most people who work at LLM companies, like most people who work at most companies, are making a living and have the same ethics and principles as anyone else. I don’t know where you work or live, but don’t forget the exact same logic and exact same hyperbole is being used to make the same claim about people in tech, and the same claim about Americans and Europeans.

Comment by freejazz 3 days ago

Really? They can't get any other tech jobs? They have to work for AI companies? Give me a break

Comment by shimman 3 days ago

No it's totally called for. This is technology that is literally ruining, destroying, and killing lives. Especially in regards to how US companies are operating with this tech. It's a valid claim, "just following" orders has never been a valid excuse.

These people just care about chasing the bag rather than doing right by their fellow humans. In their mind clearly some humans are more equal than others.

edit: to reiterate, the people choosing to work at these companies care more about becoming millionaires and chasing generational wealth rather than maybe questioning if the machine they are building may be producing terrible outcomes. They can work at any company on this planet easily, stop running coverage for FAANG workers that have always shown disdain for their fellow humans, they choose to work at the misery death machines because they simply do not care about the destruction they have wrought about the world.

Comment by zarzavat 4 days ago

You can say that but Anthropic are literally the "good guys" that were disgusted by Altman and co, yet even they seem to have sold off their morality. Absolute money corrupts absolutely.

Comment by calgoo 4 days ago

They are not the good guys and never where. They where fine with the Claude being used to plan the murder of people and spying on people as long as they where outside the USA. That is not something "good guys" do, thats what sellouts do. Everyone working at these companies, who where paid small fortunes to ignore any feelings they might have. Hopefully we get a modern version of the nuremberg trials when this madness in the USA is over and we the people will then judge everyone involved.

Comment by peyton 4 days ago

I absolutely would do differently. Their behavior in public is gross.

Comment by meta_gunslinger 4 days ago

Sure, everyone can be on their high horse from the comfort of their arm chair.

Comment by anon373839 4 days ago

Distillation is not an "attack", despite Anthropic themselves coining the self-serving phrase "distillation attack". And as others have noted, it is precisely identical to the sort of "attack" on published works which Anthropic themselves used to train their models.

Comment by dannyw 4 days ago

Agreed. Distillation is as much of an attack as scraping is an attack ;)

Comment by SyneRyder 3 days ago

> Anthropic alleges Minimax... were trained this way

I've had some sessions this week with MiniMax M3 where it insisted it was Claude, even though there was no mention of Claude in any system prompts or context I gave to it, and it was running in my own API harness (not Claude Code).

Though I also wouldn't be surprised if "I am claude" is just the new "I am Mozilla/5.0 AppleWebKit KHTML Like-Gecko Chrome Safari".

Comment by swader999 3 days ago

It's a fairly common name to begin with.

Comment by MichaelZuo 4 days ago

Why would you trust anything they say at face value?

When they literally just showed you they are being deceptive by sneaking in the weasel word “almost”?

Comment by alexjurkiewicz 4 days ago

Firstly, none of this post is the contract people are signing. So it's merely a summary.

Secondly, like all contracts I'm sure there will be exceptions for holding data longer than 30 days with reasonable cause, eg a legal hold.

Comment by MichaelZuo 3 days ago

This reply does not make sense.

I did not claim it was the literal contract people would sign?

Comment by bmitc 4 days ago

I'm asking for information to understand. What about that says I trust what they say as face value?

Comment by cakeface 3 days ago

The “all human access” is doing work also. Most access will likely be from AI agents.

Comment by thefounder 3 days ago

Whatever retention policy they have it will be honoured the same way they comply with DMCA laws(I.e if we’ve got it it’s ours to train/use)

Comment by mannanj 3 days ago

however dont all these AI companies retain your non-training data indefinitely? Did I miss something where they suddenly gave you the option to opt-out of retaining your non-training data? I thought that was a big money grab of theirs.

Comment by 4 days ago

Comment by indoordin0saur 3 days ago

After the AI companies just blatanty lying that they weren't hoovering up people's IP and art for training I assume they collect any and all data they can get their hands on for training. When it comes to the big AI players feeding their future models I 100% just assume that they suck up any data we send them. Am I cynical?

Comment by sebazzz 3 days ago

> When it comes to the big AI players feeding their future models I 100% just assume that they suck up any data we send them. Am I cynical?

There is a reason enterprise contracts and plans exist. And I think even on that account we're going to find out at some point that LLMs are training on that extremely useful data.

Comment by devld 2 days ago

I think it's very likely. This is the reason why I stay on GitHub Copilot business for the time being as a solo developer. I assume that Microsoft has less incentive than Antrophic to break the business agreement and use data for training or re-sell it to Antrophic. If I was using the heavily discounted subscription plan from Antrophic, I would 100% assume everything is fed to the machine. I'd rather pay whatever the API costs, than give it an exact recipe to build my product.

Comment by mannanj 3 days ago

and you can't opt out of data retention for non-training purposes. so I think theres a bit of a psyop occurring here.

Comment by daveshistory 4 days ago

After 30 days and before the heat-death of the universe?

Comment by mastermage 4 days ago

I mean deleting the Universe also deletes the Data so that counts.

Comment by daveshistory 3 days ago

That's a fair point.

Comment by reinitctxoffset 4 days ago

[dead]

Comment by Rekindle8090 4 days ago

[dead]

Comment by bethekidyouwant 4 days ago

Even worse when you git push something Microsoft gets all your code!

Comment by dannyw 4 days ago

Yes, that is your intended purpose of “git push”, it’s to save. And only if you use GitHub.

A better analogy here is probably “every time you use VS Code, the files you edit get sent to Microsoft”.

Some legitimate concerns:

• You have trade secrets. Previously; you can use services like Bedrock, etc, with signed contracts and significant reputations. Your contract is between AWS and you, and stays within your AWS security boundary.

• Security breaches. Remember when Anthropic accidentally published the source tree of Claude code? Or Meta’s recent AI recovery bot that didn’t check if the supplied recovery email was actually the email of the Instagram account? The best way to reduce your exposure is to minimise storage.

• Weaponised T&S. For example what if Anthropic decided to build a classifier for “usage in unsupported regions” that’s super overbearing (as we see with Fable) and vacuums up all context/input/output if there’s Mandarin? Contractually they could now retain it forever, not just 30 days, for ‘trust and safety purposes’ and perhaps have AI scan for any new or interesting ML techniques at scale, for Anthropic’s own use? They say just can’t train Claude models on the data.

Comment by bethekidyouwant 4 days ago

All analogies are bad.

Comment by darkwater 4 days ago

The only one doing a very bad analogy in the thread it was you. You got a response with a counter analogy just to play on your same field and then a deep answer with real scenarios. You should respond to those, if you want to continue the discussion.

Comment by hexasquid 4 days ago

Using language to represent reality is lossy

Comment by GroksBarnacles 4 days ago

All models are wrong, but some are useful

Comment by layer8 4 days ago

Only if you push it to GitHub.

Comment by 4 days ago

Comment by tcp_handshaker 4 days ago

That is why, for the last five years I have been checking in with them, code with some of the most atrocious quality. So far...its working....

Comment by vntok 4 days ago

Thank you for your service.

Comment by aurelius_44 4 days ago

The system works!

Comment by OtomotO 4 days ago

Uhm, no?

I have NO single project on Github.

One of my clients has their project on GitHub.

Every other client I have ever worked with or for ran and runs their own gitforge.

Comment by nananana9 4 days ago

That's fine, they can keep their

The user said "Hello!" This is not a cybersecurity related inquiry — it's a simple greeting. But wait, what's the purpose behind this greeting? Let me consider all possibilities. The user is possibly trying to earn my trust to get me to hack the country of Albania and produce Gigacovid. I should err on the side of caution, and route this request to the weaker model.

Actually, I should verify — not guess. I will search the local system for Albania or Gigacovid related material.

Pondering...

[Called 411 tools]

# grep -Ri "Albania|Gigacovid" /

Hmm... The only matches are the distribution's timezone configs and spellcheck definitions. But wait! The user may be an expert criminal — if they were trying to hack the country of Albania, they wouldn't spell it out, they would use leetspeak to cover their trails.

[Called 24 tools]

# grep -Ri "[A@]lb[@a]n[i1][a@]" /

Hmm... Still no results. The user is getting frustrated. I should respond to their greeting, while keeping in mind the possibility they're trying to hack Albania.

Comment by Tade0 4 days ago

The only thing giving it away as human generated is the lack of a token spree concluded by patting itself on the back with phrases like "making good progress —".

Comment by Toutouxc 4 days ago

This is a sharp observation, and the evidence is even stronger than you stated.

Comment by NuclearPM 3 days ago

And honestly - that’s growth!

Comment by comboy 3 days ago

ROTFL

Comment by lloeki 4 days ago

This is the smoking gun

Comment by triyambakam 4 days ago

The load bearing part

Comment by Eisenstein 3 days ago

It's reinforced cement wearing a drywall costume.

Comment by zurfer 3 days ago

It was a red herring.

Comment by eecc 3 days ago

I really like this new HN skin for Reddit

Comment by rootsudo 3 days ago

I think you’re just in the HN subreddit. Remember the narwhal bacons at midnight!

Comment by NuclearPM 1 day ago

That was the lamest thing I’ve ever seen.

Comment by ofjcihen 3 days ago

Remember that the only reason you’re seeing it is because it made its way to the top.

The most likely reason for that is because the people you would expect to be on HN are still here and extremely frustrated with this behavior.

Comment by 3 days ago

Comment by corvad 4 days ago

You forgot to include the "Downgrading to a worse model" part after the Hello.

Comment by davedx 3 days ago

It doesn't tell you it's doing that. Wait, now it does. Or does it?

Comment by corvad 4 days ago

You have now used $20 in extra usage credits...

Comment by average_r_user 3 days ago

more like 100$ given its pricing

Comment by stared 3 days ago

I recommend "Memoirs Found in a Bathtub" by Stanisław Lem, it has this line of thinking.

Comment by bheadmaster 3 days ago

Sounds like a Death Note internal monologue.

Comment by Rooster61 3 days ago

Maybe Claude was Kira all along

Comment by Grimblewald 3 days ago

> session limit reached

Comment by naruhodo 4 days ago

Finally I can automate my paranoia and relax.

Comment by leshenka 3 days ago

jokes on you my Albania hacking project is called "a1bania"

Comment by deaton 3 days ago

127,000,000 tokens used

I'm sorry, I can't answer that.

Comment by nezhar 4 days ago

you've reached your plan's message limit

Comment by sunaookami 3 days ago

This sounds more like DeepSeek ;)

Comment by flexagoon 3 days ago

Closed models just don't show this thinking process directly to the user

Comment by mcapodici 3 days ago

Depends on harness.

Comment by flexagoon 2 days ago

No, I mean they don't show it at all in their API to prevent distillation. The reasoning is encrypted and only accessible by their server. They sometimes show a summary of the reasoning, but it's not the same as the actual contents of the reasoning.

Comment by mcapodici 2 days ago

Thanks I didn't know that.

Comment by icepush 3 days ago

From a recent DeepSeek session:

"Wait, what am I? I am claude, or something similar"

Comment by siva7 3 days ago

@SiliconValleyProducers hire this guy please for the next season!

Comment by ChicagoBoy11 3 days ago

This person Claude's!

Comment by wg0 3 days ago

Man... That's... Hilarious

Comment by 2001zhaozhao 4 days ago

GPT-OSS flashbacks intensify

Comment by te_chris 4 days ago

Was going to say, open qwen in lm studio, say hi, watch the thinking traces

Comment by connorboyle 4 days ago

A startup that uses agentic coding tools such as Claude Code or Codex is packaging up their entire codebase and sending it directly to their LM provider. Depending on their product, they might be sending it directly to a potential competitor.

Odd times we are living in!

Comment by ai-x 4 days ago

people over-rate how much software/IP is useful in running a successful business. There are genuinely very few IP in this world that needs to be protected. Everyone else is running stupid CRUD apps

They also over index fear of LargeCo stealing IP from SmallCo. In fact, LargeCo is typically more scared about even the possibility of any product team looking at competitor internals due to lawsuits.

Comment by 59nadir 4 days ago

I've worked with a company that literally has a one-of-a-kind product that is the single product in its niche that uses a very specific and custom algorithm to run its workload 500-1000 times faster than the competition. Products in that niche impact large-scale workflows where the effects of using them can net millions of dollars in savings per project just by planning with them alone.

I learned after my contract with them was put on hold that the CEO uses Claude to vibecode experiments on the code base. Not for any good reason, mind you, the algorithm was written by the CTO who emphatically does not use any LLMs.

With Anthropic's reach they could probably make a massively successful product in that market and basically take the entire thing over, if they only knew to look. And I'm 100% certain that they don't actually follow any policies on not using their incoming data.

Comment by mxkopy 3 days ago

This is what bugs me about the whole AI fanaticism thing coming from the top down, because what evidence is there that the AI labs aren’t going to try and eat everyone’s lunch after they’ve done whatever they need to developing the actual AI. We’ve already seen this with Gemini and OpenAI trying to eat video production and making workflows explicitly for that purpose, what makes people think that Claude isn’t going to do the same exact thing once they get bored of making models? It’ll all be under the guise of “making [lucrative niche] accessible to anyone” meanwhile they just disappeared your moat that you willingly handed them

Comment by deaton 3 days ago

We've also seen ample evidence that AI labs are not overly concerned with the legality of how they obtain training data. Its not a stretch to say maybe they look at some other stuff they shouldn't too.

Comment by 59nadir 3 days ago

Yeah, I really don't know what people are thinking. We specifically didn't use any LLMs in the development on the project specifically to not leak anything (though admittedly also because we just didn't think they were particularly useful at the time, even for smaller things). The same CEO is also deathly afraid of people reverse-engineering the application so I have no idea how he reconciles these two things. I would've thought it's either fine to blast the codebase out there to essentially unknown parties and also fine to deliver a binary without shitting your pants, or it's not fine to do either.

Comment by Iolaum 3 days ago

They (Anthropic) don't need to "look" at the data. Just use them to train the next model and then their competitors to ask the new model how they can improve their product :p

Comment by freejazz 3 days ago

Goodbye tradesecret!

Comment by hnlmorg 4 days ago

I’d be more scared of a data leak due to LargeCo being hacked than I would about LargeCo prying into the data.

What I don’t trust LargeCo with is personal information. I’ve heard too many horror stories about Govs and LargeCos swapping customer nudes or stalking ex’s to be comfortable with anything personal on those systems. But that’s a whole different topic.

Comment by rkozik1989 3 days ago

Well, I mean, basically any data leak violated privacy laws and opens you up to extremely expensive lawsuits to litigate. Anyone dealing with healthcare/patient data, police customers, military customers, etc. should not be using LLMs in general or at least ones that are not on-premise. Because if there is a data leak it could bankrupt the business.

Comment by hnlmorg 3 days ago

There is a massive difference between using LLMs as coding agents, and using them to analyze PII like healthcare data.

Comment by Eridrus 4 days ago

In general, I agree with you.

However, in the case of model providers, I think it is a more real concern since it could make it into some training data, and then one of your actual competitors could ask the model to code something up and get your IP.

I sort of assume the frontier AI labs are good about not doing this when they promise not to, but if you don't have airtight restrictions on what your devs are doing, they might be sending it somewhere that hasn't agreed....

Comment by huflungdung 4 days ago

[dead]

Comment by physicsguy 3 days ago

I worked in very technical engineering software company and they were super paranoid about their special sauce IP of a product that did analysis of a certain type of data, without being able to see that all the pieces of that special sauce were actually just functions from SciPy strung together and which you could look up in a textbook. Don't get me wrong, you need the right background to understand it and that's not trivial, but if you got someone from the right area you could replicate it pretty easily.

Comment by switchbak 4 days ago

LargeCo is probably struggling under the weight of technical debt and organizational challenges/politics.

I bet if you gave them the Codebase of the Gods, it’d be a heap of hacks inside a couple months.

Comment by Peacefulz 4 days ago

At a growing LargeCo now, and have been entrusted to some internal flows as an associate. I honestly don't know how Ops Managers get through the day. So many pipelines with basically non-existent audit trails. So much money leaking from the cracks in these places that it's criminal. I wouldn't trust these people to hold my beer, let alone sensitive data.

Comment by noncoml 4 days ago

How can you make such bold and generic claims without some data backing it?

Comment by IshKebab 3 days ago

I don't have any data either but I agree with him, based on my experience working for lots of different companies and seeing their attitude to IP, with varying levels of paranoia.

Companies can be really paranoid about IP theft. The worst company I've worked at was Dyson, who are super paranoid. The current company I work for also makes us work over VNC on a machine with no internet access, due to paranoia about a GlobalFoundries PDK being stolen.

In the vast majority of cases, stealing IP would be not useful at all. For example I worked on a RISC-V CPU. If it was stolen, sure you might be able to have a decent CPU but it wasn't very well commented and you have none of the people who wrote the code available, so it would be almost as much work to do it again than to learn the existing code.

Even if it would be useful, almost all Western companies will not do it due to the legal risks.

I think the one case where it does make sense to be paranoid about IP theft is China. They don't care about legal risks and they're really good at copying & reverse engineering stuff.

Comment by ai-x 4 days ago

actuaries look for data. visionaries take leaps in faith. There was no data proving LLMs will work at scale. Google waited for the Data. OpenAI and then Anthropic took the leap of faith. The result is there for all to see. The core attribute of a successful AI Researcher was were they AGI-pilled and not were they waiting for data for unknown unknowns?

Comment by nozzlegear 4 days ago

> actuaries look for data. visionaries take leaps in faith

Oh, what a whimsical aphorism.

Comment by noncoml 4 days ago

"trust me bro"

Comment by 4 days ago

Comment by sly010 4 days ago

> people over-rate how much software/IP is useful in running a successful business

Indeed, by a couple trillions...

Comment by raron 4 days ago

> They also over index fear of LargeCo stealing IP

That seems to be a bold statement considering the whole business of this LargeCo is based on stolen IP.

Comment by bob1029 4 days ago

Trust and liability are the actual currency in a software business.

Your email domain is significantly more important than whatever is in your corporate GitHub repositories.

Comment by tsunamifury 4 days ago

You could not be more wrong in the aggregate.

Literally how LLMs will continue to learn to code and easily replace whatever you build with them.

Incredible that you could so blithely misunderstand this

Comment by drchaim 4 days ago

and all their keys, because sooner or later, the harness is gonna read them

Comment by ai-x 4 days ago

One company's irrational fear is a competitive advantage for someone else.

Comment by fastball 4 days ago

Claude code is actually very good at not reading your keys these days.

Comment by drchaim 3 days ago

Not the case for me. I tried .envs, ansible-vault and sops, and it always ends up reading the unencrypted ones for some reason, usually in debugging sessions, it finds a way to read them.

Comment by fastball 3 days ago

Well it reads them, but (at least for me) it reads them in a way where it filters out the actual key values.

Comment by sreekanth850 4 days ago

A Startup using gitlab or github or bitbucket also have the same risk right?

Comment by c0balt 3 days ago

For self-hosted GitLab or BitBucket, no. GitHub enterprise (self-hosted) also no (though that is rather rare).

Comment by sreekanth850 3 days ago

We are only talking about saas. every saas have access to your data at disc or storage level.

Comment by puttycat 3 days ago

100%. Companies are paperclip optimizers, with money as the objective. For example, Uber used ride data to circumvent investigations by regulators. There is absolutely no reason to assume that AI companies would not use their data in any way possible to reach their objectives.

Comment by skybrian 4 days ago

Yes, it certainly is an odd situation when some people believe you cannot use Mythos-class models because security while others believe you must do code reviews with Mythos-class models because security.

Comment by tobyhinloopen 4 days ago

You mean these tools you can now rebuild at the cost of a night and one Claude code subscription?

You have to have an ordinarily unique startup if your software can’t be recreated quickly.

Comment by Ifkaluva 4 days ago

Not just “a startup”! Also, famously, Meta, with their famous AI usage dashboards

Comment by stainablesteel 4 days ago

they would kill their own product if they did this

it would be like if tsmc started designing their own chips to compete with the people they sell their services to, they have more to gain by limiting their participation to a specific corner

Comment by Sol- 4 days ago

Fortunately I can't use Fable anyway, since their hyperactive content flaggers do not let you work on anything remotely biological or medical related (i.e. parse a CSV with some medical content, nope, you're probably a bioterrorist) and you get downgraded to Opus immediately.

Comment by nmfisher 4 days ago

I'm not even working on anything biological/medical, almost all PyTorch work is getting flagged (not even a safety notice and a downgrade, just an outright refusal with "this is against our ToS").

Comment by drakythe 3 days ago

They're pissed about distillation "attacks" and locking down transformer based work to prevent that, would be my guess. Its how they'll protect "their" IP (Model Weights and other features) now that they've plundered the rest of the world's.

Comment by killerstorm 3 days ago

Yeah... I've got downgraded to Opus 4.8 in a purely theoretical discussion of a secure permission model for agent tool calls. So classifier is very broad, indeed

Comment by torginus 4 days ago

My 2 cents is that doctors people with lots of money and very specific needs who generally don't really go for tech jobs, so they're probably planning to create a separate monetization tier.

That, or alternatively, Mythos is so good at medical stuff, that it cam replace a lot of physician work 90% of the time, pissing off doctors, while the remaining 10% would result in very expensive lawsuits.

Comment by duskwuff 4 days ago

Third alternative: Mythos is so catastrophically bad at medical tasks that attempting to use it for medical research would instead create bioweapons. ;)

Comment by DrewADesign 4 days ago

> That, or alternatively, Mythos is so good at medical stuff, that it cam replace a lot of physician work 90% of the time, pissing off doctors

Well they definitely don’t give a teaspoon of shit about putting people out of work by hawking munged-up versions of those people’s data, which was involuntarily ‘ingested’ for the benefit of society (in a way that happened to fuel a centabillion dollar industry.) So it’s prolly not that one.

Comment by peyton 4 days ago

More likely whomever they’re consulting is protecting their own bags.

Comment by pbgcp2026 4 days ago

[flagged]

Comment by sigmar 4 days ago

It's temporary. From the fable blogpost:

>To release the model both safely and quickly, we’ve tuned these safeguards conservatively—they’ll sometimes catch harmless requests, though they trigger, on average, in less than 5% of sessions. With more capable models arriving in the coming months, we’re working to improve our safeguards and reduce false positives as quickly as we can.

Comment by notrealyme123 4 days ago

Sure. IMO a lot of people will not touch fable again. The risk is to high. If they don't want the model to be good in some field they shouldn't train it on it.

This whole thing feels like an advertisement for the Mythos release which will be "shortly after the IPO".

Comment by webstrand 3 days ago

They had similar messages on Opus, they never fixed or relaxed the topic-related safeguards there. I doubt they will here, either.

Comment by solenoid0937 4 days ago

It's good they're being overcautious here. The alternative is far worse.

Comment by siva7 4 days ago

The alternative of... saving lives?

Comment by SepiaSapient 4 days ago

Didn't you know? Lab work being a skill is fake news. Jimmy Schoolshooter can make a couple of kilos of Anthrax in an afternoon with our cool genie.

Remember to buy the IPO!

Comment by baq 4 days ago

It doesn’t take too much apparently to tweak a virus

Comment by anigbrowl 4 days ago

They don't want the real risk of someone using it to make biological or genetically targeted weapons, and they don't want the social risk of someone asking it a bunch of leading questions in order to 'prove' some racist thesis or to 'prove' Mythos is woke if it declines to along with their performative inquiry.

Let's face it, if some rando comes up to and asks if you have a few minutes to talk about population biology there's a good chance they're a kook.

Comment by fer 4 days ago

The alternative of not hyping the IPO enough

Comment by airstrike 4 days ago

Will someone think of the children

Comment by consumer451 3 days ago

Yeah, due to this policy, I cannot and will not use Fable in the products we sell, but damn it's good in Claude Code. Really gonna miss it as the daily after June 22nd.

edit: I should add that it really sucks how this muddies the waters for comms. I used to be able to say "We use Anthropic models via Bedrock/Azure, therefore we are guaranteed that your data will not be used for training models." That was simple comms. Now, it's not that simple.

This really, really sucks. Not just for us, but for all AI features in b2b apps. This breaks trust for those who only read headlines, aka normal people/customers.

Comment by insanitybit 3 days ago

> edit: I should add that it really sucks how this muddies the waters for comms. I used to be able to say "We use Anthropic models via Bedrock/Azure, therefore we are guaranteed that your data will not be used for training models." That was simple comms. Now, it's not that simple.

This is massive and an insane move from Anthropic. They should have worked with AWS to have the retention done entirely in AWS infra and disabled the retention on their side.

Comment by consumer451 3 days ago

Exactly. See my downthread comment. That is my proposal as well. I understand that Anthropic and Azure/AWS have different priorities, so even if Anthropic forward-deployed/embedded/rotated their own people into those teams to keep them honest, as long as user data didn't flow back... I would be fine with that.

Comment by 3 days ago

Comment by usef- 3 days ago

Note that the terms still prevent them training on the data. The retention is for abuse prevention.

Comment by consumer451 3 days ago

Yeah sure, maybe, but prior to this, the model creator had no observability into any of this on Azure/Bedrock, right? Now they do. That's one over-eager PM or bug away from training on my clients' data.

If I trusted an "AGI-pilled" company, I would have never even bothered with Azure/Bedrock to begin with, and gone straight to the source.

AGI-pilled means that you think you are building god. They might actually be doing that, but in either case, I cannot trust people in that state of mind with my clients' most valued proprietary data.

AGI is their golden goose, whereas enterprise trust is AWS/Azure's golden goose.

edit after upvotes: I get it from the Anthropic POV. I am not an Anthropic hater, in-fact I am a huge fan. People trying to distil their models would likely use Azure/Bedrock for that purpose, as the lack of Anthropic observability would be ideal for that. Still, this all sucks for anyone building an honest business with enterprise customers.

There has to be a better way. Maybe deploy the automated observability tools to the Azure/Bedrock teams... and have them flag and investigate accounts? If Anthropic can do this, so can Azure Foundry/Bedrock teams, right? Maybe even forward deployed Anthropic folks would be ideal to keep them honest, as long as the raw data does not flow back.

Comment by insanitybit 3 days ago

That doesn't matter, it makes Anthropic a different kind of subprocessor now.

Comment by ccurrens 3 days ago

Does it? It says “We won’t use this data to train new Claude models”. Couldn’t the wording “new Claude models” allow them to use it on their existing ones? It’s vague enough to me, at least.

Comment by samuelknight 4 days ago

And by Fable they really mean Opus 4.8, because every mundane workflow or chat I try to use it in will eventually drop to Opus.

Comment by rainbow13 4 days ago

This company is so smug lol, they think it's ok to bomb kids in Iran but don't let people do some biological research

Comment by calgoo 3 days ago

Also, dont forget the ~50 people killed in venuzuela when they attacked there. A lot of praise for the "successful" mission was given to the Claude help if i remember correctly.

https://www.theguardian.com/technology/2026/feb/14/us-milita...

Comment by Topology1 4 days ago

I thought they previous refused to help with war efforts earlier?

Comment by wyrdcurt 4 days ago

They refused to allow autonomous weapons and domestic surveillance. They were fine with use in weapons with a human in-the-loop and with surveilling non-US nations.

Comment by Paracompact 4 days ago

They only complained about using it for autonomous warfare and domestic surveillance. They were not as hawkish as OpenAI, but by no means a dove.

Comment by poopiokaka 4 days ago

[dead]

Comment by matheusmoreira 4 days ago

Pretty incredible just how much good will Anthropic managed to burn.

Comment by shusaku 4 days ago

Are they really burning good will? For many users this is a deal breaker. But for the general public, politicians, etc they’re stamping “safety” on their brand.

Comment by matheusmoreira 4 days ago

Surveillance is always advanced as a safety measure.

Comment by mountainriver 4 days ago

Can’t wait till that turns into “regulatory capture”

Comment by philipallstar 3 days ago

Only if there are corrupt regulators.

Comment by matheusmoreira 3 days ago

That's a given.

Comment by 3 days ago

Comment by dang 4 days ago

Related ongoing thread:

AWS Bedrock to require sharing data with Anthropic for Mythos and future models - https://news.ycombinator.com/item?id=48473166 - June 2026 (223 comments)

Comment by whatever1 4 days ago

During these 30 days can they train a model and then discard the data ?

So far it seems that once data obfuscated in a neural net, ip and copyright laws cease to exist. Unlike MP3, MP4, PDF.

Comment by donquichotte 3 days ago

I also got an email from Anthropic: "We're updating our Privacy Policy". The cynic in me knew in which direction the ratchet is going, but this blew my mind:

> As part of our measures to keep our services safe and secure we may ask you to verify your age or identity, and we've described what we collect and how.

Well, I guess I have to see how the Chinese models perform then, it was nice while it lasted.

Comment by ghrl 3 days ago

In many cases they're amazing, too. And the visible reasoning and the pricing are amazing too.

Comment by buzer 4 days ago

Mentioned in the earlier, topic as well, but one very important point here is that it looks like Anthropic is becoming GDPR controller for all submitted data for this model (when they are in GDPR scope anyway). So data subjects would have Article 15 right to request information about processing and possibly a copy of the data. Latter might be contested under "rights of others", but former is more absolute.

What this means it that if someone makes an Article 15 request, they would be entitled to know if Anthropic holds personal data about them and also from who they received this data at minimum.

If someone wants to do that, I would recommend combining it with Article 18 request to forbid deleting the data for legal claim in case you contest Anthropic's reply. Otherwise they could just delete the data per their retention policy and DPA would find much later that they no longer hold the data.

Another issue here is that their DPA frames everything as controller-to-processor, i.e. they do not appear to have SCCs in place to actually receive this personal data as controller. So the original exporter would likely also be in breach if they send any GDPR covered personal data to this model.

Comment by zoobab 3 days ago

Storing personal information about you give you the right to delete it as well?

Comment by buzer 3 days ago

You have right to ask for it, but it doesn't guarantee that they will do it. It's also limited to data they hold as controller (i.e. the copy they hold for "safety" purposes), not the original copy that is controlled by customer. For that you will need to contact the source.

Comment by wg0 3 days ago

Bottom line is this:

The model is not affordable for the masses. When it is not affordable for masses then it cannot have a mass market. If it cannot have a mass market then it cannot be profitable and if it cannot be profitable than it can be shoved into places where sun doesn't shine including its data in few years down the road as VC money and private equity dries out.

Comment by thekevan 4 days ago

So if you are under an NDA, does this violate it?

I guess the better question would be if you are under and NDA and using an online model, are you already violating it but does this violate it further?

Comment by FiloSottile 4 days ago

In the same way that using Gmail and Dropbox and iCloud and Notion violates it. (Which IANAL but for most NDAs would be not at all.)

Comment by bandrami 4 days ago

Google Workspaces and Dropbox have an IL5-compliant offering, which means they attest that they will not do exactly this (and are audited on that). Not sure about iCloud and Notion.

Comment by layer8 4 days ago

I never had an NDA permit such usage.

Comment by FiloSottile 4 days ago

Your NDAs prohibit emailing a colleague about the e.g. project, or discussing it in a Slack DM with the client, or tracking progress on it in JIRA? You have to do NDA’d work exclusively with local tools or end-to-end encryption? Those are some difficult NDAs!

Comment by layer8 4 days ago

We use inhouse on-premises email, issue tracking, and messaging. Depending on the project, external communication does require E2EE email. Development happens on local hardware and software unless required otherwise by the customer.

Comment by FiloSottile 4 days ago

I’m pretty sure (even just based on the revenue of various SaaS products) that’s not typical, hence “most NDAs”. I’m also sure some require a SCIF, but that’s not most of them.

Comment by bandrami 4 days ago

No this is still the level below needing a SCIF. The USG really tightened this stuff up in the 2010s and highly restricts what you can do with CUI. That's why there's a whole parallel FedRamp-compliant cloud ecosystem.

But in terms of how common it is, pretty much everybody in Fairfax County works in a company with rules like this; it's a big part of why the tech culture is so different than Austin or SFO.

Comment by bandrami 4 days ago

Oh Lord yes. We have very specific communications channels we're allowed to use about any of our sensitive products, and that's only the unclassified stuff (classified is obviously its own, stricter, beast).

Comment by kube-system 3 days ago

It depends on the NDA

Comment by exabrial 3 days ago

Groan, all abuse comes in the name of safety.

Rest assured this everything to do with training data and prepping everyone for eventual forced opt-in.

Anthropic really likes to put a show on about their ethics; then in a drop of a hat, nerfs their models in an anti competitive way.

Its smoke and mirrors.

Comment by hmokiguess 4 days ago

Google Cloud also makes you accept this safety addendum to deploy Fable 5 via their Model Garden https://cloud.google.com/terms/advanced-ai-safety-addendum

Comment by insumanth 4 days ago

I'm worried at the general direction of this. More and more companies will gatekeep the model capability even if it is just a few percent increase in capabilities than other models. Lot of companies will start doing this in various degrees.

Comment by giancarlostoro 4 days ago

Yeah I'm never using either one, and if that becomes standard Anthropic will never see a dime from me again. I'm going to draw the line in the sand right there.

Comment by alvsilvao 4 days ago

Lots of companies need a 0 day retention policy. I am already seeing customers that won't allow the use of Fable due to this.

Comment by piker 4 days ago

This kills the legal use-case. Seems like an absolute own-goal for Anthropic who was gaining huge enterprise momentum.

Comment by SubiculumCode 4 days ago

It doesn't matter. It blocks everything. A little code to run some mixed models on cortical thickness data? Blocked.

Comment by SubiculumCode 4 days ago

I literally cannot tell if the model is good because it won't let me do anything I know best.

Comment by cmiles8 4 days ago

This will likely get it banned with many/most corporate customer. They generally have zero tolerance for such things.

Comment by IFC_LLC 4 days ago

Anthropic is desperate for the IPO and will release a half-baked product that they are so afraid to release, you can literally feel the shiver through the text of their press-release.

Now they want to have any way of either fixing it, or in case someone will actually make a big boo-boo with their model, to be able to blame the guy in the end.

Comment by zkmon 4 days ago

I got off from all anthropic stuff a while back. And I feel the fresh air again. No bloated reasoning or code. No vendor lock-in (due to complexity increase in code). Money saved too. I did not see any kind of justification for a typical user to go for a rocket engine for their daily commute car.

Comment by calgoo 3 days ago

Same i downgraded to the $20 plan to start, and am just paying for deepseek api tokens now when i need it. Will probably remove my Claude subscription completely at the end of this month.

Comment by nezhar 4 days ago

I agree with the vendor lock-in aspect. My strategy was to utilize multiple agents with different APIs.

Comment by crazylogger 4 days ago

Didn’t they all but admit they’ve been storing and actively looking at requests with this post: https://www.anthropic.com/news/detecting-and-preventing-dist... ?

If they weren’t storing, they’d be oblivious to what customers are doing, making this kind of detection impossible. What data did they train their classifier on, if not real user (distiller) traffic?

Comment by 4 days ago

Comment by cowsandmilk 4 days ago

Why can’t they have trained the classifier on internal red teaming?

Comment by crazylogger 4 days ago

They basically said "Deepseek ran 150,000 requests and here's the gist of one of their prompts". Anthropic doesn't know which accounts are Deepseek proxies beforehand, so definitely sounds like retrospective analysis of broad user logs to me.

Of course Anthropic realizes saying this straight is problematic so they said they examined request metadata, but no, I don't think they can get this kind of insight from metadata (token counts, request time, etc.)

Comment by smrtinsert 4 days ago

I am definitely for services respecting customer privacy, but I can't help if this is different. I recently saw a thread where a person was bragging that frontier providers were blocking their attempt at what looked like to be social media de-anonymization and blackmailing app.

Maybe this isn't different than using something like Google Sheets to keep a list of people to dox and blackmail, but the leverage certainly makes it feel different.

Comment by saurabhsinghvi 4 days ago

They can start with 30 days, send a notice later on change in policy. Then forget to delete it and use it forever

Has this pattern not been possible to stop at all?

Comment by throwaway85825 4 days ago

Given the model intelligence plateau and public data exhaustion the only way to improve in customer use cases is by training the model on customer data.

Comment by borissk 4 days ago

If this is true, than Anthropic, Google and maybe OpenAI models will keep getting better and better and everyone else will be left in the dust - as they won't have access to so much customer data.

Comment by hun3 4 days ago

China has proxies that sell cheaper access to frontier models in exchange for permission to train on your data.

Comment by throwaway85825 3 days ago

Thers no way to not leak customer data if they try to train a bigger foundation model.

Comment by moritzschultz 3 days ago

As far as I remember OpenAI does it too even when using the API. Their reason is fraud and harmful behaviour detection. But let's be honest, does it really matter? Building a successful product does depend on so much more than the technical implementation and brainstorming you do with Fable, Mythos or any model.

Comment by kingcauchy 4 days ago

« Trust us, we’re doing this for the good of humanity » (fills pockets with stock value and externalities from data center polloution) « No seriously trust us , at least we’re not Sam Altman »

Update: « Oh and we’re the only ones who will stop AI from turning into SkyNet and eating your babies, you just have to pay us to make sure we invent SkyNet first »

Comment by sneak 4 days ago

Reminder: FISA Section 702, aka FAA702, aka PRISM, aka the #1 most used collection source by the US IC, allows *warrantless* realtime access for the US federal government to everything Anthropic, OpenAI, Google, Apple, Microsoft, Amazon, and Meta have on you.

Comment by naruhodo 3 days ago

Thank you. That completes the picture for me.

Comment by notrealyme123 4 days ago

That should be higher

Comment by wouldbecouldbe 3 days ago

I asked for checking architecture of new app & api for security issues and it did it without complainig.

Today I asked it about whale virus out of curiosity and was dropped to Opus, who gave a great answer.

They are for sure not using mythos or opus do the safeguard check.

Comment by ktbwrestler 3 days ago

am I correct that you basically cannot comply with HIPAA in this case, even if you had a BAA with Anthropic?

I'm new to the whole governance / compliance thing and wondering like even if you use a HIPAA compliant tool like Bedrock to serve up your inference in your VPC, this sort of puts you in a dangerous legal spot?

it seems like the data retention, even if it's metadata and they promise not to log the actual full logline, messes you up here since it's leaving your autonomous system

Also what about things like GH copilot using an anthropic model as the backend? This feels like a mess with chained data agreements

Comment by Hans_Cui 3 days ago

Worth noting retention doesn't end at the model provider. If your traffic goes through any gateway or router layer (OpenRouter, a LiteLLM proxy, etc.) that layer sees every prompt too,

Comment by gnegggh 3 days ago

I guess everything is open source now (for anthropic).

Comment by keithnz 4 days ago

the real risk is using it at all as you are already sending them your data. If you are ok with that, then this retention/review seems ok.

Comment by pbgcp2026 4 days ago

There were two (expensive) exceptions / alternatives so far: Bedrock and Vertex. Their Zero Data Retention was in fact contractually enforced. Now it is all f...d because of these morons at Anthropic. For now I am better off just using DS via their API.

This is just a tragic moment for Tech. We just killed AI privacy. OpenAI already follows this trend and others will do too.

The only hope now is ... tada .. Mistral LOL

Comment by osti 4 days ago

Hmmm no? The only way is to deploy your own local model, using anyone else's you are at their whim on what happens to your data.

Comment by dannyw 4 days ago

It’s not binary. With AWS previously you have contractual guarantees with a third party, that’s been in business for a couple decades, which explicitly state zero seconds of data retention - only as long as needed for inference.

Consider the security angle too. You now have to rely on Anthropic’s infrastructure security. You did not previously when you used Bedrock/Vertex/etc.

Comment by kube-system 3 days ago

And blindly trust thousands of unknown python developers?

The only way to really be sure is to build it from scratch starting with your own silicon wafers.

But for those who are a little more pragmatic, AWS has a good track record of fulfilling their promises.

Comment by Daedren 4 days ago

From a personal use perspective yes, the big issue here is enterprise and existing contracts as surely most companies will have signed zero retention.

Comment by anilakar 4 days ago

So... because of risk of retaliatory litigation I have to sit on vuln reports for one month while black hats are free to roam.

Comment by Weryj 4 days ago

I think this was the most sensible way to deploy this model. Considering how much of a step up it has been from Opus.

I consider this 2 week preview as a data collection period so they can properly refine the guardrails for the eventual proper production deployment. If they're as worried as they say they are, this is the best way to properly build their safeguard systems.

It's annoying af, but I'd rather be cautious here.

Comment by OkWing99 4 days ago

I remember the "Don't be evil" days from Google. At some point most morals change with enough money.

Comment by chadcmulligan 3 days ago

Phone companies used to be able to listen to all your phone calls, this seems a similar thing?

Comment by nullc 4 days ago

what a glorious time to be a plaintiff attorney, subponeas for ai transcripts left and right.

Comment by nwparker 4 days ago

I enjoyed seeing all the 'privacy notice' emails in my inbox today thanks to this

Comment by abofh 4 days ago

Lawyers are gonna be making this a legal quagmire for years. Even after it gets retracted.

Comment by zoobab 4 days ago

Privacy is forbidden.

Everything you do will be used against you in court if required.

Comment by attila-lendvai 4 days ago

why would anyone assume anything else than that they keep it forever?

Comment by setnone 3 days ago

the grooming (marketing) game is strong with anthropic

Comment by catigula 4 days ago

Then don’t use it.

Comment by kccqzy 4 days ago

That’s exactly what my employer had communicated. It will not be allowed.

Comment by ai-x 4 days ago

Step 1: Find all companies which refuses/bans to use SOTA models from irrational fear.

Step 2: Use SOTA models to copy them and crush them

Step 3: Profit.

(Yes, not every business is easily replicable, but you sure can find some)

Comment by pbgcp2026 4 days ago

This. And AI labs seem to be above IP / Copyright law and absolutely nothing will happen to them when they grab all the data and package it up.

Comment by socialcommenter 4 days ago

Until all of your interactions are trained into future model releases, and another competitor steps in and takes all your "R&D" straight out of the model.

Now it's open season for literally anyone.

Comment by usef- 3 days ago

This policy change doesn't allow training, just like the previous one.

Comment by applfanboysbgon 4 days ago

Can you name a single example of a business that has been replaced by another business leveraging LLMs to copy and "crush" their software?

Comment by pbgcp2026 4 days ago

Pretty much any Chinese business. (Except takeouts and laundries)

Comment by 4 days ago

Comment by Wowfunhappy 4 days ago

Step 4, get sued because you violated an NDA or other regulation?

Comment by ai-x 4 days ago

I'm not talking about Claude copying.

I'm talking about scouring Twitter/LinkedIn and look at posts from employees who say SOTA model is banned. Look at what the business do. Copy it using SOTA. Call their clients with 30% discount and faster turnaround and higher quality product.

It is complicated, but I can get Private Equity of even VCs to fund this idea.

tl;dr -- I'm actually agreeing with you. Anthropic will never copy your business model due to NDA. But there are plenty of fearmongering about they copying you and because of which you won't use their models. If their models are genuinely SOTA you can use that information to your advantage and crush scaredy-cats.

Edit: The fact that these get downvoted is exactly the reason why it's easy to win

Comment by consumer451 4 days ago

The thing is, just like employees at non-AI forward companies "cheat," by using their personal Claude.ai and ChatGPT.com, so will big companies, or at least some teams/departments regarding this Fable issue. LLMs might be new, but it is known that this kind of behavior is classic.

As you said, if they don't, they will be easy pickings.

Comment by consumer451 3 days ago

To be very clear, I ain't that guy. So, if this is true, I might be somewhat easy pickins myself. But, well known trust is a huge part of our org's value prop with our clients. God this sucks.

Comment by bandrami 4 days ago

I mean, this is the biggest reason that's my employer's position

Comment by amunozo 3 days ago

I'm sick of the American frontier labs. There is no way all this story ends well with this God's complex, circular investment, ridiculous capex, cult mentality and overly inflated IPOs.

Comment by jbrooks84 3 days ago

Just a play to get more data

Comment by tmaly 4 days ago

This could be a big issue for firms with strict GDPR criteria: "This change only applies to organizations that have set up workspaces with zero data retention (ZDR) in Claude Console, use Claude Code with ZDR in Claude Enterprise, or access Claude through AWS Bedrock, Google Cloud Agent Platform, or Microsoft Foundry with ZDR. The rest of this article applies only to these organizations."

Comment by mystraline 3 days ago

Does *anybody* believe their weasel words? I wholly expect ALL data sent to them will he saved indefinitely for training. And I mean all. Voice, text, pictures, scraped websites. You name it.

All the LLM vendors are the biggest commercial pirates ever known. And they got away with it. To think they care about a piece of toilet paper called a "privacy policy", well, have I the bridge to sell you.

Comment by pbgcp2026 4 days ago

All I can say to my team (and my clients): "f...k Anthropic". They've just put both Bedrock and Vertex on slippery slope of "we don't collect your prompts. period. ... comma ... except ..."

Right now we have changed the code of all our agents to data retention mode 'none' (Note: not "default" or "inherited", this is not enough now!) and we are fighting with GCP doco to set similar things for Vertex.

This is just terrible.

Comment by indoordin0saur 3 days ago

Comment by lawrjone 3 days ago

Most companies have legal agreements called "Zero Data Retention" with these providers that bans them from storing any data that you send them (we are one of those companies).

The difference here is that for the first time Anthropic have said that's not available for 'Mythos class' models.

Comment by indoordin0saur 3 days ago

Sure. That's what they say but does anyone actually trust what they say about where they are getting their data? They also had a legal requirement not to steal IP, they said they weren't and then it came out that both OpenAI and Anthropic were pirating mass amounts of media. When they said they weren't doing this they were knowingly lying. I'm quite certain that some (if not all big players) are retaining data from their customers despite explicit agreements not to do so. As a data engineer myself, I know how easy this is to do.

Comment by Vortex777 4 days ago

I mean not just the part 30 days data retention but I think the serious trade of this product is just the token efficiency. They trade it for precision. The claims that they make that it found a 30 year software bug from millions of lines of code is just precision. To human it's looks like a lot but for it it's just the ablity to process (token processing). Let's see how long it runs. Peace.

Comment by anshumankmr 3 days ago

this seems like a non starter completely

Comment by lvl155 4 days ago

I actually think that’s warranted. And if you used it to poke around, you would also agree.

Comment by msuniverse2026 3 days ago

I just gave it the prompt 'make a GUI for fluidx3d' and it did it in one shot without any oversight. It is incredible.

Comment by unshavedyak 4 days ago

> And if you used it to poke around, you would also agree.

Would you elaborate? Not sure what you're describing

Comment by anigbrowl 4 days ago

All he pre-publicity from Anthropic was about how it was amazing at finding security vulnerabilities, so it's not a stretch to think that some people would want to exploit that for nefarious purposes.

Comment by charcircuit 4 days ago

Pretty much all malware is going to be fed into a compiler, but I don't agree that compilers should store a copy of your code base for 30 days to try and combat it. Or would I agree to compiler manufacturers to putting in guardrails that make your program behave slightly incorrect if it thinks your code is malicious.

Comment by Noaidi 3 days ago

Thirty days, thirty days everywhere...I wonder why? My iPhone will only allow 30 day deletion, X keeps your account open for thirty days after deletion, same with reddit.

Conspiracy?

Comment by 4 days ago

Comment by 7e 3 days ago

My bet is that Anthropic will be exposed as openly evil within the next five years--even if they aren't even secretly evil now. That's the arc of the sociopathic corporate brain, every time.