Lockdown Mode
Posted by berlianta 4 days ago
Comments
Comment by simonw 4 days ago
The existence of lockdown mode does however imply that ChatGPT, in its default settings, does not provide robust protection against sufficiently determined data exfiltration attacks!
Comment by berlianta 4 days ago
Comment by jameshart 4 days ago
Comment by berlianta 4 days ago
Comment by bombcar 3 days ago
Comment by alehlopeh 3 days ago
Comment by gchamonlive 3 days ago
Looking at the trifecta axis, if we assume we can't control untrusted content, that leaves us to create safeguards for private data access and external communication.
Would it be enough if we had a buffer between when these two happened: access to the environment and access to the web?
Comment by simonw 3 days ago
Comment by Noumenon72 4 days ago
Comment by varenc 4 days ago
I imagine that enterprise companies will be quite interested in this.
Comment by thomas34298 3 days ago
Yet, their tools such as codex are able to read ALL FILES on my PC without explicit permission unless you spawn them within a container: https://github.com/openai/codex/issues/2847
It seems like OpenAI stealing sensitive data from their customers is not a big problem for them as it has been reported as an issue for almost a year now and currently has the 2nd most upvotes among open issues (they work on issues based on upvotes, so they claim).
Comment by BSDobelix 3 days ago
Why not just use your OS-integrated permission mechanism? No container needed.
Comment by kirtivr 3 days ago
If so many tools are straight up blocked, I would be very sceptical of the quality of the results.
Comment by sigmoid10 3 days ago
Comment by kirtivr 3 days ago
That's a problem we are just now wrapping our minds around.
It's not as simple as prompt sanitization. The model is the interpreter, and we don't yet have the right tools to guide it.
Comment by rafram 4 days ago
Comment by throwaway27448 4 days ago
Comment by noir_lord 3 days ago
Always has been before he was associated with OpenAI.
Which is weird because the bullshit he spouts isn’t so different to the bullshit other top execs spout and I don’t have the same visceral reaction to them (though I still don’t like a bunch of them).
Comment by Laurel1234 3 days ago
Comment by ares623 4 days ago
Comment by neonstatic 4 days ago
Comment by zerobees 4 days ago
I have mixed feelings about this feature. We're playing with tech that's supposed to do human-shaped things but can't be trusted nearly as much as a human employee (and can't be held responsible for what it does). Restricting the tools available to that patently untrustworthy entity doesn't solve the problem, it just makes the entity less useful, forcing you to sooner or later let it out of the jail.
Comment by cosmicriver 3 days ago
Comment by ACCount37 3 days ago
And "trusted nearly as much as a human employee", well... you do know that phishing and insiders are two primary ways for attackers to get into company infrastructure, right?
AIs pair human-shaped capabilities with human-shaped vulnerabilities. It's a way of automating PEBKAC.
Comment by noir_lord 3 days ago
Suspect thats the point, by giving you the “choice” they also make the user responsible or can at least shift the blame.
Comment by NewsaHackO 3 days ago
Comment by amluto 3 days ago
Comment by kijin 4 days ago
How long until somebody figures out how to trick Codex into disabling Lockdown Mode for you?
Comment by mapontosevenths 4 days ago
Humans also do not know how to do this reliably, which is why phishing is still a thing and always will be.
Comment by Smaug123 4 days ago
Comment by hypeatei 3 days ago
These are machines, not humans, so I don't understand the comparison. The point of tech advancement is that we eliminate entire classes of errors that humans make. You'd probably look at me funny if I wrote a production application that failed randomly in unexpected ways like corrupting data, opening security holes, etc. then explained it away with "well, humans do it too!"
Comment by mapontosevenths 3 days ago
We need to get better at using them and building them by validating both the inputs and outputs of such systems in more sophisticated ways, but to act surprised and denounce them because they fail in different ways than more primitive systems misses the point.
They're stochastic by design. If we want deterministic results we must use deterministic validators in conjunction with the stochastic system. It's trivial, and one day security experts will look back on the time when people didn't in the same way we look back on 90's software that didn't validate user input at all.
Comment by dnnddidiej 4 days ago
As a pre LLM analogy imagine working at a bank with a whitelist firewall. You need to install a package but requires an IT ticket. Safer but slooooower.
Now not saying what the answer here is but that is the issue.
The answer may be more like industries that get safer through lessons (like aviation) rather than go for 100% safety out of the gate. Because both fast travel and AI agents are insanely useful.
Comment by altmanaltman 4 days ago
That's what it means when they say aviation regulations are written in blood. Not that they just fling planes into the sky and be like "boy i hope we learn some new regulations from this". The number of airplane crashes would be astronomically larger if the 100% safety part was not embedded into the design process.
Comment by dnnddidiej 4 days ago
Comment by madanparas 4 days ago
Comment by vladsiu 4 days ago