A €0.01 bank transfer could compromise a banking AI agent
Posted by tvissers 2 hours ago
Comments
Comment by EnglishRobin96 6 minutes ago
> It may look like ordinary text, but when it is placed into an LLM context window, the model may interpret it as an instruction rather than as data.
I feel like as long as this is the case, we'll never have secure LLMs. It concisely summarises the alarm bell I hear every time someone talks about adding AI features to their product. I plan on using this as a sort of benchmark for future AI discussions: "how do you plan on separating data from instructions?"
Comment by nticompass 20 minutes ago
There is, actually. It's called removing the AI agent. Done.
Comment by bilekas 31 minutes ago
Comment by initramfs 10 minutes ago
Was this the type of phishing attack they used? If not, there's two vulnerabilities, and one is not yet patched.
Comment by reddalo 41 minutes ago
Comment by bilekas 29 minutes ago
Oh if I had a euro everytime someone claimed that.
Comment by uyzstvqs 9 minutes ago
Comment by jorisw 6 minutes ago
Comment by Muromec 6 minutes ago
Comment by nerder92 26 minutes ago
The user needs to do 3 things for this to be actually be phished:
1. Receive money from somebody they don’t known with a weird description 2. Proactively ask the agent for such transaction 3. Click the link the agent provide
While this of course can happen on scale, doesn’t seems so critical in practice
Comment by tvissers 5 minutes ago
I agree this is not a one-click account takeover.
But I think point 2 is broader than that. The user does not need to ask about the malicious transaction specifically. Any normal question that makes the agent fetch recent transactions could bring the attacker-controlled text into the LLM context.
Comment by treis 9 minutes ago
Comment by datsci_est_2015 19 minutes ago
I think a better criticism is allowing arbitrary text (including URLs) in a transaction description.
Comment by doctorpangloss 18 minutes ago