lionkor 8 days ago

I think most commenters are really missing the point. This is not a "maybe" possible attack that only works if the stars align. This is "if you follow the AI hype and use this tool naiively, anyone can access your private repos".

This is a security vulnerability. This is an attack. If I leave my back door unlocked, it's still a burglary when someone walks in and takes everything I own. That doesn't mean that suddenly "it's not an attack".

This is victim blaming, nothing else. You cannot expect people to use hyped AI tools and also know anything about anything. People following the AI hype and giving full access to AIs are still people, even if they lack a healthy risk assessment. They're going to get hurt by this, and you saying "its not an attack" isn't going to make that any better.

The reality is that the agent should only have the permissions and accesses of the person writing the request.

1
lbeurerkellner 8 days ago

I agree. It is also interesting to consider how AI security, user eduction/posture and social engineering relate. It is not traditional security in the sense of a code vulnerability, but is is a real vulnerability that can be exploited to harm users.

nssnsjsjsjs 8 days ago

Furthermore once you are inside the LLM you could try to invoke other tools and attempt to exfiltrate secrets etc. An inject like this on a 10k star repo could run on 100s of LLMs and then tailor it to cross to another popular tool for exfiltration even if the GH key is public and readonly access.

nstart 7 days ago

This! It's actually quite frustrating to see how people are dismissing this report. A little open mindedness will show just how wild the possibilities are. Today it's GitHub issues. Tomorrow it's the agent that's supposed to read all your mails and respond to the "easy" ones (this imagined case is likely going to hit a company support inbox somewhere someday).

nicce 8 days ago

We should handle LLMs as insider threat instead of typical input parsing problem and we get much better.

nssnsjsjsjs 8 days ago

All text input is privileged code basically. There is no delimiting possible.