andix 6 days ago

1:50 is a great detection ratio for finding a needle in a haystack.

2
epolanski 6 days ago

I don't think the author agrees as he points out the bugs weren't that difficult to find.

Aachen 5 days ago

Nah. I'm not an expert code auditor myself but I've seen my colleagues do it and I've seen ChatGPT try its hand. Even when I give it a specific piece of code and probe/hint in the right direction, it produces five paragraphs of vulnerabilities, none of which are real, while overlooking the one real concern we identified

You can spend all day reading slop or you can get good at this yourself and be much more efficient at this task. Especially if you're the developer and know where to look and how things work already, catching up on security issues relevant to your situation will be much faster than looking for this needle in the haystack that is LLM output