BaxBench: Can LLMs Generate Secure and Correct Back Ends? baxbench.com 2 points by chillax 16 hours ago
lucasluitjes 16 hours ago I've seen LLMs generate plenty of wildly insecure code, but the percentage of insecure solutions out of the solutions that are functional, is even higher than I expected.Also, I'm curious how the average coder would fare on this benchmark.
I've seen LLMs generate plenty of wildly insecure code, but the percentage of insecure solutions out of the solutions that are functional, is even higher than I expected.
Also, I'm curious how the average coder would fare on this benchmark.