Comment on ChatGPT's o3 Model Found Remote Zeroday in Linux Kernel Code

knokelmaat@beehaw.org ⁨5⁩ ⁨days⁩ ago

I hate AI. Why?

However

I also took the time to read the original blog post, and it is a fascinating story.

The author starts out with using an existing vulnerability as a benchmark for ChatGPT testing. They describe how they took the code specific to the vulnerability and packaged it for ChatGPT, how they formatted the query and what their results were. In 100 runs only 8 correctly identify the targeted vulnerability, the rest are false positives or claim that there are no vulnerabilities in the given code.

Then they take their test a step further and increase the amount of code shared with ChatGPT so that it also includes stuff of the module that had nothing to do with the original vulnerability. As expected, this larger input decreases performance and also reduces the vulnerability detection rate for the targeted vulnerability. However, in those 100 runs, another vulnerability was described that wasn’t a false positive. An actual new vulnerability that the author didn’t know about was discovered. Again, the signal to noise ratio is very low, and one has to sift through a lot of wrong reports to get a realistic one, but this proved that it could be used as a useful tool for helping to detect vulnerabilities.

I highly recommend reading the blog post.

As much as I like to be critical about AI, it doesn’t help if we put our heads in the sand and act as if it never does something cool.

source
Sort:hotnewtop