OpenAI has developed technology to reliably detect AI-generated text, according to inside sources and documents reported by the Wall Street Journal. However, the company is reluctant to release it, likely due to concerns about its own business model.
The detector provides an assessment of how likely it is that all or part of the document was written by ChatGPT. Given a sufficient amount of text, the method is said to be 99.9 percent effective.
That means given 100 pieces of text and asked if they are made by ChatGPT or not, it gets maybe one of them wrong.
Allegedly, that is, and with the caveat of “sufficient amount of text”, whatever that means.
A false positive is when it incorrectly determines that a human written text is written by AI. While a detection rate of 99.9% sounds impressive, it’s not very reliable if it comes with a false positive rate of 20%.
That means given 100 pieces of text and asked if they are made by ChatGPT or not, it gets maybe one of them wrong. Allegedly, that is, and with the caveat of “sufficient amount of text”, whatever that means.
It’s actually 1 in 1000, 99.0% would be 1/100.
A false positive is when it incorrectly determines that a human written text is written by AI. While a detection rate of 99.9% sounds impressive, it’s not very reliable if it comes with a false positive rate of 20%.
I know what a false positive is, and it’s not a thing when talking about effectiveness, they claim it gets it right 99.9% of the time.
Right, I see what you mean now. I misread your comment as explaining something that was already clear.