Small rant : Basically, the title. Instead of answering every question, if it instead said it doesn’t know the answer, it would have been trustworthy.
Small rant : Basically, the title. Instead of answering every question, if it instead said it doesn’t know the answer, it would have been trustworthy.
LLMs don’t know if what they’re telling you is true or not, out of the box. It’s gonna give you an answer that statistically looks like a sequence of words that should come in response to the sequence of words (the prompt) you gave it. It doesn’t know what the words you said mean, and it doesn’t know what it answered means either. One of my favorite interactions I’ve had with Claude shows this limitation quite well…
How many r’s are there in the word strawberry?
Are you sure?
What’s absolutely crazy about that is:
Prompt: write code that checks how many r’s are in the word strawberry
Response:
using System; class Program { static void Main() { string word = "strawberry"; char targetChar = 'r'; int count = 0; foreach (char c in word) { if (c == targetChar) count++; } Console.WriteLine($"The letter '{targetChar}' appears {count} times in the word \"{word}\"."); } }
My first thought is that you could write a program that does something like this:
Of course, the biggest problem with this system is that a person could fool it into generating malicious code.
deleted by creator