You know how Google’s new feature called AI Overviews is prone to spitting out wildly incorrect answers to search queries? In one instance, AI Overviews told a user to use glue on pizza to make sure the cheese won’t slide off (pssst…please don’t do this.)
Well, according to an interview at The Vergewith Google CEO Sundar Pichai published earlier this week, just before criticism of the outputs really took off, these “hallucinations” are an “inherent feature” of AI large language models (LLM), which is what drives AI Overviews, and this feature “is still an unsolved problem.”
In the interest of transparency, I don’t know if this guy is telling the truth, but it feels very plausible.
and our parents told us Wikipedia couldn’t be trusted…
Conservapedia to the rescue.
Since when has feeding us misinformation been a problem for capitalist parasites like Pichai?
Misinformation is literally the first line of defense for them.
But this is not misinformation, it is uncontrolled nonsense. It directly devalues their offering of being able to provide you with an accurate answer to something you look for. And if their overall offering becomes less valuable, so does their ability to steer you using their results.
So while the incorrect nature is not a problem in itself for them, (as you see from his answer)… the degradation of their ability to influence results is.
But this is not misinformation, it is uncontrolled nonsense.
The strategy is to get you to keep feeding Google new prompts in order to feed you more adds.
The AI response is just a gimmick. It gives Google something to tell their investors, when they get asked “What are you doing with AI right now? We hear that’s big.”
But the real money is getting unique user interactions for the purpose of serving up more ad content. In that model, bad answers are actually better than no answers, because they force the end use to keep refining the query and searching through the site backlog.
I don’t believe they will retain user interactions if the reason for the user interactions dissapears. The value of Google is they provide accurate search results.
I can understand some users just want to be spoonfed an answer. But that’s not what most people expect from a search engine.
I want google to use actual AI to filter out all the nonsense sites that turn a Reddit post into an article of 500 words using an LLM without any actual value. That should be googles proposition.
The value of Google is they provide accurate search results.
They offer the most accurate results of search engines you’re familiar with. But in a shrinking field with degrading quality, that’s a low bar and sinking quick.
I want google to use actual AI to filter out all the nonsense sites
So did the last head of Google search, until the new CEO fired him.
If you don’t know the answer is bad, which confident idiots spouting off on reddit and being upvoted into infinity has proven is common, then you won’t refine your search. You’ll just accept the bad answer and move on.
Your logic doesn’t follow. If someone doesn’t know the answer and are searching for it, they likely won’t be able to tell if the answer is correct. We literally already have that problem with misinformation. And what sounds more confident than an AI?
Nothing is going to change until people die because of this shit.
I’ve been around for a while and shit has changed significantly under a relatively long lived set of dudes.
Good. Nothing will get us through the hype cycle faster than obvious public failure. Then we can get on with productive uses.
I don’t like the sound of getting on with “productive uses” either though. I hope the entire thing is a catastrophic failure.
I hate the AI hype right now, but to say the entire thing should fail is short sighted.
Imagine people saying the following: “The internet is just hype. I get too much spam emails. I hope the entire thing is a catastrophic failure.”
Imagine we just shut down the entire internet because the dotcom bubble was full of scams and overhyped…
Genuinely curious, what pieces do you suggest we can keep from LLM/GenAI/etc?
?
Have you never used any of these tools? They’re excellent at doing simple things very fast. But it’s like a word processor in the 90s. It’s just a tool, not the font of all knowledge.
I guess younger people won’t know this, but word processor programs were very impressive when they first came out. They replaced typewriters; a page printed from a printer looked much more professional than even the best typewriters. This lent an air of credibility to anything that was printed from a computer because it was new and expensive.
Think about that now. Do you automatically trust anything that’s just printed on a piece of paper? No, because that’s stupid. Anyone can just print whatever they want. LLMs are like that now. They can just say whatever they want. It’s up to you to make sure it’s true.
Font of all knowledge sounds like an excellent font. I assume it’s serifed?
these hallucinations are an “inherent feature” of AI large language models (LLM), which is what drives AI Overviews, and this feature "is still an unsolved problem”.
Then what made you think it’s a good idea to include that in your product now?!
“We do AI now”. Shareholders creaming themselves, stocks going to the moon. New yacht for PichAI.
This is what happens every time society goes along with tech bro hype. They just run directly into a wall. They are the embodiment of “Didn’t stop to think if they should” and it’s going to cause a lot of problems for humanity.
They keep saying it’s impossible, when the truth is it’s just expensive.
That’s why they wont do it.
You could only train AI with good sources (scientific literature, not social media) and then pay experts to talk with the AI for long periods of time, giving feedback directly to the AI.
Essentially, if you want a smart AI you need to send it to college, not drop it off at the mall unsupervised for 22 years and hope for the best when you pick it back up.
No he’s right that it’s unsolved. Humans aren’t great at reliably knowing truth from fiction too. If you’ve ever been in a highly active comment section you’ll notice certain “hallucinations” developing, usually because someone came along and sounded confident and everyone just believed them.
We don’t even know how to get full people to do this, so how does a fancy markov chain do it? It can’t. I don’t think you solve this problem without AGI, and that’s something AI evangelists don’t want to think about because then the conversation changes significantly. They’re in this for the hype bubble, not the ethical implications.
We do know. It’s called critical thinking education. This is why we send people to college. Of course there are highly educated morons, but we are edging bets. This is why the dismantling or coopting of education is the first thing every single authoritarian does. It makes it easier to manipulate masses.
It’s called critical thinking education.
Yeah, I mean, we have that, and parents are constantly trying to dismantle it. No amount of “critical thinking education” can undo decades of brainwashing from parents and local culture.
You could only train AI with good sources
I mean yes, but also no. If you only train it with “good sources” then you miss out on a whole bunch of other valuable information.
Just like scholar.google.com only has “good sources” but generally it’s not going to have the information that 90% of your search queries will be about.
The truth is, this is the perfect type of a comment that makes an LLM hallucinate. Sounds right, very confident, but completely full of bullshit. You can’t just throw money on every problem and get it solved fast. This is an inheret flaw that can only be solved by something else than a LLM and prompt voodoo.
They will always spout nonsense. No way around it, for now. A probabilistic neural network has zero, will always have zero, and cannot have anything but zero concept of fact - only stastisically probable result for a given prompt.
It’s a politician.
They will always
for now.
I let you in on a secret: scientific literature has its fair share of bullshit too. The issue is, it is much harder to figure out its bullshit. Unless its the most blatant horseshit you’ve scientifically ever seen. So while it absolutely makes sense to say, let’s just train these on good sources, there is no source that is just that. Of course it is still better to do it like that than as they do it now.
The issue is, it is much harder to figure out its bullshit.
Google AI suggested you put glue on your pizza because a troll said it on Reddit once…
Not all scientific literature is perfect. Which is one of the many factors that will stay make my plan expensive and time consuming.
You can’t throw a toddler in a library and expect them to come out knowing everything in all the books.
AI needs that guided teaching too.
Google AI suggested you put glue on your pizza because a troll said it on Reddit once…
Genuine question: do you know that’s what happened? This type of implementation can suggest things like this without it having to be in the training data in that format.
In this case, it seems pretty likely. We know Google paid Reddit to train on their data, and the result used the exact same measurement from this comment suggesting putting Elmer’s glue in the pizza:
https://old.reddit.com/r/Pizza/comments/1a19s0/my_cheese_slides_off_the_pizza_too_easily/
And their deal with Reddit: https://www.cbsnews.com/news/google-reddit-60-million-deal-ai-training/
It’s going to be hilarious to see these companies eventually abandon Reddit because it’s giving them awful results, and then they’re completely fucked
You’re wrong. Anyone who has ever used Google knows Reddit is an absolute goldmine of valuable information. The problem is it’s also full of jokes and puns and bad information, and AI isn’t able to sort one from the other (yet).
This doesn’t mean that there are reddit comments suggesting putting glue on pizza or even eating glue. It just means that the implementation of Google’s LLM is half baked and built it’s model in a weird way.
Genuine question: do you know that’s what happened?
Yes
Replace the CEO with an AI. They’re both good at lying and telling people what they want to hear, until they get caught