AI chatbots tend to choose violence and nuclear strikes in wargames

ylai@lemmy.ml · 10 months ago

AI chatbots tend to choose violence and nuclear strikes in wargames

DontTreadOnBigfoot@lemmy.world · 10 months ago

Gandhi intensifies

Knoxvomica@lemmy.ca · 10 months ago

So do humans.

Plopp@lemmy.world · 10 months ago

Here’s a wild thought. Maybe that’s why the chat bot (I assume LLM) does it too, because it’s been trained on us! 🤯

Malfeasant@lemmy.world · 10 months ago

I learned it from watching you!

fidodo@lemmy.world · 10 months ago

Where are all these nuclear strikes?

Visstix@lemmy.world · 10 months ago

Sid Meier’s Civilization games

Death_Equity@lemmy.world · 10 months ago

Ghandi has the right idea.

breadsmasher@lemmy.world · 10 months ago

This sounds like the result of feeding it tons of literature that denotes having nuclear weapons, and the world we live in now being “peaceful” (as the ai claimed to want)

intensely_human@lemm.ee · 10 months ago

Nuclear weapons promote peace, but using them doesn’t so much.

RedstoneValley@sh.itjust.works · 10 months ago

Don’t want to spoil your little circlejerk here, but that should not surprise anyone, considering chatbots are trained on vast amounts of human data input. Humans have a rich history of violence with only brief excursions into “collaborating for the good of mankind and the planet we live on”. So unless you build a chatbot that focuses on those values the result will inevitably be a mirror image of us human shitbags.

ormr@feddit.de · edit-2 10 months ago

Humans have a history of violence as well as altruism. And with an increasing degree of societal complexity, humans also have a consistent record of violence reduction. See e.g. “The better angels of our nature” (Pinker, 2011).

Painting humans as intrinsically violent is not backed by evidence.

RedstoneValley@sh.itjust.works · edit-2 10 months ago

Ok, maybe it helps to be more specific. We have an LLM which is based on a broad range of human data input, like news, internet chatter, stories but also books of all kinds including those about philosophy, diplomacy, altruism etc. But if the topic at hand is “conflict resolution” the overwhelming data will be about violent solutions. It’s true that humans have developed means for peaceful conflict resolution. But at the same time they also have a natural tendency to focus on “bad news” so there is much more data available on the shitty things that happen in the world which is then fed to the chatbot.

To fix this, you would have to train an LLM specifically to have a bias towards educational resources and a moral code based on established principles.

But current implementations (like ChatGPT) don’t work that way. Quite the opposite, in fact: In training, first we ingest all the data that we can get our hands on (including all the atrocities in the world) and then in a second step we fine-tune the LLM to make it “better”.

intensely_human@lemm.ee · 10 months ago

But humans are intrinsically violent as evidenced by the fact every human society has weapons, kills animals to eat, and goes to war.

I’m familiar with Pinker. If he’s claiming humans are not intrinsically violent he can take it up with me because he’s rejecting the most obvious of evidence.

If humans weren’t intrinsically violent, then there wouldn’t be human violence.

Car@lemmy.dbzer0.com · 10 months ago

Interesting. There was a study put out some time ago that had 40 or so game theorists develop algorithms to compete against each other. The most successful algorithm cooperated with the opponent until they defected, at which point they would defect the next round.

They never performed a first strike. Only one retaliation strike for each attack their opponent performed. After the retaliation, it was back to cooperating with no long term ill will.

Ech@lemm.ee · 10 months ago

I think I saw something about it that. It was an extended prisoner’s dilemma game, right? I wouldn’t say that’s directly applicable to every gaming genre.

Car@lemmy.dbzer0.com · 10 months ago

Without being in the room, we can only go off what the article lays out. These are wargaming scenarios though, so escalation is a very real concern. If both sides are running these models to provide recommendations and both are pushing for greater conflict, you find yourself in a prisoner’s dilemma real quick.

fidodo@lemmy.world · 10 months ago

These aren’t simulations that are estimating results, they’re language models that are extrapolating off a ton of human knowledge embedded as artifacts into text. It’s not necessarily going to pick the best long term solution.

intensely_human@lemm.ee · 10 months ago

Language models can extrapolate but they can also reason (by extrapolating human reasoning).

Ech@lemm.ee · 10 months ago

The models used by the writers of the article and those used by the military are going to be radically different.

Car@lemmy.dbzer0.com · edit-2 10 months ago

The writers of the article are reporting on use of these models by the military. They aren’t using the models. If I remember right they called out some models developed by one of the defense contractors like palantir

Ech@lemm.ee · 10 months ago

The researchers tested LLMs such as OpenAI’s GPT-3.5 and GPT-4, Anthropic’s Claude 2 and Meta’s Llama 2

All these AIs are supported by Palantir’s commercial AI platform – though not necessarily part of Palantir’s US military partnership

Also, they’re reporting on a Stanford study of how these platforms could be used militaristically, not the military’s actual use of them.

intensely_human@lemm.ee · 10 months ago

The way you said that tells me you don’t know what a prisoner’s dilemma is. It’s not “a situation where both sides have escalated”.

Car@lemmy.dbzer0.com · edit-2 10 months ago

I’m not sure where our disconnect is. We have a situation where both sides can cooperate, one side can defect, or both sides can defect. Call it whatever you want, it’s the same scenario.

Here it’s with planning for military force. Do you risk a nuclear strike to save yourself from one? If you can get a first strike (defect), then you win. If you both refrain (cooperate), then you stay alive. If you both attempt a first strike (defect), you all lose.

Change the words around and it’s the same.

Both suspects don’t tell (cooperate), both get minimum or no jail time. One tells on the other (defects), that one gets off but the other gets maximum. Both tell on each other (defect), both get some jail time.

SatanicNotMessianic@lemmy.ml · 10 months ago

Get it to play tic-tac-toe against itself. Problem solved.

SkybreakerEngineer@lemmy.world · 10 months ago

How about a nice game of chess?

InFerNo@lemmy.ml · 10 months ago

No, let’s play global thermonuclear war

AtariDump@lemmy.world · 10 months ago

Pulls out an 8in floppy to war dial.

the_q@lemmy.world · 10 months ago

Violence is the only thing that has a chance of changing things. If it was civil action it’d be illegal. It makes sense an AI would come to that conclusion.

UsernameIsTooLon@lemmy.world · 10 months ago

Nukes get shit done

intensely_human@lemm.ee · 10 months ago

Well done

fidodo@lemmy.world · 10 months ago

These results come at a time when the US military has been testing such chatbots based on a type of AI called a large language model (LLM) to assist with military planning during simulated conflicts

Jesus fucking Christ we’re all doomed

BetaDoggo_@lemmy.world · 10 months ago

In the context of a “war game” this makes sense. If you remain completely neutral it’s impossible to win. Any examples of similar scenarios the model saw during training would have high aggression rates.

xantoxis@lemmy.world · 10 months ago

Unfortunately this AI was playing Stardew Valley

TwitchingCheese@lemmy.world · 10 months ago

Probably shouldn’t have included Project Plowshare in the training data…

fidodo@lemmy.world · 10 months ago

Did you read the article? It gave examples of escalations in neutral scenarios that make no sense.

shalafi@lemmy.world · edit-2 10 months ago

It’s probably vibing on the Dark Forest Theory. If that’s the case, it makes sense to utterly destroy all opponents as hard and fast as you can, even if they’re not currently opponents.

recapitated@lemmy.world · 10 months ago

That anyone would ask language models to analyze circumstances, perform logic and reason or conjure an application of knowledge and skill is kind of their own fault.

It is a language model, it excels at rephrasing given ideas.

If you put nuke buttons under a flock of pigeons or toddlers just to see what happens, they might launch. It’s not much of a study.

littleblue✨@lemmy.world · 10 months ago

Fun fact: when researchers taught a group of simians about currency, they invented prostitution.

intensely_human@lemm.ee · 10 months ago

Violence, in war games? Gosh how horrible l

DrownedRats@lemmy.world · 10 months ago

By war games It means the actually military kind where armies get together and practice was against eachother. We’re not talking call of duty here.

Jake Farm@sopuli.xyz · 28 days ago

No they are talking about role playing because LLMs can’t differentiate reality from pretend.

qx128@lemmy.world · 10 months ago

I mean… so do people.

RagingRobot@lemmy.world · 10 months ago

Well that’s a good way to win so yeah

winky9827b@lemmy.world · 10 months ago

Not according to WAPR

Jake Farm@sopuli.xyz · 28 days ago

World Association for Psychosocial Rehabilitation?

Malfeasant@lemmy.world · 10 months ago

Seems like a good topic for a movie…

Jake Farm@sopuli.xyz · 28 days ago

Because that is what people do in roleplaying situations if the option is there.