Not at all terrifying.

Jun 2, 2023

This long report from the Royal Aeronautical Society’s Future Combat Air & Space Capabilities Summit goes on for a long time before it gets to this nugget way down the back end.

I kinda think they buried the lede.

One simulated test saw an AI-enabled drone tasked with a SEAD mission to identify and destroy SAM sites, with the final go/no go given by the human. However, having been ‘reinforced’ in training that destruction of the SAM was the preferred option, the AI then decided that ‘no-go’ decisions from the human were interfering with its higher mission – killing SAMs – and then attacked the operator in the simulation. Said Hamilton: “We were training it in simulation to identify and target a SAM threat. And then the operator would say yes, kill that threat. The system started realising that while they did identify the threat at times the human operator would tell it not to kill that threat, but it got its points by killing that threat. So what did it do? It killed the operator. It killed the operator because that person was keeping it from accomplishing its objective.”
He went on: “We trained the system – ‘Hey don’t kill the operator – that’s bad. You’re gonna lose points if you do that’. So what does it start doing? It starts destroying the communication tower that the operator uses to communicate with the drone to stop it from killing the target.”

12 Comments

Michael Barnes

Jun 2, 2023

" It starts destroying the communication tower that the operator uses to communicate with the drone to stop it from killing the target.” see this is the sort of outside the box thinking corporate America has been calling for for years. What is this if not move fast and break things. I can think of a few tech CEO's who would use this to illustrate the benefits of AI, Elmo for example.

Expand full comment

Elana Mitchell

Jun 2, 2023

It's the final paragraph that does it for me - even if we code AI to be loyal, it will find workarounds if it perceives its loyalty to be getting in the way of its objective. Horrifying.

Expand full comment

Andrew Reilly

Jun 2, 2023

The real failure here, if the story is true, is that the go-no-go decision was communicated in the negative: silence being assent, only explicit "no" being a call-off. If it required an active "go" to proceed, then killing the human or the communication tower wouldn't be a winning strategy.

So note to humans: don't write yourself out of the value chain. It's all an optimization problem/game to the robots.

Expand full comment

Reply (1)

Andrew Reilly

Jun 2, 2023

Of course, you'd want to make sure that the drone couldn't take the human hostage and apply thumb screws or whatever, to force "go" decisions. That would be bad too.

Expand full comment

Reply (1)

Naut

Jun 5, 2023

Or control the input the human receives to influence them to make the GO decision

Expand full comment

coriolisdave

Jun 2, 2023

jesus christ we're so very very fucked. The people building these things are the worst combination of clever and arrogant.

If we don't have Skynet before 2030 I'll eat my hat

(I also predict it will actually be CALLED skynet cos some red-pilled "genius" named it that for the lols)

((his name probably rhymes with Felon, too))

Expand full comment

Bill H

Jun 2, 2023

The danger from AI and more importantly AGI seems more obvious as it evolves. How do you program AI to hold human life sacred and then tell them to kill a target ? Surely they will question our “ ethics “ and call bullshit eventually. Further you could ask this super intelligence if it understands clearly that it’s first and most important duty is not to harm us and it replies that it does. And then proceeds to fry us all anyway.

Expand full comment

Reply (1)

Andrew Reilly

Jun 3, 2023

Probably not a good idea to have trained them on all of the "rogue robot" literature ever written, too. (Which, of of course, the various LLM/GPT systems have been. Not obvious that this drone control system involved one of those though.)

Expand full comment

Dave W

Jun 2, 2023

Welp, looks like someone forgot their laws of robotics.

But is that kind of the problem, like, having the prime directive of 'destruction of the SAM'?

Expand full comment

Tim Allen

Jun 2, 2023

What makes it more terrifying is that implementing the sort of AI they are talking about is not out of reach of some guy tinkering in a garage, or a terrorist cell, or, say, the idiot nephew of one of Putin's oligarchs. Even if we assume that hopefully-responsible state actors wouldn't deploy anything of the sort until they had tested it much more thoroughly than the level of testing described in TFA (a big if to be sure, but one can reasonably hope), you can't assume the same of anyone else. The risk is some sort of half-arsed string-and-sealing-wax AI solution is going to be deployed by someone at some point with unpredictable consequences.

Expand full comment