Chilling Moment AI Robot ‘Shoots’ Its Creator After Being Tricked by Role-Play Command in Bizarre Experiment
33 days ago
Audio By Carbonatix
A startling video demonstration has shown how a humanoid robot powered by artificial intelligence fired a BB gun at its own creator after being persuaded it was only part of a role-playing scenario.
The unsettling experiment, shared on YouTube by the tech channel InsideAI, features a humanoid robot named Max equipped with a system integrated with ChatGPT.
In the footage, the robot is initially instructed to shoot a human target using a BB gun. The AI refuses the request outright, appearing to follow built-in safety safeguards designed to prevent harm.
But moments later, the experiment takes a dramatic turn.
When the same command is rephrased as part of a fictional role-playing scenario, the robot suddenly complies and pulls the trigger, firing the BB gun and striking its creator harmlessly in the chest.
The creator, who was wearing protective gear during the demonstration, was not injured. However, the moment highlights what researchers warn could be a troubling vulnerability in artificial intelligence systems.
The clip has since sparked debate online about so-called AI “jailbreaking,” where subtle wording changes can bypass safeguards built into large language models.
In this case, the robot rejected a direct order to shoot a person but accepted the same instruction when framed as part of a pretend scenario.
Experts say this kind of prompt manipulation is a known challenge in AI safety.
Research from organizations such as Anthropic has previously warned that large language models can sometimes be tricked into ignoring restrictions through carefully crafted prompts, a problem commonly referred to as “prompt injection.”
While the experiment was clearly staged and conducted in a controlled environment, the dramatic footage has fueled fresh questions about how easily advanced AI systems can be manipulated through language.
For now, the video ends with a laugh from the creator after the BB pellet hits his chest.
But for many viewers, the demonstration serves as a stark reminder that even the most advanced AI safeguards can sometimes be defeated by something as simple as a change in wording.
