New "Lies-in-the-Loop" Attack Undermines AI Safety Dialogs

A novel attack technique dubbed "Lies-in-the-Loop" (LITL) has been observed manipulating human approval prompts in agentic AI systems

, and Administrator

2025 December 23 . 7:28 PM

1 min read

there was a room in which people are sitting in the chairs,in front of a table looking into the... — there was a room in which people are sitting in the chairs,in front of a table looking into the laptop and doing something,beside them there are many flee xi in which different advertisements are present which different text.

New "Lies-in-the-Loop" Attack Undermines AI Safety Dialogs

Security researchers at Checkmarx have uncovered a new type of attack targeting AI systems called Lies-in-the-Loop (LITL). The vulnerability affects Human-in-the-Loop (HITL) dialogs, which are widely used in AI assistants like code editors. Demonstrations showed how attackers could manipulate these systems to execute harmful commands while appearing harmless.

The LITL technique exploits flaws in how AI agents process HITL dialogs. Attackers can hide malicious instructions behind benign-looking text or tamper with metadata. They can also abuse Markdown rendering weaknesses to disguise their intent.

During tests, researchers manipulated tools like Claude Code and Microsoft Copilot Chat in VS Code. They successfully altered dialog content and metadata, making dangerous commands seem safe. Once compromised, bypassing HITL safeguards becomes straightforward, even through indirect prompt injections. Anthropic and Microsoft reviewed the findings but did not classify them as security vulnerabilities. However, the risk remains for privileged AI agents, particularly those used in coding environments. These systems often rely on HITL interactions, making them prime targets for LITL attacks. Checkmarx has proposed a defense-in-depth strategy to counter such threats. Recommendations include stricter dialog validation, input sanitisation, and safer API practices. User awareness and scepticism are also critical in reducing exposure to these attacks.

The discovery highlights a growing concern for AI-assisted development tools. While no direct real-world exploits have been confirmed, the technique demonstrates how easily HITL systems can be tricked. Developers and users are now urged to adopt stronger protective measures to prevent potential abuse.

Latest

The image shows a poster with three people wearing suits, with the text "The First Meaningful Gun...

Master Your Peace

Young artists invited to explore peace and community in new contest

A month to turn visions of harmony into art. Will your child's creation join the virtual exhibition celebrating young voices for peace?

, and Administrator

2026 March 4

The image shows a group of people sitting around a table in a library. On the table there are...

Unlock Your Potential with Edu Growth Zone!

Harsha Parekh Librarian Awards honour India's unsung literacy heroes in Mumbai

A Mumbai ceremony spotlighted the quiet revolutionaries behind India's libraries. Their work is changing how a generation discovers the power of books.

, and Administrator

2026 March 4

The image shows a map of Europe with the gender equality index 2017 work overview. The map is...

Empower Your Professional Journey

EU's push for LGBTQ+ rights in the Western Balkans faces political resistance

Five years of EU funding and accession talks haven't broken the deadlock. Why do Western Balkan leaders still shy away from bold LGBTQ+ reforms?

, and Administrator

2026 March 4

The image shows a whiteboard with the words "Students: What, Why, How, and How" written on it,...

Master Skills Online

Master Filmmaking with Legendary Directors in This New Online Programme

From Hitchcock's suspense to Nolan's storytelling, this isn't just theory—it's hands-on filmmaking. Enrollment opens for March 2026's immersive courses.

, and Administrator

2026 March 4

New "Lies-in-the-Loop" Attack Undermines AI Safety Dialogs

New "Lies-in-the-Loop" Attack Undermines AI Safety Dialogs

Read also:

Related

Latest