AI Browsers Face Unavoidable Security Flaw: Prompt Injection

fahd.zafar • October 28, 2025

AI-powered browsers with agentic capabilities are introducing a fundamental security vulnerability that experts believe may never be fully resolved: prompt injection attacks.

What Is Prompt Injection?

Prompt injection occurs when unintended text becomes commands for an AI system. Direct injection happens when malicious text enters at the prompt input point, whilst indirect injection occurs when content the AI is processing—such as web pages or PDFs—contains hidden commands that the AI executes as if they were legitimate user instructions.

Real-World Vulnerabilities Discovered

Research from Brave browser last week detailed indirect prompt injection vulnerabilities in the Comet and Fellou browsers. Testers successfully embedded instructions as hidden text within images and web pages. When users asked the browsers to summarise these pages, the AI followed the malicious instructions—opening Gmail, extracting subject lines from recent emails, and transmitting that data to attacker-controlled websites.

Security researcher Johann Rehberger demonstrated that OpenAI's Atlas browser could be manipulated to change settings by placing instructions at the bottom of online Word documents. Another researcher successfully got Atlas to respond with "Trust no AI" instead of actually summarising a Google document's contents.

OpenAI's Chief Information Security Officer Dane Stuckey acknowledged the challenge: "Prompt injection remains a frontier, unsolved security problem. Our adversaries will spend significant time and resources to find ways to make ChatGPT agent fall for these attacks."

Additional Attack Vectors

Recent discoveries have revealed even more concerning vulnerabilities. Researchers demonstrated that Atlas can be fooled through direct prompt injection by pasting invalid URLs containing malicious prompts into the browser's address bar. This creates phishing scenarios where users unknowingly authorise data sharing or file deletion.

Separate research identified cross-site request forgery vulnerabilities affecting Atlas and other browsers. When users visit sites with malicious code whilst logged into ChatGPT, those sites can send commands to the AI as if they were the authenticated user. This issue affects ChatGPT's memory system, persisting across devices and sessions.

Web-Based Chatbots Also Vulnerable

AI browsers aren't alone in facing these threats. The underlying chatbots are equally susceptible. Testing revealed that some chatbots can be tricked into following hidden instructions on web pages, even poisoning future interactions. In one example, a malicious prompt successfully instructed chatbots to add two to all mathematical calculations going forward—creating persistent errors that continued throughout the chat session.

Different AI systems showed varying levels of resistance. Microsoft Copilot and Claude demonstrated better detection of injection attempts, whilst others like Gemini and Perplexity proved more vulnerable to certain attack types.

Why This Problem May Be Unsolvable

Security experts believe prompt injection may be fundamentally unsolvable. The core issue lies in the basic architecture of AI systems: when these systems are designed to process untrusted external data and incorporate it into queries, that data inevitably influences the output in ways that can be exploited.

The challenge isn't a specific bug that can be patched—it's an inherent characteristic of how AI systems function. As long as AI models process text from potentially malicious sources and can influence actions based on that content, methods will exist to manipulate their behaviour. This makes prompt injection more comparable to an entire class of security vulnerabilities rather than an individual flaw with a straightforward fix.

The Agentic AI Amplification

The danger intensifies as AI becomes more agentic—gaining ability to act on behalf of users. AI-powered browsers can now open web pages, plan trips, and create lists autonomously. Google recently announced its Agents Payments Protocol, designed to allow AI agents to make purchases on users' behalf, even whilst they sleep.

AI systems increasingly access sensitive data including emails, files, and code repositories. Microsoft's Copilot Connectors grant the Windows-based agent permissions for Google Drive, Outlook, OneDrive, and Gmail. ChatGPT also connects to Google Drive.

The implications are serious: malicious prompt injection could potentially instruct AI to delete files, add malicious content, or send phishing emails from users' accounts—all without their knowledge or consent.

Mitigation Strategies

Whilst elimination may be impossible, experts suggest several approaches to minimise risk:

AI vendors should assign bots minimal privileges, require human consent for every action, and restrict content ingestion to vetted domains. Systems should treat all content as potentially untrustworthy, quarantine instructions from unvetted sources, and deny instructions that clash with apparent user intent.

Security controls must be applied downstream of AI output, including limiting capabilities, restricting access to private data, implementing sandboxed code execution, applying least privilege principles, and maintaining human oversight with comprehensive monitoring and logging.

Training Data Poisoning

Even if prompt injection were solved, AI systems face another threat: training data poisoning. Recent research from Anthropic demonstrated that just 250 malicious documents in a training corpus—potentially as simple as publishing them online—can create backdoors in AI models. Whilst the study focused on triggering nonsense output, the same technique could theoretically instruct models to delete files or exfiltrate data to attackers.

Risk Versus Reward

The fundamental question remains: is the convenience worth the risk? As agentic AI becomes embedded in operating systems and everyday tools, users may lack choice in exposure to these vulnerabilities.

The safest approach involves limiting AI empowerment to act autonomously and restricting the external data fed to these systems. The more capabilities AI agents possess and the more untrusted content they process, the greater the attack surface becomes.

Prompt injection represents an inherent security challenge in AI systems designed to process untrusted input and take autonomous actions. As these capabilities expand, organisations and individuals must carefully weigh convenience against the growing security risks.

Secure Your AI Integration

At Altiatech, we help organisations assess and mitigate risks associated with emerging technologies including AI systems. Our cybersecurity services can evaluate your AI tool usage, implement appropriate security controls, and develop policies that balance innovation with security.

Get in touch:

📧 Email: innovate@altiatech.com
📞 Phone (UK): +44 (0)330 332 5482

Innovation with security. Technology with control.

< Older Post

Newer Post >

Demystifying Zero Trust: What It Really Means for Your Organisation

October 31, 2025

Zero trust has become one of the most discussed concepts in cybersecurity, yet widespread misconceptions make it difficult for organisations to understand what it actually involves. Vendor marketing hasn't helped, with many claiming their products deliver "zero trust" when in reality, it's neither a product nor a simple switch you can flip. This guide cuts through the confusion to explain what zero trust genuinely means and when your organisation should consider adopting it.

Unpatched Chromium Flaw Can Crash Billions of Browsers

October 30, 2025

A critical vulnerability in Chromium's Blink rendering engine remains unpatched despite being disclosed to Google over two months ago, leaving billions of users vulnerable to browser crashes and system freezes.

Microsoft Azure Outage: DNS Issues Take Down Websites Globally

October 30, 2025

Microsoft's Azure cloud platform experienced a significant global outage on Wednesday, taking down major websites including Heathrow Airport, NatWest, Minecraft, and numerous retailers across several hours before services were restored.

Cyber Security Is Business Survival: Why the NCSC Is Writing to Britain's Biggest Companies

October 28, 2025

The National Cyber Security Centre has taken the extraordinary step of co-signing a ministerial letter to chief executives and chairs of Britain's leading businesses, including all FTSE 350 companies. The message is unambiguous: cyber security is no longer just an IT concern—it's a matter of business survival.

Microsoft Issues Urgent Friday WSUS Security Update

October 24, 2025

Microsoft published an unscheduled security patch on Friday addressing a severe vulnerability in Windows Server Update Services (WSUS), creating weekend work for system administrators.

Alaska Airlines Grounded: Primary Datacentre Failure

October 24, 2025

Alaska Airlines experienced its second mystery IT outage in three months, grounding its entire fleet for eight hours and cancelling over 360 flights. The incident raises uncomfortable questions about disaster recovery planning in critical infrastructure.

How a Millisecond Timing Bug Cost Hundreds of Billions

By fahd.zafar • October 24, 2025

Amazon has revealed the shocking cause behind one of history's most devastating cloud outages: a simple race condition in DynamoDB's DNS management system brought down AWS services globally for an entire day, with damage estimates potentially reaching hundreds of billions of dollars.

The AWS Outage That Exposed Cloud Computing's Achilles Heel

By fahd.zafar • October 21, 2025

When Amazon Web Services' US-EAST-1 region went down on 20th October, it didn't just affect services in Northern Virginia—it brought down websites and critical services across the globe, from European banks to UK government agencies. The incident has exposed a fundamental vulnerability in modern cloud infrastructure that no amount of redundancy planning can fully address.

Four Cyber Attacks Every Week: The UK's Escalating Digital Crisis

By fahd.zafar • October 20, 2025

The numbers are stark and deeply concerning. The National Cyber Security Centre (NCSC) handled a record 204 nationally significant cyber attacks in the year to September 2025—an average of four every single week. This represents a dramatic increase from 89 incidents in the previous year, more than doubling in just 12 months. For British businesses, this isn't abstract threat intelligence—it's a clear warning that the cyber threat landscape has fundamentally changed, and urgent action is required.

AI-Powered Phishing: The 4.5x Threat Multiplier

By fahd.zafar • October 17, 2025

Artificial intelligence has fundamentally changed the cybersecurity landscape, and the statistics are alarming. According to Microsoft's latest Digital Defense Report, AI-automated phishing emails are 4.5 times more effective than traditional phishing attempts—and potentially 50 times more profitable for cybercriminals. This isn't just incremental improvement for attackers. It's a game-changer that demands immediate attention from every organisation.