AI Browsers Face Unavoidable Security Flaw: Prompt Injection

fahd.zafar • October 28, 2025

AI-powered browsers with agentic capabilities are introducing a fundamental security vulnerability that experts believe may never be fully resolved: prompt injection attacks.

What Is Prompt Injection?

Prompt injection occurs when unintended text becomes commands for an AI system. Direct injection happens when malicious text enters at the prompt input point, whilst indirect injection occurs when content the AI is processing—such as web pages or PDFs—contains hidden commands that the AI executes as if they were legitimate user instructions.



Real-World Vulnerabilities Discovered

Research from Brave browser last week detailed indirect prompt injection vulnerabilities in the Comet and Fellou browsers. Testers successfully embedded instructions as hidden text within images and web pages. When users asked the browsers to summarise these pages, the AI followed the malicious instructions—opening Gmail, extracting subject lines from recent emails, and transmitting that data to attacker-controlled websites.


Security researcher Johann Rehberger demonstrated that OpenAI's Atlas browser could be manipulated to change settings by placing instructions at the bottom of online Word documents. Another researcher successfully got Atlas to respond with "Trust no AI" instead of actually summarising a Google document's contents.


OpenAI's Chief Information Security Officer Dane Stuckey acknowledged the challenge: "Prompt injection remains a frontier, unsolved security problem. Our adversaries will spend significant time and resources to find ways to make ChatGPT agent fall for these attacks."



Additional Attack Vectors

Recent discoveries have revealed even more concerning vulnerabilities. Researchers demonstrated that Atlas can be fooled through direct prompt injection by pasting invalid URLs containing malicious prompts into the browser's address bar. This creates phishing scenarios where users unknowingly authorise data sharing or file deletion.


Separate research identified cross-site request forgery vulnerabilities affecting Atlas and other browsers. When users visit sites with malicious code whilst logged into ChatGPT, those sites can send commands to the AI as if they were the authenticated user. This issue affects ChatGPT's memory system, persisting across devices and sessions.



Web-Based Chatbots Also Vulnerable

AI browsers aren't alone in facing these threats. The underlying chatbots are equally susceptible. Testing revealed that some chatbots can be tricked into following hidden instructions on web pages, even poisoning future interactions. In one example, a malicious prompt successfully instructed chatbots to add two to all mathematical calculations going forward—creating persistent errors that continued throughout the chat session.


Different AI systems showed varying levels of resistance. Microsoft Copilot and Claude demonstrated better detection of injection attempts, whilst others like Gemini and Perplexity proved more vulnerable to certain attack types.



Why This Problem May Be Unsolvable

Security experts believe prompt injection may be fundamentally unsolvable. The core issue lies in the basic architecture of AI systems: when these systems are designed to process untrusted external data and incorporate it into queries, that data inevitably influences the output in ways that can be exploited.


The challenge isn't a specific bug that can be patched—it's an inherent characteristic of how AI systems function. As long as AI models process text from potentially malicious sources and can influence actions based on that content, methods will exist to manipulate their behaviour. This makes prompt injection more comparable to an entire class of security vulnerabilities rather than an individual flaw with a straightforward fix.



The Agentic AI Amplification

The danger intensifies as AI becomes more agentic—gaining ability to act on behalf of users. AI-powered browsers can now open web pages, plan trips, and create lists autonomously. Google recently announced its Agents Payments Protocol, designed to allow AI agents to make purchases on users' behalf, even whilst they sleep.


AI systems increasingly access sensitive data including emails, files, and code repositories. Microsoft's Copilot Connectors grant the Windows-based agent permissions for Google Drive, Outlook, OneDrive, and Gmail. ChatGPT also connects to Google Drive.


The implications are serious: malicious prompt injection could potentially instruct AI to delete files, add malicious content, or send phishing emails from users' accounts—all without their knowledge or consent.



Mitigation Strategies

Whilst elimination may be impossible, experts suggest several approaches to minimise risk:


AI vendors should assign bots minimal privileges, require human consent for every action, and restrict content ingestion to vetted domains. Systems should treat all content as potentially untrustworthy, quarantine instructions from unvetted sources, and deny instructions that clash with apparent user intent.


Security controls must be applied downstream of AI output, including limiting capabilities, restricting access to private data, implementing sandboxed code execution, applying least privilege principles, and maintaining human oversight with comprehensive monitoring and logging.



Training Data Poisoning

Even if prompt injection were solved, AI systems face another threat: training data poisoning. Recent research from Anthropic demonstrated that just 250 malicious documents in a training corpus—potentially as simple as publishing them online—can create backdoors in AI models. Whilst the study focused on triggering nonsense output, the same technique could theoretically instruct models to delete files or exfiltrate data to attackers.



Risk Versus Reward

The fundamental question remains: is the convenience worth the risk? As agentic AI becomes embedded in operating systems and everyday tools, users may lack choice in exposure to these vulnerabilities.


The safest approach involves limiting AI empowerment to act autonomously and restricting the external data fed to these systems. The more capabilities AI agents possess and the more untrusted content they process, the greater the attack surface becomes.


Prompt injection represents an inherent security challenge in AI systems designed to process untrusted input and take autonomous actions. As these capabilities expand, organisations and individuals must carefully weigh convenience against the growing security risks.



Secure Your AI Integration

At Altiatech, we help organisations assess and mitigate risks associated with emerging technologies including AI systems. Our cybersecurity services can evaluate your AI tool usage, implement appropriate security controls, and develop policies that balance innovation with security.



Get in touch:

📧 Email: innovate@altiatech.com
📞 Phone (UK): +44 (0)330 332 5482


Innovation with security. Technology with control.

December 15, 2025
Traditional security models assumed everything inside the corporate network was trustworthy, focusing defensive efforts on the perimeter. This approach fails catastrophically in today's hybrid work environment where employees access resources from homes, coffee shops, and co-working spaces whilst applications reside across multiple clouds.
Microsoft logo on a wood-paneled wall, with colorful squares and company name.
December 10, 2025
Microsoft is introducing major Microsoft 365 licensing changes in 2026. Learn what’s changing, who is affected and how businesses should prepare.
December 8, 2025
Cloud computing promised cost savings through pay-per-use models and elastic scaling. Yet many UK organisations discover their cloud bills steadily increasing without corresponding business growth. The culprit? Cloud waste - unnecessary spending on unused or inefficiently configured resources.
November 28, 2025
A threat group known as Scattered Lapsus$ Hunters is targeting Zendesk users through a sophisticated campaign involving fake support sites and weaponised helpdesk tickets, according to security researchers at ReliaQuest. The operation represents an evolution in how cybercriminals exploit trust in enterprise SaaS platforms.
November 28, 2025
Amazon Web Services has launched a new feature allowing customers to make DNS changes within 60 minutes during service disruptions in its US East (N. Virginia) region. The announcement tacitly acknowledges what many have long observed: AWS's largest and most critical region has a reliability problem.
November 28, 2025
A Scottish council remains unable to fully restore critical systems two years after a devastating ransomware attack, highlighting the long-term consequences of inadequate cybersecurity preparation and the challenges facing resource-constrained local authorities.  Comhairle nan Eilean Siar, serving Scotland's Western Isles, suffered a ransomware attack in November 2023 that required extensive system reconstruction. According to a report published by Scotland's Accounts Commission, several systems remain unrestored even now, with large data volumes slowing the digital recovery process.
November 26, 2025
Ready to migrate from Windows 10? Contact Altiatech for a comprehensive migration assessment and strategy tailored to your organisation's needs.
November 25, 2025
The Cybersecurity and Infrastructure Security Agency has issued an alert warning that multiple cyber threat actors are actively leveraging commercial spyware to target users of mobile messaging applications including Signal and WhatsApp. The sophisticated campaigns use advanced social engineering and exploit techniques to compromise victims' devices and gain unauthorized access to their communications.
By fahd.zafar November 24, 2025
Microsoft has introduced experimental AI agent capabilities into Windows through Copilot Actions and agent workspaces, features designed to automate everyday tasks like organising files, scheduling meetings, and sending emails. However, the announcement comes with significant security warnings that business leaders and IT administrators must understand before enabling these capabilities.
November 17, 2025
Anthropic has disclosed the first documented case of a large-scale cyberattack executed with minimal human intervention, marking a significant escalation in AI-enabled cyber threats. The campaign, attributed with high confidence to a Chinese state-sponsored group, demonstrates how rapidly AI capabilities are being weaponised for espionage operations.