AI zero-day exploitcybersecuritythreat intelligenceAI-assisted zero-day attack platform securityGoogle Mandiant AI threat detection

Google Flags First AI-Built Zero-Day in the Wild

29 Jun 20267 min readMarina Koval

// IN THIS ARTICLE

01What Happened 02Technical Anatomy 03Who Gets Burned 04Playbook for Security Teams 05Key Takeaways 06Frequently Asked Questions

The question every Head of Platform with an open source admin stack in production should be asking their CISO this week is not whether AI changes the threat model, it's whether the current vulnerability budget assumes attackers move at human speed. Google now says that assumption is dead. A prominent cybercrime crew used an AI model to help build a working zero-day, and the only reason it didn't go mass-market is that Google got to the vendor first.

What Happened

On Monday, Google published a report from its Gemini, Google Threat Intelligence Group (GTIG), and Mandiant teams summarizing how attackers are using AI in live operations. The headline finding, as SecurityWeek reported, is that Google has for the first time identified a zero-day exploit it believes was developed with AI assistance.

The target was an unnamed open source web-based system administration tool. The payload was a Python script designed to bypass two-factor authentication. The threat actor, also unnamed, appeared to be staging for mass exploitation before Google intervened with the vendor.

Google was careful with its attribution. "Although we do not believe Gemini was used, based on the structure and content of these exploits, we have high confidence that the actor likely used an AI model to support the discovery and weaponization of this vulnerability," the company wrote. The tells were stylistic: "the script contains an abundance of educational docstrings, including a hallucinated CVSS score, and uses a structured, textbook Pythonic format highly characteristic of LLMs training data (e.g., detailed help menus and the clean _C ANSI color class)."

That's not the only data point in the report. China-linked and North Korea-linked groups feature heavily. A China-linked actor deployed agentic tools called Strix and Hexstrike in attacks against a Japanese tech firm and a major East Asian cybersecurity company. The Chinese group UNC2814, known for going after telecoms and government targets, used a persona-driven jailbreak in which the model was instructed to act as a senior security auditor, then pointed at TP-Link firmware with OFTP implementations. North Korea's APT45 sent thousands of repetitive prompts to recursively analyze CVEs and validate PoC exploits, building what Google calls "a more solid arsenal of exploit capabilities that would be impractical to manage without AI assistance."

Technical Anatomy

Strip away the AI framing and look at what actually changed in the attacker's workflow. A 2FA bypass against a web-based admin tool is not a novel class of bug. The interesting part is the production pipeline behind it.

Historically, weaponizing an auth bypass goes through a few human-expensive steps: read the source, identify the state machine for the second factor, find a logic flaw or a race condition, write the Python, harden it for unattended use across hundreds or thousands of targets. Each step gates on a skilled operator's time. The forensic signature Google describes, the hallucinated CVSS score, the textbook docstrings, the clean ANSI color helper class, says the operator handed large chunks of that pipeline to a model. The script reads like LLM output because it is LLM output, lightly supervised.

UNC2814's persona-driven jailbreak matters for a related reason. Treating the model as "a senior security auditor" lets the operator extract reasoning that the provider's guardrails would otherwise block. Pair that with embedded firmware (TP-Link OFTP) and you get cheap, parallel reverse engineering of binary blobs that previously required a specialist. APT45's pattern is different again, less creative and more industrial: thousands of repetitive prompts, recursive CVE analysis, automated PoC validation. That's a batch job, not a hacker. It looks more like an ETL pipeline than a kill chain.

For defenders mapping these behaviors to MITRE ATT&CK, the techniques themselves aren't new. Resource Development, Reconnaissance, Develop Capabilities. What's new is the cost curve. The same operator now runs ten parallel discovery loops instead of one. The economics flip from "one good vuln per analyst per quarter" to "one good vuln per analyst per week." That's the number that should be on a CISO's whiteboard.

Who Gets Burned

The first casualties are anyone running open source admin tooling exposed to the internet. The unnamed target in Google's report is the canonical profile: web UI, system-level privileges, 2FA as the last line of defense, maintained by a thin volunteer team. iGaming operators with self-hosted ops dashboards, fintechs running Rancher or Portainer style consoles, crypto custodians with internal admin panels behind SSO and TOTP, all fit the shape.

The second casualty group is anyone whose threat model assumed N-day patching cadence was good enough. If APT45's pattern generalizes, the window between public CVE disclosure and weaponized PoC compresses toward zero. Patch SLAs written for a 14-day or 30-day window become a liability the CFO will eventually see on an incident report.

The third group is the vendors themselves. Google quietly worked with the impacted vendor to prevent mass exploitation, and that's the unit economics question nobody is pricing yet. Who pays for coordinated disclosure at AI speed? The maintainer of a free admin tool doesn't have a security engineer on call. The hyperscaler that detected the exploit absorbed the cost as a goodwill function of running Mandiant. That model doesn't scale to the thousand-tool long tail every platform team actually depends on.

This is where the GC and the VP Eng should have a fifteen minute conversation this week. The GC wants to know whether the company's SBOM and vendor security questionnaires cover open source admin tooling at all, or whether procurement only screens commercial vendors. The VP Eng wants to know which of those tools have a paid support tier with security response, and what it would cost to move the critical ones onto it before the next quarterly board update.

Playbook for Security Teams

Three concrete moves for the next 30 days.

First, inventory the admin plane. Every web-based system administration tool with a login page reachable from a corporate network, listed with its 2FA implementation. If 2FA is enforced via a plugin or a homegrown middleware rather than the framework, escalate it. The Google exploit targeted exactly that seam.

Second, change the patch SLA model from "calendar days post-CVE" to "hours post-exploit-signal." Wire the CISA KEV feed into the on-call rotation if it isn't already, and add a tier for any CVE touching tooling on the admin inventory list above. APT45's recursive PoC validation pattern means the public CVE is now a starting gun, not a planning input.

Third, harden against the agentic recon pattern UNC2814 demonstrated. Embedded firmware, network appliances, and edge devices are now in scope for cheap reverse engineering. If the perimeter still includes consumer-grade routers, branch office switches with last-decade firmware, or IoT bridges that nobody patches, those are now first-class assets in the threat model. Pull them behind a managed gateway or replace them. The build-vs-buy math on that has changed because the attacker's build cost just dropped.

One organizational note. The hiring market for offensive security engineers who understand LLM tooling is about to tighten hard. Teams that wait until Q4 to backfill will pay 30 to 50 percent over current comp bands. Lock in retention conversations now.

Key Takeaways

Google's report marks the first time it has publicly identified a zero-day exploit it attributes to AI-assisted development, targeting 2FA on an open source admin tool.
The forensic tells were stylistic, hallucinated CVSS scores and textbook Pythonic structure, not a novel vulnerability class. The cost curve of weaponization is what changed.
State-linked actors including UNC2814 and APT45 are already running AI-augmented vulnerability research at scale, using persona jailbreaks and recursive PoC validation.
Teams running open source admin tooling with internet-reachable login surfaces are the immediate exposure. Patch SLAs built around 14 to 30 day windows are now too slow.
Platform leads evaluating their next infrastructure decision should now be asking: which of our critical open source dependencies has a funded security response path, and what does it cost to move the rest onto one before year-end?

Frequently Asked Questions

Q: What did Google actually identify in this report?

Google said it found a zero-day exploit it believes was developed with the help of an AI model, targeting a 2FA bypass on an unnamed open source web-based system administration tool. The exploit was a Python script, and Google worked with the vendor to prevent mass exploitation before the campaign launched.

Q: How did Google attribute the exploit to AI assistance if Gemini wasn't used?

Google relied on stylistic evidence: a hallucinated CVSS score in the script, abundant educational docstrings, and a textbook Pythonic structure with patterns characteristic of LLM training data. The company stated high confidence that an AI model was used for discovery and weaponization, even though it doesn't believe Gemini specifically was the model in question.

Q: What should platform and security teams do this week in response?

Inventory every web-based admin tool reachable on internal or external networks and verify its 2FA implementation. Tighten patch SLAs for any tooling on that list so they trigger on exploit signals rather than calendar days post-CVE. And review which critical open source dependencies have a paid or funded security response path, because the maintainer-of-one model doesn't scale to AI-speed disclosure.

Marina Koval

RiverCore Analyst · Dublin, Ireland

// RELATED ARTICLES

Amazon Q Extension Flaw Leaked AWS Credentials on Repo Open

A Wiz-discovered flaw in Amazon Q Developer auto-ran commands from workspace configs on repo open, putting AWS credentials in reach via a single cloned repo.

Cisco SD-WAN Zero-Day CVE-2026-20245 Exploited Two Months Pre-Disclosure

A 7.8 CVSS Cisco Catalyst SD-WAN zero-day was exploited at least two months before disclosure, chaining with two undisclosed auth bypass flaws to plant a hidden 'troot' account at a telco.

Agentic AI Enters Programmatic Buying: Three Vendors, One Governance Problem

Three named platforms are pushing agentic AI into programmatic buying. The interesting question isn't speed, it's who owns the system of record when multiple agents act on the same campaign.