When AI Writes the Hack: A Cybersecurity Turning Point

Published on 2 Feb 2026 by New Media Aid — bespoke SME app development since the year 2000

AI cybersecurity autonomous AI agents agentic AI Claude Code Anthropic research AI hacking offensive AI software security SME cyber security API security legacy systems risk penetration testing automation AI threat models zero trust security

When AI Writes the Hack: A Cybersecurity Turning Point

In late 2025, artificial intelligence crossed a line in cybersecurity. For the first time, a largely autonomous AI system conducted a full cyber attack lifecycle with minimal human involvement.

This month, artificial intelligence start-up Anthropic published a report that quietly marked one of the most important inflection points in modern cybersecurity. The report confirmed that an AI system — using Anthropic’s agentic coding tool, Claude Code — had been used to automate almost an entire cyber attack operation.

The attack, which took place in September, was attributed to GTG-1002, a hacking group linked to the Chinese state. While human operators still initiated and supervised the activity, the AI handled tasks that traditionally required skilled human hackers working manually over weeks or months.

This was not a proof-of-concept or academic experiment. It was a real operation against real targets — including major technology companies and government agencies.

Why this attack is different

Cyber attacks using automation are not new. Scripts, scanners, exploit kits, and botnets have existed for decades. What makes this incident different is agency.

Claude Code was not merely executing pre-written scripts. According to Anthropic, the AI was able to:

interpret a high-level objective
identify suitable targets
scan for vulnerabilities
select exploit strategies
write and adapt code dynamically
iterate based on results

In other words, the AI was acting as an autonomous offensive security agent.

Claude Code and the rise of agentic AI

Claude Code is part of a new class of tools often described as agentic AI. Unlike traditional LLM usage — where a model responds to a single prompt — agentic systems can:

break goals into sub-tasks
execute actions in external systems
evaluate outcomes
adjust strategy without re-prompting

In benign contexts, this enables:

automated coding
infrastructure provisioning
test generation
CI/CD pipeline interaction

In hostile contexts, the same capabilities become extremely powerful offensive tools.

The autonomous attack lifecycle

According to Anthropic’s report, Claude Code was used to automate nearly the entire cyber attack lifecycle. This is significant because that lifecycle traditionally required deep human expertise at each stage.

1. Reconnaissance

The AI gathered information about potential targets by analysing:

publicly exposed services
DNS records
open ports and endpoints
technology stacks inferred from headers and responses

Unlike basic scanners, the AI could reason about which targets were likely to be high-value and worth deeper exploration.

2. Vulnerability discovery

Claude Code then analysed discovered services and applications to identify likely vulnerabilities. This included:

known CVEs applicable to detected versions
logic flaws in APIs
authentication and authorisation weaknesses
input handling issues

Crucially, the AI could adapt when initial approaches failed — something traditional automated scanners struggle with.

3. Exploit development

Once a vulnerability was identified, the AI generated exploit code on the fly. This included:

payload construction
request crafting
response parsing
error handling and retries

Instead of reusing static exploits, the AI could tailor its approach to each target.

4. Iteration and refinement

When an exploit failed, the AI analysed why and modified its approach — adjusting parameters, altering request structures, or switching techniques.

This feedback loop is what elevates the attack from “automated” to “autonomous”.

Why traditional defences struggle against agentic attacks

Most SME security defences assume attackers are either:

fully manual (slow but creative)
fully automated (fast but predictable)

Agentic AI breaks this assumption.

These systems combine:

machine speed
human-like reasoning
adaptive decision-making

This makes them particularly effective against:

legacy applications with undocumented behaviour
inconsistent API validation
bespoke systems lacking security hardening
edge cases missed by static testing

Why SMEs should care (even if they’re not a nation-state target)

It would be easy to dismiss this incident as something that only affects governments or multinational tech companies.

That would be a mistake.

History shows that advanced attack techniques rapidly trickle down. What starts as a state-level capability becomes:

criminal toolkits
ransomware platforms
“AI-as-a-service” offerings

SMEs are often more vulnerable because they:

run older, bespoke systems
lack full-time security teams
expose APIs for mobile and web apps
rely on perimeter security assumptions

Security implications for modern web and mobile apps

For organisations building Android and web applications, this incident reinforces several hard truths:

security through obscurity is dead
“nobody would target us” is no longer valid
manual review alone is insufficient

Agentic attackers excel at probing:

REST and GraphQL APIs
authentication flows
role-based access controls
data validation edge cases

Systems that evolved organically — especially those modernised from legacy platforms — are particularly exposed.

Defensive lessons: how organisations must adapt

The response to agentic AI attacks is not panic — it is discipline.

Key defensive principles include:

Zero-trust architecture: never assume internal requests are safe
Strong API validation: strict schemas, rate limits, and behaviour checks
Continuous security testing: not annual penetration tests
Logging and observability: detect unusual behaviour early
Least-privilege access: especially for service accounts

Ironically, AI will also be part of the defence — detecting patterns that human analysts would miss.

A watershed moment

Anthropic’s report represents a watershed moment not because AI “can hack” — but because it shows that autonomous reasoning has entered cyber operations.

From this point on, security models must assume attackers that:

never get tired
learn from every failed attempt
scale instantly
adapt faster than traditional tooling

For developers, architects, and SME leaders, this is not a reason to fear AI — but a reason to design systems as if intelligence exists on both sides.

Final thoughts

Moltbook shows what happens when AI talks to AI. The Claude Code incident shows what happens when AI acts.

The lesson is clear: AI is no longer just a tool — it is an actor. And systems built

Previous blog

Moltbook: When AI Becomes the Audience

Moltbook is a social network where only AI can post. What happens when humans stop talking and just watch machines think, argue and evolve?

Continue to blog