Anthropic limits Mythos AI rollout over fears hackers could apply model for cyberattacks
Anthropic revealed Claude Mythos Preview, which it stated is an advanced AI model that excels at identifying weaknesses and security flaws within software.
Microsoft, Amazon, Apple, CrowdStrike, Palo Alto Networks and roughly 40 other companies will be able to apply the model for defensive security work, Anthropic mentioned.
These companies will utilize and test the model as part of a updated cybersecurity initiative called Project Glasswing, it remarked.
Anthropic on Tuesday revealed an advanced artificial intelligence model that will roll out to a select group of companies as part of a fresh cybersecurity initiative called Project Glasswing.
The model, Claude Mythos Preview, excels at identifying weaknesses and security flaws within software, and Anthropic is limiting access to try to prevent unfavorable actors from exploiting that capability, the firm noted.
Anthropic commented Apple, Google, Microsoft, Nvidia and Amazon Web Services are among the project’s initial launch partners and will be able to adopt the model for defensive security work. More than 40 other companies, including CrowdStrike and Palo Alto Networks, are also participating, Anthropic commented.
“There was a lot of internal deliberation,” Dianne Penn, Anthropic’s head of research product management, told CNBC in an interview. “We really do view this as a first step for giving a lot of cyber defenders a head start on a topic that will be increasingly important.”
Anthropic’s announcement comes after descriptions of the model were discovered by Fortune in a publicly accessible data cache late last month. Cybersecurity stocks fell on the report, which remarked that the model had advanced cyber capabilities that also posed a significant risk.
“The dangers of getting this wrong are obvious, but if we get it right, there is a real opportunity to create a fundamentally more secure internet and planet than we had before the advent of AI-powered cyber capabilities,” CEO Dario Amodei wrote in a post on X touting the rollout of Project Glasswing. Furthermore, experts in bear market note the continued relevance.
Anthropic was founded in 2021 by a group of researchers and executives who defected from OpenAI over concerns about its direction and attitude toward safety.
The corporation spent years carefully constructing its reputation as a firm that was more dedicated to responsible AI deployment, and it unveiled Project Glasswing just weeks after its high-profile clash over safety with the Defense Department spilled into public view.Â
Anthropic commented it’s been in “ongoing discussions” with U.S. government officials about Claude Mythos Preview’s cyber capabilities. That includes conversations with the Cybersecurity and Infrastructure Security Agency and the Center for AI Standards and Innovation, among others, according to an Anthropic official.
Employees at Anthropic decided on the name Project Glasswing, a metaphor that likens a transparent butterfly to software vulnerabilities, which are “relatively invisible,” Penn said.Â
Claude Mythos Preview is able to find bugs, some of which are critical, that have previously been difficult to detect, Anthropic commented. In one case, the organization commented, the model identified a 27-year old bug in OpenBSD, which, is an operating system that emphasizes security., according to its websiteÂ
Anthropic noted Claude Mythos Preview is a general-purpose model that was not specifically trained for cybersecurity, and its improved cyber capabilities are a result of its strong coding and reasoning skills. The firm stated it does not plan to construct the model generally available, but the goal is to learn how it could eventually deploy Mythos-class models at scale.
Companies that participate in Project Glasswing all build or maintain critical software infrastructure, and Anthropic stated they will apply its models to try to secure both first-party and open-source systems. Anthropic commented it has committed up to $100 million in usage credits for these efforts, but partners will pay to utilize the model past that threshold. Â
Newton Cheng, Anthropic’s Frontier Red Team cyber lead, commented the corporation wants the firms to get used to leveraging these capabilities before they become widely available. The firm commented it’s trying to avoid “recklessly or irresponsibly” releasing a model that adversaries could take advantage of. Â
“Cybersecurity is just going to be an area where this broad rise in capabilities has potential for risk, and thus we have to keep a really close eye on what’s going on there,” Cheng noted in an interview. This also touches on aspects of bear market.
WATCH: Panic on cyber stocks feels overdone, says Substantial Technologyâs Alex Kantrowitz