BIP Austin digital publishing platform

collapse
Home / Daily News Analysis / The US government’s Anthropic models ban was never about an AI jailbreak

The US government’s Anthropic models ban was never about an AI jailbreak

Jun 25, 2026  Twila Rosenbaum 3 views
The US government’s Anthropic models ban was never about an AI jailbreak

The U.S. government's enforcement letter to Anthropic, which effectively forced the company to pull its latest AI models offline just before the weekend, should be a wake-up call for any U.S. tech company — AI lab or otherwise.

On Friday afternoon, the U.S. Commerce Department sent Anthropic a letter invoking an obscure export control directive that banned non-Americans, including Anthropic's employees, from accessing Fable 5 and Mythos 5, citing an unspecified national security concern. Anthropic said it believes the letter is related to a bypass of the model's guardrails, but isn't sure because the letter doesn't provide specific details. The letter has not been made public.

In response, Anthropic shut down both of its top models to all customers to ensure compliance. The result was that the U.S. government successfully forced a tech company to pull its products offline with a swift, unilateral action that didn't appear to require court approval.

The Broader Implications for AI and Tech

Friday's intervention by the Trump administration demonstrates that the AI industry is not immune to government interference. It also serves as a warning to the wider tech industry: comply, or face shutdown. The move has sparked intense debate about the scope of executive power over software releases, especially in emerging fields like artificial intelligence.

Citing sources, Axios described a tense situation over the weekend between the two major players, noting that personality differences between Anthropic and the Trump administration led to the export directive, rather than a technical issue with the AI products. New details that emerged over the weekend cast further doubt on the government's already shaky reasoning.

Security Researchers Weigh In

Katie Moussouris, a cybersecurity veteran and founder of Luta Security, published a blog post revealing that Anthropic had recently shared with her a private copy of a paper written by security researchers describing an alleged guardrail bypass in Fable 5. The Wall Street Journal reports that the paper's authors are security researchers at Amazon. Moussouris said that Anthropic reached out to ask for her take on the paper.

Moussouris' blog post described how the researchers triggered the guardrail bypass, but emphasized that the bypass itself should never have triggered an export control. She explained the difference between asking an AI model to review code for security issues versus asking it to fix code. The end result is largely the same, even if the questions are posed slightly differently. She argued that the behavior described in the paper cannot meaningfully be fixed, and any attempt would only weaken the model for defense.

Moussouris criticized the export control directive as hasty, heavy-handed, and misguided. She and dozens of other top security researchers and experts have called on the Trump administration to revoke the order, calling the move to pull advanced cybersecurity capabilities from network defenders in the U.S. dangerous.

Historical Context of Export Controls

Past administrations have made sweeping decisions on knowledge gaps. For instance, language used by the U.S. government during the 2010s to fix export law covering cybersecurity tools that could also be used for cyberattacks was so broad that it inadvertently nearly outlawed legitimate security and vulnerability research. That incident eventually led to reforms after significant pushback from the security community.

However, the Trump administration's directive appears retaliatory. Justin Hendrix, editor of Tech Policy Press, said the move is likely to raise alarms in foreign capitals about the reliability of American AI for critical applications. The message is that AI companies in the United States cannot be trusted to operate without interference from the U.S. government.

Unanswered Questions and Potential Motives

The Trump administration hasn't confirmed why it invoked its export control directive. Did officials misread the research report and overreact? Did Amazon CEO Andy Jassy say something to senior government officials that prompted the reaction, out of caution or spite? Was something lost in translation, or was this a way to pressure Anthropic, with whom the administration already has a fractious relationship? It is possible that the White House was unaware of the far-reaching consequences of the letter's demand and officials are scrambling to undo the damage of their own making.

To quote Hendrix, the climate is one of a cloud of suspicion that senior officials are picking favorites based on personal and political factors. The aftermath is that the government has set a dangerous precedent about how much control it intends to wield over the release of American-made software.

This time the government took issue with Anthropic; tomorrow it could be with anyone else. The implications for startups and established companies alike are profound: any AI model could potentially be shut down without warning if it is perceived to violate a vaguely defined national security interest.

Technical Details of the Guardrail Bypass

Security researchers described a technique that involved crafting prompts that subtly differ from standard queries. For example, asking the model to review code for vulnerabilities triggers its safety mechanisms, but asking it to fix the same code may bypass those guardrails. The researchers demonstrated that by rephrasing requests, they could get Fable 5 to produce outputs that its safety systems were designed to prevent. However, Moussouris argues that such behavior is inherent to the model's capabilities and cannot be patched without sacrificing utility for defensive cybersecurity tasks.

The controversy highlights a fundamental challenge in AI safety: models trained to refuse certain requests can often be tricked through simple linguistic variations. The export control directive assumes that these bypasses represent a flaw that can be fixed, but many experts believe they are an intrinsic property of large language models.

Reactions from the Tech Community

The incident has drawn sharp criticism from across the political spectrum. Conservative libertarians argue that the government overstepped its authority by acting without judicial oversight. Progressive tech critics see it as evidence that the administration is using national security pretexts to settle scores with companies it dislikes. Meanwhile, industry groups have expressed concern that the uncertainty will deter investment in AI research and development.

Some have drawn parallels to the 2018 ban on ZTE, which crippled the Chinese telecom company. But unlike that case, the Anthropic directive targets a domestic company and its own employees, raising questions about the scope of export controls as applied to U.S. persons. Legal experts note that the relevant regulations typically restrict access by foreign nationals to certain technologies, but here the ban applies to Anthropic itself, effectively forcing it to cease operation of its models.

The situation remains fluid. Anthropic has not indicated whether it will challenge the directive in court. If it does, the case could set a landmark precedent for the limits of executive power over software. For now, the company's customers — many of whom rely on Fable 5 and Mythos 5 for critical cybersecurity functions — are left scrambling for alternatives.

The broader lesson is clear: in the eyes of the current administration, no AI company is immune from unilateral shutdown. The technology sector must urgently consider how to protect itself from such interventions, whether through legal safeguards, industry standards, or by building resilient systems that do not depend on a single provider.


Source:TechCrunch News


Share:

Your experience on this site will be improved by allowing cookies Cookie Policy