Anthropic argues that scaling its own power is central to responsible AI development. Critics say that logic is self-serving and dangerous. Here's what's actually at stake.
Anthropic, the AI safety company behind the Claude models, has a core argument baked into its business strategy: the safest outcome for AI development is Anthropic winning. According to a WIRED report, the company holds that responsible AI can only happen if a safety-focused lab stays at the frontier, which means growing fast, raising billions, and outcompeting rivals. Critics say that framing conveniently doubles as a justification for accumulating power, and that the two goals are harder to separate than Anthropic admits.
WIRED reported that Anthropic has built its commercial strategy around a specific claim: that its own dominance in the AI market is a precondition for AI being safe. The company frames aggressive growth, large fundraising rounds, and frontier model development not as business ambition but as a safety obligation.
The logic goes roughly like this. Powerful AI is coming regardless. If a company that prioritizes safety does not stay at the cutting edge, a company that does not will. Therefore, Anthropic must scale, must compete, and must win enough of the market to shape how the technology develops.
This argument is not new inside Anthropic. It has been part of the company’s public positioning since its founding. What has changed is the scale at which Anthropic is now operating, and the scrutiny that comes with it.
The criticism landing on Anthropic is pointed. Observers and competitors note that the same argument, “we need power to prevent harm,” is exactly what a company seeking power would say whether or not it was true. There is no clean way to falsify it from the outside.
This creates a structural problem for anyone relying on Anthropic’s safety framing when choosing AI tools or partners. The safety claims are real in the sense that Anthropic does publish research, does run red-teaming programs, and does employ serious researchers. But the claim that Anthropic’s growth is itself the safety strategy asks for a level of institutional trust that has not been independently verified.
A few specific tensions worth watching:
For businesses building on Claude or using Anthropic’s API, this is not an abstract debate. It shapes what the company prioritizes, how it responds to regulation, and how much independence its safety team actually has when it conflicts with the growth team.
We are not dismissing Anthropic’s safety work. The company has researchers doing serious, published work on alignment and interpretability. That is real and it matters.
But the “we must win to keep AI safe” framing is doing a lot of heavy lifting, and it deserves more skepticism than it usually gets in coverage. Every major AI lab now has a version of this story. OpenAI has it. Google DeepMind has it. They cannot all be the necessary steward of safe AI. At least some of them are just building a business and dressing it up in safety language because that language is currently rewarded by investors, regulators, and press.
The honest version of Anthropic’s position is something like: “We think we are better at safety than our competitors, we might be right, and we are betting that racing to the frontier while caring about safety beats ceding that ground to someone who does not.” That is a defensible position. It is also a gamble, not a guarantee, and calling it responsible development does not make it one.
For our clients evaluating AI vendors: the safety branding of a lab tells you something, but not everything. Look at the actual model behavior, the terms of service, the data handling, and the track record on specific harms. Those are things you can actually test.
If your business is building on or evaluating Anthropic’s Claude models, here is a practical checklist:
The bottom line: trust the benchmarks and the contracts, not the mission statement.