“For my team, the cost of [AI] compute is far beyond the costs of the employees.” – Bryan Catanzaro – Vice president of applied deep learning at Nvidia
In the current generation of artificial intelligence deployment, the binding constraint for many organisations is no longer talent but access to affordable compute at scale.1 That inversion of the traditional cost structure is what turns remarks from senior technologists into a broader economic signal: for some cutting-edge teams, the recurring bill for accelerated hardware, cloud instances, networking, and power now exceeds the wage bill for the engineers designing and operating the systems.1,3 This is not just an accounting oddity; it alters how firms evaluate automation, how investors price AI strategies, and how policymakers should interpret predictions of rapid job displacement.
From cheap silicon and expensive people to expensive silicon and leveraged people
For several decades, digital transformation followed a familiar pattern: hardware and basic infrastructure costs per unit of computation fell predictably, while skilled labour remained scarce and expensive. Organisations hired more software engineers to sit atop an increasingly cheap computational substrate. In that world, the canonical argument for automation was straightforward: once a process was codified in software, the marginal cost of running it again was negligible compared with paying an additional human to do the same work.
What has changed with frontier AI systems is the scale and intensity of computation required to deliver competitive performance. Training large language models and vision systems involves running vast numbers of parallel operations across specialised GPUs and associated accelerators, often for weeks, on clusters that can cost tens or hundreds of millions of dollars in capital expenditure for hyperscalers and large enterprises.1,6,8 Even when organisations do not own the hardware, their cloud providers pass through these capital and operating costs via metered pricing. As a result, the unit of analysis for AI economics has shifted from “salary per employee” to metrics such as cost per GPU hour, cost per million tokens, and cost per inference request.2,4
Industry observers now describe AI infrastructure as a new class of heavy industry: data centres designed around specialised accelerators, redundant power feeds, and advanced cooling, with aggregate global spending that consultancy estimates place in the trillions of dollars by 2030.1,6 That capital intensity explains why some AI teams report that, at the margin, paying for more compute to improve a model or scale an application is financially weightier than hiring additional engineers to refine prompts, build interfaces, or clean data.
Why compute dominates the cost stack
There are several mechanisms behind compute eclipsing labour in AI projects.
First, state-of-the-art models are extremely large. Modern large language models and multi-modal systems often contain hundreds of billions or even 1 trillion parameters. Each training run requires repeated passes over large datasets, with backpropagation and optimisation algorithms applied across every parameter. The computation required scales roughly linearly with the number of parameters and data tokens, and in practice teams often run multiple training, fine-tuning, and ablation cycles. That translates into millions of GPU hours, even when using the most efficient hardware and software stacks available.2,4,10
Second, inference – the process of serving model outputs to users – imposes ongoing costs that grow with adoption. Training is a one-off or periodic capital-like expense, but inference is an operational expense that scales with queries. Industry frameworks therefore emphasise cost per token as a central metric: the all-in cost to produce each output token, incorporating hardware depreciation, energy, data centre overheads, networking, and software optimisation.2 Even small differences in cost per million tokens can compound dramatically when applications serve millions of users or integrate AI into high-frequency workflows.
Third, the energy footprint of frontier AI is substantial. High-end GPUs draw significant power, and data centres require additional energy for cooling and ancillary systems. Energy prices vary geographically, but in many locations they are high enough that power constitutes a large share of the total cost of ownership for AI infrastructure. Analysts have therefore started speaking of “intelligence per megawatt” as a key performance dimension.2 When firms compare this stack of costs with the wages of knowledge workers, the balance can tilt unexpectedly toward hardware and energy spending.
Fourth, there is a structural asymmetry between compute and labour costs. Employee salaries are relatively predictable and can be adjusted slowly via hiring freezes, attrition, or compensation changes. Compute costs, by contrast, can spike rapidly if product usage grows or if teams run more experiments than anticipated. In startups, venture capital typically funds both headcount and infrastructure, but some investors report portfolio companies spending more than 80% of their capital raised on compute resources, dwarfing wage bills.8
The empirical picture: AI not yet cheaper than humans
These mechanisms are now visible in empirical work. One 2024 study from MIT examined where AI systems could perform visual tasks at or near human level, and then compared the cost of machine versus human performance.1 The researchers concluded that automation was economically viable in roughly 23% of roles where vision is central to the job, meaning that in about 77% of cases, it remained cheaper to pay humans than to deploy AI.1 The issue was not capability – the models could often do the tasks – but economics: hardware, energy, and infrastructure outweighed labour costs.
Macro-level data on AI expenditure reinforce this picture. Big technology firms alone have announced around 740 billion in capital expenditures in a single year, largely driven by AI data centre build-out, representing a 69% increase over the prior year.1 Other analysis suggests that AI-related expenditures could reach 5,2 trillion by 2030 under central scenarios, and as high as 7,9 trillion under more aggressive build-out, with data centre and IT equipment accounting for the bulk of this spending.1,6 Against those numbers, even generous headcounts of highly paid engineers, researchers, and product staff occupy a smaller share of the cost base than might be expected.
At the same time, external observers note that, despite the scale of these investments, there is still limited aggregate evidence of AI-driven productivity gains across the economy. Budget analysts and academic studies point out that, so far, there is no broad-based data showing AI displacing jobs at scale or dramatically boosting measured output per worker, even as tech sector layoffs have accelerated.1,3 That divergence between spending and measured productivity raises the question of whether the current wave of AI investment is front-loaded – infrastructure built ahead of realised returns – or whether some fraction will prove to be misallocated capital.
Strategic tension: build now, pay later
This leads to a central strategic tension. On one side is the argument that AI is an infrastructure revolution akin to electrification or the early internet, requiring enormous upfront capital before productivity gains show up in statistics. Firms that invest early, this view holds, will establish competitive moats via proprietary data, trained models, and optimised infrastructure. For such firms, the fact that compute currently costs more than labour may be beside the point; they are laying the foundations for future economies of scale, where the amortised cost per unit of AI output falls sharply as utilisation rises.1,2,6
On the other side is a more sceptical view, which emphasises opportunity cost and path dependence. If AI systems are currently more expensive than humans for many tasks, especially outside narrow high-value niches, then replacing workers prematurely may destroy value rather than create it. Companies that chase AI for its own sake, without rigorous cost-benefit analysis, risk saddling themselves with fixed infrastructure commitments and ongoing compute bills that are difficult to roll back. This argument is bolstered by evidence that many firms do not fully understand their AI cost structures, focusing on headline model access fees or GPU rental rates rather than total cost per token or per workflow.8
These perspectives are not mutually exclusive. It is possible that some organisations are overbuilding, while others are rationally investing in infrastructure that will underpin future competitive advantage. What unites them is an underlying bet: that the cost of compute will fall fast enough, and the productivity benefits of AI will rise high enough, to justify today’s discrepancy between machine and labour costs.
Falling unit costs versus rising aggregate spend
Industry roadmaps and analyst reports forecast significant reductions in the unit economics of AI over the next few years. Hardware generations such as Nvidia’s Blackwell architecture promise up to 30× gains in inference performance at similar power budgets compared with earlier accelerators.4,6 Software improvements – better compilers, quantisation techniques like FP4 precision, more efficient attention mechanisms, and mixture-of-experts routing – all work to reduce the computational load per unit of useful output.2,10 Gartner-style forecasts point to the cost of running inference for models with 1 trillion parameters dropping by more than 90% over a four-year horizon.1
If realised, those gains could radically alter the relative cost of compute and labour. A workflow that is uneconomic today because each AI call is expensive might become viable once the cost per million tokens falls below some threshold. In that future, the remark that compute costs more than employees would be overtaken by a new reality in which compute is cheap enough that the main question becomes how to reconfigure organisations to exploit it.
However, even as unit costs fall, aggregate spending may still rise. The classic rebound effect applies: cheaper computation tends to expand the range of feasible applications and increase total usage. Organisations that pay less per token may respond by embedding AI into more products, workflows, and services, multiplying the total number of tokens generated. If spending on AI grows from 5,2 trillion to 7,9 trillion by 2030, a large part of that increase will likely reflect expanded scope, not just higher prices.1,6 The result is a paradox: individually, each unit of compute may become cheaper and more efficient; collectively, compute may remain the largest single line item for AI-heavy firms.
Employment, displacement, and the cost paradox
The fact that compute can cost more than employees complicates narratives about AI-driven job displacement. From a firm’s perspective, automation only makes sense when the total cost of designing, training, deploying, and maintaining an AI system is lower than – or at least justifiable relative to – the wage cost, management overhead, and performance variability of human workers. When AI is more expensive, substituting capital for labour purely on cost grounds is irrational.
This does not mean AI will not change employment. Instead, it suggests a more nuanced pattern of complementarity and selective substitution. In domains where human labour is extremely costly or scarce, such as high-end legal services, algorithmic trading, or complex simulation, even expensive compute may be a bargain. In mass-market customer service or routine back-office work, by contrast, the current cost structure favours augmenting workers with AI tools rather than replacing them outright. The MIT study’s finding that only around 23% of vision-centric jobs are currently economically automatable illustrates how narrow the immediate substitution window may be.1
The paradox is that headlines about companies spending more on AI infrastructure than on salaries coexist with data showing limited net job losses attributable directly to AI.1,3 Part of the explanation is temporal: firms are investing ahead of adoption, building capabilities before fully restructuring their workforces. Another part is strategic: some firms see AI as a growth tool rather than a cost-cutting tool, aiming to enable new products and services rather than simply replacing existing staff.
Pricing models and hidden subsidies
One reason many users and even corporate customers underestimate the true cost of AI compute is the structure of pricing models. A significant portion of the market relies on flat subscription charges or simple usage tiers that do not map transparently to underlying infrastructure costs.1 For light users, this can be attractive: they pay a fixed fee and rarely hit limits. For heavy users, however, the provider may be effectively subsidising usage, especially if the subscription was priced before providers had a clear picture of real-world load.
Reports of AI software fees rising by 20% to 37% over a year indicate providers adjusting to this reality.1 As cost pressures mount – from energy, hardware procurement, and the need to recoup massive capital investments – providers are likely to shift toward more granular, usage-based pricing that reflects cost per token or per request more accurately.2 When that occurs, more enterprises will discover that their apparent labour savings are offset by higher-than-expected compute bills.
This evolution will bring AI closer to other utilities: electricity, cloud storage, and bandwidth. In each case, users ultimately pay for the marginal resource consumed, and efficient usage becomes a competitive advantage. Just as cloud-native firms learned to optimise workloads to reduce compute and storage charges, AI-native firms will need to optimise prompts, context lengths, caching strategies, and model architectures to minimise unnecessary tokens and reduce idle capacity.
Why the remark matters
The observation that compute can be more costly than employees is important for several constituencies.
For executives and boards, it underscores the need for rigorous capital allocation in AI initiatives. Projects should be evaluated not only on potential strategic upside but also on fully burdened compute economics: total cost per workflow, sensitivity to usage spikes, and exposure to future hardware and energy price shifts. In an environment where tech companies have already announced hundreds of billions in AI-related capital expenditure, misjudging these factors can have material consequences for profitability and competitive positioning.1,3,6
For investors, the remark acts as a reminder that not all AI spending is value-creating. A significant share may be speculative or defensive, driven by fear of missing out rather than clear use cases. Distinguishing between firms that translate compute into durable revenue and those that merely accumulate expensive infrastructure will be a central task over the next decade.
For policymakers and labour economists, recognising the current cost structure is essential when interpreting forecasts of rapid, sweeping job automation. If AI is still more expensive than humans for the majority of tasks, then near-term labour market disruption is likely to be more contained and sector-specific than some narratives suggest. This does not eliminate the long-term risk of displacement as unit costs fall, but it introduces a window in which policy can focus on adaptation: training, re-skilling, and ensuring that productivity gains, when they arrive, are broadly shared.
Finally, for engineers and product teams, the remark is a design constraint. It implies that building AI systems is not just a problem of maximising accuracy or capability; it is also a problem of optimising for economic viability. Model selection, quantisation choices, caching, retrieval strategies, and system architecture all affect compute consumption. Teams that learn to treat tokens, GPU seconds, and watts as scarce resources, on par with developer time, will be better positioned to create sustainable AI products.
As AI infrastructure matures, the relative prices of compute and labour will continue to evolve. The present moment, in which a leading practitioner can credibly say that the compute bill dwarfs the wage bill, is a snapshot in a longer trajectory. Whether history records it as an early, capital-intensive stage on the way to widely affordable machine intelligence, or as evidence of an overcapitalised boom, will depend on how effectively organisations turn expensive silicon into genuine productivity.
References
1. Nvidia executive: The cost of AI tools is ‘far beyond’ the … – Fortune – 2026-04-28 – https://fortune.com/2026/04/28/nvidia-executive-cost-of-ai-is-greater-than-cost-of-employees/
2. Rethinking AI TCO: Why Cost per Token Is the Only Metric That Matters – 2026-04-15 – https://blogs.nvidia.com/blog/lowest-token-cost-ai-factories/
3. AI can cost more than human workers now – Axios – 2026-04-26 – https://www.axios.com/2026/04/26/ai-cost-human-workers
4. GPU Economics: Understanding AI Infrastructure Costs in Finance – 2026-02-14 – https://chatfin.ai/blog/gpu-economics-understanding-ai-infrastructure-costs-in-finance/
5. Nvidia Executive Admits AI Costs Exceed Employee … – YouTube – 2026-04-29 – https://www.youtube.com/shorts/BMaX-sGXAj4
6. [PDF] Economic Analysis Report of Nvidia – Frontier Scientific Publishing – https://en.front-sci.com/index.php/memf/article/view/4667/4965
7. Nvidia exec says AI is more expensive than actual workers – 2026-04-29 – https://www.tomshardware.com/tech-industry/artificial-intelligence/nvidia-exec-says-ai-is-more-expensive-than-actual-workers-yet-some-companies-dont-see-the-extra-costs-as-a-negative
8. Navigating the High Cost of AI Compute | Andreessen Horowitz – 2023-04-27 – https://a16z.com/navigating-the-high-cost-of-ai-compute/
9. Nvidia executive says AI is more expensive than paying human … – 2026-04-30 – https://news.ycombinator.com/item?id=47941609
10. AI Inference: Balancing Cost, Latency, and Performance | EBook – 2025-06-27 – https://www.nvidia.com/en-us/solutions/ai/inference/balancing-cost-latency-and-performance-ebook/
