‌

A daily bite-size selection of top business content.

PM edition. Issue number 1348

Latest 10 stories. Click the button for more.

‌

Term: Gen Alpha and Gen Z lexicon

"The Gen Alpha and Gen Z lexicon consists of internet-born slang heavily focused on self-improvement and social dominance, exemplified by the terms mogging and maxxing. Mogging refers..." - Gen Alpha and Gen Z lexicon

Attempts to climb social hierarchies are not new, but the way younger cohorts talk about those efforts has shifted dramatically in the 2020s. Status is now narrated through a dense web of internet slang that turns self-optimisation into a running joke, a competitive sport, and a coping mechanism, all at once. Instead of quietly working on grades, careers, or fitness, many teenagers and young adults frame their lives as a series of live-streamed upgrades and one-upmanship battles.

This linguistic shift matters because it changes how success, failure, and even personality are conceptualised. Rather than seeing improvement as a slow, private process, the language of online youth culture treats self-change as something modular and gamified: individual traits can be "maxxed", others can be "nerfed", and people can be "mogged" or "frame-mogged" simply by standing next to someone who outclasses them in a particular dimension. The vocabulary does not just describe reality; it shapes how young users experience their bodies, friendships, and future prospects.

From everyday slang to a status-obsessed dialect

On the surface, Gen Alpha and Gen Z speech includes many light-hearted expressions of approval and disapproval. Words such as "lit", "slay", "ate", or "yeet" convey enthusiasm, admiration, or energy, while terms such as "mid", "Ohio", "noob" and "L" serve as everyday put-downs. There are also playful insults and praise for charisma and coolness: "rizz" as shorthand for charisma, "rizzler" for someone who is especially charming, and "sigma" for a highly independent or dominant figure. These are the more visible parts of a much larger ecosystem that also includes darker and more technically framed vocabulary.

Below that surface lies a cluster of terms that explicitly link status with self-improvement and hierarchy. "Mogging" describes asserting dominance over someone else, often visually or socially; "maxxing" is the idea of maximising a particular trait or domain; "looksmaxxing" is a form of intensive appearance optimisation that can stretch from skincare and gym work to cosmetic procedures and performance-enhancing drugs. This lexical field constructs social life as a ladder, where every interaction can be read as a win, a loss, or a chance to upgrade.

Mogging: social comparison as a social script

Mogging is the clearest linguistic embodiment of status competition. In online usage, it means outperforming or outclassing another person so decisively that the comparison is humiliating or at least unmistakable. The emphasis is less on objective achievement than on relative impression. One does not simply be taller or stronger; one "frame-mogs" someone by looking more imposing in a photo, or "aura-mogs" them by seeming cooler or more charismatic in a social situation.

This focus on the comparative, rather than absolute, dimension of traits aligns with classic social psychology findings: people evaluate themselves through contrast with salient peers, not absolute standards. The slang simply makes that process explicit and performative. Being "mogged" instantly labels an interaction as a status loss, often used jokingly among friends: a better outfit, a higher test score, or a more successful flirtation can be narrated as a "mog". Yet the joke rests on a real anxiety about inferiority and exclusion that is amplified by algorithmic feeds filled with idealised peers and influencers.

In more hostile corners of the internet, the term has sharper edges. Within communities influenced by incel culture, "mogging" often centres on physical appearance and masculinity, particularly height, facial structure, and muscularity. There, to be "mogged" is not a playful tease but evidence of being biologically or socially doomed. The same word, then, carries both a mainstream, semi-ironic teen usage and a more fatalistic subcultural meaning, which can blur when content crosses platforms.

Maxxing: modular optimisation of the self

Where mogging names the outcome of status competition, maxxing describes the process of trying to improve. The suffix "-maxxing" originates from video games, where to "max out" a stat is to raise it to the highest possible level. Online, the term now attaches to almost any trait or domain: "looksmaxxing" for appearance, "gymmaxxing" for physical strength, "rizmaxxing" for charm, "jestermaxxing" for attention-grabbing silliness, and even more niche or absurd variants.

This modular quality reflects a quasi-engineering view of personality. The self is decomposed into parameters that can, in theory, be tuned independently. In an informal sense, someone might imagine a vector , where each component represents a personal attribute such as strength, attractiveness, income, humour, or social network size. Maxxing then becomes the attempt to increase one or more components subject to constraints of time, energy, and resources. Although this is rarely formalised mathematically in everyday discussion, the underlying logic is optimisation: improve specific coordinates of the self to move up an implicit fitness landscape.

In some communities, that optimisation is taken literally. Looksmaxxing forums discuss detailed regimens ranging from skincare and orthodontics to jawline exercises, bodybuilding routines, and elective surgery. Users share "before" and "after" photos, compare progress, and exchange advice on everything from sleep and diet to more extreme interventions such as anabolic steroids or facial implants. The language of maxxing gives these practices a narrative frame: the body becomes a project, and each intervention a deliberate move toward a better local maximum.

From looksmaxxing to full-spectrum self-engineering

Looksmaxxing is the most documented example of this mindset. It treats physical attractiveness as a multi-factor parameter that can be substantially raised with enough knowledge and effort. Typical domains of intervention include grooming, skincare, dental alignment, body composition, and clothing. More aggressive paths involve surgery on nose, jaw, or eyelids, and pharmacological enhancement through hormones or steroids. The range of practices can be conceptualised as a control vector that influences the evolution of a state variable representing perceived attractiveness or status over time.

Informally, some users think in dynamic terms: if is their current "rating" or status, then consistent improvement strategies aim to shift , where captures the impact of a given set of actions. The discourse of maxxing nudges people to focus on the gradient: what actions yield the steepest increase in visible gains per unit of effort. While not expressed in equations on social platforms, the underlying mentality of incremental, compounding optimisation strongly echoes both self-help literature and quantitative trading or machine learning culture.

Beyond appearance, a broader "selfmaxxing" culture encourages stacking improvements across multiple life domains: fitness, income, social skills, and personal brand. The idea is that aggregate status can be raised by simultaneously nudging several traits upward. To use a simple metaphor, if total social capital is some function , where are individual attributes, then maxxing becomes the project of increasing under constraints. The language makes that project feel game-like and quantifiable, even when underlying realities remain messy and uncertain.

Parameters, signals, and the role of the gaze

The lexicon implicitly distinguishes between internal qualities and external signals. Attributes are valued insofar as they are legible to others: height, frame, jawline, clothing, and online follower counts serve as immediate signals that can produce a "mog" in a single glance. Less visible traits, such as kindness or integrity, rarely feature directly in mogging or maxxing talk because they are harder to observe and compare in short-form content.

This emphasis on signals makes sense in scrolly, image-driven environments. When peers and strangers are mostly encountered through photos and short videos, the parameters that matter most are those that compress well into pixels. The result is a feedback loop: traits that generate visible status differences are named, tracked, and exaggerated in slang; those traits then receive more attention and investment, which further entrenches their centrality. A jawline exercise such as "mewing" appears trivial in isolation, but in a world where selfies, avatars, and video calls mediate social life, such micro-optimisations feel rational to many young users.

Origins in subculture and migration to the mainstream

A striking feature of this vocabulary is its path of diffusion. Many terms now used casually by teenagers originate in highly specific subcultures. Linguistic research and popular glossaries note that a large portion of Gen Z and Gen Alpha slang emerges from African-American Vernacular English and Black queer ball culture, particularly in areas relating to style, shade, and performance. Words such as "slay" or "fam" moved from marginal communities into global youth speech, often losing their original cultural context along the way.

By contrast, the cluster surrounding looksmaxxing, mogging, and related concepts arises from incel forums and adjacent online spaces in the 2010s. There, they were embedded in a grim worldview that framed attractiveness as a quasi-genetic destiny and romantic success as a zero-sum game. Over time, certain terms escaped those environments and were recontextualised by streamers, meme accounts, and mainstream users. A word like "mog" can therefore appear both in deeply misogynistic discussions of genetic lotteries and in light-hearted TikTok comments about who wore an outfit better.

This migration complicates attempts to interpret the lexicon morally. It is possible for a teenager to say they were "mogged" in a video game or school photo without any contact with incel ideology. Yet the structural logic of the language still carries echoes of its origins: a fixation on rankings, fatalistic assumptions about biological limits, and a strong emphasis on visual assessment. Understanding the genealogy of these terms helps educators and parents distinguish between harmless banter and early exposure to more toxic frames.

Competing interpretations: empowerment, irony, or pathology?

Observers disagree on whether this lexicon is primarily harmful, neutral, or even empowering. One interpretation emphasises its motivational role. Framing improvement as "maxxing" can encourage young people to take control of aspects of their lives they can change: learning to dress better, exercising, improving conversational skills, or studying more effectively. In this view, the gamified language makes self-development more engaging, particularly for cohorts raised on role-playing games and progress bars.

A second interpretation focuses on irony and play. Many youths use these words with a clear sense of exaggeration, mocking both hustle culture and doomer fatalism. Calling a friend a "rizzler" or joking about being "Ohio" or "mid" functions as bonding, not serious diagnosis. On this reading, the lexicon allows teenagers to poke fun at the performance pressures they face, creating an in-group code that adults often misunderstand.

A third interpretation, often voiced by clinicians and social critics, highlights the risks. Constantly talking about being mogged or needing to maxx may reinforce body dysmorphia, social comparison, and perfectionism, especially among vulnerable users. When looksmaxxing discussions drift toward surgery and pharmacological enhancement, they can normalise extreme interventions to very young audiences. The vocabulary can also smuggle in zero-sum thinking: if every interaction is framed as a win-loss event, cooperation and mutual support may be harder to cultivate.

These interpretations need not be mutually exclusive. The same words can function as light-hearted memes in one context and as symptoms of deeper distress in another. What matters is less the dictionary definition than the surrounding discourse: who is speaking, to whom, and with what tone.

Tensions and debates within youth culture

Within Gen Alpha and Gen Z themselves, there are internal disagreements about this vocabulary. Some embrace it as a creative and entertaining way to talk about the pressures they face, while others criticise it as reductive or exhausting. The tension mirrors broader debates about hustle culture and wellbeing. On one side, there is celebration of grind, glow-ups, and self-reinvention; on the other, a push towards authenticity, mental health, and acceptance of imperfection.

Another fault line concerns inclusivity. Slang borrowed from marginalised communities can be stripped of its roots, flattening rich cultural histories into catchy phrases. Meanwhile, incel-origin terms may carry misogynistic or fatalistic undertones even when used casually. Some younger users are increasingly aware of these origins and selectively adopt or reject terms based on their perceived baggage. The result is a constantly shifting landscape where meanings are contested and renegotiated.

Why this lexicon still matters

Understanding mogging, maxxing, and adjacent slang is not simply a matter of decoding youth jargon for curiosity's sake. These words are compact models of how many young people experience social life in an era of constant visibility. They encode assumptions about what counts as value, where agency lies, and how relationships should be evaluated. When every interaction can be narrated as a miniature contest, and every trait is a candidate for optimisation, the boundaries between selfhood, performance, and competition blur.

For educators, parents, and employers, attending to this language offers a window into the underlying pressures: fear of being "mid" or "Ohio" in a world of hyper-curated feeds; desire to "maxx" whatever one can control in the face of economic and environmental uncertainty; ambivalence about whether to treat self-improvement as an earnest project or a running gag. For young people themselves, being able to step back from the slang and see its structure can be a first step towards deciding which scripts to inhabit and which to rewrite.

As platforms, aesthetics, and slang inevitably change, the particular words in vogue will shift. Yet the underlying themes - visibility, comparison, optimisation, and belonging - are likely to persist. The current lexicon crystallises how those themes are being worked out in real time by Gen Alpha and Gen Z, revealing not only how they talk but how they are being taught, by algorithms and peers alike, to understand themselves.

‌

Quote: Jamie Dimon - JP Morgan Chase CEO

"I'm not that worried about stable coins... if you have them, they should have the same rules and regulations as us. AML, BSA, KYC, you know, insurance, you know, disclosures, liquidity, transparency, social requirements, which we have, you know, if just a level playing field is all we asking." - Jamie Dimon - JP Morgan Chase CEO

The contest over who issues money, who moves it, and under whose rules, is increasingly fought not between states and banks, but between incumbent financial institutions and crypto-native issuers of digital tokens. Stablecoins sit at the centre of this struggle because they emulate core banking functions - taking in cash and issuing a seemingly riskless liability - while often operating on infrastructure, and under governance arrangements, that developed outside the traditional regulatory perimeter . At stake is not only who captures fee income from payments, but who bears the compliance burden for screening illicit flows, providing disclosures, and standing behind customer funds when markets turn.

Jamie Dimon's intervention, in which he downplays systemic fear about stablecoins but insists that any such instruments should carry the same Anti-Money Laundering (AML), Bank Secrecy Act (BSA), Know Your Customer (KYC), insurance, disclosure and liquidity requirements as regulated banks, expresses the frustration of a sector that sees asymmetric burdens rather than existential threat . The implicit argument is that the core economic activity - taking a dollar and giving the customer a redeemable digital claim - is functionally similar irrespective of whether it happens on a bank ledger, a permissioned blockchain, or a public chain. If so, different regulatory treatment looks less like innovation policy and more like regulatory arbitrage.

What is economically happening inside a stablecoin?

Put in balance-sheet terms, an issuer that accepts customer funds and holds corresponding reserves while issuing a redeemable token is engaged in a form of narrow banking or money-market fund activity. In simple notation, if customer deposits are denoted , stablecoin liabilities , and reserve assets , then a fully backed issuer seeks to maintain and, ignoring fees, . Economically, this is very close to a bank issuing demand deposits backed by high-quality liquid assets, except that the claim is represented as a blockchain token rather than an entry in a core banking system database. The regulatory puzzle is whether the token's technology should change how we treat this liability from the perspectives of prudential oversight, consumer protection, and financial crime.

Traditional banks contend that it should not. They are already subject to comprehensive frameworks that cover AML and counter-terrorist financing, sanctions screening, data retention, suspicious activity reporting, and customer due diligence . They must meet capital ratios, liquidity coverage and leverage constraints, submit to stress testing, and provide deposit insurance or its functional equivalent, all designed to reassure users that one unit of account inside the bank is reliably convertible into cash at par even under stress. From this vantage point, a stablecoin that promises a 1:1 claim on underlying reserves but is not subject to similar scrutiny looks like a synthetic bank account with fewer obligations attached.

JPMorgan's dual posture: user and critic of digital tokens

The stance of JPMorgan is complicated by the fact that it is both a vocal critic of unregulated crypto markets and an active experimenter with blockchain-based payment instruments. The bank has piloted the use of digital tokens for cross-border payments, aiming to make international transfers faster, cheaper and more reliable for institutional clients . Its Kinexys Digital Payments platform uses blockchain-based accounts - including sterling-denominated ledgers from London - to enable real-time settlement for corporates and trading firms . Separately, JPM Coin has been designed as a programmable digital coin that clients can redeem for US dollar balances held at JPMorgan Chase; one unit of JPM Coin is explicitly intended to equal one US dollar credited to the customer's account .

This activity shows that the bank does not object to the architecture of tokenised money per se. Instead, it objects to regimes where functionally similar instruments circulate without the compliance and prudential infrastructure that banks must maintain. The proposed JPM deposit token (JPMD), slated for use on public blockchain infrastructure but available only to pre-screened institutional clients, follows the same pattern . It offers the speed, programmability and interoperability of a blockchain-based token while insisting that users are already within the bank's KYC perimeter and that reserves remain inside the commercial banking system. In other words, if digital tokens are going to reshape payments, JPMorgan wants them to do so inside the existing supervisory framework, not outside it.

Stablecoins and the regulatory perimeter

The core tension is about where to draw the line between innovation and shadow banking. Stablecoins were originally introduced as a convenient way to move value between crypto exchanges and decentralised applications without touching traditional bank rails. Over time, larger issuers began to hold substantial reserves in short-term government securities, bank deposits and cash equivalents, turning them into sizeable holders of money market instruments. At scale, this creates a structure not unlike a money market fund, which in many jurisdictions is subject to dedicated regulation because episodes of stress have demonstrated the risk of runs and the need for backstops.

Regulators therefore face a choice. Treat stablecoin issuers as banks, bringing them fully into deposit-taking regimes; treat them as money market funds with tailored rules on liquidity and asset composition; or create a bespoke category with equivalent outcomes for consumer safety and systemic risk. Dimon's insistence on a level playing field effectively argues against any regime that leaves these issuers lightly regulated or supervised compared with banks that perform similar functions . The list he cites - AML, BSA, KYC, insurance, disclosures, liquidity, transparency and social requirements - maps almost exactly onto the obligations that large banks already shoulder .

Why "not that worried" still implies significant risk

The remark that stablecoins are not a major worry should not be misconstrued as confidence that they pose no problem. It reflects a view about scale, structure and substitutability. Relative to the multi-trillion scale of global bank deposits, the outstanding value of even the largest stablecoins remains modest, and most usage still clusters in speculative crypto trading rather than mainstream retail payments. A systemic crisis in this niche would be painful for participants but unlikely to threaten core banking stability in the way that wholesale funding stresses or sovereign debt shocks might.

Moreover, from the vantage point of a large, diversified bank, any migration of payment flows to well-regulated digital instruments could in principle be internalised by launching in-house tokens or deposit representations, as JPMorgan has done. If regulators force stablecoin issuers into regulatory regimes that mirror bank obligations, incumbent banks could have a competitive advantage: they already run extensive compliance infrastructure, from transaction monitoring engines to sanctions lists and KYC workflows . New entrants would shoulder similar fixed costs without the same scale benefits.

On the other hand, if stablecoins remain loosely regulated, they might erode the information monopoly and fee income that banks derive from their privileged role in payments and settlement. For banks, "not that worried" therefore means "comfortable as long as the regulatory perimeter expands to cover these instruments." It is conditional reassurance, not blanket approval.

AML, BSA and KYC in the token era

Stablecoins raise specific challenges for financial crime compliance. Blockchains provide unprecedented transparency at the transaction level but typically operate with pseudonymous addresses. AML frameworks depend on associating flows with real-world identities, assessing risk profiles, and monitoring behaviour over time. When tokens move peer-to-peer across borders in seconds, outside established correspondent banking networks, the question becomes who is responsible for checking whether a given wallet belongs to a sanctioned entity, a high-risk jurisdiction, or a fraud scheme.

Traditional AML programmes are risk-based: firms allocate more scrutiny to higher-risk relationships and channels . In formal terms, if denotes the risk score of customer , the compliance function seeks to apply controls such that expected residual risk lies below specified thresholds while keeping the cost of controls manageable. Stablecoins complicate this by enabling rapid hops between intermediaries, reducing the time window to intervene, and sometimes allowing users to self-custody tokens without any ongoing relationship with a regulated entity.

Dimon's demand that stablecoins face equivalent AML, BSA and KYC rules is therefore a demand that someone bear responsibility for these frictions . Either the issuers put in place onboarding, monitoring and sanctions screening for their users; or regulated gateways - exchanges, wallet providers, merchant acquirers - are required to apply full controls whenever tokens touch fiat or regulated venues. From the point of view of banks, the risk is that they continue to carry heavy compliance burdens while stablecoin ecosystems free-ride on the assumption that, eventually, conversions back to fiat will be caught by bank-level controls.

Insurance, disclosures and liquidity: the run problem

Another cluster of issues in Dimon's comment concerns consumer protection and run risk. Traditional bank deposits benefit from deposit insurance schemes up to specified limits, as well as from lender-of-last-resort facilities at central banks. Stablecoins usually offer neither. Instead, they promise that reserves are held in conservative instruments and that attestation reports or audits confirm that assets exceed liabilities. The viability of this model depends on the quality, frequency and credibility of disclosures, and on how quickly reserves can be liquidated in a stress scenario.

In formal liquidity terms, if denotes reserves and denotes circulating tokens at time , a fully reserved stablecoin aims to maintain even under large redemption shocks. However, if a substantial portion of reserves is in slightly longer-duration assets, forced liquidation during a panic could crystallise losses, leading to and breaking the peg. Banks are familiar with these dynamics and are forced to hold specific proportions of high-quality liquid assets, undergo stress testing, and prepare contingency funding plans. Dimon's reference to insurance, disclosures and liquidity rules is an argument that anyone offering a par-valued claim redeemable on demand should be subject to analogous requirements .

Critics of this approach respond that over-regulating stablecoins risks cementing the incumbency of existing banks and dampening competition in payments. They argue that a spectrum of risk should be permitted, with fully insured bank deposits at one end and clearly disclosed, uninsured stablecoins at the other. Provided users understand what they are holding, and provided exposure does not become so large as to threaten systemic stability, market discipline could in theory constrain issuers. The counter-argument from bank leaders is that information asymmetries and herd behaviour make such discipline weak in practice, especially for retail users, and that the political cost of letting a large stablecoin fail without backstop would likely be unacceptable.

Cross-border payments and the mCBDC horizon

Part of what drives experimentation with stablecoins is frustration with the inefficiency of cross-border payments. Traditional correspondent banking chains can be slow, expensive and opaque, especially for smaller corporates and remittance corridors. JPMorgan's own research on multi-central-bank digital currencies (mCBDCs) estimates that a coordinated corridor network could unlock tens of billions of value in cross-border flows by reducing frictions and settlement lags . Stablecoins have demonstrated in live markets that near-instant global transfers are technically feasible, even if they currently operate within crypto-centric ecosystems.

This poses a strategic question for banks and central banks: should they allow private stablecoins to dominate tokenised cross-border flows, or should they develop their own infrastructures - deposit tokens, walled-garden stablecoins, or CBDCs - that offer similar speed under tighter control? Dimon's comments suggest a preference for the latter path: harness the efficiency of blockchain-based settlement, but keep issuance, reserves and compliance inside the supervised banking and central banking nexus . In this vision, stablecoins that persist outside that nexus must at least be pulled towards parity in regulatory expectations.

The mCBDC work cited by JPMorgan envisions corridors in which banks and payment providers act as nodes in a shared, programmable settlement layer, allowing instant cross-border transfers while maintaining jurisdictional control and compliance . If such networks mature, the relative advantage of unregulated stablecoins in cross-border payments could shrink, particularly for institutional flows. That, in turn, would strengthen the bargaining position of regulators in demanding higher standards from remaining private issuers.

Debates and objections: innovation versus enclosure

There is, however, a live debate about whether applying full bank-like regulation to stablecoins prematurely encloses an area of innovation that has not yet found its final forms. Proponents of a lighter touch argue that programmable money - tokens that can encode conditions, automate escrow, or interact natively with smart contracts - will spawn new business models for commerce, machine-to-machine payments and decentralised finance. Requiring every such token to be issued within the constraints of large bank compliance and legacy technology could stifle experimentation and entrench incumbents.

Another line of criticism focuses on the notion of a "level playing field." From a narrow perspective, equalising obligations seems fair. But critics point out that incumbents already benefit from implicit subsidies, such as access to central bank liquidity and, in some jurisdictions, perceptions of too-big-to-fail status. If these advantages are maintained while newcomers are forced to shoulder identical compliance costs, the resulting playing field may be formally level but economically tilted. The retort from banks is that those privileges are matched by explicit constraints, such as higher capital requirements, living wills, and intense supervisory oversight.

There are also technical objections. Public blockchains allow open access innovation: anyone can build a wallet, protocol or application around a stablecoin, without seeking permission from a central operator. Bank-issued tokens on permissioned platforms, by contrast, typically restrict participation to vetted institutions and rely on centralised governance. Some technologists warn that forcing private stablecoins into fully permissioned regimes risks losing the very composability and global reach that made them attractive, leaving only a digitised facsimile of existing bank money.

Why the argument matters

Beyond the immediate contest between banks and stablecoin issuers, the argument encapsulated in Dimon's remarks cuts to the future structure of the monetary and payments system. If regulators agree that any instrument that looks and behaves like money must be subject to bank-equivalent rules, then the spectrum of monetary instruments available to households and firms may narrow to insured deposits, CBDCs and tightly controlled bank tokens. Innovation will still occur, but largely within the governance frameworks of major financial institutions and central banks .

If, instead, policymakers carve out space for private stablecoins to operate under lighter but still robust regimes, we may see a more pluralistic monetary landscape, with different tokens competing on features, integrations, and governance models. This carries greater risk of episodes of instability but also greater potential for new forms of financial intermediation, including decentralised lending, automated market-making and programmable trade finance. The boundary lines drawn over the next few years will determine which of these paths dominates.

Dimon's position reflects the pragmatic calculus of a systemically important bank that has already invested heavily in blockchain-based instruments and global compliance infrastructure . He is signalling openness to digital forms of value so long as they compete under the same rulebook that governs his own institution. Whether society ultimately prefers a tightly regulated, bank-centric token ecosystem or a more open, heterogeneous one will depend on how regulators weigh innovation against stability, and how credible they deem the promise that technology alone can substitute for the institutional guarantees banks currently provide.

As stablecoin regulation evolves, the question will not be whether such tokens should exist - markets have already answered that - but on what terms they interact with the rest of the financial system. The insistence on a "level playing field" is best understood as an attempt by incumbents to ensure that whatever the outcome, they are not left shouldering a disproportionate share of obligations while watching rivals monetise similar economic functions with lighter oversight. That debate, rather than any abstract enthusiasm or hostility towards crypto, will shape the eventual accommodation between stablecoins, banks and state-backed money.

‌

Term: Immiserating growth - Economics

"Immiserating growth is an economic paradox in which a country expands production and exports, but becomes worse off because the increase in supply drives down the price of its exports so sharply that the deterioration in its terms of trade outweighs the gains from higher output." - Immiserating growth - Economics

Debates on trade and development usually presume that greater integration into world markets, higher export volumes, and rising output will translate into higher real incomes. Yet there are conditions under which an outward expansion in production and trade can lower a country's welfare, even when measured in aggregate. This tension between more activity and less well-being forces a closer look at how world prices, elasticities of demand, and specialisation patterns interact with growth.

The central issue is the interaction between growth and the terms of trade. For a trading economy, what matters for welfare is not only how much it can produce, but how many imports it can command for a given volume of exports. If growth is heavily biased toward the export sector, world markets may be flooded with that country's exportable good. When foreign demand is not very responsive, the export price can fall sharply. If this deterioration in the terms of trade is strong enough, the country may end up able to purchase fewer imports at world prices, even though it ships more units abroad.

Substantive meaning: growth that makes a country poorer

Substantively, immiserating growth describes a situation where a country produces and exports more, but its real income or social welfare declines. The expansion of economic activity uses more labour, capital, and land, but the goods and services that residents can actually consume, particularly imported goods, become scarcer relative to their opportunity costs. This is most transparent when welfare is defined over consumption possibilities: if the country's budget line in world markets rotates unfavourably as growth occurs, feasible consumption bundles shrink rather than expand.

In trade-theoretic language, the key mechanism is an adverse movement in the terms of trade large enough to offset the positive output or "wealth" effect of growth. Suppose growth is driven by technical progress or factor accumulation in the export sector. Output of the exportable rises. If the country is large enough to influence world prices, this additional supply depresses the world price of the export good. The terms of trade, defined as the relative price of exports to imports, fall. When the deterioration in the terms of trade is sufficiently severe, the country's consumption possibilities at the new world prices lie on a lower indifference curve than before growth: it is worse off, despite producing more.

Practical meaning in trade and development

In practice, the concept is most relevant for large exporters of goods characterised by low demand elasticities, such as certain primary commodities. For a small open economy that takes world prices as given, expansion of exports cannot trigger the global price effects required to generate immiserating growth. By contrast, when an economy is a major supplier of a commodity, shifts in its export volumes can move world prices against it, especially when demand is sluggish or even perverse.

Two sets of real-world concerns illustrate the practical meaning:

- Commodity dependence: Many developing economies are heavily specialised in a narrow set of primary exports, such as coffee, copper, or cotton. Growth driven by expanding these sectors, especially without diversification, can contribute to downward pressure on world prices. If import prices for manufactured goods do not fall correspondingly, the ratio of export to import prices deteriorates. Historical episodes of declining commodity terms of trade have raised worries that producing more of such exports could, in extremis, leave countries poorer.

- Unequal gains from trade: In global value chains, segments with high elasticity of supply and low bargaining power may expand rapidly yet capture a shrinking share of total value added. Some authors have extended the immiserating growth idea beyond the two-good, two-country model to describe cases where firms, sectors, or worker groups see increasing output but declining real earnings or living standards.

Thus, the practical content of the concept is not that growth usually harms welfare, but that particular configurations of export-led growth, market power, and demand conditions can generate this perverse outcome.

Bhagwati's formalisation and the core mechanism

Jagdish Bhagwati's 1958 analysis embeds this paradox within a standard two-country, two-good, full-employment trade model. The growing country exports one good and imports the other. Growth is modelled as either factor accumulation or technical progress that shifts the production possibility frontier outward in a way that is biased toward the export good. Welfare is defined over consumption of the two goods at world prices.

At a high level, Bhagwati decomposes the welfare impact of growth into two components:

- a production (or output) effect, reflecting the outward shift of the production possibility frontier; and

- a terms-of-trade effect, capturing how world prices adjust as the country's net supply changes.

Immiserating growth occurs when the negative terms-of-trade effect dominates the positive output effect. Bhagwati shows that this requires both a sufficiently adverse response of world prices and a growth pattern that increases the country's net export supply.

Mathematical specification and key parameters

Formal treatments typically express the condition for immiserating growth in terms of elasticities and the magnitude of the growth shock. A simplified intuition can be sketched without reproducing the full derivation.

Let denote the relative price of the country's export good in terms of its import good, so the terms of trade are . Let be an indirect utility function, where indexes the country's productive capacity (or a shift parameter capturing growth). Totally differentiating welfare with respect to gives a term representing the direct gain from higher capacity and a term capturing how responds to the growth-induced change in net exports. Immiserating growth corresponds to : welfare falls when productive capacity rises.

Bhagwati's geometric and analytical work emphasises several critical elasticities:

- : the constant-utility demand elasticity for the importable with respect to its price, reflecting how strongly domestic demand adjusts when the importable becomes more expensive;

- : the elasticity of supply of the importable along the production possibility frontier, showing how production shifts between exportable and importable when relative prices change;

- : the rest-of-world offer elasticity, describing how foreign excess demand responds to the terms of trade.

Bhagwati demonstrates that the possibility of immiserating growth is enhanced when the ratio of domestic production to imports of the importable is small, when and are low (implying limited domestic substitution), and when foreign offer is highly inelastic or even backward-bending, so is small or negative. Yet these are only necessary tendencies; for immiserating growth to actually occur, they must combine with either or both of two crucial conditions:

- the rest of the world's offer curve is sufficiently inelastic, possibly because the country's exports are treated as a kind of Giffen good abroad; and/or

- growth reduces domestic production of importables at constant relative prices, a particularly strong export-biased pattern of expansion.

One can summarise the welfare effect schematically as:

Immiserating growth arises when:

The "terms-of-trade loss" depends on the size of the price change induced by growth and on the country's initial trade volume: the larger the country's pre-growth exports, the more damaging a given adverse price shift becomes.

Parameter meanings and economic intuition

The elasticities and ratios that appear in formal conditions have intuitive interpretations:

- Domestic demand elasticity : When domestic consumers are not very responsive to higher import prices, they continue to demand similar quantities despite deterioration in the terms of trade. This raises the import bill in terms of exports, worsening the welfare impact of any given price change.

- Domestic supply elasticity : When producers do not readily shift resources back toward importables as their relative price rises, the country continues to specialise in the export good, amplifying the expansion of net exports and the downward pressure on the export price.

- Rest-of-world offer elasticity : When foreign demand for the export good is inelastic, a relatively small increase in export volume triggers a large fall in price. The growing country effectively faces a steep foreign offer curve, magnifying the terms-of-trade deterioration.

- Scale of growth and trade: Even when elasticities are unfavourable, immiserating growth requires a sufficiently large shift in net exports. Modest growth moves relative prices only slightly, so the output gain dominates. It is only under extreme export expansion that the terms-of-trade loss can become large enough to dominate.

These parameters show that immiserating growth is a knife-edge phenomenon, relying on particular combinations of structural features and large shocks. This is one reason why most empirical work treats it as a theoretical curiosity rather than a pervasive threat.

Major schools of thought and extensions

The original discussion of immiserating growth sits within the neoclassical trade tradition, using smoothly convex production and indifference curves, competitive markets, and full employment. Subsequent literature can be grouped into several strands:

- Refinements within traditional trade theory: Further work has examined more general production structures, multiple goods, and alternative assumptions about preferences and technology. Many analyses confirm that while immiserating growth is theoretically possible, its conditions are restrictive.

- Commodity price pessimism: Classical development economists and later structuralists worried that secular trends in primary commodity prices might lead to a softer form of immiserating growth for resource exporters. Even if welfare does not literally fall with growth, the gains could be extremely small or unequally distributed.

- Micro- and meso-level immiserising growth: More recent work has applied the concept to households, farms, firms, and regions rather than entire countries. Here immiserising growth arises when increased economic activity coincides with falling real living standards for specific groups, for example because of worsening terms of trade between what smallholders sell and what they buy, or due to deteriorating employment conditions.

- Distributional perspectives: Some authors use "immiserising growth" loosely to describe growth that fails to benefit the poor, even if aggregate income rises. This broadens the concept beyond its original aggregate welfare meaning, but captures important political economy concerns about who gains from trade.

While these extensions differ from Bhagwati's precise model, they share a common concern: growth processes that change relative prices in ways that undermine welfare for some unit of analysis, whether a nation or a socio-economic group.

Tensions, critiques, and empirical relevance

Several tensions animate the ongoing debate around immiserating growth.

1. Rarity versus possibility

Most trade economists acknowledge that immiserating growth is logically possible but argue that it is empirically rare. The combination of highly export-biased growth, large country size, very inelastic foreign demand, and limited domestic substitution is unusual. In many observed episodes of export-led growth, terms of trade either improve or deteriorate only modestly, leaving net welfare gains clearly positive.

Critics respond that even if full-fledged immiserating growth is rare at the national level, weaker forms are not. Episodes where rapid export expansion delivers surprisingly small welfare gains, due in part to adverse price movements, are not hard to find in commodity markets. Moreover, if one relaxes the requirement that aggregate welfare must fall, the notion of "immiserising" subsets of the population becomes empirically much more plausible.

2. Static versus dynamic perspectives

Bhagwati's model is static: it compares two equilibria before and after growth. Dynamic considerations complicate the picture. Investing in an export sector that temporarily worsens terms of trade might still be optimal if it generates learning-by-doing, technological upgrading, or market access that raises future productivity. Short-run immiseration could, in principle, buy long-run gains.

On the other hand, path dependence and lock-in are real risks. If adverse terms of trade trap a country in low value-added specialisation, the long-run trajectory may be one of cumulative disadvantage. The immiserating growth framework thus intersects with debates over industrial policy, diversification, and escape from commodity dependence.

3. Market power and bargaining

The classic theory assumes competitive markets, yet in many export sectors multinational buyers wield significant monopsony power. In such contexts, expansion of developing country output may push down not only world prices but also the share of final prices accruing to producers. This can generate immiserising outcomes for farmers or workers even if aggregate national income rises. Here the relevant "terms of trade" are not just between exports and imports, but also between producers and intermediaries along value chains.

Why the concept still matters

Despite its restrictive assumptions, immiserating growth retains analytical and policy relevance for several reasons.

First, it serves as a corrective to any automatic identification of export growth with welfare improvement. Policy strategies that simply advocate "more exports" without regard to price dynamics, demand elasticities, and specialisation patterns risk underestimating potential downsides. The concept underscores the need to consider how growth interacts with world markets, not just how much it enlarges domestic capacity.

Second, it highlights the importance of market structure and power in shaping the gains from trade. Countries or groups that face inelastic demand for what they sell and highly elastic supply for what they buy are structurally disadvantaged. Understanding these asymmetries is crucial for designing trade, industrial, and competition policies that avoid trapping economies in low-welfare equilibria.

Third, in a world of climate constraints and resource limits, the idea problematises growth strategies that rely on ever-expanding extraction and export of natural resources. If heightened exploitation leads to lower world prices and environmental degradation, the net welfare gains may be small or negative. Here the "immiseration" may be ecological as well as economic.

Finally, the broader family of immiserising growth concepts reminds analysts to track distributional outcomes. Growth episodes that leave some groups worse off cannot be evaluated solely by aggregate indicators. Whether at the level of nations, regions, or communities, shifts in relative prices and bargaining positions can make certain forms of growth deeply contentious, even when macro aggregates look favourable.

In this wider sense, immiserating growth is less a prediction about the typical consequences of export expansion and more a warning about specific structural configurations. When a country is large in world markets, heavily specialised in goods facing inelastic demand, and unable to adjust its production or consumption patterns easily, policymakers must pay close attention to the balance between output gains and terms-of-trade movements. Ignoring that balance risks celebrating growth that, once translated through world prices and domestic distribution, leaves people worse off than before.

‌

Quote: Ray Dalio - Bridgewater Capital founder

"When life doesn't give what you want, you can be angry or sad, or you can learn about how reality works and develop principles that will work well to get you what you want." - Ray Dalio - Bridgewater Capital founder

Disappointment is usually experienced first as a feeling and only later, if at all, as a problem to be modelled and solved. Most people stay in the realm of mood: anger at what should have happened, sadness at what did not, resentment at those who seemed to stand in the way. The alternative is far more demanding and far more productive: to treat each frustrated desire as data about how the world operates, to interrogate that data, and then to codify better ways of acting so that similar situations produce different outcomes next time. That pivot from emotion to inquiry, from grievance to design, is the underlying mechanism that separates episodic success from compounding progress.

From reaction to diagnosis

The immediate human response to not getting what we want is typically narrative: someone wronged me; the system is rigged; I am unlucky; I am not good enough. These stories may contain fragments of truth, but they are structurally unhelpful because they are not diagnostic. They do not ask what causal chain produced the result, how incentives and constraints interacted, which assumptions failed, or what was missing from our mental model of reality. A diagnostic response, by contrast, asks: given that this outcome occurred, what does it reveal about how the underlying system works, and what would need to change-about my behaviour, my environment, or my strategy-for a different outcome to be more probable next time?

That shift requires accepting that reality is not obliged to conform to our expectations. Instead, reality is made of patterns, feedback loops, and constraints that can be studied. When outcomes disappoint us, they are not insults but information. Anger and sadness may be unavoidable first responses, but if they remain the final stage, they are wasted. The crucial move is to operationalise setbacks: to treat them as experiments whose results must be analysed and then encoded into practical rules for future decisions.

Dalio, principles, and the economic machine

Ray Dalio built his career and his firm around this idea of turning painful surprises into explicit principles. As the founder of Bridgewater Associates, one of the world's largest hedge funds, he is known not only for his investment performance but for his insistence that decisions should be governed by clear, tested rules rather than by transient feelings or unchecked intuitions. In his investing, he is famous for breaking down the economy into a set of understandable mechanisms-what he calls an "economic machine"-and then using that understanding to guide decisions.

On the investment side, Dalio analyses how credit, productivity, and monetary policy interact over time to produce recurring cycles of inflationary, disinflationary, and deflationary environments. When losses or errors occur, instead of treating them as bad luck, he looks for structural misreadings of these cycles or of debt dynamics. Those insights are then codified into systematic decision rules, increasingly executed by algorithms that reflect his principles about how markets behave. The same pattern underpins his approach to life: observe outcomes, interrogate what they say about reality, and distil them into principles that can be reused.

The quote in question comes from a lecture Dalio gave to graduates of Long Island University, where he urged them to treat setbacks as prompts to understand reality better and to develop principles for dealing with it. He linked this stance to personal responsibility and radical open-mindedness: owning one's life, staying in touch with how the world actually works, and being willing to revise beliefs when evidence demands it. For Dalio, being "in touch with reality" and maintaining "radical open-mindedness" are not slogans; they are operating requirements for anyone trying to achieve ambitious goals in a complex environment.

Pain as data: the engine of improvement

Central to Dalio's framework is the idea that emotional pain is a necessary input to learning. He encapsulates this in a principle often summarised as: "Pain + Reflection = Progress". The raw hurt of failure highlights a gap between our mental map and the territory of reality. Reflection then turns that discomfort into insight. Without the pain, there is no trigger to question our assumptions; without the reflection, pain degenerates into bitterness or avoidance.

Dalio goes further, explicitly advising people to "go to the pain rather than avoid it" because staying at the edge of discomfort accelerates evolution. This is a demanding discipline. It requires noticing the impulse to turn away from criticism, loss, or embarrassment, and instead leaning in: replaying decisions, inviting disagreement, and scrutinising blind spots. In his own organisation, this mentality is embedded in a culture of high standards and constant improvement, where individuals are expected to confront their weaknesses in order to grow.

Bridgewater's culture document describes an "overriding objective" of excellence, defined as continual improvement, and asserts that accuracy in understanding reality is essential to achieving it. Translating that to individual lives, the message is that excellence in any domain depends on being ruthlessly honest about what is and is not working, especially when that honesty is uncomfortable. The alternative is to protect one's self-image and stay within familiar patterns, at the cost of stagnation.

Principles as reusable algorithms

In Dalio's usage, "principles" function like human-readable algorithms for living and working. A principle is more than a vague value; it is a conditional rule that connects situations to actions: if you encounter X pattern in reality, then you should do Y. Over the decades, he catalogued such rules-about hiring, decision-making, risk management, and conflict resolution-and compiled them in his book "Principles: Life & Work". The aim is not to produce a rigid manual but to provide a library of tested responses to recurring problems.

Principles have several advantages over ad hoc reactions. First, they reduce the influence of transient emotions. When you decide in advance that you will, say, seek out the smartest opposing view before making a major decision, you are less likely to be swayed by fear or overconfidence in the moment. Second, they create consistency across time and context; different situations that share the same underlying structure can be handled in similar ways. Third, they are teachable. An organisation that shares the same principles can coordinate action without needing the founder to be involved in every decision.

In investing, this approach is visible in Dalio's emphasis on systematic decision-making and diversification. He advocates creating rule-based processes that can be stress-tested with historical data and executed via algorithms. Diversification across uncorrelated assets becomes a principle: do not rely on a single bet; spread risk so that no one error is catastrophic. When markets move against a position, the question is not primarily "How do I feel about this?" but "What did this movement reveal about the assumptions built into my decision rules, and what principle should be updated?"

Responsibility versus resentment

The quote draws a stark behavioural fork in the road: when frustrated, you can either remain in the emotional loop of anger and sadness or adopt the slower, more demanding posture of student of reality. Dalio explicitly urged his audience to "own" their lives and take responsibility for making them great. Responsibility, in this sense, is not a moral label but a practical stance: it is the decision to treat one's situation as improvable through better understanding and better principles, rather than as fixed by external forces.

Resentment has its own logic. It offers a sense of justification and identity: I am the sort of person to whom unfair things happen. It can also be politically potent, mobilising people against perceived injustices. But as a personal operating system, resentment is paralysing. It interprets every disappointment as confirmation of a hostile world, not as input for revised action. By contrast, the principle-based stance acknowledges constraints and injustice but asks, "Given that these features of reality exist, what is the best strategy for pursuing my goals within them?" That question restores agency without denying difficulty.

Strategic and technological tension

There is a deeper tension embedded in Dalio's approach: the balance between systematising reality and respecting its complexity. On the one hand, he argues that understanding how reality works is essential and that principles can be codified into increasingly sophisticated systems, including algorithms. Bridgewater has long used technology to turn qualitative insights into quantitative decision rules, effectively building models that transform messy market data into structured signals.

On the other hand, excessive faith in models can be dangerous. Economic and social systems are adaptive; participants change their behaviour in response to observed patterns. A trading rule that works today may fail once others imitate it. The more we encode our understanding of reality into rigid systems, the greater the risk that we will be blindsided by novel configurations that fall outside our priors. The strategic challenge is to hold principles strongly enough to provide guidance, yet lightly enough to revise them when evidence accumulates against them.

Dalio attempts to manage this tension through radical open-mindedness and continuous stress-testing of ideas. He encourages "radical transparency" inside Bridgewater, meaning that people at all levels are expected to challenge each other's thinking, including his own. This social technology aims to prevent the ossification of principles into dogma. If someone presents better evidence or a better model, the principle should change. In theory, this keeps the system adaptive: principles are not final truths but current best hypotheses about how reality works.

Debates and objections

Critics raise several objections to this principle-driven pragmatism. One is that it can slide into technocratic coldness. If all setbacks are treated as data and all relationships as arenas for truth-seeking, there is a risk of underweighting the emotional and moral dimensions of human life. Bridgewater's intense culture-of constant feedback, rigorous criticism, and high expectations-has been described as both transformative and punishing, depending on whom you ask. Some thrive in such environments; others experience them as dehumanising.

Another objection is structural. Not everyone has the same degree of control over their circumstances. Systemic inequities, institutional barriers, and sheer luck shape life chances. To tell someone in a constrained environment that they should simply "learn how reality works" and develop principles to get what they want can sound naive or cruel. The valid insight-that focusing on controllable factors is empowering-must be balanced with recognition of structural limits. A realistic principle-based approach should include, as part of its understanding of reality, an analysis of power, institutions, and collective action, not only individual grit.

A further critique questions whether complex human lives can, or should, be governed by explicit principles at all. Much of what makes people effective in relationships, creativity, or leadership is tacit knowledge: pattern recognition that resists codification. Over-formalising behaviour can produce rigid, instrumental ways of being that miss nuance. Dalio's own corpus partly acknowledges this by insisting that others develop their own principles, articulated in their own words, rather than simply copying his. The aim is to make explicit as much as is useful, while accepting that some wisdom remains intuitive.

Why it matters beyond finance

Despite these tensions, the underlying stance has wide relevance beyond hedge funds and business schools. In a world marked by rapid technological change, geopolitical instability, and shifting social norms, individuals and institutions are constantly confronted with outcomes they did not anticipate. Reacting with anger or despair is understandable but strategically inert. Developing a practice of decoding those outcomes into updated models and better principles is one of the few robust responses available.

For a young professional passed over for promotion, the principle-based approach might mean examining feedback, observing who does advance, and inferring what the organisation actually rewards. Perhaps the implicit success criteria differ from the formal job description. That insight can be turned into new principles: prioritise projects that demonstrate impact in ways the leadership values, cultivate specific relationships, or, if misalignment is severe, adopt the principle that one should move to an environment where desired behaviours are recognised.

For someone facing personal adversity-illness, financial setback, relationship breakdown-the same logic applies. The question is not whether the situation is fair; often it is not. The operative questions are: what constraints does this reality impose, what options remain, what skills or supports are missing, and what principles would reduce the probability or severity of similar outcomes in future? That might translate into principles about savings and diversification of income, about communication and boundaries in relationships, or about health habits and early detection.

At a societal level, this mindset underpins effective policy-making. When a programme fails to achieve its stated goals, the productive response is to ask which assumptions about human behaviour or institutional capacity were wrong and to update design principles accordingly. Blame and outrage may have their place in assigning responsibility, but they do not by themselves produce better systems. A society that treats failures as occasions for learning builds institutional memory and resilience; one that treats them only as scandals repeats them.

Building a personal system of principles

Practically, adopting the stance implied by the quote involves building one's own evolving set of principles. That starts with a habit of documentation: after meaningful successes or failures, write down what happened, why you think it happened, and what rule you wish you had followed. Over time, patterns emerge: recurring mistakes point to missing or flawed principles; recurring wins highlight reliable strategies. This is analogous to Dalio's long-term effort to record his insights from managing people and markets and then refine them into the corpus published in "Principles".

Next comes testing. A principle is a hypothesis: "If I do X in situations of type Y, outcomes will generally improve." Applying it consciously in future situations and observing the results either strengthens confidence or triggers revision. Discussion with others-especially those who disagree-serves as further stress-testing, similar to the way Bridgewater uses internal debate to improve decision rules. Over years, this process yields a personalised operating system calibrated to one's goals, strengths, and environment.

Finally, there is the matter of courage. Learning how reality works can be uncomfortable, because it often reveals ways in which we have been self-deceiving, underperforming, or complicit in our own problems. Developing principles that "work well to get you what you want" requires not only intellect but willingness to confront those truths and to change entrenched habits. Dalio's insistence on "going to the pain" is a recognition that, without this courage, the entire project stalls. Most people do not lack access to information about how the world works; they lack the willingness to apply that information consistently when it conflicts with short-term comfort.

The choice framed in the quote is therefore not a one-off decision but a recurrent one, embedded in daily life. Each time reality disappoints, we either deepen the groove of emotional reactivity or strengthen the muscle of inquiry and principle-building. Over years, those choices compound. One path leads to a life organised around narratives of grievance and helplessness; the other leads, imperfectly and with many detours, towards greater effectiveness and alignment between what we want and what we are able to achieve.

"When life doesn't give what you want, you can be angry or sad, or you can learn about how reality works and develop principles that will work well to get you what you want.” - Quote: Ray Dalio - Bridgewater Capital founder

‌

Quote: Victor Khosla - Strategic Value Partners

"There are entire sectors like private equity, like real estate, that are constipated. They can't sell." - Victor Khosla - Founder of credit investor Strategic Value Partners at SuperReturn Berlin 2026

Capital that cannot circulate starts to decay. In private markets, the inability to sell assets on a reasonable timetable is no longer a cyclical annoyance but a systemic constraint shaping strategy, governance, and even the survival prospects of whole franchises. Funds that once relied on brisk exits to prove their worth and raise ever larger vehicles are now confronting portfolios that linger, values that look aspirational rather than realisable, and investors whose patience is being measured in years rather than quarters. The blockage is not confined to a few misjudged deals; it runs through core sectors such as buyouts and real estate, where the exit machinery that underpinned the boom has stalled.

The anatomy of a blocked exit pipeline

The immediate problem is mechanical: there are too many assets that were acquired at peak or near-peak valuations and too few buyers willing to pay prices that preserve those marks. During the era of cheap money, sponsors could rely on abundant leverage, aggressive growth forecasts, and liquid secondary demand to justify paying high multiples for businesses and properties. When policy rates rose and financing costs reset, the arithmetic that once supported these prices no longer added up, but the portfolios built on the old assumptions remained.

Industry data show that the value of private equity sales has dropped by roughly a fifth in the recent period, even as the stock of unsold assets has accumulated. Managers are continuing to hold companies that, according to pre-2022 playbooks, would have been exited via trade sale, sponsor-to-sponsor deal, or IPO after a holding period of four to six years. Instead, those assets are being extended, refinanced, or shifted into continuation vehicles, often without crystallising the type of gains that limited partners had come to expect.

The same blockage is evident in institutional real estate, where higher interest rates compress valuations and increase financing costs, particularly for leveraged, income-producing assets. Buyers demand discounts to compensate for uncertainty and higher yields, while sellers are reluctant to transact at prices that would lock in visible write-downs. The result is a bid-ask spread that is wide enough to choke off volumes, leaving many buildings and portfolios in a state of limbo: not obviously distressed, but not evidently worth prior valuations either.

Victor Khosla and the vantage point of distressed credit

From the perspective of distressed and opportunistic credit investors, the current blockage is as much an opportunity as a diagnosis of systemic strain. Victor Khosla, founder and chief investment officer of Strategic Value Partners (SVP), has built his franchise precisely around capitalising on moments when traditional owners can no longer carry assets or debt structures set up under rosier assumptions. SVP specialises in distressed and deep value situations across corporate credit, private equity, and special situations, giving Khosla a panoramic view of where pressure is building and how it may resolve.

His characterisation of sectors such as private equity and real estate as effectively frozen speaks to more than a cyclical slowdown. It signals a view that many sponsors are trapped by their own historical decisions: valuations that were underwritten in a world of low discount rates and abundant leverage have become psychological anchor points that slow down the recognition of new realities. For opportunistic credit funds, that hesitation creates the potential for negotiated restructurings, discounted secondary trades, and control transactions once forced sellers emerge.

SuperReturn Berlin and the politics of liquidity

The comments arose at SuperReturn, the flagship gathering of the private capital industry in Berlin, where thousands of general partners and limited partners meet to negotiate, market, and assess the state of the asset class. This is not a neutral academic forum; it is where capital providers confront managers about delayed distributions, extended fund lives, and the apparent disconnect between reported valuations and observable exit outcomes. Against that backdrop, describing entire sectors as unable to sell pins the problem firmly on the supply side: sponsors unwilling to adjust, rather than a mysterious absence of buyers.

Several large buyout executives at the event emphasised that private equity will have to start capitulating on valuations to clear the backlog. The idea of capitulation implies not just incremental discounts but a recognition that the previous equilibrium on pricing was artificially supported by low rates and exuberant competition. For LPs, the politics of liquidity are now central. They entered the asset class on the promise of illiquidity premia and diversification, not of indefinite lock-up with opaque marks. Conferences that once celebrated record fundraising now feature sessions dedicated to portfolio liquidity, continuation vehicles, and secondaries as release valves.

From virtuous cycle to negative feedback loop

In the boom era, private equity operated in a self-reinforcing loop. Strong distributions and mark-ups allowed LPs to recycle capital into new funds, which in turn justified higher fundraising targets and more aggressive deal-making. As long as exit markets remained open, the cycle sustained itself, even if leverage and valuations crept steadily upward. When exits slow materially, that virtuous loop can flip into a negative feedback process.

LPs with limited cash inflows from distributions become more selective, committing less to new vehicles or insisting on tighter terms and more conservative pacing. GPs, facing slower fundraising, are pressured to generate liquidity to preserve their franchise value, but they can only do so by selling assets at prices that may require write-downs. If they resist, they risk sliding into the category of so-called zombie funds: vehicles that hold ageing assets, charge fees, but generate little in the way of exits or performance fees.

One can view the situation through a simple cash flow lens. Let represent the net distributions to LPs at time , net of capital calls. In the expansion phase, many funds operated with patterns where for most post-investment years, allowing LPs to maintain or grow commitments without increasing their net exposure. As exits have stalled, has turned negative for many investors, even as their reported net asset value remains high. The resulting denominator effect, where private market allocations swell relative to public portfolios due to market moves and valuation lag, further constrains their capacity to recommit.

Real estate: the silent partner in the blockage

Real estate, especially in sectors affected by structural shifts such as offices and certain retail formats, amplifies this dynamic. Rising financing costs and changing usage patterns have forced investors to reassess long-term income assumptions. Yet many portfolios are still carried at values that assume moderate yield expansion and stable occupancy, rather than a more fundamental repricing. This creates a situation where transactions that would reveal more dramatic value adjustments are avoided, reinforcing the freeze.

For leveraged owners, the core equation linking asset values, loan-to-value ratios, and covenant headroom has shifted. If one denotes the market value of a property as and the outstanding debt as , covenants may require that , where is the maximum permissible leverage ratio. When values fall while debt remains fixed or only slowly amortising, owners can find themselves breaching or approaching breach, even when cash flows have not yet collapsed. This increases dependence on lender forbearance or restructuring and makes it harder to transact at realistic values without triggering technical defaults.

In turn, lenders and debt investors must choose between extending and pretending, injecting additional capital, or forcing asset sales into thin markets. Distressed and opportunistic funds are watching closely, as each of these choices can convert illiquid mark-to-model valuations into executable deals, often at prices that reflect distress rather than orderly value.

Strategic responses: write-downs, secondaries, and continuation funds

One of the major tensions highlighted by the current environment is between short-term reputational pain and long-term franchise survival. General partners who accept material write-downs today may suffer in near-term performance league tables and carried interest prospects, but they regain the ability to sell assets, return capital, and reset expectations. Those who hold onto legacy valuations may postpone the day of reckoning but risk compounding the problem as fund lives lengthen and LP pressure grows.

The secondary market has become a crucial adjustment mechanism. LPs seeking liquidity can sell their fund interests at discounts to net asset value, transferring the risk and potential upside of ageing portfolios to specialised buyers. Similarly, GP-led secondaries and continuation vehicles allow sponsors to extend ownership of particular assets while providing partial liquidity to existing investors. Yet these structures are themselves constrained by the same valuation debate: at what price should stakes be transferred, and to what extent should discounts acknowledge the illiquidity and uncertainty embedded in the asset?

Continuation funds in particular raise governance questions. When a GP sells an asset from one of its funds into a vehicle it also controls, conflicts become inherent. Independent fairness opinions and auction processes are used to mitigate these issues, but if underlying assets are marked aggressively, continuation transactions can be perceived as a way of avoiding the full recognition of market-clearing prices rather than a genuine value-maximising strategy.

Technological and structural drivers behind the blockage

Beyond interest rates and valuations, there are deeper technological and structural forces contributing to the logjam. In some sectors, especially software and digital infrastructure, business models that were once rewarded with very high growth multiples have shifted into slower, more cash-generative phases. Sponsors that underwrote deals on the assumption of sustained hyper-growth and rapid multiple expansion now face businesses that are solid but not explosive, making the exit case less straightforward.

At the same time, the rise of direct investing by large institutions, sovereign wealth funds, and family offices has altered the buyer universe. Some of these investors are less willing to participate in sponsor-to-sponsor trades at high multiples, preferring direct origination or co-investments where fees and governance are more aligned. This reduces one of the key exit routes that underpinned the previous boom, increasing dependence on trade buyers and public markets, both of which are sensitive to macro conditions and sector-specific narratives.

Advances in data and analytics, while powerful, also create more scrutiny. Prospective buyers now have access to richer operational and market datasets, enabling more granular stress-testing of revenue, margin, and cash flow scenarios. In a risk-off environment, such tools often lead to more conservative underwriting rather than more aggressive bidding, as buyers quantify downside scenarios more explicitly. The informational asymmetry that once allowed sellers to market a growth story with limited transparency has narrowed, making it harder to justify top-of-cycle valuations.

Debates and objections: is the problem overstated?

Not everyone accepts the narrative of pervasive constipation. Some argue that the industry is simply adjusting to a new rate regime and that exit volumes, while lower than peak, remain within historical norms when viewed over a longer horizon. They point out that selected sectors, such as energy transition, infrastructure, and certain niches in technology and healthcare, continue to see robust activity at sensible valuations. From this perspective, the blockage is concentrated in specific vintages and strategies that overreached, rather than an indictment of the asset class.

Others emphasise that the illiquidity of private markets is part of their design, arguing that an excessive focus on near-term exits risks undermining the long-term value creation thesis. They highlight cases where patient capital and operational improvement over extended holding periods have yielded strong outcomes, even without rapid flips. According to this view, pressure from LPs for faster distributions may be partly driven by their own allocation and liquidity management issues, rather than by an inherent failure of private equity or real estate strategies.

A further objection is that commentary from distressed and opportunistic investors is not disinterested. Those who stand to benefit from forced selling have an incentive to emphasise the scale of the problem and to urge capitulation on price. While this does not invalidate the diagnosis of a backlog, it suggests that statements need to be interpreted in the context of strategic positioning: what is a blockage for one segment of the market is a source of deal flow for another.

Why the blockage matters for the broader financial system

Despite those objections, the build-up of unsold assets in private equity and real estate has implications that go well beyond individual firms or funds. Institutional portfolios globally have significant exposures to private markets, often via defined benefit pension schemes, insurers, endowments, and sovereign funds. When exit channels narrow, these investors face a mismatch between their assumed liquidity profile and realised cash flows. For pension schemes in particular, the experience of backlogs in other risk-transfer markets, such as bulk annuity transactions, underscores how a crowded pipeline can delay strategic plans.

If LPs find themselves over-allocated to illiquid assets for prolonged periods, they may respond by reducing future commitments, driving a downshift in fundraising across the industry. Managers with strong track records and differentiated strategies may adapt, but marginal players could struggle to survive, leading to consolidation or outright failures. Bank and non-bank lenders with significant exposure to leveraged loans and commercial real estate debt must also manage the risk that delayed exits extend credit risk horizons and increase the probability of restructurings or losses.

There is also a macroeconomic angle. When significant pools of capital are trapped in assets that cannot be repriced or redeployed efficiently, capital allocation across the economy becomes less dynamic. Companies that might benefit from new investment may struggle to attract it if capital remains locked in legacy deals, while real estate that could be repurposed or redeveloped remains under the control of owners reluctant to crystallise losses. Over time, such frictions can dampen productivity, slow the reallocation of resources, and hinder the adjustment to new technological and societal patterns of demand.

Paths to resolution: price discovery and structural change

Ultimately, resolving a systemic backlog in private markets requires price discovery. As more transactions clear at realistic levels, valuation benchmarks will adjust, and the psychological barrier to accepting lower prices will weaken. This process can be painful, particularly for vintages funded at the peak of the cycle, but it is a prerequisite for restoring the circulation of capital. Several mechanisms are likely to play a role.

First, a wave of consensual restructurings and recapitalisations can realign capital structures with current cash generation and asset values, often accompanied by new equity injections from opportunistic investors. Second, distressed sales triggered by covenant breaches, fund-life constraints, or lender pressure will provide transparent reference points for pricing, even if they occur under duress. Third, regulatory and accounting developments that encourage more conservative valuation practices can push managers to update marks closer to observable transaction levels, even in the absence of exits.

On a structural level, the industry may move towards vehicles with more flexible liquidity features, such as evergreen funds, listed private markets vehicles, or hybrid structures that blend open-ended and closed-ended characteristics. While such innovations cannot eliminate the fundamental illiquidity of underlying assets, they may distribute the timing risk more broadly and reduce the concentration of exit pressure at the end of traditional fund lives.

There is also scope for more integration between public and private markets. Dual-track processes, public-to-private cycles, and the use of listed feeders or co-investment vehicles can create additional degrees of freedom for sponsors and investors. However, these tools work only if public markets are themselves receptive and if valuations in public and private domains converge sufficiently to allow arbitrage-free transitions.

Why the metaphor resonates now

The characterisation of private equity and real estate as jammed speaks to a larger anxiety: that a model built on constant motion has met a structural speed limit. Years of easy money encouraged an expansion of private markets that outpaced the development of corresponding exit channels. As the cost of capital reset, the imbalance became visible. Whether the current phase is remembered as a temporary blockage or as the tipping point towards a smaller, more disciplined industry will depend on how quickly sponsors accept new pricing realities and how effectively capital can be redeployed.

For LPs, regulators, and policymakers, the episode is a reminder that illiquidity is not a passive characteristic but an active risk factor. Promises of double-digit returns and diversification benefits cannot be divorced from the practical question of when and how capital comes back. The answer, in the coming years, will depend on the willingness of managers to trade short-term discomfort for long-term viability, and on the capacity of specialised investors to absorb, restructure, and eventually recycle assets that others can no longer afford to hold.

“There are entire sectors like private equity, like real estate, that are constipated. They can’t sell.” - Quote: Victor Khosla - Founder of credit investor Strategic Value Partners at SuperReturn Berlin 2026

‌

Quote: Editorial - Workers' Daily

"The benefits of technological advancement should be shared by society as a whole, rather than becoming a tool for a small number of employers to undermine workers' rights." - Editorial - Workers' Daily - the official mouthpiece of China's umbrella trade union organisation

Across recent waves of automation, the most enduring fault line has not been whether machines can perform particular tasks, but who captures the savings and productivity gains they generate. In the context of artificial intelligence, that question has become sharply political in China, where rapid deployment of generative systems collides with a labour market still shaped by industrial restructuring, a large pool of precarious workers, and an official commitment to social stability. As AI tools move from experimental pilots into core business processes, the distribution of their benefits increasingly determines whether technological change is experienced as opportunity, dispossession or something in between.

The immediate backdrop is a surge in corporate enthusiasm for AI as a way to compress labour costs in sectors ranging from customer service and data labelling to logistics and back-office administration. For firms operating on thin margins, replacing call centre agents or content moderators with AI-based systems can promise savings running to tens of millions of yuan across a workforce of 10 000 people. When those savings are not matched by new protections, retraining or bargaining power for employees, the technology becomes, in practice, a lever to shift income and security away from workers and towards shareholders and senior management.

China's experience with earlier rounds of restructuring provides a powerful memory of how such transitions can play out. The lay-offs of state-owned enterprise workers in the 1990s and early 2000s created a generation marked by insecurity and grievance, even as the broader economy grew. That legacy shapes today's anxiety about AI: mass replacement of workers by automated systems risks not only individual hardship but also heightened social tension, particularly in cities where service jobs have become a crucial absorption mechanism for rural migrants and displaced industrial labour. Against that backdrop, the insistence that technological progress must not become a tool for undermining workers' rights takes on a concrete, rather than abstract, urgency.

From productivity promise to labour displacement risk

AI, especially in its generative form, promises major gains in efficiency by automating routine cognitive tasks: drafting basic responses, triaging customer queries, generating boilerplate documents, and analysing large volumes of text or image data. For employers, the economic logic is clear. If an AI system can handle the work of, say, five junior clerks at a fraction of their combined salary and without social insurance contributions, it becomes hard to resist. Even modest improvements in model accuracy can make it rational, from a firm's perspective, to restructure departments and reduce headcount.

The risk is that this rationality at the firm level is socially destructive when replicated across thousands of enterprises. If many employers simultaneously replace segments of their workforce with AI, displaced workers may find that the new jobs created by the technology are too few, too specialised, or require skills they do not yet possess. The labour market then adjusts not through smooth transitions into higher-value roles but through periods of unemployment, downward wage pressure and an expansion of informal or precarious work.

The tension is sharpened in China, where the state retains a strong directive role and where social stability is treated as a central policy objective. Government advisers have begun warning that AI applications introduced solely to reduce labour costs, without improving services or sustainability, should face close scrutiny or explicit constraints. That stance implicitly challenges a purely market-driven allocation of the benefits of AI: cost savings cannot simply be converted into higher profits while workers absorb all of the transition costs.

Legal red lines and the politics of dismissal

This emerging political stance has started to crystallise in legal outcomes. Chinese courts have ruled that employers cannot justify firing workers simply because AI has rendered their current tasks redundant. In a high-profile case in Hangzhou, a technology company attempted to dismiss an employee after introducing AI tools that took over much of his work, offering a reassignment that involved a steep salary cut. The intermediate court found the dismissal unlawful, holding that an employer's decision to adopt AI does not constitute the kind of external, uncontrollable change that would justify redundancy under labour law.

Crucially, the court also rejected the idea that a substantial pay cut hidden inside a reassignment offer is an acceptable way to shift the costs of technological change onto workers. By treating such a move as effectively a constructive dismissal, the judgment signalled that firms cannot rely on AI as an accounting trick to convert stable jobs into worse-paid, less secure positions while claiming continued compliance with formal employment rules.

This jurisprudence sits on top of a broader legal framework in which the right to continued employment is embedded more robustly than in many market economies. China's Labour Contract Law places strong limits on arbitrary dismissal and treats job security as a social good, not merely a private contract matter. In the AI context, that framework acts as a guardrail: technological adoption is permitted and even encouraged, but not as a justification for offloading adjustment costs onto employees without negotiation or compensation.

Trade union positioning and official narratives

Within this legal and political environment, the rhetoric of China's official trade union structures carries particular weight. As the mass representative body for workers, closely intertwined with state institutions, the umbrella trade union organisation operates less as an adversarial bargaining agent and more as a channel through which policy priorities are communicated and reinforced. When its official newspaper warns against using technological advancement to erode workers' rights, it is not merely expressing a moral view; it is aligning labour discourse with the state's interest in maintaining legitimacy and social stability during a disruptive technological transition.

The editorial stance reflects both defensive and strategic considerations. Defensively, the union apparatus needs to demonstrate that it can protect workers from the most predatory forms of AI-driven restructuring, or risk appearing irrelevant to a younger generation of employees in digital and service sectors. Strategically, it participates in shaping a narrative in which China can pursue technological leadership while claiming a distinctive social model, one that contrasts with images of unregulated automation and gig work precarity elsewhere.

This positioning does not erase underlying tensions. Many workers are not formally unionised or lack genuine bargaining leverage within their workplaces, especially in private tech firms and platform companies. Enforcement of labour protections remains uneven, and there is a large grey zone where AI tools intensify workloads or surveillance without leading to outright dismissals. Nevertheless, the articulation of a principle that technological gains should be socially distributed rather than concentrated provides a reference point for workers, regulators and courts.

Algorithmic control, visibility and hidden labour

Debates about sharing the benefits of AI often focus on headline figures: how many jobs might be replaced, how much productivity might increase, what fraction of GDP growth can be attributed to automation. Yet a substantial part of AI's labour impact lies not in visible job losses but in the reconfiguration of work through algorithmic control and the creation of largely invisible labour behind the systems.

Chinese-developed AI models depend on vast quantities of human-labelled data, content moderation, and platform maintenance work, much of it outsourced, low-paid and weakly protected. These workers, who generate the training data that underpin sophisticated models, often lack clear paths to benefit from the subsequent productivity gains. Meanwhile, for employees in call centres, warehouses or delivery services, AI-powered scheduling and monitoring tools can increase pressure, reduce autonomy and tighten performance metrics, even if headcount stays constant.

In this sense, the risk is twofold. First, AI can be used to directly displace segments of the workforce. Second, even where jobs remain, AI systems can become instruments for intensifying labour, extracting more output per worker without commensurate increases in pay or security. When trade unions and courts insist that AI should not be turned into a tool for undermining rights, they are implicitly contesting both forms of impact: the overt substitution of machines for people and the subtler erosion of working conditions through algorithmic management.

Economic models of sharing technological gains

In economic terms, the argument centres on how the surplus generated by AI is divided between capital and labour. If the introduction of an AI system raises a firm's output or reduces its costs, the additional surplus can be decomposed into higher profits, increased wages, better working conditions, lower prices for consumers, or public revenues via taxation. Absent countervailing power, the default tends to favour capital holders and senior executives.

Some analysts formalise this distribution using production functions in which output depends on both traditional labour and an AI-enhanced component, with bargaining over wages determining how gains are shared. While the specific mathematics may vary, the underlying logic is straightforward: when workers lack the power to insist on a share of productivity improvements, their relative position deteriorates even if overall economic output rises. In that scenario, technological advancement can coexist with stagnant or falling real wages and heightened insecurity for large segments of the workforce.

China's institutional framework gives the state tools to intervene in this distribution. Minimum wage policies, social insurance requirements, and judicial interpretations of labour law all shape the bargaining landscape in which AI adoption occurs. If courts repeatedly rule that firms cannot dismiss workers or cut salaries under the pretext of technological change, employers may be pushed to find ways to use AI that complement, rather than replace, existing staff: reassigning employees to higher-value tasks, investing in reskilling programmes, or reducing working hours without cutting pay.

Such complementarity is not automatic and may be more feasible in some sectors than others. High-skill knowledge work, for instance, may lend itself to human-AI collaboration in which professionals use generative tools to enhance their output, while retaining control over final decisions. Low-skill routine work, by contrast, may be more susceptible to straightforward automation. The underlying normative claim - that benefits should be shared - thus implies a need for targeted policies to prevent sectors at greatest risk from bearing disproportionate costs.

Counterarguments and employer perspectives

Critics of strong labour protections in the context of AI often argue that constraining firms' ability to restructure will slow innovation, reduce competitiveness and ultimately harm workers by limiting growth and job creation. From this perspective, allowing employers to freely adopt AI and adjust their workforce is necessary to ensure that domestic firms can keep pace with international rivals, especially in sectors where global competition is fierce.

Employers may also contend that attempts to legislate or adjudicate against AI-driven redundancies misunderstand the reality that technological change inherently involves creative destruction. Protecting existing roles too rigidly, they argue, risks locking labour into obsolete tasks and freezing the economy in less efficient configurations. Some worry that courts declaring AI-driven restructuring illegitimate could create uncertainty, deter investment and encourage firms to relocate to jurisdictions with more permissive regimes.

These objections highlight a genuine tension: how to maintain dynamism and openness to innovation while preventing technology from becoming a one-way conduit for transferring risks and costs onto workers. Proponents of stronger protections respond that the choice is not between technological stagnation and unfettered automation. Rather, it is between different institutional arrangements for managing transitions - some of which spread adjustment burdens across firms, workers and the state, and others of which concentrate them on those with the least bargaining power.

International comparisons and distinctive features

Globally, debates over AI and labour rights span a spectrum. Some advanced economies emphasise data protection and algorithmic transparency, while leaving employment relationships largely governed by existing redundancy and discrimination rules. Others focus on reskilling and social safety nets, aiming to ease transitions without imposing strong constraints on firms' restructuring choices. China's emerging approach, with its combination of strong formal limits on arbitrary dismissal and an official labour narrative emphasising social stability, occupies a distinctive position on this spectrum.

One distinctive feature lies in the central role of the state in both promoting AI development and arbitrating its labour impacts. Government plans and industrial policies actively support AI research, infrastructure and deployment, while the same state apparatus, through courts and labour agencies, signals boundaries for how far employers can go in using AI to cut jobs. This dual role can generate tensions - for instance, between local governments eager to attract high-tech investment and national-level concerns over social unrest - but it also means that labour impacts are not treated as an afterthought.

Another distinguishing element is the prominence of social stability as an explicit policy objective. In a political system highly sensitive to large-scale unemployment or visible worker protests, there is a pragmatic incentive to ensure that AI does not trigger sudden waves of dispossession. Protecting workers from being summarily replaced by AI can thus be understood not only as a matter of justice but also as a tool of risk management for the state.

Why the distributional question matters now

The urgency of clarifying how AI's benefits are shared arises from the speed and breadth of current deployment. Unlike earlier waves of automation targeted mainly at manufacturing or narrowly defined clerical tasks, contemporary AI systems reach into creative industries, legal and financial services, health care administration and even elements of management decision-making. As more layers of the economy become susceptible to algorithmic substitution or augmentation, the number of workers whose roles are reshaped by AI expands far beyond a single sector.

In this context, a clear public stance that technological advances should not be repurposed as mechanisms for eroding labour rights performs several functions. It sets expectations for employers considering AI-driven restructuring, warning that they may face legal and reputational risks if they treat workers as disposable inputs in a cost-minimising exercise. It gives workers and their representatives a discursive and legal basis to challenge unfair practices, from unjustified lay-offs to exploitative algorithmic management. And it positions the broader society to debate, rather than passively accept, the terms under which AI is integrated into everyday economic life.

There is no guarantee that such a stance will fully prevent the concentration of technological gains in the hands of a small group of actors. Powerful firms with access to capital, data and engineering talent remain well-placed to dominate AI markets and capture outsized returns. However, by embedding labour-protective principles in law, judicial practice and official discourse, China is attempting to tilt the playing field away from a pure race to the bottom in labour costs. Whether this experiment succeeds will depend not only on rulings and editorials but on continuous enforcement, worker organisation and the willingness of policymakers to adjust course as new challenges emerge.

What is already clear is that the contest over AI is not only about whose models are most powerful or whose infrastructure is most advanced. It is also about whose lives improve, whose become more precarious, and who has a say in that process. Framing technological advancement as something to be shared by society at large, rather than wielded as a tool to weaken the bargaining power of those who perform the work, draws a line between two possible futures: one in which AI deepens existing hierarchies of power, and another in which its benefits are mediated through institutions that recognise workers not as expendable inputs, but as stakeholders with rights that do not vanish when a new machine comes online.

“The benefits of technological advancement should be shared by society as a whole, rather than becoming a tool for a small number of employers to undermine workers’ rights.” - Quote: Editorial - Workers' Daily - the official mouthpiece of China's umbrella trade union organisation

‌

Term: Perplexity - Artificial intelligence

"In computer science and natural language processing, 'perplexity' is a mathematical measurement of uncertainty. It evaluates how effectively an AI model predicts the next word in a sequence or sentence." - Perplexity - Artificial intelligence

Human language is structured enough to be predictable yet rich enough to surprise. Any system that tries to generate or understand text must therefore manage a tension between recognising familiar patterns and handling rare or novel expressions. The central technical question is how to measure whether such a system is predicting linguistic continuations in a way that aligns with real usage, rather than merely memorising or guessing.

Perplexity enters at precisely this point as a quantitative lens on predictive behaviour. It converts a model's entire probability distribution over possible next words into a single scalar that captures how uncertain, or "confused", the model is when faced with real data. Low values indicate that the model assigns high probability to what humans actually say; high values indicate that the model spreads probability mass thinly or places it on implausible options. Because this single figure is computed systematically across large corpora, it has become deeply embedded in how researchers train, compare, and refine language models.

Uncertainty, surprise, and predictive distributions

Any predictive text system maintains, implicitly or explicitly, a probability distribution over possible next tokens given the context. If we denote the context by (for "history") and the next token by , the system internally represents a conditional distribution . This distribution encodes both what the system believes is likely and how strongly it believes it. When the actual next word appears, the value is a direct measure of how well its expectations matched reality.

Entropy and related information-theoretic quantities provide a way to aggregate these local assessments. The Shannon entropy of a distribution over a discrete vocabulary is defined as . This quantity grows when the distribution is flatter (more uncertain) and shrinks when it is sharply peaked around a few options (more certain). However, entropy is expressed in bits or nats depending on the logarithm base, which is not immediately intuitive. Perplexity bridges this gap by re-expressing entropy as an effective number of equally likely choices.

Perplexity in mathematical terms

Formally, perplexity is defined as the exponential of the average negative log probability that a model assigns to a sequence of tokens. Suppose we have a sequence drawn from a test corpus, and a model that assigns probability to each token given its history . The average negative log-likelihood per token is

Perplexity is then defined as

If natural logarithms are used, perplexity corresponds to raised to the entropy rate; if base-2 logarithms are used, perplexity is , where is the entropy in bits. Intuitively, a perplexity of, say, 50 means that the model behaves as if it were choosing among about 50 equally likely options at each step, even though in reality its distribution may be uneven.

Several properties follow immediately from this definition. Perplexity is always at least 1, with 1 corresponding to a model that assigns probability 1 to the correct next word at every position. It is minimised when the model represents the true data-generating distribution and grows as the model's predictions deviate from the empirical distribution. Because it aggregates over the entire sequence, it penalises systematic miscalibration rather than occasional errors.

Relation to log-likelihood and cross-entropy

Perplexity is closely linked to standard statistical objectives used in training language models. Maximum likelihood training attempts to find parameters that maximise the log-likelihood . Minimising the average negative log-likelihood is equivalent to minimising cross-entropy between the empirical data distribution and the model's distribution. Because perplexity is a monotonic transformation of , minimising perplexity is identical to maximising likelihood or minimising cross-entropy.

This equivalence is practically important. During optimisation, models are updated to reduce loss, but when results are reported to other researchers or stakeholders, perplexity provides a more interpretable metric. A drop in perplexity from, say, 80 to 40 is easy to understand as halving the effective number of equally likely options, which in turn suggests much sharper predictions. This interpretability makes perplexity a convenient benchmark when comparing architectures, training regimes, or datasets.

Practical meaning for model quality

In applied natural language processing, perplexity is used in two broad ways: as a training objective proxy and as a diagnostic for model behaviour. During development of language models, improvements in architecture or training data selection are often evaluated by their effect on perplexity on held-out corpora. Lower perplexity generally correlates with better performance in generative tasks such as language modelling, text completion, and machine translation.

For example, when an organisation transitions from a smaller recurrent neural network to a transformer-based model, it typically observes a substantial drop in perplexity on standard benchmarks. This reduction is evidence that the new model captures longer-range dependencies and richer linguistic structure. In turn, this tends to yield more fluent text generation and more accurate predictions of rare but grammatically and semantically appropriate tokens.

Perplexity also functions as a sanity check on training dynamics. Sudden spikes can indicate numerical instability, data pipeline corruption, or misconfigured learning rates. Gradual stagnation of perplexity during training suggests that the model has reached the limits of what can be extracted from the current data with the given capacity. Monitoring perplexity across domains or languages can reveal where a model is under-exposed or miscalibrated, guiding targeted data collection or fine-tuning.

Local versus global uncertainty

Although perplexity is defined as a global average over a test set, it is often insightful to inspect its local contributions. The term can be interpreted as the surprise associated with observing given the context. High local surprise may stem from genuinely rare words, idiomatic expressions, abrupt topic shifts, or areas where the training data was thin. By examining segments with high average surprise, practitioners can diagnose specific weaknesses such as poor handling of code-switching, domain-specific jargon, or unusual syntactic patterns.

This local view is crucial for understanding that models with similar overall perplexity may fail in different ways. Two systems might achieve comparable averages yet differ sharply in how they allocate uncertainty: one may be consistently moderately uncertain, while another is confident most of the time but catastrophically wrong in certain regimes. Perplexity alone does not distinguish these patterns, prompting complementary analyses such as calibration curves, error typologies, and task-specific evaluations.

Parameter meanings and modelling choices

In the classical statistical language modelling literature, perplexity often appears in the context of n-gram models. An n-gram model approximates the conditional distribution by considering only the previous tokens: . Parameters in such models are counts and smoothing coefficients that adjust for sparsity. Perplexity provides a direct way to quantify how well these approximations capture real sequences and how much improvement is obtained by increasing or introducing better smoothing.

In modern neural language models, parameters are continuous weights in deep architectures. Although there is no closed-form mapping from individual parameters to perplexity, some structural choices have predictable effects. Increasing model width and depth, enlarging context windows, and using richer positional encodings typically reduce perplexity up to a point, after which overfitting and diminishing returns appear. Likewise, training on larger and more diverse corpora tends to lower perplexity, but domain mismatch between training and evaluation data can negate these gains.

Temperature and related decoding parameters, which control randomness in sampling, do not affect perplexity directly because perplexity is calculated on the underlying distribution, not on generated samples. However, severe miscalibration in the distribution - for example, distributions that are too flat or too peaked - will show up as elevated perplexity relative to an ideal model.

Competing and complementary evaluation metrics

Despite its widespread use, perplexity is not a universal proxy for downstream task performance. Many benchmarks in natural language processing involve structured prediction, reasoning, or interaction with users, for which task-specific metrics are more appropriate. Accuracy, F1 score, BLEU, ROUGE, and human judgements of fluency or relevance often provide a more direct assessment of practical utility.

There are two main limitations in interpreting perplexity. First, it is sensitive to the tokenisation scheme: models that operate on different vocabularies - words, subwords, or characters - are difficult to compare directly. A model with a finer-grained tokenisation may have higher perplexity per token but similar or better performance when measured per character or per word. Second, perplexity ignores meaning: assigning very high probability to fluent but semantically inappropriate continuations can still yield favourable perplexity scores if they match surface statistics.

These limitations have led to a view in which perplexity is necessary but not sufficient. It remains valuable as a basic measure of language modelling quality, especially for comparing variants of the same model family under identical tokenisation and data conditions. However, it is supplemented by application-specific evaluations that capture semantics, factual accuracy, and robustness.

Perplexity in the broader AI landscape

As large language models have moved from research prototypes to widely deployed systems, perplexity has acquired a dual role. It continues to serve as a core internal metric during pretraining on massive corpora, where its reduction signals better compression of linguistic regularities and more efficient representation learning. At the same time, its direct visibility to end users has diminished, replaced by qualitative assessments of helpfulness, harmlessness, and reliability.

Behind the scenes, however, perplexity still matters for engineering decisions. Model distillation, where a large model trains a smaller one, often relies on matching probability distributions and thus on controlling perplexity gaps between teacher and student. Domain adaptation, where a general model is fine-tuned on specialised text such as legal or medical documents, is evaluated by domain-specific perplexity improvements. Even in retrieval-augmented systems, where external information is fetched at query time, perplexity on the combined context-plus-document input informs how well the model integrates retrieved evidence.

In interactive settings, such as conversational agents and AI-powered search tools, perplexity can be monitored as a proxy for the model's comfort level with a query. High perplexity on user instructions or on retrieved content may indicate that the model is extrapolating far beyond its training distribution, which correlates with greater risk of hallucination or misinterpretation. This has motivated research into using perplexity-like measures to trigger fallback behaviours, such as requesting clarification, restricting outputs, or escalating to human review.

Debates and tensions around its use

The reliance on perplexity has sparked several debates in the research community. One line of argument holds that over-optimising for perplexity can encourage models that excel at shallow pattern matching but underperform on compositional reasoning or factual consistency. Since perplexity is indifferent to whether a prediction is logically grounded, it may reward models that memorise long-range patterns in the training data without learning general principles.

Another concern is distributional shift. Perplexity is usually measured on static test sets, but deployed systems face evolving language, emerging topics, and changing user behaviour. A model with strong perplexity numbers on past news articles, for example, may exhibit much higher perplexity on discussions of novel technologies, slang, or events that occurred after its training cut-off. This gap underscores the need for continual evaluation and possibly continual training, as well as for metrics that better track real-world performance.

There is also a methodological tension between comparing models on standardised benchmarks and tailoring evaluation to specific deployment contexts. Standard corpora facilitate apples-to-apples perplexity comparisons across architectures and research groups, but they may not reflect the specialised domains where a model will actually operate. Conversely, domain-specific corpora provide more relevant perplexity estimates but reduce comparability. Balancing these considerations remains an active area of practice rather than a solved theoretical issue.

Why perplexity still matters

Despite critiques, perplexity continues to occupy a central position because it aligns naturally with how generative language models are trained and used. It connects the statistical foundations of probability distributions and entropy with practical questions about predictive performance. It is simple enough to compute and interpret, yet sensitive enough to reveal meaningful differences when architectures, datasets, or optimisation strategies change.

Moreover, perplexity reflects a deeper conceptual challenge in artificial intelligence: representing and managing uncertainty in complex, structured domains. Human communicators constantly navigate uncertainty about what will be said next, what listeners already know, and how a conversation might unfold. Language models, though purely computational, must grapple with the same uncertainty at scale. Perplexity does not capture everything about this process, but it provides a disciplined way to quantify how well models anticipate linguistic realities.

As AI systems expand beyond text into multimodal settings and more interactive applications, analogous questions arise about predicting the next frame in a video, the next action in a sequence, or the next user response in a dialogue. Extensions of perplexity and its underlying cross-entropy framework are likely to remain part of the evaluative toolkit in these areas as well. In that sense, understanding perplexity is not only about language models but about a broader approach to measuring how effectively artificial systems handle uncertainty in open-ended environments.

‌

Quote: Bryan Catanzaro - Vice president of applied deep learning at Nvidia

"For my team, the cost of [AI] compute is far beyond the costs of the employees." - Bryan Catanzaro - Vice president of applied deep learning at Nvidia

In the current generation of artificial intelligence deployment, the binding constraint for many organisations is no longer talent but access to affordable compute at scale. That inversion of the traditional cost structure is what turns remarks from senior technologists into a broader economic signal: for some cutting-edge teams, the recurring bill for accelerated hardware, cloud instances, networking, and power now exceeds the wage bill for the engineers designing and operating the systems. This is not just an accounting oddity; it alters how firms evaluate automation, how investors price AI strategies, and how policymakers should interpret predictions of rapid job displacement.

From cheap silicon and expensive people to expensive silicon and leveraged people

For several decades, digital transformation followed a familiar pattern: hardware and basic infrastructure costs per unit of computation fell predictably, while skilled labour remained scarce and expensive. Organisations hired more software engineers to sit atop an increasingly cheap computational substrate. In that world, the canonical argument for automation was straightforward: once a process was codified in software, the marginal cost of running it again was negligible compared with paying an additional human to do the same work.

What has changed with frontier AI systems is the scale and intensity of computation required to deliver competitive performance. Training large language models and vision systems involves running vast numbers of parallel operations across specialised GPUs and associated accelerators, often for weeks, on clusters that can cost tens or hundreds of millions of dollars in capital expenditure for hyperscalers and large enterprises. Even when organisations do not own the hardware, their cloud providers pass through these capital and operating costs via metered pricing. As a result, the unit of analysis for AI economics has shifted from "salary per employee" to metrics such as cost per GPU hour, cost per million tokens, and cost per inference request.

Industry observers now describe AI infrastructure as a new class of heavy industry: data centres designed around specialised accelerators, redundant power feeds, and advanced cooling, with aggregate global spending that consultancy estimates place in the trillions of dollars by 2030. That capital intensity explains why some AI teams report that, at the margin, paying for more compute to improve a model or scale an application is financially weightier than hiring additional engineers to refine prompts, build interfaces, or clean data.

Why compute dominates the cost stack

There are several mechanisms behind compute eclipsing labour in AI projects.

First, state-of-the-art models are extremely large. Modern large language models and multi-modal systems often contain hundreds of billions or even 1 trillion parameters. Each training run requires repeated passes over large datasets, with backpropagation and optimisation algorithms applied across every parameter. The computation required scales roughly linearly with the number of parameters and data tokens, and in practice teams often run multiple training, fine-tuning, and ablation cycles. That translates into millions of GPU hours, even when using the most efficient hardware and software stacks available.

Second, inference - the process of serving model outputs to users - imposes ongoing costs that grow with adoption. Training is a one-off or periodic capital-like expense, but inference is an operational expense that scales with queries. Industry frameworks therefore emphasise cost per token as a central metric: the all-in cost to produce each output token, incorporating hardware depreciation, energy, data centre overheads, networking, and software optimisation. Even small differences in cost per million tokens can compound dramatically when applications serve millions of users or integrate AI into high-frequency workflows.

Third, the energy footprint of frontier AI is substantial. High-end GPUs draw significant power, and data centres require additional energy for cooling and ancillary systems. Energy prices vary geographically, but in many locations they are high enough that power constitutes a large share of the total cost of ownership for AI infrastructure. Analysts have therefore started speaking of "intelligence per megawatt" as a key performance dimension. When firms compare this stack of costs with the wages of knowledge workers, the balance can tilt unexpectedly toward hardware and energy spending.

Fourth, there is a structural asymmetry between compute and labour costs. Employee salaries are relatively predictable and can be adjusted slowly via hiring freezes, attrition, or compensation changes. Compute costs, by contrast, can spike rapidly if product usage grows or if teams run more experiments than anticipated. In startups, venture capital typically funds both headcount and infrastructure, but some investors report portfolio companies spending more than 80% of their capital raised on compute resources, dwarfing wage bills.

The empirical picture: AI not yet cheaper than humans

These mechanisms are now visible in empirical work. One 2024 study from MIT examined where AI systems could perform visual tasks at or near human level, and then compared the cost of machine versus human performance. The researchers concluded that automation was economically viable in roughly 23% of roles where vision is central to the job, meaning that in about 77% of cases, it remained cheaper to pay humans than to deploy AI. The issue was not capability - the models could often do the tasks - but economics: hardware, energy, and infrastructure outweighed labour costs.

Macro-level data on AI expenditure reinforce this picture. Big technology firms alone have announced around 740 billion in capital expenditures in a single year, largely driven by AI data centre build-out, representing a 69% increase over the prior year. Other analysis suggests that AI-related expenditures could reach 5,2 trillion by 2030 under central scenarios, and as high as 7,9 trillion under more aggressive build-out, with data centre and IT equipment accounting for the bulk of this spending. Against those numbers, even generous headcounts of highly paid engineers, researchers, and product staff occupy a smaller share of the cost base than might be expected.

At the same time, external observers note that, despite the scale of these investments, there is still limited aggregate evidence of AI-driven productivity gains across the economy. Budget analysts and academic studies point out that, so far, there is no broad-based data showing AI displacing jobs at scale or dramatically boosting measured output per worker, even as tech sector layoffs have accelerated. That divergence between spending and measured productivity raises the question of whether the current wave of AI investment is front-loaded - infrastructure built ahead of realised returns - or whether some fraction will prove to be misallocated capital.

Strategic tension: build now, pay later

This leads to a central strategic tension. On one side is the argument that AI is an infrastructure revolution akin to electrification or the early internet, requiring enormous upfront capital before productivity gains show up in statistics. Firms that invest early, this view holds, will establish competitive moats via proprietary data, trained models, and optimised infrastructure. For such firms, the fact that compute currently costs more than labour may be beside the point; they are laying the foundations for future economies of scale, where the amortised cost per unit of AI output falls sharply as utilisation rises.

On the other side is a more sceptical view, which emphasises opportunity cost and path dependence. If AI systems are currently more expensive than humans for many tasks, especially outside narrow high-value niches, then replacing workers prematurely may destroy value rather than create it. Companies that chase AI for its own sake, without rigorous cost-benefit analysis, risk saddling themselves with fixed infrastructure commitments and ongoing compute bills that are difficult to roll back. This argument is bolstered by evidence that many firms do not fully understand their AI cost structures, focusing on headline model access fees or GPU rental rates rather than total cost per token or per workflow.

These perspectives are not mutually exclusive. It is possible that some organisations are overbuilding, while others are rationally investing in infrastructure that will underpin future competitive advantage. What unites them is an underlying bet: that the cost of compute will fall fast enough, and the productivity benefits of AI will rise high enough, to justify today's discrepancy between machine and labour costs.

Falling unit costs versus rising aggregate spend

Industry roadmaps and analyst reports forecast significant reductions in the unit economics of AI over the next few years. Hardware generations such as Nvidia's Blackwell architecture promise up to 30× gains in inference performance at similar power budgets compared with earlier accelerators. Software improvements - better compilers, quantisation techniques like FP4 precision, more efficient attention mechanisms, and mixture-of-experts routing - all work to reduce the computational load per unit of useful output. Gartner-style forecasts point to the cost of running inference for models with 1 trillion parameters dropping by more than 90% over a four-year horizon.

If realised, those gains could radically alter the relative cost of compute and labour. A workflow that is uneconomic today because each AI call is expensive might become viable once the cost per million tokens falls below some threshold. In that future, the remark that compute costs more than employees would be overtaken by a new reality in which compute is cheap enough that the main question becomes how to reconfigure organisations to exploit it.

However, even as unit costs fall, aggregate spending may still rise. The classic rebound effect applies: cheaper computation tends to expand the range of feasible applications and increase total usage. Organisations that pay less per token may respond by embedding AI into more products, workflows, and services, multiplying the total number of tokens generated. If spending on AI grows from 5,2 trillion to 7,9 trillion by 2030, a large part of that increase will likely reflect expanded scope, not just higher prices. The result is a paradox: individually, each unit of compute may become cheaper and more efficient; collectively, compute may remain the largest single line item for AI-heavy firms.

Employment, displacement, and the cost paradox

The fact that compute can cost more than employees complicates narratives about AI-driven job displacement. From a firm's perspective, automation only makes sense when the total cost of designing, training, deploying, and maintaining an AI system is lower than - or at least justifiable relative to - the wage cost, management overhead, and performance variability of human workers. When AI is more expensive, substituting capital for labour purely on cost grounds is irrational.

This does not mean AI will not change employment. Instead, it suggests a more nuanced pattern of complementarity and selective substitution. In domains where human labour is extremely costly or scarce, such as high-end legal services, algorithmic trading, or complex simulation, even expensive compute may be a bargain. In mass-market customer service or routine back-office work, by contrast, the current cost structure favours augmenting workers with AI tools rather than replacing them outright. The MIT study's finding that only around 23% of vision-centric jobs are currently economically automatable illustrates how narrow the immediate substitution window may be.

The paradox is that headlines about companies spending more on AI infrastructure than on salaries coexist with data showing limited net job losses attributable directly to AI. Part of the explanation is temporal: firms are investing ahead of adoption, building capabilities before fully restructuring their workforces. Another part is strategic: some firms see AI as a growth tool rather than a cost-cutting tool, aiming to enable new products and services rather than simply replacing existing staff.

Pricing models and hidden subsidies

One reason many users and even corporate customers underestimate the true cost of AI compute is the structure of pricing models. A significant portion of the market relies on flat subscription charges or simple usage tiers that do not map transparently to underlying infrastructure costs. For light users, this can be attractive: they pay a fixed fee and rarely hit limits. For heavy users, however, the provider may be effectively subsidising usage, especially if the subscription was priced before providers had a clear picture of real-world load.

Reports of AI software fees rising by 20% to 37% over a year indicate providers adjusting to this reality. As cost pressures mount - from energy, hardware procurement, and the need to recoup massive capital investments - providers are likely to shift toward more granular, usage-based pricing that reflects cost per token or per request more accurately. When that occurs, more enterprises will discover that their apparent labour savings are offset by higher-than-expected compute bills.

This evolution will bring AI closer to other utilities: electricity, cloud storage, and bandwidth. In each case, users ultimately pay for the marginal resource consumed, and efficient usage becomes a competitive advantage. Just as cloud-native firms learned to optimise workloads to reduce compute and storage charges, AI-native firms will need to optimise prompts, context lengths, caching strategies, and model architectures to minimise unnecessary tokens and reduce idle capacity.

Why the remark matters

The observation that compute can be more costly than employees is important for several constituencies.

For executives and boards, it underscores the need for rigorous capital allocation in AI initiatives. Projects should be evaluated not only on potential strategic upside but also on fully burdened compute economics: total cost per workflow, sensitivity to usage spikes, and exposure to future hardware and energy price shifts. In an environment where tech companies have already announced hundreds of billions in AI-related capital expenditure, misjudging these factors can have material consequences for profitability and competitive positioning.

For investors, the remark acts as a reminder that not all AI spending is value-creating. A significant share may be speculative or defensive, driven by fear of missing out rather than clear use cases. Distinguishing between firms that translate compute into durable revenue and those that merely accumulate expensive infrastructure will be a central task over the next decade.

For policymakers and labour economists, recognising the current cost structure is essential when interpreting forecasts of rapid, sweeping job automation. If AI is still more expensive than humans for the majority of tasks, then near-term labour market disruption is likely to be more contained and sector-specific than some narratives suggest. This does not eliminate the long-term risk of displacement as unit costs fall, but it introduces a window in which policy can focus on adaptation: training, re-skilling, and ensuring that productivity gains, when they arrive, are broadly shared.

Finally, for engineers and product teams, the remark is a design constraint. It implies that building AI systems is not just a problem of maximising accuracy or capability; it is also a problem of optimising for economic viability. Model selection, quantisation choices, caching, retrieval strategies, and system architecture all affect compute consumption. Teams that learn to treat tokens, GPU seconds, and watts as scarce resources, on par with developer time, will be better positioned to create sustainable AI products.

As AI infrastructure matures, the relative prices of compute and labour will continue to evolve. The present moment, in which a leading practitioner can credibly say that the compute bill dwarfs the wage bill, is a snapshot in a longer trajectory. Whether history records it as an early, capital-intensive stage on the way to widely affordable machine intelligence, or as evidence of an overcapitalised boom, will depend on how effectively organisations turn expensive silicon into genuine productivity.

"For my team, the cost of [AI] compute is far beyond the costs of the employees." - Quote: Bryan Catanzaro - Vice president of applied deep learning at Nvidia

‌

Quote: Dario Amodei - Founder and CEO, Anthropic

"Those who don't kind of see what's coming, who don't identify the moats they have, they're going to have a really hard time." - Dario Amodei - Founder and CEO, Anthropic

The practical problem is not simply that AI capability is improving quickly, but that the basis of competitive advantage is shifting faster than many organisations can recognise it. In a market where model quality, distribution, infrastructure, data access, and trust are all being repriced at once, the winners are likely to be those who can identify which advantages are durable before the rest of the field realises that the old ones have decayed.

That is the deeper force behind Dario Amodei's warning. It is not a casual remark about caution; it reflects a strategic view that many participants in the AI economy are mistaking visible momentum for defensible position. Anthropic's own public profile has made Amodei one of the clearest voices on the concentration of power in the AI era, and his concern appears to be that speed alone can create a false sense of security.

What a moat means in the AI context

A moat in the classic business sense is any structural feature that makes profits more resistant to competition. In AI, that can mean several different things at once: proprietary distribution, developer loyalty, enterprise integration, talent depth, compute access, safety credibility, regulatory readiness, or a feedback loop that improves the product faster than rivals can copy it. The difficulty is that AI moats are often unstable early on. A capability that looks like an enduring edge one year may be commoditised the next by better open models, cheaper inference, or a rival's stronger distribution channel.

That instability matters because many companies have been tempted to treat AI as a feature layer rather than a strategic reordering. But the recent market conversation has increasingly moved towards the idea that AI will not merely be sold into existing businesses; it will reorganise them from the inside. Commentary around the so-called AI rollup thesis argues that investors are buying labour-intensive businesses and rebuilding them around AI so that economics begin to resemble software rather than services. If that thesis proves correct, then the old source of value is not just under pressure; it is being redefined.

Why speed makes identification harder

Amodei's warning lands because the pace of improvement is itself part of the competitive landscape. He has said that cognitive ability in frontier systems can be doubling every four to 12 months, a pace that would make conventional strategic planning dangerously slow. When the underlying capability curve is that steep, the shelf life of an advantage shortens. What looks like a moat may actually be a temporary lead created by timing, capital, or first-mover publicity.

"Those who don’t kind of see what’s coming, who don’t identify the moats they have, they’re going to have a really hard time." - Quote: Dario Amodei - Founder and CEO, Anthropic

‌

Quote: Andrej Karpathy - Anthropic (Openai Founder, formerly head of Tesla AI)

"You can give [Claude Fable 5, the same underlying model as Mythos but with added safeguards] a lot more ambitious tasks than what you're used to, the model 'gets it' and it will just go, and it's never felt this tempting to stop looking at the code at all." - Andrej Karpathy - Anthropic (Openai Founder, formerly head of Tesla AI)

Modern software development is being pulled towards a regime where human programmers increasingly specify intent while machines decide the details. That shift is most visible not in abstract benchmarks but in the psychological moment when an expert developer realises they can hand over a far more ambitious task than before, watch an AI system autonomously decompose and implement it, and feel genuine temptation to stop inspecting every line it produces. For someone steeped in traditional notions of craftsmanship and code review, that temptation is both intoxicating and alarming.

The move from instructions to intent

The underlying transition is from imperative programming, where humans micromanage every step, to a declarative style where they specify success criteria and let an AI agent find a path. Historically, a senior engineer might spend hours designing architecture, writing scaffolding, and orchestrating tools; now, high-capability models can generate coherent multi-file projects, manage dependencies, restructure modules, and even propose test suites to validate their own work. In that world, the bottleneck shifts from typing speed or API recall to the clarity and completeness of the human specification.

This is what makes the ability to give a model substantially more ambitious tasks so significant. When an AI system can handle not just a function or a bug fix but an end-to-end feature, a migration, or a refactor across tens of files, the human role changes. The developer becomes more of a product and safety architect: deciding goals, constraints, and trade-offs, then auditing whether the agent met them. The quote speaks directly to this: work that once had to be decomposed into micro-prompts can now be expressed as a single high-level directive, with the model reliably filling in the operational gaps.

Why this particular endorsement matters

The significance of that shift is amplified by who is speaking. Andrej Karpathy is not a casual user experimenting with consumer tooling but a foundational figure in modern deep learning and applied AI. He was a founding member of OpenAI and later headed Tesla Autopilot, leading large teams building vision and planning systems for real-world safety-critical autonomy. He has also been one of the most visible educators in the field, teaching tens of thousands of practitioners how neural networks work and how to reason about their failure modes.

That background makes his sense of surprise at feeling "behind" as a programmer, and the ego hit of giving more of the work to AI, noteworthy. When someone with deep understanding of model limitations and training quirks reports that a code-focused model "gets it" on larger, more complex tasks, it is not naive enthusiasm but a statement grounded in long experience of what usually goes wrong. It suggests a step change in practical reliability, especially on extended coding sessions.

Claude Fable 5, Mythos and safeguards

The specific model involved, Claude Fable 5, sits in a deliberately structured product family. Anthropic describes Fable 5 as sharing the same underlying model as Claude Mythos, but with additional safeguards and alignment layers. In practice, that means the core capability for long, multi-step reasoning and coding is retained, while the system has tighter policies around potentially harmful outputs and more conservative behaviours in ambiguous domains. Mythos is aimed at frontier, less constrained exploration; Fable, at general-purpose work where safety, compliance, and predictability matter.

Karpathy himself characterises Fable 5 as a "major-version-bump-deserving step change", particularly on long and difficult tasks. Reports and demos around the release highlight stronger performance not only on standard coding benchmarks but on real-world development flows: navigating large repositories, performing multi-file edits, and maintaining context over extended sessions. The result is a system that feels less like an autocomplete gadget and more like a junior engineer who can hold a problem in mind over hours of work.

Crucially, the "added safeguards" do not just refer to refusal policies. They also encompass training and inference-time measures that make the model more robust to prompt injection, reduce hallucinations, and bias it toward verifiable operations like running tests or inspecting diffs instead of bluffing. That combination - high capability plus strong guardrails - is what makes handing over more ambitious tasks psychologically viable. The user is not simply trusting a stochastic parrot; they are interacting with a toolchain engineered for cautious autonomy.

Karpathy's journey into agentic coding

To understand the deeper significance of the quote, it helps to situate it within Karpathy's broader ideas about software. In recent years he has described a transition towards what he calls "Software 3.0" or "agentic coding". Earlier eras could be caricatured as follows: in Software 1.0, humans wrote explicit logic; in Software 2.0, humans trained models but still wrote the surrounding infrastructure; in Software 3.0, AI systems increasingly write, test, and maintain significant portions of the codebase themselves, guided by human intent and oversight.

Within that frame, he has promoted practices like "vibe coding", where a developer converses with an AI assistant, iteratively refining prompts and reading outputs rather than manually hand-crafting every function. The point is not laziness but bandwidth: by offloading boilerplate and low-level wiring, humans can spend more time on product thinking, architecture, and evaluation. Yet he has also been candid about the danger of "brain atrophy" if humans stop engaging deeply with technical substance and become mere prompt routers.

His move to Anthropic, announced publicly on X, is explicitly about pushing this paradigm further. He is joining the Claude pre-training team, with a mandate to build a sub-team focused on using Claude itself to accelerate pre-training research. That is, he is not only using AI to write ordinary application code but using AI agents to help design, run, and analyse the experiments that produce the next generation of AI models. Some observers describe this as laying the groundwork for recursive self-improvement, where systems contribute directly to their own advancement.

The temptation to stop reading the code

The most charged part of the quote is not the praise for task capability but the admission that "it's never felt this tempting to stop looking at the code at all". That sentence crystallises a new risk frontier. Up to now, cautious practitioners have recommended heavy human inspection of AI-generated code: checking logic, scanning for security flaws, reviewing for maintainability. Those practices are time-consuming, but they preserve a culture where humans remain accountable for what ships.

As model quality improves, the marginal benefit of reading every line may appear to shrink. When the output often looks clean, idiomatic, and passes tests, the pressure to skim rather than scrutinise grows stronger. That temptation is exacerbated by business incentives. If an AI agent can implement a feature in 1 hour that would take a human 10 hours, organisations will be driven to capture that 9-hour gain, especially under competitive pressure. Deep review may be cast as optional overhead rather than mandatory safety.

This dynamic is not unique to coding. In aviation, pilots became less hands-on as autopilots grew more reliable, leading to worries about skill decay; yet in rare edge cases, human intervention remained vital. The same pattern looms in software: as AI-generated code becomes the default, there is a risk that fewer engineers retain the ability to reason from first principles when the system fails in a novel way.

Strategic and technological tension

The tension, then, is between speed and scrutiny, between trusting an increasingly competent agent and insisting on human understanding. On one side lies the productivity windfall: AI can manage dependency graphs, propose architecture refactors, and generate regression tests at a pace that would overwhelm any human team. On the other side lies epistemic opacity: large language models generate code via pattern completion, not explicit formal derivation, and even when the code passes tests, it may encode subtle bugs, non-obvious security weaknesses, or performance pathologies.

In safety-conscious organisations, this tension will likely be addressed with layered controls. For critical systems, one can imagine a workflow where an AI agent proposes changes, another independent agent attempts to break or exploit them, and human reviewers arbitrate. For less critical contexts, teams may accept a higher degree of automated autonomy, using telemetry and canary deployments to catch regressions in production.

Technologically, the quote points to a world where coding models are integrated deeply into development environments as persistent agents rather than stateless assistants. In that world, the system remembers project history, tracks unresolved issues, and maintains a map of the codebase. This is already visible in the way tools like Claude Code are embedded into full IDE surfaces where generation, testing, and git operations happen in one loop. The practical question is not whether such agents will exist but what guardrails and observability layers they will carry.

Anthropic's safety-first positioning

Anthropic has invested heavily in a brand and research agenda built around "constitutional" AI and safety. That approach involves specifying normative guidelines that models are trained to follow, and then auditing behaviour against those guidelines. For coding, that can be extended into concrete policies: refuse to write insecure patterns, prefer constant-time implementations in cryptographic contexts, suggest mitigation when encountering user-supplied input.

Fable 5's positioning as "Mythos but safe" reflects a belief that potential harms can be reduced without sacrificing too much capability. Karpathy's enthusiasm suggests that, at least in his workflows, the safeguards are not experienced as a hindrance but as a trust multiplier. He can instruct the model more ambitiously precisely because he expects it to act conservatively when it encounters sensitive operations and to avoid reckless actions like deleting large portions of a repository without confirmation.

Yet there remains an unresolved debate over how far safety techniques can go in mitigating risks that emerge from sheer scale and generality. Even a strongly aligned model may generate exploitable code when given innocuous prompts, simply because the space of correct-looking but vulnerable implementations is vast. Critics argue that this cannot be fully addressed by refusal policies and that deep formal methods or language-level safety guarantees will be necessary. The temptation to "stop looking at the code" must be evaluated against that backdrop.

Debates and objections

There are at least four major lines of objection or concern surrounding the world implied by the quote.

First, there is the professional identity and labour market concern. If AI tools can handle an increasing share of coding, especially the more routine or boilerplate-heavy parts, junior roles may shrink, making it harder for new developers to gain experience. Karpathy himself acknowledges a crossroads between "brain atrophy" and skill evolution, where humans must decide whether to re-skill towards higher-level system design and evaluation or risk being displaced.

Second, there is the epistemic reliability concern. Benchmarks can show impressive averages, but systems are still brittle on rare edge cases, poorly specified tasks, or ambiguous requirements. A sense that "the model gets it" can mask the fact that its understanding is statistical, not semantic in a human sense. Critics worry that as trust grows, organisations will deploy AI-generated code beyond domains where its failure modes are well characterised.

Third, there is the self-referential risk of using AI to build the next generation of AI. The work Karpathy is taking on at Anthropic involves using Claude to accelerate pre-training research itself, potentially moving towards recursive self-improvement. Enthusiasts argue that this is necessary to make progress at the current frontier, where experiments are too numerous and complex for purely human pipelines. Skeptics warn that errors, biases, or misalignments may be amplified if AI-driven research loops are not carefully constrained and audited.

Fourth, there is the cultural concern. Software engineering has long valued code readability not only for maintainability but as a vehicle for knowledge sharing. If more of the codebase is generated and fewer humans read it deeply, tacit knowledge may concentrate in the behaviour of models rather than in the minds of engineers. Some fear a loss of craftsmanship and a drift towards opaque systems even within a single organisation.

Why this moment matters

Despite these concerns, the practical direction of travel is clear. Developers are already wiring multiple frontier models into a single development surface, choosing per-task which to call, whether Claude, GPT, or others, based on performance and cost rather than vendor loyalty. Tools that bundle coding, testing, and version control into agentic workflows are proliferating. The quote captures a threshold where these tools no longer feel like experimental sidekicks but like the primary engine of implementation.

From a strategic perspective, this changes how organisations think about their software capability. Instead of asking how many engineers they can hire, they will ask how effectively they can orchestrate AI coding capacity: prompt libraries, evaluation harnesses, and safety procedures become as important as hiring pipelines. Companies that embrace this shift thoughtfully will invest in engineers who are excellent at specifying intent, designing tests, and auditing AI proposals - a different profile from traditional full-stack roles.

For individual developers, it poses a challenge and an invitation. The challenge is to resist the laziness of unexamined trust while also resisting nostalgia for a world where writing every line oneself was feasible. The invitation is to climb the abstraction ladder: to become better at defining product goals, at thinking in systems, at debugging not just functions but entire AI-assisted workflows.

Karpathy's experience with Claude Fable 5 illustrates that frontier models are now strong enough to make this shift emotionally palpable. When a veteran practitioner feels tempted to stop reading the code, that is not a signal to give up scrutiny, but it is evidence that the agent has crossed a qualitative threshold. The world of software will be shaped by how we respond to that feeling: whether by surrendering to it, ignoring it, or deliberately building new practices, tools, and norms that harness its power without abandoning responsibility.

"You can give [Claude Fable 5, the same underlying model as Mythos but with added safeguards] a lot more ambitious tasks than what you're used to, the model 'gets it' and it will just go, and it's never felt this tempting to stop looking at the code at all." - Quote: Andrej Karpathy - Anthropic (Openai Founder, formerly head of Tesla AI)

‌