News and Tools

Breaking Business News

Our selection of the top business news sources on the web.

Quote: David Solomon – Goldman Sachs CEO

“Generally speaking people hate change. It’s human nature. But change is super important. It’s inevitable. In fact, on my desk in my office I have a little plaque that says ‘Change or die.’ As a business leader, one of the perspectives you have to have is that you’ve got to constantly evolve and change.” – David Solomon – Goldman Sachs CEO

The quoted insight comes from David M. Solomon, Chief Executive Officer and Chairman of Goldman Sachs, a role he has held since 2018. It was delivered during a high-profile interview at The Economic Club of Washington, D.C., 30 October 2025, as Solomon reflected on the necessity of adaptability both personally and as a leader within a globally significant financial institution.

His statement is emblematic of the strategic philosophy that has defined Solomon’s executive tenure. He uses the ‘Change or die’ principle to highlight the existential imperative for renewal in business, particularly in the context of technological transformation, competitive dynamics, and economic disruption.

Solomon’s leadership at Goldman Sachs has been characterised by deliberate modernisation. He has overseen the integration of advanced technology, notably in artificial intelligence and fintech, implemented culture and process reforms, adapted workforce practices, and expanded strategic initiatives in sustainable finance. His approach blends operational rigour with entrepreneurial responsiveness – a mindset shaped both by his formative years in high-yield credit markets at Drexel Burnham and Bear Stearns, and by his rise through leadership roles at Goldman Sachs.

His remark on change was prompted by questions of business resilience and the need for constant adaptation amidst macroeconomic uncertainty, regulatory flux, and the competitive imperatives of Wall Street. For Solomon, resisting change is an instinct, but enabling it is a necessity for long-term health and relevance — especially for institutions in rapidly converging markets.

About David M. Solomon

Born 1962, Hartsdale, New York.
Hamilton College graduate (BA Political Science).
Early career: Irving Trust, Drexel Burnham, Bear Stearns.
Joined Goldman Sachs as a partner in 1999, advancing through financing and investment banking leadership.
CEO from October 2018, Chairman from January 2019.
Known for a modernisation agenda, openness to innovation and talent, commitment to client service and culture reform.
Outside finance: Philanthropy, board service, and a second career as electronic dance music DJ “DJ D-Sol”, underscoring a multifaceted approach to leadership and personal renewal.

Theoretical Backstory: Leading Thinkers on Change and Organisational Adaptation

Solomon’s philosophy echoes decades of foundational theory in business strategy and organisational behaviour:

Charles Darwin (1809–1882)
While not a business theorist, Darwin’s principle of “survival of the fittest” is often cited in strategic literature to emphasise the adaptive imperative — those best equipped to change, survive.

Peter Drucker (1909–2005)
Drucker, regarded as the father of modern management, wrote extensively on innovation, entrepreneurial management and the need for “planned abandonment.” He argued, “The greatest danger in times of turbulence is not the turbulence; it is to act with yesterday’s logic.” Drucker’s legacy forms a pillar of contemporary change management, advising leaders not only to anticipate change but to institutionalise it.

John Kotter (b. 1947)
Kotter’s model for Leading Change remains a classic in change management. His eight-step framework starts with establishing a sense of urgency and is grounded in the idea that successful transformation is both necessary and achievable only with decisive leadership, clear vision, and broad engagement. Kotter demonstrated that people’s resistance to change is natural, but can be overcome through structured actions and emotionally resonant leadership.

Clayton Christensen (1952-2020)
Christensen’s work on disruptive innovation clarified how incumbents often fail by ignoring, dismissing, or underinvesting in change — even when it is inevitable. His concept of the “Innovator’s Dilemma” remains seminal, showing that leaders must embrace change not as an abstract imperative but as a strategic necessity, lest they be replaced or rendered obsolete.

Rosabeth Moss Kanter
Kanter’s work focuses on the human dynamics of change, the importance of culture, empowerment, and the “innovation habit” in organisations. She holds that the secret to business success is “constant, relentless innovation” and that resistance to change is deeply psychological, calling for leaders to engineer positive environments for innovation.

Integration: The Leadership Challenge

Solomon’s ethos channels these frameworks into practical executive guidance. For business leaders, particularly in financial services and Fortune 500 firms, the lesson is clear: inertia is lethal; organisational health depends on reimagining processes, culture, and client engagement for tomorrow’s challenges. The psychological aversion to change must be managed actively at all levels — from the boardroom to the front line.

In summary, the context of Solomon’s quote reflects not only a personal credo but also the consensus of generations of theoretical and practical leadership: only those prepared to “change or die” can expect to thrive and endure in an era defined by speed, disruption, and relentless unpredictability.

Quote: Andrej Karpathy – Ex-OpenAI, Ex-Tesla AI

“[With AI] we’re not building animals. We’re building ghosts or spirits.” – Andrej Karpathy – Ex-OpenAI, Ex-Tesla AI

Andrej Karpathy, renowned for his leadership roles at OpenAI and Tesla’s Autopilot programme, has been at the centre of advances in deep learning, neural networks, and applied artificial intelligence. His work traverses both academic research and industrial deployment, granting him a panoramic perspective on the state and direction of AI.

When Karpathy refers to building “ghosts or spirits,” he is drawing a conceptual line between biological intelligence—the product of millions of years of evolution—and artificial intelligence as developed through data-driven, digital systems. In his view, animals are “baked in” with instincts, embodiment, and innate learning capacities shaped by evolution, a process unfolding over geological timeframes. By contrast, today’s AI models are “ghosts” in the sense that they are ethereal, fully digital artefacts, trained to imitate human-generated data rather than to evolve or learn through direct interaction with the physical world. They lack bodily instincts and the evolutionary substrate that endows animals with survival strategies and adaptation mechanisms.

Karpathy describes the pre-training process that underpins large language models as a form of “crappy evolution”—a shortcut that builds digital entities by absorbing the statistical patterns of internet-scale data without the iterative adaptation of embodied beings. Consequently, these models are not “born” into the world like animals with built-in survival machinery; instead, they are bootstrapped as “ghosts,” imitating but not experiencing life.

The Cognitive Core—Karpathy’s Vision for AI Intelligence

Karpathy’s thinking has advanced towards the critical notion of the “cognitive core”: the kernel of intelligence responsible for reasoning, abstraction, and problem-solving, abstracted away from encyclopaedic factual knowledge. He argues that the true magic of intelligence is not in the passive recall of data, but in the flexible, generalisable ability to manipulate ideas, solve problems, and intuit patterns—capabilities that a system exhibits even when deprived of pre-programmed facts or exhaustive memory.

He warns against confusing memorisation (the stockpiling of internet facts within a model) with general intelligence, which arises from this cognitive core. The most promising path, in his view, is to isolate and refine this core, stripping away the accretions of memorised data, thereby developing something akin to a “ghost” of reasoning and abstraction rather than an “animal” shaped by instinct and inheritance.

This approach entails significant trade-offs: a cognitive core lacks the encyclopaedic reach of today’s massive models, but gains in adaptability, transparency, and the capacity for compositional, creative thought. By foregrounding reasoning machinery, Karpathy posits that AI can begin to mirror not the inflexibility of animals, but the open-ended, reflective qualities that characterise high-level problem-solving.

Karpathy’s Journey and Influence

Karpathy’s influence is rooted in a career spent on the frontier of AI research and deployment. His early proximity to Geoffrey Hinton at the University of Toronto placed him at the launch-point of the convolutional neural networks revolution, which fundamentally reshaped computer vision and pattern recognition.

At OpenAI, Karpathy contributed to an early focus on training agents to master digital environments (such as Atari games), a direction in retrospect he now considers premature. He found greater promise in systems that could interact with the digital world through knowledge work—precursors to today’s agentic models—a vision he is now helping to realise through ongoing work in educational technology and AI deployment.

Later, at Tesla, he directed the transformation of autonomous vehicles from demonstration to product, gaining hard-won appreciation for the “march of nines”—the reality that progressing from system prototypes that work 90% of the time to those that work 99.999% of the time requires exponentially more effort. This experience informs his scepticism towards aggressive timelines for “AGI” and his insistence on the qualitative differences between robust system deployment and controlled demonstrations.

The Leading Theorists Shaping the Debate

Karpathy’s conceptual framework emerges amid vibrant discourse within the AI community, shaped by several seminal thinkers:

Theorist	Core Idea	Relation to Karpathy’s Ghosts vs. Animals Analogy
Richard Sutton	General intelligence emerges through learning algorithms honed by evolution (“bitter lesson”)	Sutton advocates building “animals” via RL and continual learning; Karpathy sees modern AI as ghosts—data-trained, not evolved.
Geoffrey Hinton	Neural networks model learning and perception as statistical pattern discovery	Hinton’s legacy underpins the digital cortex, but Karpathy stresses what’s missing: embodied instincts, continual memory.
Yann LeCun	Convolutional neural networks and representation learning for perceptual tasks	LeCun’s work forms part of the “cortex”, but Karpathy highlights the missing brain structures and instincts for full generality.

Sutton’s “bitter lesson” posits that scale and generic algorithms, rather than domain-specific tricks, ultimately win—suggesting a focus on evolving animal-like intelligence. Karpathy, however, notes that current development practices, with their reliance on dataset imitation, sidestep the deep embodiment and evolutionary learning that define animal cognition. Instead, AI today creates digital ghosts—entities whose minds are not grounded in physical reality, but in the manifold of internet text and data.

Hinton and LeCun supply the neural and architectural foundations—the “cortex” and reasoning traces—while both Karpathy and their critics note the absence of rich, consolidated memory (the hippocampus analogue), instincts (amygdala), and the capacity for continual, self-motivated world interaction.

Why “Ghosts,” Not “Animals”?

The distinction is not simply philosophical. It carries direct consequences for:

Capabilities: AI “ghosts” excel at pattern reproduction, simulation, and surface reasoning but lack the embodied, instinctual grounding (spatial navigation, sensorimotor learning) of animals.
Limitations: They are subject to model collapse, producing uniform, repetitive outputs, lacking the spontaneous creativity and entropy seen in human (particularly child) cognition.
Future Directions: The field is now oriented towards distilling this cognitive core, seeking a scalable, adaptable reasoning engine—compact, efficient, and resilient to overfitting—rather than continuing to bloat models with ever more static memory.

This lens sharpens expectations: the way forward is not to mimic biology in its totality, but to pursue the unique strengths and affordances of a digital, disembodied intelligence—a spirit of the datasphere, not a beast evolved in the forest.

Broader Significance

Karpathy’s “ghosts” metaphor crystallises a critical moment in the evolution of AI as a discipline. It signals a turning point: the shift from brute-force memorisation of the internet to intelligent, creative algorithms capable of abstraction, reasoning, and adaptation.

This reframing is shaping not only the strategic priorities of the most advanced labs, but also the philosophical and practical questions underpinning the next decade of AI research and deployment. As AI becomes increasingly present in society, understanding its nature—not as an artificial animal, but as a digital ghost—will be essential to harnessing its strengths and mitigating its limitations.

Quote: Sholto Douglas – Anthropic

“People have said we’re hitting a plateau every month for three years… I look at how models are produced and every part could be improved. The training pipeline is primitive, held together by duct tape, best efforts, and late nights. There’s so much room to grow everywhere.” – Sholto Douglas – Anthropic

Sholto Douglas made the statement during a major public podcast interview in October 2025, coinciding with Anthropic’s release of Claude Sonnet 4.5—at the time, the world’s strongest and most “agentic” AI coding model. The comment specifically rebuts repeated industry and media assertions that large AI models have reached a ceiling or are slowing in progress. Douglas argues the opposite: that the field is in a phase of accelerating advancement, driven both by transformative hardware investment (“compute super-cycle”), new algorithmic techniques (particularly reinforcement learning and test-time compute), and the persistent “primitive” state of today’s AI engineering infrastructure.

He draws an analogy with early-stage, improvisational systems: the models are held together “by duct tape, best efforts, and late nights,” making clear that immense headroom for improvement remains at every level, from training data pipelines and distributed infrastructure to model architecture and reward design. As a result, every new benchmark and capability reveals further unrealised opportunity, with measurable progress charted month after month.

Douglas’s deeper implication is that claims of a plateau often arise from surface-level analysis or the “saturation” of public benchmarks, not from a rigorous understanding of what is technically possible or how much scale remains untapped across the technical stack.

Sholto Douglas: Career Trajectory and Perspective

Sholto Douglas is a leading member of Anthropic’s technical staff, focused on scaling reinforcement learning and agentic AI. His unconventional journey illustrates both the new talent paradigm and the nature of breakthrough AI research today:

Early Life and Mentorship: Douglas grew up in Australia, where he benefited from unusually strong academic and athletic mentorship. His mother, an accomplished physician frustrated by systemic barriers, instilled discipline and a systemic approach; his Olympic-level fencing coach provided a first-hand experience of how repeated, directed effort leads to world-class performance.
Academic Formation: He studied computer science and robotics as an undergraduate, with a focus on practical experimentation and a global mindset. A turning point was reading the “scaling hypothesis” for AGI, convincing him that progress on artificial general intelligence was feasible within a decade—and worth devoting his career to.
Independent Innovation: As a student, Douglas built “bedroom-scale” foundation models for robotics, working independently on large-scale data collection, simulation, and early adoption of transformer-based methods. This entrepreneurial approach—demonstrating initiative and technical depth without formal institutional backing—proved decisive.
Google (Gemini and DeepMind): His independent work brought him to Google, where he joined just before the release of ChatGPT, in time to witness and help drive the rapid unification and acceleration of Google’s AI efforts (Gemini, Brain, DeepMind). He co-designed new inference infrastructure that reduced costs and worked at the intersection of large-scale learning, reinforcement learning, and applied reasoning.
Anthropic (from 2025): Drawn by Anthropic’s focus on measurable, near-term economic impact and deep alignment work, Douglas joined to lead and scale reinforcement learning research—helping push the capability frontier for agentic models. He values a culture where every contributor understands and can articulate how their work advances both capability and safety in AI.

Douglas is distinctive for his advocacy of “taste” in AI research, favouring mechanistic understanding and simplicity over clever domain-specific tricks—a direct homage to Richard Sutton’s “bitter lesson.” This perspective shapes his belief that the greatest advances will come not from hiding complexity with hand-crafted heuristics, but from scaling general algorithms and rigorous feedback loops.

Intellectual and Scientific Context: The ‘Plateau’ Debate and Leading Theorists

The debate around the so-called “AI plateau” is best understood against the backdrop of core advances and recurring philosophical arguments in machine learning.

The “Bitter Lesson” and Richard Sutton

Richard Sutton (University of Alberta, DeepMind), one of the founding figures in reinforcement learning, crystallised the field’s “bitter lesson”: that general, scalable methods powered by increased compute will eventually outperform more elegant, hand-crafted, domain-specific approaches.
In practical terms, this means that the field’s recent leaps—from vision to language to coding—are powered less by clever new inductive biases, and more by architectural simplicity plus massive compute and data. Sutton has also maintained that real progress in AI will come from reinforcement learning with minimal task-specific assumptions and maximal data, computation, and feedback.

Yann LeCun and Alternative Paradigms

Yann LeCun (Meta, NYU), a pioneer of deep learning, has maintained that the transformer paradigm is limited and that fundamentally novel architectures are necessary for human-like reasoning and autonomy. He argues that unsupervised/self-supervised learning and new world-modelling approaches will be required.
LeCun’s disagreement with Sutton’s “bitter lesson” centres on the claim that scaling is not the final answer: new representation learning, memory, and planning mechanisms will be needed to reach AGI.

Shane Legg, Demis Hassabis, and DeepMind

DeepMind’s approach has historically been “science-first,” tackling a broad swathe of human intelligence challenges (AlphaGo, AlphaFold, science AI), promoting a research culture that takes long-horizon bets on new architectures (memory-augmented neural networks, world models, differentiable reasoning).
Demis Hassabis and Shane Legg (DeepMind co-founders) have advocated for testing a diversity of approaches, believing that the path to AGI is not yet clear—though they too acknowledge the value of massive scale and reinforcement learning.

The Scaling Hypothesis: GW’s Essay and the Modern Era

The so-called “scaling hypothesis”—the idea that simply making models larger and providing more compute and data will continue yielding improvements—has become the default “bet” for Anthropic, OpenAI, and others. Douglas refers directly to this intellectual lineage as the critical “hinge” moment that set his trajectory.
This hypothesis is now being extended into new areas, including agentic systems where long context, verification, memory, and reinforcement learning allow models to reliably pursue complex, multi-step goals semi-autonomously.

Summing Up: The Current Frontier

Today, researchers like Douglas are moving beyond the original transformer pre-training paradigm, leveraging multi-axis scaling (pre-training, RL, test-time compute), richer reward systems, and continuous experimentation to drive model capabilities in coding, digital productivity, and emerging physical domains (robotics and manipulation).

Douglas’s quote epitomises the view that not only has performance not plateaued—every “limitation” encountered is a signpost for further exponential improvement. The modest, “patchwork” nature of current AI infrastructure is a competitive advantage: it means there is vast room for optimisation, iteration, and compounding gains in capability.

As the field races into a new era of agentic AI and economic impact, his perspective serves as a grounded, inside-out refutation of technological pessimism and a call to action grounded in both technical understanding and relentless ambition.

Quote: Julian Schrittwieser – Anthropic

“The talk about AI bubbles seemed very divorced from what was happening in frontier labs and what we were seeing. We are not seeing any slowdown of progress.” – Julian Schrittwieser – Anthropic

Those closest to technical breakthroughs are witnessing a pattern of sustained, compounding advancement that is often underestimated by commentators and investors. This perspective underscores both the power and limitations of conventional intuitions regarding exponential technological progress.

Context of the Quote

Schrittwieser delivered these remarks in a 2025 interview on the MAD Podcast, prompted by widespread discourse on the so-called ‘AI bubble’. His key contention is that debate around an AI investment or hype “bubble” feels disconnected from the lived reality inside the world’s top research labs, where the practical pace of innovation remains brisk and outwardly undiminished. He outlines that, according to direct observation and internal benchmarks at labs such as Anthropic, progress remains on a highly consistent exponential curve: “every three to four months, the model is able to do a task that is twice as long as before completely on its own”.

He draws an analogy to the early days of COVID-19, where exponential growth was invisible until it became overwhelming; the same mathematical processes, Schrittwieser contends, apply to AI system capabilities. While public narratives about bubbles often reference the dot-com era, he highlights a bifurcation: frontier labs sustain robust, revenue-generating trajectories, while the wider AI ecosystem might experience bubble-like effects in valuation. But at the core—the technology itself continues to improve at a predictably exponential rate well supported by both qualitative experience and benchmark data.

Schrittwieser’s view, rooted in immediate, operational knowledge, is that the default expectation of a linear future is mistaken: advances in autonomy, reasoning, and productivity are compounding. This means genuinely transformative impacts—such as AI agents that function at expert level or beyond for extended, unsupervised tasks—are poised to arrive sooner than many anticipate.

Profile: Julian Schrittwieser

Julian Schrittwieser is one of the world’s leading artificial intelligence researchers, currently based at Anthropic, following a decade as a core scientist at Google DeepMind. Raised in rural Austria, Schrittwieser’s journey from an adolescent fascination with game programming to the vanguard of AI research exemplifies the discipline’s blend of curiosity, mathematical rigour, and engineering prowess. He studied computer science at the Vienna University of Technology, before interning at Google.

Schrittwieser was a central contributor to several historic machine learning milestones, most notably:

AlphaGo, the first program to defeat a world champion at Go, combining deep neural networks with Monte Carlo Tree Search.
AlphaGo Zero and AlphaZero, which generalised the approach to achieve superhuman performance without human examples, through self-play—demonstrating true generality in reinforcement learning.
MuZero (as lead author), solving the challenge of mastering environments without even knowing the rules in advance, by enabling the system to learn its own internal, predictive world models—an innovation bringing RL closer to complex, real-world domains.
Later work includes AlphaCode (code synthesis), AlphaTensor (algorithmic discovery), and applied advances in Gemini and AlphaProof.

At Anthropic, Schrittwieser is at the frontier of research into scaling laws, reinforcement learning, autonomous agents, and novel techniques for alignment and safety in next-generation AI. True to his pragmatic ethos, he prioritises what directly raises capability and reliability, and advocates for careful, data-led extrapolation rather than speculation.

Theoretical Backstory: Exponential AI Progress and Key Thinkers

Schrittwieser’s remarks situate him within a tradition of AI theorists and builders focused on scaling laws, reinforcement learning (RL), and emergent capabilities:

Leading Theorists and Historical Perspective

Name	Notable Ideas and Contributions	Relevance to Quote
Demis Hassabis	Founder of DeepMind; architect of the AlphaGo programme. Emphasised general intelligence and the power of RL plus planning.	Schrittwieser’s mentor and DeepMind leader. Pioneered RL paradigms beyond games.
David Silver	Developed many of the breakthroughs underlying AlphaGo, AlphaZero, MuZero. Advanced RL and model-based search methods.	Collaborator with Schrittwieser; together, demonstrated practical scaling of RL.
Richard Sutton	Articulated reinforcement learning’s centrality: “The Bitter Lesson” (general methods, scalable computation, not handcrafted). Advanced temporal difference methods and RL theory.	Mentioned by Schrittwieser as a thought leader shaping the RL paradigm at scale.
Alex Ray, Jared Kaplan, Sam McCandlish, OpenAI Scaling Team	Quantified AI’s “scaling laws”: empirical tendencies for model performance to improve smoothly with compute, data, and parameter scaling.	Schrittwieser echoes this data-driven, incrementalist philosophy.
Ilya Sutskever	Co-founder of OpenAI; central to deep learning breakthroughs, scaling, and forecasting emergent capabilities.	OpenAI’s work on benchmarks (GDP-Val) and scaling echoes these insights.

These thinkers converge on several key observations directly reflected in Schrittwieser’s view:

Exponential Capability Curves: Consistent advances in performance often surprise those outside the labs due to our poor intuitive grasp of exponentiality—what Schrittwieser terms a repeated “failure to understand the exponential”.
Scaling Laws and Reinforcement Learning: Improvements are not just about larger models, but ever-better training, more reliable reinforcement learning, agentic architecture, and robust reward systems—developments Schrittwieser’s work epitomises.
Novelty and Emergence: Historically, theorists doubted whether neural models could go beyond sophisticated mimicry; the “Move 37” moment (AlphaGo’s unprecedented move in Go) was a touchstone for true machine creativity, a theme Schrittwieser stresses remains highly relevant today.
Bubbles, Productivity, and Market Cycles: Mainstream financial and social narratives may oscillate dramatically, but real capability growth—observable in benchmarks and direct use—has historically marched on undeterred by speculative excesses.

Synthesis: Why the Perspective Matters

The quote foregrounds a gap between external perceptions and insider realities. Pioneers like Schrittwieser and his cohort stress that transformative change will not follow a smooth, linear or hype-driven curve, but an exponential, data-backed progression—one that may defy conventional intuition, but is already reshaping productivity and the structure of work.

This moment is not about “irrational exuberance”, but rather the compounding product of theoretical insight, algorithmic audacity, and relentless engineering: the engine behind the next wave of economic and social transformation.

Quote: Andrej Karpathy – Ex-OpenAI, Ex-Tesla AI

“AI is so wonderful because there have been a number of seismic shifts where the entire field has suddenly looked a different way. I’ve maybe lived through two or three of those. I still think there will continue to be some because they come with almost surprising regularity.” – Andrej Karpathy – Ex-OpenAI, Ex-Tesla AI

Andrej Karpathy, one of the most recognisable figures in artificial intelligence, has spent his career at the epicentre of the field’s defining moments in both research and large-scale industry deployment.

Karpathy’s background is defined by deep technical expertise and a front-row seat to AI’s rapid evolution. Having completed his PhD at Stanford and held pivotal research positions, he worked alongside Geoffrey Hinton at the University of Toronto during the early surge of deep learning. His career encompasses key roles at Tesla, where he led the Autopilot vision team, and at OpenAI, contributing to some of the world’s most prominent large language models and generative AI systems. This vantage point has allowed him to participate in, and reflect upon, the discipline’s “seismic shifts”.

Karpathy’s narrative has been shaped by three inflection points:

The emergence of deep neural networks from a niche field to mainstream AI, spearheaded by the success of AlexNet and the subsequent shift of the research community toward neural architectures.
The drive towards agent-based systems, with early enthusiasm for reinforcement learning (RL) and game-based environments (such as Atari and Go). Karpathy himself was cautious about the utility of games as the true path to intelligence, focusing instead on agents acting within the real digital world.
The rise of large language models (LLMs)—transformers trained on vast internet datasets, shifting the locus of AI from task-specific systems to general-purpose models with the ability to perform a broad suite of tasks, and in-context learning.

His reflection on these ‘regular’ paradigm shifts arises from lived experience: “I’ve maybe lived through two or three of those. I still think there will continue to be some because they come with almost surprising regularity.” These moments recalibrate assumptions, redirect research priorities, and set new benchmarks for capability. Karpathy’s practical orientation—building “useful things” rather than targeting biological intelligence or pure AGI—shapes his approach to both innovation and scepticism about hype.

Context of the Quote
In his conversation with podcaster Dwarkesh Patel, Karpathy elaborates on the recurring nature of breakthroughs. He contrasts AI’s rapid, transformative leaps with other scientific fields, noting that in machine learning, scaling up data, compute, and novel architectures can yield abrupt improvements—yet each wave often triggers both excessive optimism and later recalibration. A major point he raises is the lack of linearity: the field does not “smoothly” approach AGI, but rather proceeds via discontinuities, often catalysed by new ideas or techniques that were previously out of favour or overlooked.

Karpathy relates how, early in his career, neural networks were a marginal interest and large-scale “representation learning” was only beginning to be considered viable by a minority in the community. With the advent of AlexNet, the landscape shifted overnight, rapidly making previous assumptions obsolete. Later, the pursuit of RL-driven agents led to a phase where entire research agendas were oriented toward gameplay and synthetic environments—another phase later superseded by the transformer revolution and language models. Karpathy reflects candidly on earlier missteps, as well as the discipline’s collective tendency to over- or under-predict the timetable and trajectory of progress.

Leading Theorists and Intellectual Heritage
The AI revolutions Karpathy describes are inseparable from the influential figures and ideas that have shaped each phase:

Geoffrey Hinton: Hailed as the “godfather of AI”, Hinton was instrumental in deep learning’s breakthrough, advancing techniques for training multilayered neural networks and championing representation learning against prevailing orthodoxy.
Yann LeCun: Developed convolutional neural networks (CNNs), foundational for computer vision and the 2010s wave of deep learning success.
Yoshua Bengio: Co-architect of the deep learning movement and a key figure in developing unsupervised and generative models.
Richard Sutton: Principal proponent of reinforcement learning, Sutton articulated the value of “animal-like” intelligence: learning from direct interaction with environments, reward, and adaptation. Sutton’s perspective frequently informs debates about the relationship between model architectures and living intelligence, encouraging a focus on agents and lifelong learning.

Karpathy’s own stance is partly a pragmatic response to this heritage: rather than pursuing analogues of biological brains, he views the productive path as building digital “ghosts”—entities that learn by imitation and are shaped by patterns in data, rather than evolutionary processes.

Beyond individual theorists, the field’s quantum leaps are rooted in a culture of intellectual rivalry and rapid intellectual cross-pollination:

The convolutional and recurrent networks of the 2010s pushed the boundaries of what neural networks could do.
The development and scaling of transformer-based architectures (as in Google’s “Attention is All You Need”) dramatically changed both natural language processing and the structure of the field itself.
The introduction of algorithms for in-context learning and large-scale unsupervised pre-training marked a break with hand-crafted representation engineering.

The Architecture of Progress: Seismic Shifts and Pragmatic Tension
Karpathy’s insight is that these shifts are not just about faster hardware or bigger datasets; they reflect the field’s unique ecology—where new methods can rapidly become dominant and overturn accumulated orthodoxy. The combination of open scientific exchange, rapid deployment, and intense commercialisation creates fertile ground for frequent realignment.

His observation on the “regularity” of shifts also signals a strategic realism: each wave brings both opportunity and risk. New architectures (such as transformers or large reinforcement learning agents) frequently overshoot expectations before their real limitations become clear. Karpathy remains measured on both promise and limitation—anticipating continued progress, but cautioning against overpredictions and hype cycles that fail to reckon with the “march of nines” needed to reach true reliability and impact.

Closing Perspective
The context of Karpathy’s quote is an AI ecosystem that advances not through steady accretion, but in leaps—each driven by conceptual, technical, and organisational realignments. As such, understanding progress in AI demands both technical literacy and historical awareness: the sharp pivots that have marked past decades are likely to recur, with equally profound effects on how intelligence is conceived, built, and deployed.

Quote: Jonathan Ross – CEO Groq

“The countries that control compute will control AI. You cannot have compute without energy.” – Jonathan Ross – CEO Groq

Jonathan Ross stands at the intersection of geopolitics, energy economics, and technological determinism. As founder and CEO of Groq, the Silicon Valley firm challenging Nvidia’s dominance in AI infrastructure, Ross articulated a proposition of stark clarity during his September 2025 appearance on Harry Stebbings’ 20VC podcast: “The countries that control compute will control AI. You cannot have compute without energy.”

This observation transcends technical architecture. Ross is describing the emergence of a new geopolitical currency—one where computational capacity, rather than traditional measures of industrial might, determines economic sovereignty and strategic advantage in the 21st century. His thesis rests on an uncomfortable reality: artificial intelligence, regardless of algorithmic sophistication or model architecture, cannot function without the physical substrate of compute. And compute, in turn, cannot exist without abundant, reliable energy.

The Architecture of Advantage

Ross’s perspective derives from direct experience building the infrastructure that powers modern AI. At Google, he initiated what became the Tensor Processing Unit (TPU) project—custom silicon that allowed the company to train and deploy machine learning models at scale. This wasn’t academic research; it was the foundation upon which Google’s AI capabilities were built. When Amazon and Microsoft attempted to recruit him in 2016 to develop similar capabilities, Ross recognised a pattern: the concentration of advanced AI compute in too few hands represented a strategic vulnerability.

His response was to establish Groq in 2016, developing Language Processing Units optimised for inference—the phase where trained models actually perform useful work. The company has since raised over $3 billion and achieved a valuation approaching $7 billion, positioning itself as one of Nvidia’s most credible challengers in the AI hardware market. But Ross’s ambitions extend beyond corporate competition. He views Groq’s mission as democratising access to compute—creating abundant supply where artificial scarcity might otherwise concentrate power.

The quote itself emerged during a discussion about global AI competitiveness. Ross had been explaining why European nations, despite possessing strong research talent and model development capabilities (Mistral being a prominent example), risk strategic irrelevance without corresponding investment in computational infrastructure and energy capacity. A brilliant model without compute to run it, he argued, will lose to a mediocre model backed by ten times the computational resources. This isn’t theoretical—it’s the lived reality of the current AI landscape, where rate limits and inference capacity constraints determine what services can scale and which markets can be served.

The Energy Calculus

The energy dimension of Ross’s statement carries particular weight. Modern AI training and inference require extraordinary amounts of electrical power. The hyperscalers—Google, Microsoft, Amazon, Meta—are each committing tens of billions of dollars annually to AI infrastructure, with significant portions dedicated to data centre construction and energy provision. Microsoft recently announced it wouldn’t make certain GPU clusters available through Azure because the company generated higher returns using that compute internally rather than renting it to customers. This decision, more than any strategic presentation, reveals the economic value density of AI compute.

Ross draws explicit parallels to the early petroleum industry: a period of chaotic exploration where a few “gushers” delivered extraordinary returns whilst most ventures yielded nothing. In this analogy, compute is the new oil—a fundamental input that determines economic output and strategic positioning. But unlike oil, compute demand doesn’t saturate. Ross describes AI demand as “insatiable”: if OpenAI or Anthropic received twice their current inference capacity, their revenue would nearly double within a month. The bottleneck isn’t customer appetite; it’s supply.

This creates a concerning dynamic for nations without indigenous energy abundance or the political will to develop it. Ross specifically highlighted Europe’s predicament: impressive AI research capabilities undermined by insufficient energy infrastructure and regulatory hesitance around nuclear power. He contrasted this with Norway’s renewable capacity (80% wind utilisation) or Japan’s pragmatic reactivation of nuclear facilities—examples of countries aligning energy policy with computational ambition. The message is uncomfortable but clear: technical sophistication in model development cannot compensate for material disadvantage in energy and compute capacity.

Strategic Implications

The geopolitical dimension becomes more acute when considering China’s position. Ross noted that whilst Chinese models like DeepSeek may be cheaper to train (through various optimisations and potential subsidies), they remain more expensive to run at inference—approximately ten times more costly per token generated. This matters because inference, not training, determines scalability and market viability. China can subsidise AI deployment domestically, but globally—what Ross terms the “away game”—cost structure determines competitiveness. Countries cannot simply construct nuclear plants at will; energy infrastructure takes decades to build.

This asymmetry creates opportunity for nations with existing energy advantages. The United States, despite higher nominal costs, benefits from established infrastructure and diverse energy sources. However, Ross’s framework suggests this advantage is neither permanent nor guaranteed. Control over compute requires continuous investment in both silicon capability and energy generation. Nations that fail to maintain pace risk dependency—importing not just technology, but the capacity for economic and strategic autonomy.

The corporate analogy proves instructive. Ross predicts that every major AI company—OpenAI, Anthropic, Google, and others—will eventually develop proprietary chips, not necessarily to outperform Nvidia technically, but to ensure supply security and strategic control. Nvidia currently dominates not purely through superior GPU architecture, but through control of high-bandwidth memory (HBM) supply chains. Building custom silicon allows organisations to diversify supply and avoid allocation constraints that might limit their operational capacity. What applies to corporations applies equally to nations: vertical integration in compute infrastructure is increasingly a prerequisite for strategic autonomy.

The Theorists and Precedents

Ross’s thesis echoes several established frameworks in economic and technological thought, though he synthesises them into a distinctly contemporary proposition.

Harold Innis, the Canadian economic historian, developed the concept of “staples theory” in the 1930s and 1940s—the idea that economies organised around the extraction and export of key commodities (fur, fish, timber, oil) develop institutional structures, trade relationships, and power dynamics shaped by those materials. Innis later extended this thinking to communication technologies in works like Empire and Communications (1950) and The Bias of Communication (1951), arguing that the dominant medium of a society shapes its political and social organisation. Ross’s formulation applies Innisian logic to computational infrastructure: the nations that control the “staples” of the AI economy—energy and compute—will shape the institutional and economic order that emerges.

Carlota Perez, the Venezuelan-British economist, provided a framework for understanding technological revolutions in Technological Revolutions and Financial Capital (2002). Perez identified how major technological shifts (steam power, railways, electricity, mass production, information technology) follow predictable patterns: installation phases characterised by financial speculation and infrastructure building, followed by deployment phases where the technology becomes economically productive. Ross’s observation about current AI investment—massive capital expenditure by hyperscalers, uncertain returns, experimental deployment—maps cleanly onto Perez’s installation phase. The question, implicit in his quote, is which nations will control the infrastructure when the deployment phase arrives and returns become tangible.

W. Brian Arthur, economist and complexity theorist, articulated the concept of “increasing returns” in technology markets through works like Increasing Returns and Path Dependence in the Economy (1994). Arthur demonstrated how early advantages in technology sectors compound through network effects, learning curves, and complementary ecosystems—creating winner-take-most dynamics rather than the diminishing returns assumed in classical economics. Ross’s emphasis on compute abundance follows this logic: early investment in computational infrastructure creates compounding advantages in AI capability, which drives economic returns, which fund further compute investment. Nations entering this cycle late face escalating barriers to entry.

Joseph Schumpeter, the Austrian-American economist, introduced the concept of “creative destruction” in Capitalism, Socialism and Democracy (1942)—the idea that economic development proceeds through radical innovation that renders existing capital obsolete. Ross explicitly invokes Schumpeterian dynamics when discussing the risk that next-generation AI chips might render current hardware unprofitable before it amortises. This uncertainty amplifies the strategic calculus: nations must invest in compute infrastructure knowing that technological obsolescence might arrive before economic returns materialise. Yet failing to invest guarantees strategic irrelevance.

William Stanley Jevons, the 19th-century English economist, observed what became known as Jevons Paradox in The Coal Question (1865): as technology makes resource use more efficient, total consumption typically increases rather than decreases because efficiency makes the resource more economically viable for new applications. Ross applies this directly to AI compute, noting that as inference becomes cheaper (through better chips or more efficient models), demand expands faster than costs decline. This means the total addressable market for compute grows continuously—making control over production capacity increasingly valuable.

Nicholas Georgescu-Roegen, the Romanian-American economist, pioneered bioeconomics and introduced entropy concepts to economic analysis in The Entropy Law and the Economic Process (1971). Georgescu-Roegen argued that economic activity is fundamentally constrained by thermodynamic laws—specifically, that all economic processes dissipate energy and cannot be sustained without continuous energy inputs. Ross’s insistence that “you cannot have compute without energy” is pure Georgescu-Roegen: AI systems, regardless of algorithmic elegance, are bound by physical laws. Compute is thermodynamically expensive—training large models requires megawatts, inference at scale requires sustained power generation. Nations without access to abundant energy cannot sustain AI economies, regardless of their talent or capital.

Mancur Olson, the American economist and political scientist, explored collective action problems and the relationship between institutional quality and economic outcomes in works like The Rise and Decline of Nations (1982). Olson demonstrated how established interest groups can create institutional sclerosis that prevents necessary adaptation. Ross’s observations about European regulatory hesitance and infrastructure underinvestment reflect Olsonian dynamics: incumbent energy interests, environmental lobbies, and risk-averse political structures prevent the aggressive nuclear or renewable expansion required for AI competitiveness. Meanwhile, nations with different institutional arrangements (or greater perceived strategic urgency) act more decisively.

Paul Romer, the American economist and Nobel laureate, developed endogenous growth theory, arguing in works like “Endogenous Technological Change” (1990) that economic growth derives from deliberate investment in knowledge and technology rather than external factors. Romer’s framework emphasises the non-rivalry of ideas (knowledge can be used by multiple actors simultaneously) but the rivalry of physical inputs required to implement them. Ross’s thesis fits perfectly: AI algorithms can be copied and disseminated, but the computational infrastructure to deploy them at scale cannot. This creates a fundamental asymmetry that determines economic power.

The Historical Pattern

History provides sobering precedents for resource-driven geopolitical competition. Britain’s dominance in the 19th century rested substantially on coal abundance that powered industrial machinery and naval supremacy. The United States’ 20th-century ascendance correlated with petroleum access and the industrial capacity to refine and deploy it. Oil-dependent economies in the Middle East gained geopolitical leverage disproportionate to their population or industrial capacity purely through energy reserves.

Ross suggests we are witnessing the emergence of a similar dynamic, but with a critical difference: AI compute is both resource-intensive (requiring enormous energy) and productivity-amplifying (making other economic activity more efficient). This creates a multiplicative effect where compute advantages compound through both direct application (better AI services) and indirect effects (more efficient production of goods and services across the economy). A nation with abundant compute doesn’t just have better chatbots—it has more efficient logistics, agricultural systems, manufacturing processes, and financial services.

The “away game” concept Ross introduced during the podcast discussion adds a critical dimension. China, despite substantial domestic AI investment and capabilities, faces structural disadvantages in global competition because international customers cannot simply replicate China’s energy subsidies or infrastructure. This creates opportunities for nations with more favourable cost structures or energy profiles, but only if they invest in both compute capacity and energy generation.

The Future Ross Envisions

Throughout the podcast, Ross painted a vision of AI-driven abundance that challenges conventional fears of technological unemployment. He predicts labour shortages, not mass unemployment, driven by three mechanisms: deflationary pressure (AI makes goods and services cheaper), workforce opt-out (people work less as living costs decline), and new industry creation (entirely new job categories emerge, like “vibe coding”—programming through natural language rather than formal syntax).

This optimistic scenario depends entirely on computational abundance. If compute remains scarce and concentrated, AI benefits accrue primarily to those controlling the infrastructure. Ross’s mission with Groq—creating faster deployment cycles (six months versus two years for GPUs), operating globally distributed data centres, optimising for cost efficiency rather than margin maximisation—aims to prevent that concentration. But the same logic applies at the national level. Countries without indigenous compute capacity will import AI services, capturing some productivity benefits but remaining dependent on external providers for the infrastructure that increasingly underpins economic activity.

The comparison Ross offers—LLMs as “telescopes of the mind”—is deliberately chosen. Galileo’s telescope revolutionised human understanding but required specific material capabilities to construct and use. Nations without optical manufacturing capacity could not participate in astronomical discovery. Similarly, nations without computational and energy infrastructure cannot participate fully in the AI economy, regardless of their algorithmic sophistication or research talent.

Conclusion

Ross’s statement—”The countries that control compute will control AI. You cannot have compute without energy”—distils a complex geopolitical and economic reality into stark clarity. It combines Innisian materialism (infrastructure determines power), Schumpeterian dynamism (innovation renders existing capital obsolete), Jevonsian counterintuition (efficiency increases total consumption), and Georgescu-Roegen’s thermodynamic constraints (economic activity requires energy dissipation).

The implications are uncomfortable for nations unprepared to make the necessary investments. Technical prowess in model development provides no strategic moat if the computational infrastructure to deploy those models remains controlled elsewhere. Energy abundance, or the political will to develop it, becomes a prerequisite for AI sovereignty. And AI sovereignty increasingly determines economic competitiveness across sectors.

Ross occupies a unique vantage point—neither pure academic nor disinterested observer, but an operator building the infrastructure that will determine whether his prediction proves correct. Groq’s valuation and customer demand suggest the market validates his thesis. Whether nations respond with corresponding urgency remains an open question. But the framework Ross articulates will likely define strategic competition for the remainder of the decade: compute as currency, energy as prerequisite, and algorithmic sophistication as necessary but insufficient for competitive advantage.

Quote: J.W. Stephens – Author

“Be the person your dog thinks you are!” – J.W. Stephens – Author

The quote “Be the person your dog thinks you are!” represents a profound philosophical challenge wrapped in disarming simplicity. It invites us to examine the gap between our idealised selves and our everyday reality through the lens of unconditional canine devotion. This seemingly light-hearted exhortation carries surprising depth when examined within the broader context of authenticity, aspiration and the moral psychology of personal development.

The Author and the Quote’s Origins

J.W. Stephens, a seventh-generation native Texan, has spent considerable time travelling and living across various locations in Texas and internationally. Whilst the search results provide limited biographical detail about this particular author, the quote itself reveals a distinctively American sensibility—one that combines practical wisdom with accessible moral instruction. The invocation of dogs as moral exemplars reflects a cultural tradition deeply embedded in American life, where the human-canine bond serves as both comfort and conscience.

The brilliance of Stephens’ formulation lies in its rhetorical structure. By positioning the dog’s perception as the aspirational standard, the quote accomplishes several objectives simultaneously: it acknowledges our frequent moral shortcomings, suggests that we already possess knowledge of higher standards, and implies that achieving those standards is within reach. The dog becomes both witness and ideal reader—uncritical yet somehow capable of perceiving our better nature.

The quote functions as what philosophers might term a “regulative ideal”—not a description of what we are, but a vision of what we might become. Dogs, in their apparent inability to recognise human duplicity or moral inconsistency, treat their owners as wholly trustworthy, infinitely capable, and fundamentally good. This perception, whether accurate or illusory, creates a moral challenge: can we rise to meet it?

Philosophical Foundations: Authenticity and the Divided Self

The intellectual lineage underpinning this seemingly simple maxim extends deep into Western philosophical tradition, touching upon questions of authenticity, self-knowledge, and moral psychology that have preoccupied thinkers for millennia.

Søren Kierkegaard (1813-1855) stands as perhaps the most important theorist of authenticity in Western philosophy. The Danish philosopher argued that modern life creates a condition he termed “despair”—not necessarily experienced as anguish, but as a fundamental disconnection from one’s true self. Kierkegaard distinguished between the aesthetic, ethical, and religious stages of existence, arguing that most people remain trapped in the aesthetic stage, living according to immediate gratification and social conformity rather than choosing themselves authentically. His concept of “becoming who you are” anticipates Stephens’ formulation, though Kierkegaard’s vision is considerably darker and more demanding. For Kierkegaard, authentic selfhood requires a “leap of faith” and acceptance of radical responsibility for one’s choices. The dog’s unwavering faith in its owner might serve, in Kierkegaardian terms, as a model of the absolute commitment required for authentic existence.

Jean-Paul Sartre (1905-1980) developed Kierkegaard’s insights in a secular, existentialist direction. Sartre’s notion of “bad faith” (mauvaise foi) describes the human tendency to deceive ourselves about our freedom and responsibility. We pretend we are determined by circumstances, social roles, or past choices when we remain fundamentally free. Sartre argued that consciousness is “condemned to be free”—we cannot escape the burden of defining ourselves through our choices. The gap between who we are and who we claim to be constitutes a form of self-deception Sartre found both universal and contemptible. Stephens’ quote addresses precisely this gap: the dog sees us as we might be, whilst we often live as something less. Sartre would likely appreciate the quote’s implicit demand that we accept responsibility for closing that distance.

Martin Heidegger (1889-1976) approached similar territory through his concept of “authenticity” (Eigentlichkeit) versus “inauthenticity” (Uneigentlichkeit). For Heidegger, most human existence is characterised by “fallenness”—an absorption in the everyday world of “das Man” (the “They” or anonymous public). We live according to what “one does” rather than choosing our own path. Authentic existence requires confronting our own mortality and finitude, accepting that we are “beings-toward-death” who must take ownership of our existence. The dog’s perspective, unburdened by social conformity and living entirely in the present, might represent what Heidegger termed “dwelling”—a mode of being that is at home in the world without falling into inauthenticity.

The Psychology of Self-Perception and Moral Development

Moving from continental philosophy to empirical psychology, several theorists have explored the mechanisms by which we maintain multiple versions of ourselves and how we might reconcile them.

Carl Rogers (1902-1987), the founder of person-centred therapy, developed a comprehensive theory of the self that illuminates Stephens’ insight. Rogers distinguished between the “real self” (who we actually are) and the “ideal self” (who we think we should be). Psychological health, for Rogers, requires “congruence”—alignment between these different self-concepts. When the gap between real and ideal becomes too wide, we experience anxiety and employ defence mechanisms to protect our self-image. Rogers believed that unconditional positive regard—accepting someone fully without judgment—was essential for psychological growth. The dog’s perception of its owner represents precisely this unconditional acceptance, creating what Rogers termed “conditions of worth” that are entirely positive. Paradoxically, this complete acceptance might free us to change precisely because we feel safe enough to acknowledge our shortcomings.

Albert Bandura (born 1925) developed social learning theory and the concept of self-efficacy, which bears directly on Stephens’ formulation. Bandura argued that our beliefs about our capabilities significantly influence what we attempt and accomplish. When we believe others see us as capable (as dogs manifestly do), we are more likely to attempt difficult tasks and persist through obstacles. The dog’s unwavering confidence in its owner might serve as what Bandura termed “vicarious experience”—seeing ourselves succeed through another’s eyes increases our own self-efficacy beliefs. Moreover, Bandura’s later work on moral disengagement explains how we rationalise behaviour that conflicts with our moral standards. The dog’s perspective, by refusing such disengagement, might serve as a corrective to self-justification.

Carol Dweck (born 1946) has explored how our beliefs about human qualities affect achievement and personal development. Her distinction between “fixed” and “growth” mindsets illuminates an important dimension of Stephens’ quote. A fixed mindset assumes that qualities like character, intelligence, and moral worth are static; a growth mindset sees them as developable through effort. The dog’s perception suggests a growth-oriented view: it sees potential rather than limitation, possibility rather than fixed character. The quote implies that we can become what the dog already believes us to be—a quintessentially growth-minded position.

Moral Philosophy and the Ethics of Character

The quote also engages fundamental questions in moral philosophy about the nature of virtue and how character develops.

Aristotle (384-322 BCE) provides the foundational framework for understanding character development in Western thought. His concept of eudaimonia (often translated as “flourishing” or “the good life”) centres on the cultivation of virtues through habituation. For Aristotle, we become virtuous by practising virtuous actions until they become second nature. The dog’s perception might serve as what Aristotle termed the “great-souled man’s” self-regard—not arrogance but appropriate recognition of one’s potential for excellence. However, Aristotle would likely caution that merely aspiring to virtue is insufficient; one must cultivate the practical wisdom (phronesis) to know what virtue requires in specific circumstances and the habituated character to act accordingly.

Immanuel Kant (1724-1804) approached moral philosophy from a radically different angle, yet his thought illuminates Stephens’ insight in unexpected ways. Kant argued that morality stems from rational duty rather than inclination or consequence. The famous categorical imperative demands that we act only according to maxims we could will to be universal laws. Kant’s moral agent acts from duty, not because they feel like it or because they fear consequences. The gap between our behaviour and the dog’s perception might be understood in Kantian terms as the difference between acting from inclination (doing good when convenient) and acting from duty (doing good because it is right). The dog, in its innocence, cannot distinguish these motivations—it simply expects consistent goodness. Rising to meet that expectation would require developing what Kant termed a “good will”—the disposition to do right regardless of inclination.

Lawrence Kohlberg (1927-1987) developed a stage theory of moral development that explains how moral reasoning evolves from childhood through adulthood. Kohlberg identified six stages across three levels: pre-conventional (focused on rewards and punishment), conventional (focused on social approval and law), and post-conventional (focused on universal ethical principles). The dog’s expectation might be understood as operating at a pre-conventional level—it assumes goodness without complex reasoning. Yet meeting that expectation could require post-conventional thinking: choosing to be good not because others are watching but because we have internalised principles of integrity and compassion. The quote thus invites us to use a simple, pre-moral faith as leverage for developing genuine moral sophistication.

Contemporary Perspectives: Positive Psychology and Virtue Ethics

Recent decades have seen renewed interest in character and human flourishing, providing additional context for understanding Stephens’ insight.

Martin Seligman (born 1942), founder of positive psychology, has shifted psychological focus from pathology to wellbeing. His PERMA model identifies five elements of flourishing: Positive emotion, Engagement, Relationships, Meaning, and Accomplishment. The human-dog relationship exemplifies several of these elements, particularly the relationship component. Seligman’s research on “learned optimism” suggests that how we explain events to ourselves affects our wellbeing and achievement. The dog’s relentlessly optimistic view of its owner might serve as a model of the explanatory style Seligman advocates—one that sees setbacks as temporary and successes as reflective of stable, positive qualities.

Christopher Peterson (1950-2012) and Martin Seligman collaborated to identify character strengths and virtues across cultures, resulting in the Values in Action (VIA) classification. Their research identified 24 character strengths organised under six core virtues: wisdom, courage, humanity, justice, temperance, and transcendence. The quote implicitly challenges us to develop these strengths not because doing so maximises utility or fulfils duty, but because integrity demands that our actions align with our self-understanding. The dog sees us as possessing these virtues; the challenge is to deserve that vision.

Alasdair MacIntyre (born 1929) has argued for recovering Aristotelian virtue ethics in modern life. MacIntyre contends that the Enlightenment project of grounding morality in reason alone has failed, leaving us with emotivism—the view that moral judgments merely express feelings. He advocates returning to virtue ethics situated within narrative traditions and communities of practice. The dog-owner relationship might be understood as one such practice—a context with implicit standards and goods internal to it (loyalty, care, companionship) that shape character over time. Becoming worthy of the dog’s trust requires participating authentically in this practice rather than merely going through the motions.

The Human-Animal Bond as Moral Mirror

The specific invocation of dogs, rather than humans, as moral arbiters merits examination. This choice reflects both cultural realities and deeper philosophical insights about the nature of moral perception.

Dogs occupy a unique position in human society. Unlike wild animals, they have co-evolved with humans for thousands of years, developing sophisticated abilities to read human gestures, expressions, and intentions. Yet unlike humans, they appear incapable of the complex social calculations that govern human relationships—judgement tempered by self-interest, conditional approval based on social status, or critical evaluation moderated by personal advantage.

Emmanuel Levinas (1906-1995) developed an ethics based on the “face-to-face” encounter with the Other, arguing that the face of the other person makes an ethical demand on us that precedes rational calculation. Whilst Levinas focused on human faces, his insight extends to our relationships with dogs. The dog’s upturned face, its evident trust and expectation, creates an ethical demand: we are called to respond to its vulnerability and faith. The dog cannot protect itself from our betrayal; it depends entirely on our goodness. This radical vulnerability and trust creates what Levinas termed the “infinite responsibility” we bear toward the Other.

The dog’s perception is powerful precisely because it is not strategic. Dogs do not love us because they have calculated that doing so serves their interests (though it does). They do not withhold affection to manipulate behaviour (though behavioural conditioning certainly plays a role in the relationship). From the human perspective, the dog’s devotion appears absolute and uncalculating. This creates a moral asymmetry: the dog trusts completely, whilst we retain the capacity for betrayal or manipulation. Stephens’ quote leverages this asymmetry, suggesting that we should honour such trust by becoming worthy of it.

Practical Implications: From Aspiration to Action

The quote’s enduring appeal lies partly in its practical accessibility. Unlike philosophical treatises on authenticity or virtue that can seem abstract and demanding, Stephens offers a concrete, imaginable standard. Most dog owners have experienced the moment of returning home to exuberant welcome, seeing themselves reflected in their dog’s unconditional joy. The gap between that reflection and one’s self-knowledge of moral compromise or character weakness becomes tangible.

Yet the quote’s simplicity risks trivialising genuine moral development. Becoming “the person your dog thinks you are” is not achieved through positive thinking or simple willpower. It requires sustained effort, honest self-examination, and often painful acknowledgment of failure. The philosophical traditions outlined above suggest several pathways:

The existentialist approach demands radical honesty about our freedom and responsibility. We must acknowledge that we choose ourselves moment by moment, that no external circumstance determines our character, and that self-deception about this freedom represents moral failure. The dog’s trust becomes a call to authentic choice.

The Aristotelian approach emphasises habituation and practice. We must identify the virtues we lack, create situations that require practising them, and persist until virtuous behaviour becomes natural. The dog’s expectation provides motivation for this long-term character development.

The psychological approach focuses on congruence and self-efficacy. We must reduce the gap between real and ideal self through honest self-assessment and incremental change, using the dog’s confidence as a source of belief in our capacity to change.

The virtue ethics approach situates character development within practices and traditions. The dog-owner relationship itself becomes a site for developing virtues like responsibility, patience, and compassion through daily engagement.

The Quote in Contemporary Context

Stephens’ formulation resonates particularly in an era characterised by anxiety about authenticity. Social media creates pressure to curate idealised self-presentations whilst simultaneously exposing the gap between image and reality. Political and institutional leaders frequently fail to live up to professed values, creating cynicism about whether integrity is possible or even desirable. In this context, the dog’s uncomplicated faith offers both comfort and challenge—comfort that somewhere we are seen as fundamentally good, challenge that we might actually become so.

The quote also speaks to contemporary concerns about meaning and purpose. In a secular age lacking consensus on ultimate values, the question “How should I live?” lacks obvious answers. Stephens bypasses theological and philosophical complexities by offering an existentially grounded response: live up to the best version of yourself as reflected in uncritical devotion. This moves the question from abstract principle to lived relationship, from theoretical ethics to embodied practice.

Moreover, the invocation of dogs rather than humans as moral mirrors acknowledges a therapeutic insight: sometimes we need non-judgmental acceptance before we can change. The dog provides that acceptance automatically, creating psychological safety within which development becomes possible. In an achievement-oriented culture that often ties worth to productivity and success, the dog’s valuation based simply on existence—you are wonderful because you are you—offers profound relief and, paradoxically, motivation for growth.

The quote ultimately works because it short-circuits our elaborate mechanisms of self-justification. We know we are not as good as our dogs think we are. We know this immediately and intuitively, without needing philosophical argument. The quote simply asks: what if you were? What if you closed that gap? The question haunts precisely because the answer seems simultaneously impossible and within reach—because we have glimpsed that better self in our dog’s eyes and cannot quite forget it.

Quote: Jensen Huang – CEO Nvidia

“Oftentimes, if you reason about things from first principles, what’s working today incredibly well — if you could reason about it from first principles and ask yourself on what foundation that first principle is built and how that would change over time — it allows you to hopefully see around corners.” – Jensen Huang – CEO Nvidia

Jensen Huang’s quote was delivered in the context of an in-depth dialogue with institutional investors on the trajectory of Nvidia, the evolution of artificial intelligence, and strategies for anticipating and shaping the technological future.

Context of the Quote

The quote was made during an interview at a Citadel Securities event in October 2025, hosted by Konstantine Buhler, a partner at Sequoia Capital. The dialogue’s audience consisted of leading institutional investors, all seeking avenues for sustainable advantage or ‘edge’. The conversation explored the founding moments of Nvidia in the early 1990s, through the reinvention of the graphics processing unit (GPU), the creation of new computing markets, and the subsequent rise of Nvidia as the platform underpinning the global AI boom. The question of how to ‘see around corners’ — to anticipate technology and industry shifts before they crystallise for others — was at the core of the discussion. Huang’s answer, invoking first-principles reasoning, linked Nvidia’s success to its ability to continually revisit and challenge foundational assumptions, and to methodically project how they will be redefined by progress in science and technology.

Jensen Huang: Profile and Approach

Jensen Huang, born in Tainan, Taiwan in 1963, immigrated to the United States as a child, experiencing the formative challenges of cultural dislocation, financial hardship, and adversity. He obtained his undergraduate degree in electrical engineering from Oregon State University and a master’s from Stanford University. After working at AMD and LSI Logic, he co-founded Nvidia in 1993 at 30, reportedly at a Denny’s restaurant. From the outset, the company faced daunting odds — neither established market nor assured funding, and frequent existential risk in the initial years.

Huang is distinguished not only by technical fluency — he is deeply involved in hardware and software architecture — but also by an ability to translate complexity for diverse audiences. He eschews corporate formality in favour of trademark leather jackets and a focus on product. His leadership style is marked by humility, a willingness to bet on emerging ideas, and what he describes as “urgent innovation” born of early near-failure. This disposition has been integral to Nvidia’s progress, especially as the company repeatedly “invented markets” and defined entirely new categories, such as accelerated computing and AI infrastructure.

By 2024, Nvidia became the world’s most valuable public company, with its GPUs foundational to gaming, scientific computing, and, critically, the rise of AI. Huang’s awards — from the IEEE Founder’s Medal to listing among Time Magazine’s 100 most influential — underscore his reputation as a technologist and strategic thinker. He is widely recognised for being able to establish technical direction well before it becomes market consensus, an approach reflected in the quote.

First-Principles Thinking: Theoretical Foundations

Huang’s endorsement of “first principles” echoes a method of problem-solving and innovation associated with thinkers as diverse as Aristotle, Isaac Newton, and, in the modern era, entrepreneurs and strategists such as Elon Musk. The essence of first-principles thinking is to break down complex systems to their most fundamental truths — concepts that cannot be deduced from anything simpler — and to reason forward from those axioms, unconstrained by traditional assumptions, analogies, or received wisdom.

Aristotle was the first to coin the term “first principles”, distinguishing knowledge derived from irreducible foundational truths from knowledge obtained through analogy or precedent.
René Descartes advocated for systematic doubt and logical rebuilding of knowledge from foundational elements.
Richard Feynman, the physicist, was famous for urging students to “understand from first principles”, encouraging deep understanding and avoidance of rote memorisation or mere pattern recognition.
Elon Musk is often cited as a contemporary example, applying first-principles thinking to industries as varied as automotive (Tesla), space (SpaceX), and energy. Musk has described the technique as “boiling things down to the most fundamental truths and then reasoning up from there,” directly influencing not just product architectures but also cost models and operational methods.

Application in Technology and AI

First-principles thinking is particularly powerful in periods of technological transition:

In computing, first principles were invoked by Carver Mead and Lynn Conway, who reimagined the semiconductor industry in the 1970s by establishing the foundational laws for microchip design, known as Mead-Conway methodology. This approach was cited by Huang as influential for predicting the physical limitations of transistor miniaturisation and motivating Nvidia’s focus on accelerated computing.
Clayton Christensen, cited by Huang as an influence, introduced the idea of disruptive innovation, arguing that market leaders must question incumbent logic and anticipate non-linear shifts in technology. His books on disruption and innovation strategy have shaped how leaders approach structural shifts and avoid the “innovator’s dilemma”.
The leap from von Neumann architectures to parallel, heterogenous, and ultimately AI-accelerated computing frameworks — as pioneered by Nvidia’s CUDA platform and deep learning libraries — was possible because leaders at Nvidia systematically revisited underlying assumptions about how computation should be structured for new workloads, rather than simply iterating on the status quo.
The AI revolution itself was catalysed by the “deep learning” paradigm, championed by Geoffrey Hinton, Yann LeCun, and Andrew Ng. Each demonstrated that previous architectures, which had reached plateaus, could be superseded by entirely new approaches, provided there was willingness to reinterpret the problem from mathematical and computational fundamentals.

Backstory of the Leading Theorists

The ecosystem that enabled Nvidia’s transformation is shaped by a series of foundational theorists:

Mead and Conway: Their 1979 textbook and methodologies codified the “first-principles” approach in chip design, allowing for the explosive growth of Silicon Valley’s fabless innovation model.
Gordon Moore: Moore’s Law, while originally an empirical observation, inspired decades of innovation, but its eventual slow-down prompted leaders such as Huang to look for new “first principles” to govern progress, beyond mere transistor scaling.
Clayton Christensen: His disruption theory is foundational in understanding why entire industries fail to see the next shift — and how those who challenge orthodoxy from first principles are able to “see around corners”.
Geoffrey Hinton, Yann LeCun, Andrew Ng: These pioneers directly enabled the deep learning revolution by returning to first principles on how learning — both human and artificial — could function at scale. Their work with neural networks, widely doubted after earlier “AI winters”, was vindicated with landmark results like AlexNet (2012), enabled by Nvidia GPUs.

Implications

Jensen Huang’s quote is neither idle philosophy nor abstract advice — it is a methodology proven repeatedly by his own journey and by the history of technology. It is a call to scrutinise assumptions, break complex structures to their most elemental truths, and reconstruct strategy consciously from the bedrock of what is not likely to change, but also to ask: on what foundation do these principles rest, and how will these foundations themselves evolve.

Organisations and individuals who internalise this approach are equipped not only to compete in current markets, but to invent new ones — to anticipate and shape the next paradigm, rather than reacting to it.

Quote: Andrej Karpathy – Ex-OpenAI, Ex-Tesla AI

“What I think we have to do going forward…is figure out ways to remove some of the knowledge and to keep what I call this cognitive core. It’s this intelligent entity that is stripped from knowledge but contains the algorithms and contains the magic of intelligence and problem-solving and the strategies of it and all this stuff.” – Andrej Karpathy – Ex-OpenAI, Ex-Tesla AI

Andrej Karpathy’s observation about the need to “strip away knowledge whilst retaining the cognitive core” represents one of the most penetrating insights into contemporary artificial intelligence development. Speaking on Dwarkesh Patel’s podcast in October 2025, Karpathy—formerly a leading figure at both OpenAI and Tesla’s Autopilot programme—articulated a fundamental tension at the heart of modern AI: the current generation of large language models have become prodigious memorisers, yet this very capability may be constraining their potential for genuine intelligence.

The Paradox of Pre-training

To comprehend Karpathy’s thesis, one must first understand the architecture of contemporary AI systems. Large language models are trained on vast corpora—often 15 trillion tokens or more—through a process called pre-training. During this phase, models learn to predict the next token in a sequence, effectively compressing the entire internet into their neural networks. Karpathy describes this compressed representation as only “0.07 bits per token” for a model like Llama 3 70B, highlighting the extraordinary degree of compression occurring.

This compression serves two distinct functions, which Karpathy carefully delineates. First, models accumulate factual knowledge—the content of Wikipedia articles, the specifics of historical events, the details of scientific papers. Second, and more crucially, they develop what Karpathy terms “algorithmic patterns”—the capacity for in-context learning, the ability to recognise and complete patterns, the fundamental mechanisms of reasoning itself.

The problem, as Karpathy sees it, is that contemporary models have become too adept at the former whilst the latter remains the true seat of intelligence. When a model can regurgitate passages verbatim or recite obscure facts, it demonstrates remarkable memory. But this same capability creates what he calls a “distraction”—the model becomes reliant on its hazy recollections of training data rather than developing robust reasoning algorithms that could operate independently of specific factual knowledge.

The Cognitive Core Concept

Karpathy’s proposed solution is to isolate and preserve what he terms the “cognitive core”—an intelligent entity stripped of encyclopaedic knowledge but retaining the fundamental algorithms of problem-solving, the strategies of thought, and what he describes as “the magic of intelligence.” This concept represents a profound shift in how we conceptualise artificial intelligence.

Consider the analogy to human cognition. Humans are remarkably poor memorisers compared to AI systems. Present a human with a random sequence of numbers, and they’ll struggle after seven or eight digits. Yet this apparent limitation forces humans to develop robust pattern-recognition capabilities and abstract reasoning skills. We’re compelled to “see the forest for the trees” precisely because we cannot memorise every individual tree.

Karpathy suggests that AI systems would benefit from similar constraints. A model with less memory but stronger reasoning capabilities would be forced to look up factual information whilst maintaining sophisticated algorithms for processing that information. Such a system would more closely resemble human intelligence—not in its limitations, but in the way those limitations drive the development of generalisable cognitive strategies.

The implications extend beyond mere technical architecture. Karpathy envisions cognitive cores as compact as one billion parameters—potentially even smaller—that could operate as genuine reasoning engines rather than glorified databases. These systems would “know that they don’t know” when confronted with factual questions, prompting them to retrieve information whilst applying sophisticated analysis. The result would be AI that thinks more than it remembers, that reasons rather than recites.

From Evolution to Engineering: The Path Not Taken

Karpathy’s perspective on AI development diverges sharply from what he calls the “Richard Sutton viewpoint”—the notion that we should build AI systems analogous to biological intelligence, learning from scratch through reinforcement learning in the manner of animals. Instead, Karpathy argues we’re building what he evocatively describes as “ghosts” or “spirit entities”—ethereal intelligences that emerge from imitating human-generated text rather than evolving through environmental interaction.

This distinction illuminates a crucial divergence in AI philosophy. Biological intelligence, as embodied in animals, emerges from evolution operating over millions of years, with vast amounts of capability “baked in” to neural circuitry. A zebra foal runs within minutes of birth not through reinforcement learning but through evolutionary encoding. Humans similarly arrive with substantial cognitive machinery pre-installed, with lifetime learning representing maturation and refinement rather than learning from first principles.

By contrast, contemporary AI systems learn through what Karpathy terms “crappy evolution”—pre-training on internet documents serves as a compressed, accelerated alternative to evolutionary optimisation. This process creates entities fundamentally different from biological intelligence, optimised for different tasks through different mechanisms. The current approach imitates the products of human intelligence (text, code, conversations) rather than replicating the developmental process that creates intelligence.

The Limits of Current Learning Paradigms

Karpathy’s critique extends to reinforcement learning, which he describes with characteristic bluntness as “terrible.” His concerns illuminate deep problems in how AI systems currently learn from experience. In reinforcement learning, a model generates hundreds of solution attempts, and those that arrive at correct answers have every intermediate step up-weighted, whilst failed attempts are down-weighted. Karpathy calls this “sucking supervision through a straw”—extracting minimal learning signal from vast amounts of computational work.

The fundamental issue is noise. When a solution works, not every step along the way was necessarily correct or optimal. The model may have taken wrong turns, pursued dead ends, or stumbled upon the answer despite flawed reasoning. Yet reinforcement learning broadcasts the final reward across the entire trajectory, reinforcing both good and bad reasoning indiscriminately. Karpathy notes that “you may have gone down the wrong alleys until you arrived at the right solution,” yet every mistaken step gets marked as something to do more of.

Humans, by contrast, engage in sophisticated post-hoc analysis. After solving a problem, we reflect on which approaches worked, which didn’t, and why. We don’t simply label an entire problem-solving session as “good” or “bad”—we dissect our reasoning, identify productive and unproductive strategies, and refine our approach. Current AI systems lack this reflective capacity entirely.

This limitation connects to broader questions about how AI systems might achieve continual learning—the ability to genuinely learn from ongoing experience rather than requiring massive retraining. Karpathy suggests that humans engage in a nightly “distillation phase” during sleep, processing the day’s experiences and integrating insights into long-term knowledge. AI systems have no equivalent mechanism. They simply restart from the same state each time, unable to evolve based on individual experiences.

Model Collapse and the Entropy Problem

A subtle but critical concern in Karpathy’s analysis is what he terms “model collapse”—the tendency of AI systems to produce outputs that occupy “a very tiny manifold of the possible space of thoughts.” Ask ChatGPT to tell a joke repeatedly, and you’ll receive the same three jokes. Request reflection on a topic multiple times, and you’ll observe striking similarity across responses. The models are “silently collapsed,” lacking the entropy and diversity that characterises human thought.

This phenomenon creates profound challenges for synthetic data generation, a technique labs use to create additional training material. If models generate training data for themselves or subsequent models, this collapsed distribution gradually dominates the training corpus. Training on one’s own outputs creates a dangerous feedback loop—each generation becomes less diverse, more stereotyped, more “collapsed” than the last. Karpathy suggests this may not even be a solvable problem, noting that humans similarly “collapse over time,” becoming more rigid and less creative as they age, revisiting the same thoughts and patterns with decreasing learning rates.

The contrast with children is illuminating. Young minds, not yet “overfitted” to the world, produce shocking, creative, unexpected responses precisely because they haven’t collapsed into standard patterns of thought. This freshness, this maintenance of high entropy in cognitive processes, may be essential to genuine intelligence. Yet our current training paradigms actively work against it, rewarding convergence towards common patterns and penalising deviation.

The Decade of Agents: Why Progress Takes Time

When Karpathy states this will be “the decade of agents” rather than “the year of agents,” he draws on hard-won experience from five years leading Tesla’s Autopilot programme. His insights into why artificial intelligence deployment takes far longer than demonstrations suggest carry particular weight given this background.

The central concept is what Karpathy calls “the march of nines.” Getting something to work 90% of the time—the level typically showcased in demonstrations—represents merely the first nine in “99.9%.” Each additional nine requires equivalent effort. During his tenure at Tesla, the team progressed through perhaps two or three nines over five years. More crucially, numerous nines remain before self-driving cars achieve true autonomy at scale.

This pattern isn’t unique to autonomous vehicles. Karpathy argues it applies across safety-critical domains, including software engineering. When code errors can leak millions of Social Security numbers or create critical security vulnerabilities, the cost of failure becomes prohibitively high. The demo-to-product gap widens dramatically. What works impressively in controlled conditions fails in countless edge cases when confronting reality’s full complexity.

Waymo’s experience illustrates this challenge. Despite providing “perfect drives” as early as 2014, the company still operates limited deployments requiring elaborate teleoperation infrastructure and supervision. Humans haven’t been removed; they’ve been rendered invisible, beaming in remotely to handle edge cases. The technology lives in a “pulled-back future”—functional but not yet economical, capable but not yet scalable.

Contemporary AI agents face analogous challenges. Whilst Claude and GPT-5 Pro demonstrate remarkable capabilities, they remain what Karpathy characterises as “elementary school students”—savants with perfect memory but lacking robust reasoning across all necessary dimensions. They’re “cognitively deficient” in ways users intuitively recognise even if they can’t articulate precisely what’s missing.

The Software Engineering Puzzle

Perhaps no domain better illustrates the puzzling contours of current AI capabilities than software engineering. Karpathy notes, somewhat ruefully, that whilst these systems were meant to enable “any economically valuable task,” API revenue remains “dominated by coding.” This supposedly general intelligence overwhelmingly excels at one specific domain.

This concentration isn’t accidental. Code enjoys unique properties that make it ideal for current AI architectures. Software development has always operated through text—terminals, editors, version control systems all manipulate textual representations. LLMs, trained on internet text, encounter code as a native format. Moreover, decades of infrastructure exist for handling code textually: diff tools for showing changes, IDEs for navigation, testing frameworks for verification.

Contrast this with domains lacking such infrastructure. Creating presentations involves spatial arrangement and visual design—there’s no “diff” for slides that elegantly shows modifications. Many knowledge work tasks involve physical documents, in-person interactions, or tacit knowledge that resists textual representation. These domains haven’t been pre-optimised for AI interaction in the way software development has.

Yet even in coding, Karpathy remains sceptical of current capabilities for cutting-edge work. When building nanoChat, a repository implementing a complete ChatGPT clone in simplified form, he found AI tools valuable for autocomplete and handling familiar patterns but inadequate for novel architectural decisions. The models kept trying to impose standard approaches when he deliberately chose non-standard implementations. They couldn’t comprehend his custom solutions, constantly suggesting deprecated APIs and bloating code with unnecessary defensive programming.

This points to a deeper truth: current models excel at reproducing common patterns from their training data but struggle with code “that has never been written before”—precisely the domain of frontier AI research itself. The recursive self-improvement that some forecast, where AI systems rapidly enhance their own capabilities, founders on this limitation. Models can accelerate work within established paradigms but cannot yet pioneer truly novel approaches.

The Trajectory of Intelligence Explosion

Karpathy’s perspective on potential intelligence explosions diverges sharply from both pessimistic and optimistic extremes. He sees AI not as a discrete, alien technology but as a continuation of computing’s evolution—part of an ongoing automation trend stretching back through compilers, high-level programming languages, and computer-aided design tools. From this view, the “intelligence explosion” has already been occurring for decades, visible in the exponential GDP growth curve that represents accumulated automation across countless domains.

This framing leads to counterintuitive predictions. Rather than expecting AI to suddenly accelerate economic growth from 2% annually to 20%, Karpathy suggests it will enable continued progress along the existing exponential trajectory. Just as computers, the internet, and mobile phones transformed society without producing visible discontinuities in aggregate growth statistics, AI will diffuse gradually across industries, maintaining rather than disrupting established growth patterns.

This gradualism doesn’t imply insignificance. The compounding effects of sustained exponential growth produce extraordinary transformation over time. But it does suggest that simple extrapolations from impressive demonstrations to imminent superintelligence misunderstand how technology integrates into society. There will be no discrete moment when “AGI” arrives and everything changes. Instead, we’ll experience continuous advancement in capabilities, continuous expansion of automation, and continuous adaptation of society to new technological possibilities.

The analogy to the Industrial Revolution proves instructive. That transformation didn’t occur through a single breakthrough but through cascading improvements across multiple technologies and practices, gradually shifting society from 0.2% annual growth to 2%. Similarly, AI’s impact will emerge through countless incremental deployments, each automating specific tasks, enabling new workflows, and creating feedback loops that accelerate subsequent progress.

The Human Element: Education in an AI Future

Karpathy’s work on Eureka, his educational initiative, reveals his deepest concerns about AI’s trajectory. He fears not that AI will fail but that “humanity gets disempowered by it,” relegated to the sidelines like the portly, passive citizens of WALL-E. His solution lies in radically reimagining education around the principle that “pre-AGI education is useful; post-AGI education is fun.”

The analogy to fitness culture illuminates this vision. Nobody needs physical strength to manipulate heavy objects—we have machines for that. Yet gyms proliferate because exercise serves intrinsic human needs: health, aesthetics, the satisfaction of physical mastery. Similarly, even in a world where AI handles most cognitive labour, humans will pursue learning for its inherent rewards: the pleasure of understanding, the status of expertise, the deep satisfaction of mental cultivation.

But achieving this vision requires solving a technical problem: making learning genuinely easy and rewarding. Currently, most people abandon learning because they encounter material that’s too difficult or too trivial, bouncing between frustration and boredom. Karpathy describes the experience of working with an expert language tutor who maintained a perfect calibration—always presenting challenges at the edge of current capability, never boring, never overwhelming. This created a state where “I was the only constraint to learning,” with knowledge delivery perfectly optimised.

Replicating this experience at scale represents what Karpathy sees as education’s great technical challenge. Current AI tutors, despite their sophistication, remain far from this standard. They can answer questions but cannot probe understanding, identify gaps, or sequence material to create optimal learning trajectories. The capability exists in exceptional human tutors; the challenge lies in encoding it algorithmically.

Yet Karpathy sees this challenge as tractable. Just as AI has transformed coding through autocomplete and code generation, it will eventually transform education through personalised, responsive tutoring. When learning becomes “trivial”—not in the sense of requiring no effort but in the sense of encountering no artificial obstacles—humans will pursue it enthusiastically. Not everyone will become an expert in everything, but the ceiling on human capability will rise dramatically as the floor on accessibility descends.

The Physics of Understanding: Karpathy’s Pedagogical Philosophy

Karpathy’s approach to teaching reveals principles applicable far beyond AI. His background in physics instilled what he describes as finding “first-order terms”—identifying the essential, dominant factors in any system whilst recognising that second and third-order effects exist but matter less. This habit of abstraction, of seeing spherical cows where others see only messy complexity, enables the creation of minimal, illustrative examples that capture phenomena’s essence.

MicroGrad exemplifies this approach perfectly. In 100 lines of Python, Karpathy implements backpropagation—the fundamental algorithm underlying all neural network training. Everything else in modern deep learning frameworks, he notes, is “just efficiency”—optimisations for speed, memory management, numerical stability. But the intellectual core, the actual mechanism by which networks learn, fits in 100 comprehensible lines. This distillation makes the previously arcane accessible.

The broader principle involves “untangling knowledge”—reorganising understanding so each concept depends only on what precedes it. This creates “ramps to knowledge” where learners never encounter gaps or leaps that would require them to take claims on faith. The famous transformer tutorial embodies this, beginning with a simple bigram model (literally a lookup table) and progressively adding components, each motivated by solving a specific limitation of what came before.

This approach contrasts sharply with the standard academic practice of presenting solutions before establishing problems, or introducing abstractions before concrete examples. Karpathy sees such approaches as, in his words, “a dick move”—they rob learners of the opportunity to grapple with challenges themselves, to develop intuition about what solutions might work, and to appreciate why particular approaches succeed where alternatives fail. The pedagogical crime isn’t challenging students; it’s presenting answers without first establishing questions.

Leading Theorists: The Intellectual Lineage

Richard Sutton and the Bitter Lesson

Richard Sutton, a pioneering reinforcement learning researcher, articulated what became known as “the bitter lesson”—the observation that simple, scalable methods leveraging computation consistently outperform approaches incorporating human knowledge or structural assumptions. His perspective suggests that the path to artificial general intelligence lies through learning algorithms powerful enough to discover structure from experience, much as evolution discovered biological intelligence.

Sutton’s famous assertion that “if you got to the squirrel, you’d be most of the way to AGI” reflects this view. Animal intelligence, in his framework, represents the core achievement—the fundamental learning algorithms that enable organisms to navigate environments, solve problems, and adapt to challenges. Human language and culture, whilst impressive, represent relatively minor additions to this foundation.

Karpathy respectfully dissents. His “we’re building ghosts, not animals” formulation captures the divergence: current AI systems don’t replicate the learning processes that create biological intelligence. They imitate the products of human intelligence (text, code, reasoning traces) rather than replicating its developmental origins. This distinction matters profoundly for predicting AI’s trajectory and understanding its capabilities and limitations.

Geoffrey Hinton and the Neural Network Renaissance

Geoffrey Hinton, often termed the “godfather of AI,” pioneered the neural network approaches that underpin contemporary systems. His persistence through decades when neural networks were unfashionable, his development of backpropagation techniques, and his later work on capsule networks and other architectures established the foundation for today’s large language models.

Karpathy studied directly under Hinton at the University of Toronto, experiencing firsthand the intellectual ferment as deep learning began its ascent to dominance. Hinton’s influence appears throughout Karpathy’s thinking—the emphasis on learning from data rather than hand-crafted rules, the focus on representation learning, the conviction that scale and simplicity often trump elaborate architectural innovations.

Yet Karpathy’s view extends beyond his mentor’s. Where Hinton focused primarily on perception (particularly computer vision), Karpathy grapples with the full scope of intelligence—reasoning, planning, continual learning, multi-agent interaction. His work synthesises Hinton’s foundational insights with broader questions about cognitive architecture and the nature of understanding itself.

Yann LeCun and Convolutional Networks

Yann LeCun’s development of convolutional neural networks in 1989 represented the first successful application of gradient descent to real-world pattern recognition. His work on handwritten digit recognition established core principles: the power of hierarchical feature learning, the importance of translation invariance, the value of specialised architectures for specific domains.

Karpathy’s reconstruction of LeCun’s 1989 network, time-travelling 33 years of algorithmic improvements, reveals his appreciation for this lineage. He found that pure algorithmic advances—modern optimisers, better architectures, regularisation techniques—could halve error rates. But achieving further gains required more data and more computation. This trinity—algorithms, data, compute—advances in lockstep, with no single factor dominating.

This lesson shapes Karpathy’s predictions about AI’s future. He expects continued progress across all three dimensions, with the next decade bringing better algorithms, vaster datasets, more powerful hardware, and more efficient software. But no breakthrough in any single dimension will produce discontinuous acceleration. Progress emerges from the intersection of many incremental improvements.

The Broader Intellectual Context

The debate Karpathy engages extends beyond specific individuals to fundamental questions about intelligence itself. Does intelligence arise primarily from general learning algorithms (the Sutton view) or from accumulated structure and innate mechanisms (the evolutionary perspective)? Can we build intelligence by imitating its products (the current LLM approach) or must we replicate its developmental processes? Will artificial intelligence remain fundamentally tool-like, augmenting human capability, or evolve into genuinely autonomous agents pursuing their own goals?

These questions connect to century-old debates in psychology and cognitive science between behaviourists emphasising learning and nativists emphasising innate structure. They echo discussions in evolutionary biology about the relative roles of genetic determination and developmental plasticity. They parallel arguments in philosophy of mind about whether intelligence requires embodiment or can exist as pure information processing.

Karpathy’s position threads between extremes. He acknowledges both the power of learning from data and the necessity of architectural structure. He recognises both the distinctiveness of AI systems and their illuminating analogies to biological intelligence. He balances optimism about AI’s potential with realism about current limitations and the difficulty of translating demonstrations into robust, deployed systems.

The Cognitive Core in Context: A New Paradigm for Intelligence

The concept of a cognitive core stripped of factual knowledge represents more than a technical proposal—it’s a reconceptualisation of what intelligence fundamentally is. Rather than viewing intelligence as encompassing both reasoning algorithms and accumulated knowledge, Karpathy proposes treating these as separate, with reasoning capability as the essence and factual knowledge as external resources to be accessed rather than internalised.

This separation mirrors certain aspects of human cognition whilst diverging in others. Humans do maintain a distinction between knowing how to think and knowing specific facts—we can reason about novel situations without direct experience, applying general problem-solving strategies learned in one domain to challenges in another. Yet our factual knowledge isn’t purely external; it shapes the very structure of our reasoning, creating rich semantic networks that enable rapid, intuitive judgement.

The proposal to strip AI systems down to cognitive cores involves accepting tradeoffs. Such systems would need to perform external lookups for factual information, introducing latency and dependency on knowledge bases. They would lack the pattern-matching capabilities that arise from vast memorisation, potentially missing connections between superficially unrelated domains. They might struggle with tasks requiring seamless integration of many small facts, where lookup costs would dominate processing time.

Yet the gains could prove transformative. A genuine cognitive core—compact, efficient, focused on algorithmic reasoning rather than fact retrieval—could operate in settings where current models fail. Edge deployment becomes feasible when models don’t require storing terabytes of parameters. Personalisation becomes practical when core reasoning engines can be fine-tuned or adapted without retraining on entire knowledge corpora. Interpretability improves when reasoning processes aren’t obscured by retrieval of memorised patterns.

Most profoundly, genuine cognitive cores might avoid the collapse and loss of entropy that plagues current models. Freed from the burden of maintaining consistency with vast memorised datasets, such systems could explore more diverse solution spaces, generate more varied outputs, and maintain the creative flexibility that characterises human cognition at its best.

Implications for the Decade Ahead

Karpathy’s decade-long timeline for agentic AI reflects hard-earned wisdom about technology deployment. His experience with autonomous vehicles taught him that impressive demonstrations represent merely the beginning of a long productisation journey. Each additional “nine” of reliability—moving from 90% to 99% to 99.9% accuracy—requires comparable effort. Safety-critical domains demand many nines before deployment becomes acceptable.

This reality shapes expectations for AI’s economic impact. Rather than sudden disruption, we’ll witness gradual diffusion across domains with varying characteristics. Tasks that are repetitive, well-defined, purely digital, and allowing high error rates will automate first. Call centre work exemplifies this profile—short interaction horizons, clear success criteria, limited context requirements, tolerance for occasional failures that human supervisors can catch.

More complex knowledge work will resist automation longer. Radiologists, consultants, accountants—professionals whose work involves lengthy timescales, subtle judgements, extensive context, and high costs of error—will see AI augmentation before replacement. The pattern will resemble Waymo’s current state: AI handling routine cases whilst humans supervise, intervene in edge cases, and maintain ultimate responsibility.

This graduated deployment creates an “autonomy slider”—a continuous spectrum from pure human operation through various degrees of AI assistance to eventual full automation. Most jobs won’t flip discretely from human to machine. Instead, they’ll slide along this spectrum as AI capabilities improve and organisations develop confidence in delegation. This process will unfold over years or decades, not months.

The economic implications differ from both optimistic and pessimistic extremes. We won’t see overnight mass unemployment—the gradual nature of deployment, the persistence of edge cases requiring human judgement, and society’s adaptation through creating new roles all mitigate disruption. But neither will we see disappointing underutilisation—the compound effect of many small automations across countless tasks will produce genuine transformation.

The Path Forward: Research Priorities

Karpathy’s analysis suggests several critical research directions for developing robust, capable AI systems. First, developing methods to isolate cognitive cores from memorised knowledge whilst maintaining reasoning capability. This might involve novel training objectives that penalise rote memorisation whilst rewarding generalisation, or architectural innovations that separate knowledge storage from reasoning mechanisms.

Second, creating effective continual learning systems that can distil experience into lasting improvements without catastrophic forgetting or model collapse. This requires moving beyond simple fine-tuning toward something more akin to the reflection and consolidation humans perform during sleep—identifying patterns in experience, extracting lessons, and integrating insights whilst maintaining diversity.

Third, advancing beyond current reinforcement learning to richer forms of learning from experience. Rather than broadcasting sparse reward signals across entire trajectories, systems need sophisticated credit assignment that identifies which reasoning steps contributed to success and which didn’t. This might involve explicit review processes where models analyse their own problem-solving attempts, or meta-learning approaches that learn how to learn from experience.

Fourth, developing multi-agent systems with genuine culture—shared knowledge bases that agents collectively maintain and evolve, self-play mechanisms that drive capability improvement through competition, and organisational structures that enable collaboration without centralized control. Current systems remain fundamentally solitary; genuine agent economies will require breakthroughs in coordination and communication.

Fifth, and perhaps most ambitiously, maintaining entropy in AI systems—preventing the collapse toward stereotyped outputs that currently plagues even frontier models. This might involve explicit diversity penalties, adversarial training to prevent convergence, or inspiration from biological systems that maintain variation through mechanisms like mutation and recombination.

Conclusion: Intelligence as Engineering Challenge

Andrej Karpathy’s vision of the cognitive core represents a mature perspective on artificial intelligence—neither breathlessly optimistic about imminent superintelligence nor dismissively pessimistic about current limitations. He sees AI as an engineering challenge rather than a mystical threshold, requiring patient work across multiple dimensions rather than awaiting a single breakthrough.

This perspective derives from direct experience with the messy reality of deploying AI systems at scale. Self-driving cars that work perfectly in demonstrations still require years of refinement before handling edge cases reliably. Coding agents that generate impressive solutions for common problems still struggle with novel architectural challenges. Educational AI that answers questions adequately still falls far short of expert human tutors’ adaptive responsiveness.

Yet within these limitations lies genuine progress. Models continue improving along multiple dimensions simultaneously. Infrastructure for deploying and managing AI systems grows more sophisticated. Understanding of these systems’ capabilities and constraints becomes more nuanced. The path forward is visible, even if it stretches further than optimists anticipated.

The concept of stripping knowledge to reveal the cognitive core captures this mature vision perfectly. Rather than pursuing ever-larger models memorising ever-more data, we might achieve more capable intelligence through subtraction—removing the crutch of memorisation to force development of robust reasoning algorithms. Like humans compelled to abstract and generalise because we cannot remember everything, AI systems might benefit from similar constraints.

This vision offers hope not for sudden transformation but for steady progress—the kind that compounds over decades into revolutionary change. It suggests that the hard technical problems of intelligence remain tractable whilst acknowledging their genuine difficulty. Most importantly, it positions humans not as passive observers of AI’s ascent but as active participants in shaping its development and ensuring its integration enhances rather than diminishes human flourishing.

The decade ahead will test these ideas. We’ll discover whether cognitive cores can be effectively isolated, whether continual learning mechanisms can be made robust, whether the demo-to-product gap can be bridged across diverse domains. The answers will shape not just the trajectory of AI technology but the future of human society in an increasingly automated world. Karpathy’s contribution lies in framing these questions with clarity, drawing on hard-won experience to guide expectations, and reminding us that the most profound challenges often yield to patient, disciplined engineering rather than waiting for miraculous breakthroughs.

Quote: Andrej Karpathy – Ex-OpenAI, Ex-Tesla AI

“I feel like the [ AI ] problems are tractable, they’re surmountable, but they’re still difficult. If I just average it out, it just feels like a decade [ to AGI ] to me.” – Andrej Karpathy – Ex-OpenAI, Ex-Tesla AI

Andrej Karpathy’s reflection—“I feel like the [ AI ] problems are tractable, they’re surmountable, but they’re still difficult. If I just average it out, it just feels like a decade [ to AGI ] to me.”—encapsulates both a grounded optimism and a caution honed through years at the forefront of artificial intelligence research. Understanding this statement requires context about the speaker, the evolution of the field, and the intellectual landscape that shapes contemporary thinking on artificial general intelligence (AGI).

Andrej Karpathy: Technical Leadership and Shaping AI’s Trajectory

Karpathy is recognised as one of the most influential figures in modern AI. With a doctorate under Geoffrey Hinton, the so-called “godfather” of deep learning, Karpathy’s early career put him at the confluence of academic breakthroughs and industrial deployment. At Stanford, he helped launch the seminal CS231n course, which became a training ground for a generation of practitioners. He subsequently led critical efforts at OpenAI and Tesla, where he served as Director of AI, architecting large-scale deep learning systems for both language and autonomous driving.

From the earliest days of deep learning, Karpathy has witnessed—and helped drive—several “seismic shifts” that have periodically redefined the field. He recalls, for example, the transition from neural networks being considered a niche topic to their explosive relevance with the advent of AlexNet. At OpenAI, he observed the limitations of reinforcement learning when applied too soon to general agent-building and became an early proponent of focusing on practical, useful systems rather than chasing abstract analogies with biological evolution.

Karpathy’s approach is self-consciously pragmatic. He discounts analogies between AI and animal evolution, preferring to frame current efforts as “summoning ghosts,” i.e., building digital entities trained by imitation, not evolved intelligence. His career has taught him to discount industry hype cycles and focus on the “march of nines”—the painstaking work required to close the gap between impressive demos and robust, trustworthy products. This stance runs through his entire philosophy on AI progress.

Context for the Quote: Realism amidst Exponential Hype

The statement about AGI’s timeline emerges from Karpathy’s nuanced position between the extremes of utopian accelerationism and excessive scepticism. Against a backdrop of industry figures claiming near-term transformative breakthroughs, Karpathy advocates for a middle path: current models represent significant progress, but numerous “cognitive deficits” persist. Key limitations include the lack of robust continual learning, difficulties generalising out-of-distribution, and the absence of key memory and reasoning capabilities seen in human intelligence.

Karpathy classifies present-day AI systems as “competent, but not yet capable agents”—useful in narrow domains, such as code generation, but unable to function autonomously in open-ended, real-world contexts. He highlights how models exhibit an uncanny ability to memorise, yet often lack the generalisation skills required for truly adaptive behaviour; they’re powerful, but brittle. The hard problems left are not insurmountable, but solving them—including integrating richer memory, developing agency, and building reliable, context-sensitive learning—will take sustained, multi-year effort.

AGI and the Broader Field: Dialogue with Leading Theorists

Karpathy’s thinking exists in dialogue with several foundational theorists:

Geoffrey Hinton: Pioneered deep learning and neural network approaches that underlie all current large-scale AI. His early conviction in neural networks, once seen as fringe, is now mainstream, but Hinton remains open to new architectural breakthroughs.
Richard Sutton: A major proponent of reinforcement learning as a route to general intelligence. Sutton’s vision focuses on “building animals”—systems capable of learning from scratch via trial and error in complex environments—whereas Karpathy now sees this as less immediately relevant than imitation-based, practically grounded approaches.
Yann LeCun: Another deep learning pioneer, LeCun has championed the continuous push toward self-supervised learning and innovations within model architecture.
The Scaling Optimists: The school of thought, including some in the OpenAI and DeepMind circles, who argue that simply increasing model size and data, within current paradigms, will inexorably deliver AGI. Karpathy explicitly distances himself from this view, arguing for the necessity of algorithmic innovation and socio-technical integration.

Karpathy sees the arc of AI progress as analogous to general trends in automation and computing: exponential in aggregate, but marked by periods of over-prediction, gradual integration, and non-linear deployment. He draws lessons from the slow maturation of self-driving cars—a field he led at Tesla—where early demos quickly gave way to years of incremental improvement, ironing out “the last nines” to reach real-world reliability.

He also foregrounds the human side of the equation: as AI’s technical capability increases, the question becomes as much about organisational integration, legal and social adaptation, as it does about raw model performance.

In Summary: Surmountable Yet Difficult

Karpathy’s “decade to AGI” estimate is anchored in a sober appreciation of both technical tractability and practical difficulty. He is neither pessimistic nor a hype-driven optimist. Instead, he projects that AGI—defined as machines able to deliver the full spectrum of knowledge work at human levels—will require another decade of systematic progress spanning model architecture, algorithmic innovation, memory, continual learning, and above all, integration with the complex realities of the real world.

His perspective stands out for its blend of technical rigour, historical awareness, and humility in the face of both engineering constraints and the unpredictability of broader socio-technical systems. In this, Karpathy situates himself in conversation with a lineage of thinkers who have repeatedly recalibrated the AI field’s ambitions—and whose own varied predictions continue to shape the ongoing march toward general intelligence.

Quote: Roelof Botha – Senior Steward, Sequoia

“I don’t think venture is an ‘asset class’ in the sense many LPs think… You need dozens of Figma-sized outcomes every year to make that math work; I don’t see that many. So the only thing that breaks is the return assumption. Venture is return-free risk, not a risk-free return.” – Roelof Botha – Senior Steward, Sequoia

Botha’s mathematical argument is straightforward and devastating. With approximately $250 billion flowing annually into US venture capital and limited partners expecting net IRRs in the 12% range, the implied arithmetic requires roughly $1 trillion in annual exit value over typical fund horizons. Yet historical data reveals only about 20 companies per year achieve realised exits worth $1 billion or more. Even if we generously assume that frontier AI companies will produce larger outcomes than historical norms, the gulf between required and probable returns remains vast. The statement “you need dozens of Figma-sized outcomes every year to make that math work” underscores the sheer improbability of meeting aggregate return expectations.

This is not merely academic scepticism. Botha speaks from the vantage point of having navigated multiple market cycles whilst maintaining Sequoia’s position at the apex of venture performance. His perspective is informed by personal experience across technology’s most significant transitions: he took PayPal public as CFO at 28 in 2002—the first “dotcom” to go public after the crash—advocated for YouTube’s acquisition two years before Google bought it for $1.65 billion, and has since been instrumental in investments spanning Instagram, Square, MongoDB, Unity and DoorDash. When someone with this track record states that venture doesn’t function as an asset class in the conventional sense, it merits serious attention.

The Institutional Memory Problem

Botha’s critique exposes a fundamental tension in how institutional capital thinks about venture. The asset class framework assumes diversification, scalability and predictable return distributions—characteristics that venture capital demonstrably lacks. The data consistently show extreme power law dynamics: top-decile and top-quartile performance diverge dramatically from median outcomes, and the gap has widened as more capital has entered the market. Limited partners treating venture as they would bonds or equities—allocating based on target portfolio weights and rebalancing mechanically—are applying frameworks designed for normally distributed returns to a domain where outcomes follow profoundly skewed distributions.

The historical precedent supports Botha’s scepticism. When one examines the roster of leading Silicon Valley venture firms from 1990, most have ceased to exist or have faded into irrelevance. Even amongst firms that survived, maintaining top-tier performance across multiple decades and generational transitions remains vanishingly rare. Sequoia itself has institutionalised “healthy paranoia” through daily rituals—including wall-to-wall printing of “we are only as good as our next investment” in each partner’s handwriting—precisely because sustained excellence is so improbable.

Cost Structure and Margin Dynamics

Botha’s broader investment philosophy, evident throughout the conversation, provides essential context for understanding why he believes current capital deployment is fundamentally misaligned with probable outcomes. His emphasis on cost structure and unit economics—”cost is an advantage, not price”—reflects a disciplined focus on companies that can achieve sustainable margins rather than those burning capital to chase topline growth. This stands in sharp contrast to the behaviour incentivised when excessive capital seeks deployment: founders are encouraged to prioritise scale over efficiency, and investors compete on valuation rather than value-add partnership.

The contemporary challenge in AI applications illustrates this tension. Many AI-enabled software companies currently exhibit compressed gross margins—perhaps 40% rather than the 80% typical of pre-AI SaaS businesses—due to inference costs. Botha’s view is that these margins will improve materially over time as algorithms become more efficient, open-source models compete with proprietary offerings, and founders deploy model ensembles that match use cases to cost-value ratios. However, this requires patient capital willing to underwrite margin expansion paths rather than demanding immediate profitability or hyper-growth at any cost. The current abundance of venture capital undermines this discipline.

Decision-Making Architecture and Team Composition

Sequoia’s internal governance mechanisms reveal how a firm can maintain investment discipline amidst market exuberance. The partnership employs anonymous preliminary voting across approximately 12 participants per fund meeting, premortems that explicitly name cognitive biases in investment memoranda, and a culture of “front-stabbing” where dissent must be voiced directly and substantively. This architecture is designed to surface honest disagreement whilst preserving the conviction necessary for outlier bets. Critically, Sequoia has deliberately kept its investment team small—roughly 25 investors total—to maintain the trust required for candid debate. This stands in stark contrast to firms that have scaled headcount aggressively to deploy larger funds.

The personnel profile Botha describes—”pirates, not people who want to join the navy”—reflects a specific cultural DNA: competitive, irreverent, non-conformist individuals who nonetheless possess high integrity and play as a team. This is not window dressing; it’s a functional requirement for maintaining the dissonance between institutional humility (“we are only as good as our next investment”) and individual conviction (the willingness to champion contrarian positions). The challenge for most organisations is that these traits—competitive individualism and collaborative teamwork, paranoia and boldness—create inherent tensions that require active cultural management.

Implications for Founders and Emerging Managers

For founders, Botha’s analysis suggests that the current abundance of venture capital may be more liability than asset. Excess funding often undermines the discipline required to build durable businesses with strong unit economics and sustainable margins. The historical pattern he references—spreading talent thin, similar to 1999—implies that many startups are overstaffed, over-capitalised and under-focused on the cost structures that ultimately determine competitive advantage. Founders who resist the temptation to raise at inflated valuations and instead prioritise capital efficiency may find themselves better positioned when market conditions normalise.

For emerging fund managers, the message is equally stark: network development, relationship cultivation and demonstrable value-add matter far more than deploying large pools of capital. Botha’s advice to “build the network and the tributaries” reflects a business model predicated on access and partnership rather than balance sheet scale. Managers who attempt to compete by raising ever-larger funds are swimming against the arithmetic Botha outlines—there simply aren’t enough outsized outcomes to justify the capital deployed.

Theoretical Foundations: Power Laws and Portfolio Construction

Botha’s argument intersects with longstanding academic debates about venture capital portfolio construction and return dynamics. The seminal work by Korteweg and Sorensen (2010) on risk adjustment in venture returns demonstrated that much of venture’s apparent outperformance disappears when properly accounting for risk and selection bias. Subsequent research by Ewens, Jones and Rhodes-Kropf (2013) on the price of diversification showed that venture returns exhibit extreme skewness, with top-decile funds capturing disproportionate value. Harris, Jenkinson and Kaplan (2014) found that whilst top-quartile venture funds consistently outperform public markets, median and below-median funds underperform even after adjusting for leverage and illiquidity.

The theoretical challenge is that venture capital has always been characterised by power law dynamics—Chris Dixon and others have popularised Nassim Taleb’s observation that venture returns follow a power law distribution where a small number of investments generate the majority of returns. What Botha is arguing is that current capital inflows have pushed the industry beyond the point where even sophisticated portfolio construction can reliably generate attractive risk-adjusted returns for the typical investor. This is distinct from claiming that no attractive opportunities exist; rather, he’s asserting that the quantum of attractive opportunities relative to deployed capital has reached unsustainable levels.

Historical Analogues and Market Cycles

The 1999 parallel Botha invokes is instructive. During the dotcom bubble, venture capital fundraising surged from roughly $35 billion in 1998 to $106 billion in 2000. The subsequent crash saw fundraising collapse to $10 billion by 2003. What’s often forgotten is that the best-performing funds from that era—those that generated genuine alpha—tended to be smaller, more selective vehicles that maintained investment discipline even as capital availability surged. Sequoia itself raised a relatively modest $450 million fund in 1999, resisting the temptation to scale fund size aggressively.

The 2021 parallel is equally relevant. As growth-stage valuations reached unprecedented levels and tourists flooded into venture capital, established firms faced pressure to compete on valuation, deploy capital faster and compromise on diligence. Firms that maintained discipline—insisting on demonstrable unit economics, sustainable margins and realistic growth assumptions—found themselves losing competitive processes to investors willing to accept flimsier evidence of value creation. Botha’s framing suggests that this dynamic represents not temporary market froth but rather structural oversupply.

The Broader Context: Technology Adoption and Market Scale

Botha’s longer-term optimism about technology’s impact provides important nuance. He acknowledges that the scale of technology markets has expanded dramatically—from 300 million internet users during his PayPal tenure to four billion people with high-speed mobile devices today. He’s explicit that frontier technologies like AI, robotics, genomics and blockchain-based financial infrastructure will create substantial value. His scepticism is not about innovation potential but rather about the mismatch between capital deployed and capturable returns.

This distinction matters for interpreting the “return-free risk” characterisation. Botha is not arguing that venture capital cannot generate exceptional returns for skilled practitioners with disciplined processes and selective deployment. Sequoia’s portfolio—representing roughly 30% of NASDAQ market capitalisation from companies backed whilst private—demonstrates that outlier performance remains achievable. Rather, he’s asserting that treating venture as a passive, diversifiable asset class suitable for broad institutional allocation is fundamentally misconceived.

The Economics of Intelligence and Margin Evolution

The AI-specific dimension of Botha’s analysis deserves separate consideration. His framework for evaluating AI application companies combines near-term pragmatism with medium-term optimism about cost curves. In the near term, many AI-enabled products exhibit compressed margins due to inference costs, and investors must assess whether unit economics or pricing power justify those margins. Over the medium term, he expects substantial margin improvement driven by algorithmic efficiency gains, open-source model competition, economies of scale and intelligent model selection (deploying frontier models only where value justifies cost).

This view has profound implications for how investors should evaluate AI companies today. Those applying conventional SaaS valuation multiples without adjusting for current margin compression may be overvaluing companies whose competitive position depends on unsustainable subsidisation of compute costs. Conversely, those dismissing AI applications entirely based on current margin profiles may be underestimating the trajectory of cost improvement. Disciplined diligence requires explicit modeling of margin evolution paths, sensitivity to underlying cost curves and realistic assessment of pricing power as intelligence commoditises.

Governance, Conflict and Confidentiality

The operational challenges Botha describes—managing portfolio conflicts, preserving confidentiality and navigating situations where portfolio companies evolve into competitive adjacencies—illuminate the practical tensions that arise when firms operate as deep business partners rather than passive capital providers. His example of Stripe and Square converging into overlapping domains, requiring recusal from certain meetings and investment memos, illustrates that even well-intentioned conflict management involves trade-offs and constraints.

This dimension connects to the broader question of whether venture capital should be structured as a relationship business or as a capital-allocation optimisation problem. Firms pursuing the former model—exemplified by Sequoia’s emphasis on board service, operational partnership and long-term stewardship—necessarily accept constraints on portfolio breadth and sector coverage. Firms pursuing the latter can achieve greater diversification and sector coverage but sacrifice depth of partnership and founder alignment. Neither model is categorically superior, but they imply different return profiles and different sources of competitive advantage.

Implications for Limited Partner Strategy

For institutional investors, Botha’s analysis suggests a fundamental rethinking of venture allocation strategy. The orthodox approach—establishing a target allocation to venture capital as an asset class, selecting a diversified portfolio of fund managers across vintage years and strategies, and rebalancing mechanically—is predicated on assumptions that Botha’s data directly contradict. If venture exhibits power law returns at both the company level and the fund level, and if capital oversupply has pushed the industry beyond the point where diversification reliably captures attractive risk-adjusted returns, then LPs should concentrate capital with demonstrably superior managers rather than pursuing broad diversification.

This implies dramatically different behaviour: willingness to pay premium economics for access to top-decile managers, acceptance of capacity constraints and queue positions, focus on relationship development and value demonstration rather than purely financial negotiation. It also implies scepticism towards emerging managers unless they can articulate genuine edge—proprietary deal flow, differentiated value-add, or domain expertise that translates to selection advantage.

The alternative—acknowledging that venture capital allocation is effectively a form of economic development or innovation subsidy that happens to generate modest risk-adjusted returns—is intellectually honest but conflicts with fiduciary obligations. Endowments, pension funds and sovereign wealth vehicles investing primarily for financial return should perhaps treat venture capital as a satellite allocation justified by lottery-ticket optionality rather than as a core portfolio component meriting multi-billion-pound allocations.

Roelof Botha’s path to this perspective reflects an unusual combination of operating experience, investment track record and institutional leadership. Born in Pretoria in September 1973, he studied actuarial science, economics and statistics at the University of Cape Town before earning an MBA from Stanford’s Graduate School of Business, where he graduated as the Henry Ford Scholar—the top student in his class. His actuarial training instilled a probabilistic framework and long-term thinking that pervades his investment philosophy.

After working as a business analyst at McKinsey in Johannesburg from 1996 to 1998, Botha joined PayPal in 2000 as director of corporate development whilst still a Stanford student. He became PayPal’s chief financial officer in September 2001 at age 27, navigating the company through its February 2002 initial public offering and subsequent October 2002 acquisition by eBay. PayPal’s IPO occurred during a period of profound scepticism about internet businesses—one 2001 article titled “Earth to Palo Alto” essentially ridiculed the company—yet PayPal’s financial discipline and clear path to profitability vindicated the decision.

Botha joined Sequoia Capital in January 2003, working closely with Michael Moritz, who had been PayPal’s lead investor. He was promoted to partner in 2007 following Google’s acquisition of YouTube, an investment he had championed two years earlier. The YouTube founders were friends from his PayPal days, and Botha worked with them in Sequoia’s offices iterating on the product during its formative stages. His subsequent investments include Instagram (acquired by Facebook for $1 billion in 2012), Square (public market capitalisation exceeding $40 billion at peak), MongoDB (public since 2017), Unity Technologies (public 2020-2023), Natera and numerous others.

He became head of Sequoia’s US venture operations in 2010 alongside Jim Goetz, assumed sole leadership of the US business in 2017 whilst Doug Leone served as global senior steward, and was elevated to senior steward of Sequoia’s global operations in July 2022. His tenure has coincided with Sequoia’s organisational evolution—including the controversial 2021 introduction of the Sequoia Capital Fund, a permanent capital vehicle designed to hold positions indefinitely rather than liquidating according to traditional fund timelines—and with substantial turbulence in technology markets.

Botha’s intellectual formation reflects the intersection of actuarial risk assessment, McKinsey-style structured problem-solving and the crucible of operating in a high-growth technology company during both exuberance and crisis. His repeated emphasis on cost structure, margin dynamics and unit economics reflects operating experience rather than purely financial analysis. The actuarial lens—thinking in terms of probability distributions, long time horizons and avoiding ruin—distinguishes his analytical framework from investors whose backgrounds emphasise pattern recognition or momentum-driven investing.

Quote: David Solomon – Goldman Sachs CEO

“If the firm grows and you expand and you can invest in other areas for growth, we’ll wind up with more jobs… we have at every step along the journey for the last forty years as technology has made us more productive. I don’t think it’s different this time [with AI].” – David Solomon – Goldman Sachs CEO

David Michael Solomon, born in 1962 in Hartsdale, New York, is an American investment banker and DJ, currently serving as the CEO and Chairman of Goldman Sachs. His journey into the financial sector began after he graduated with a BA in political science from Hamilton College. Initially, Solomon worked at Irving Trust Company and Drexel Burnham before joining Bear Stearns. In 1999, he moved to Goldman Sachs as a partner and became co-head of the High Yield and Leveraged Loan Business.

Solomon’s rise within Goldman Sachs was swift and strategic. He became the co-head of the Investment Banking Division in 2006 and held this role for a decade. In 2017, he was appointed President and Chief Operating Officer, and by October 2018, he succeeded Lloyd Blankfein as CEO. He became Chairman in January 2019.

Beyond his financial career, Solomon is known for his passion for music, producing electronic dance music under the alias “DJ D-Sol”. He has performed at various venues, including nightclubs and music festivals in New York, Miami, and The Bahamas.

Context of the Quote

The quote highlights Solomon’s perspective on technology and job creation in the financial sector. He suggests that while technology, particularly AI, can enhance productivity and potentially lead to job reductions in certain areas, the overall growth of the firm will create more opportunities for employment. This view is rooted in his experience observing how technological advancements have historically led to increased productivity and growth for Goldman Sachs.

Leading Theorists on AI and Employment

Several leading theorists have explored the impact of AI on employment, with divergent views:

Joseph Schumpeter is famous for his theory of “creative destruction,” which suggests that technological innovations often lead to the destruction of existing jobs but also create new ones. This cycle is seen as essential for economic growth and innovation.
Klaus Schwab, founder of the World Economic Forum, has discussed the Fourth Industrial Revolution, emphasizing how AI and automation will transform industries. However, he also highlights the potential for new job creation in emerging sectors.
Economists Erik Brynjolfsson and Andrew McAfee have written extensively on how technology can lead to both job displacement and creation. They argue that while AI may reduce certain types of jobs, it also fosters economic growth and new opportunities.

These theorists provide a backdrop for understanding Solomon’s optimistic view on AI’s impact on employment, focusing on the potential for growth and innovation to offset job losses.

Conclusion

David Solomon’s quote encapsulates his optimism about the interplay between technology and job creation. Focusing on the strategic growth of Goldman Sachs, he believes that technological advancements will enhance productivity and create opportunities for expansion, ultimately leading to more employment opportunities. This perspective aligns with broader discussions among economists and theorists on the transformative role of AI in the workplace.

Quote: David Solomon – Goldman Sachs CEO

“Markets run in cycles, and whenever we’ve historically had a significant acceleration in a new technology that creates a lot of capital formation and therefore lots of interesting new companies around it, you generally see the market run ahead of the potential. Are there going to be winners and losers? There are going to be winners and losers.” – David Solomon – Goldman Sachs CEO

The quote, “Markets run in cycles, and whenever we’ve historically had a significant acceleration in a new technology that creates a lot of capital formation and therefore lots of interesting new companies around it, you generally see the market run ahead of the potential. Are there going to be winners and losers? There are going to be winners and losers,” comes from a public discussion with David Solomon, CEO of Goldman Sachs, during Italian Tech Week in October 2025. This statement was made in the context of a wide-ranging interview that addressed the state of the US and global economy, the impact of fiscal stimulus and technology infrastructure spending, and, critically, the current investment climate surrounding artificial intelligence (AI) and other emergent technologies.

Solomon’s comments were prompted by questions around the record-breaking rallies in US and global equity markets and specifically the extraordinary market capitalisations reached by leading tech firms. He highlighted the familiar historical pattern: periods of market exuberance often occur when new technologies spur rapid capital formation, leading to the emergence of numerous new companies around a transformative theme. Solomon drew parallels with the Dot-com boom to underscore the cyclical nature of markets and to remind investors that dramatic phases of growth inevitably produce both outsized winners and significant casualties.

His insight reflects a seasoned banker’s view, grounded in empirical observation: while technological waves can drive periods of remarkable wealth creation and productivity gains, they also tend to attract speculative excesses. Market valuations in these periods often disconnect from underlying fundamentals, setting the stage for later corrections. The resulting market shake-outs separate enduring companies from those that fail to deliver sustainable value.

About David Solomon

David M. Solomon is one of the most prominent figures in global finance, serving as the CEO and Chairman of Goldman Sachs since 2018. Raised in New York and a graduate of Hamilton College, Solomon has built his reputation over four decades in banking—rising through leadership positions at Irving Trust, Drexel Burnham, and Bear Stearns before joining Goldman Sachs in 1999 as a partner. He subsequently became global head of the Financing Group, then co-head of the Investment Banking Division, playing a central role in shaping the firm’s capital markets strategy.

Solomon is known for his advocacy of organisational modernisation and culture change at Goldman Sachs—prioritising employee well-being, increasing agility, and investing heavily in technology. He combines traditional deal-making acumen with an openness to digital transformation. Beyond banking, Solomon has a notable side-career as a DJ under the name DJ D-Sol, performing electronic dance music at high-profile venues.

Solomon’s career reflects both the conservatism and innovative ambition associated with modern Wall Street leadership: an ability to see risk cycles clearly, and a willingness to pivot business models to suit shifts in technological and regulatory environments. His net worth in 2025 is estimated between $85 million and $200 million, owing to decades of compensation, equity, and investment performance.

Theoretical Foundations: Cycles, Disruptive Innovation, and Market Dynamics

Solomon’s perspective draws implicitly on a lineage of economic theory and market analysis concerning cycles of innovation, capital formation, and asset bubbles. Leading theorists and their contributions include:

Joseph Schumpeter: Schumpeter’s theory of creative destruction posited that economic progress is driven by cycles of innovation, where new technologies disrupt existing industries, create new market leaders, and ultimately cause the obsolescence or failure of firms unable to adapt. Schumpeter emphasised how innovation clusters drive periods of rapid growth, investment surges, and, frequently, speculative excess.
Carlota Perez: In Technological Revolutions and Financial Capital (2002), Perez advanced a model of techno-economic paradigms, proposing that every major technological revolution (e.g., steam, electricity, information technology) proceeds through phases: an initial installation period—characterised by exuberant capital inflows, speculation, and bubble formation—followed by a recessionary correction, and, eventually, a deployment period, where productive uses of the technology diffuse more broadly, generating deep-seated economic gains and societal transformation. Perez’s work helps contextualise Solomon’s caution about markets running ahead of potential.
Charles Kindleberger and Hyman Minsky: Both scholars examined the dynamics of financial bubbles. Kindleberger, in Manias, Panics, and Crashes, and Minsky, through his Financial Instability Hypothesis, described how debt-fuelled euphoria and positive feedback loops of speculation can drive financial markets to overshoot the intrinsic value created by innovation, inevitably resulting in busts.
Clayton Christensen: Christensen’s concept of disruptive innovation explains how emergent technologies, initially undervalued by incumbents, can rapidly upend entire industries—creating new winners while displacing former market leaders. His framework helps clarify Solomon’s points about the unpredictability of which companies will ultimately capture value in the current AI wave.
Benoit Mandelbrot: Applying his fractal and complexity theory to financial markets, Mandelbrot challenged the notion of equilibrium and randomness in price movement, demonstrating that markets are prone to extreme events—outlier outcomes that, while improbable under standard models, are a recurrent feature of cyclical booms and busts.

Practical Relevance in Today’s Environment

The patterns stressed by Solomon, and their theoretical antecedents, are especially resonant given the current environment: massive capital allocations into AI, cloud infrastructure, and adjacent technologies—a context reminiscent of previous eras where transformative innovations led markets both to moments of extraordinary wealth creation and subsequent corrections. These cycles remain a central lens for investors and business leaders navigating this era of technological acceleration.

By referencing both history and the future, Solomon encapsulates the balance between optimism over the potential of new technology and clear-eyed vigilance about the risks endemic to all periods of market exuberance.

Quote: David Solomon – Goldman Sachs CEO

“AI really allows smart, talented, driven, sophisticated people to be more productive – to touch more people, have better information at their disposal, better analysis.” – David Solomon – Goldman Sachs CEO

David Solomon, CEO of Goldman Sachs, made the statement “AI really allows smart, talented, driven, sophisticated people to be more productive – to touch more people, have better information at their disposal, better analysis” during an interview at Italian Tech Week 2025, reflecting his conviction that artificial intelligence is redefining productivity and impact across professional services and finance.

David Solomon is one of the most influential figures in global finance, serving as Chairman and CEO of Goldman Sachs since 2018. Born in 1962 in Hartsdale, New York, Solomon’s early years were shaped by strong family values, a pursuit of education at Hamilton College, and a keen interest in sport and leadership. Solomon’s ascent in the industry began after stints at Irving Trust and Drexel Burnham, specialising early in commercial paper and junk bonds, then later at Bear Stearns where he played a central role in project financing. In 1999, he joined Goldman Sachs as a partner and quickly rose through the ranks—serving as Global Head of the Financing Group and later Co-Head of the Investment Banking Division for a decade.

His leadership is marked by an emphasis on modernisation, talent development, and integrating technology into the financial sector. Notably, Solomon has overseen increased investments in digital platforms and has reimagined work culture, including reducing working hours and implementing real-time performance review systems. Outside his professional life, Solomon is distinctively known for his passion for music, performing as “DJ D-Sol” at major electronic dance music venues, symbolising a leadership style that blends discipline with creative openness.

Solomon’s remarks on AI at Italian Tech Week are rooted in Goldman Sachs’ major investments in technology: with some 12,000 engineers and cutting-edge AI platforms, Solomon champions the view that technology not only streamlines operational efficiency but fundamentally redefines the reach and ability of talented professionals, providing richer data, deeper insights, and more effective analysis. He frames AI as part of a long continuum—from the days of microfiche and manual records to today’s instant, voice-powered analytics—positioning technology as both a productivity enabler and an engine for growth.

Leading Theorists and Context in AI Productivity

Solomon’s thinking sits at the crossroads of key theoretical advances in artificial intelligence and productivity economics. The transformation he describes draws extensively from foundational theorists and practitioners who have shaped our understanding of AI’s organisational impact:

Herbert Simon: A founder of artificial intelligence as a discipline, Simon’s concept of “bounded rationality” highlighted that real-world decision making could be fundamentally reshaped by computational power. Simon envisioned computers extending the limits of human cognition, a concept directly echoed in Solomon’s belief that AI produces leverage for talented professionals.
Erik Brynjolfsson: At MIT, Brynjolfsson has argued that AI is a “general purpose technology” like steam power or electricity, capable of diffusing productivity gains across every sector through automation, improved information processing, and new business models. His work clarifies that the impact of AI is not in replacing human value, but augmenting it, making people exponentially more productive.
Andrew Ng: As a pioneer in deep learning, Ng has emphasised the role of AI as a productivity tool: automating routine tasks, supporting complex analysis, and dramatically increasing the scale and speed at which decisions can be made. Ng’s teaching at Stanford and public writings focus on making AI accessible as a resource to boost human capability rather than a substitute.
Daron Acemoglu: The MIT economist challenges overly optimistic readings, arguing that the net benefits of AI depend on balanced deployment, policy, and organisational adaptation. Acemoglu frames the debate on whether AI will create or eliminate jobs, highlighting the strategic choices organisations must make—a theme Solomon directly addresses in his comments on headcount in banking.
Geoffrey Hinton: Widely known as “the godfather of deep learning,” Hinton’s research underpins the practical capabilities of AI systems—particularly in areas such as data analysis and decision support—that Solomon highlights as crucial to productive professional services.

Contemporary Application and Analysis

The productivity gains Solomon identifies are playing out across multiple sectors:

In financial services, AI-driven analytics enable deeper risk management, improved deal generation, and scalable client engagement.
In asset management and trading, platforms like Goldman Sachs’ own “Assistant” and generative coding tools (e.g., Cognition Labs’ Devin) allow faster, more nuanced analysis and automation.
The “power to touch more people” is realised through personalised client service, scalable advisory, and rapid market insight, bridging human expertise and computational capacity.

Solomon’s perspective resonates strongly with current debates on the future of work. While risks—such as AI investment bubbles, regulatory uncertainty, and workforce displacement—are acknowledged, Solomon positions AI as a strategic asset: not a threat to jobs, but a catalyst for organisational expansion and client impact, consistent with the lessons learned through previous technology cycles.

Theoretical Context Table

Theorist	Core Idea	Relevance to Solomon’s Statement
Herbert Simon	Bounded rationality, decision support	AI extending cognitive limits and enabling smarter analysis
Erik Brynjolfsson	AI as general purpose technology	Productivity gains and diffusion through diverse organisations
Andrew Ng	AI augments tasks, boosts human productivity	AI as a tool for scalable information and superior outcomes
Daron Acemoglu	Balance of job creation/destruction by technology	Strategic choices in deploying AI impact workforce and growth
Geoffrey Hinton	Deep learning, data analysis	Enabling advanced analytics and automation in financial services

Essential Insights

AI’s impact is cumulative and catalytic, empowering professionals to operate at far greater scale and depth than before, as illustrated by Solomon’s personal technological journey—from manual information gathering to instantaneous AI-driven analytics.
The quote’s context reflects the practical reality of AI at the world’s leading financial institutions, where technology spend rivals infrastructure, and human-machine synergy is central to strategy.
Leading theorists agree: real productivity gains depend on augmenting human capability, strategic deployment, and continual adaptation—principles explicitly recognised in Solomon’s operational philosophy and in global best practice.

Quote: Jamie Dimon – JP Morgan Chase CEO

“Take the Internet bubble. Remember that blew up and I can name 100 companies that were worth $50 billion and disappeared…. So there will be some real big companies, real big success. [ AI ]will work in spite of the fact that not everyone invested is going to have a great investment return.” – Jamie Dimon, CEO JP Morgan Chase

Jamie Dimon’s observation about artificial intelligence investment echoes his experience witnessing the dot-com bubble’s collapse at the turn of the millennium—a period when he was navigating his own career transition from Citigroup to Bank One. Speaking to Bloomberg in London during October 2025, the JPMorgan Chase chairman drew upon decades of observing technological disruption to contextualise the extraordinary capital deployment currently reshaping the AI landscape. His commentary serves as a measured counterpoint to the euphoria surrounding generative artificial intelligence, reminding investors that transformative technologies invariably produce both spectacular winners and catastrophic losses.

The Speaker: Institutional Banking’s Preeminent Figure

Jamie Dimon has commanded JPMorgan Chase since 2006, transforming it into America’s largest bank by assets whilst establishing himself as Wall Street’s most influential voice. His journey to this position began in 1982 when he joined American Express as an assistant to Sandy Weill, embarking upon what would become one of the most consequential partnerships in American finance. For sixteen years, Dimon and Weill orchestrated a series of acquisitions that built Travelers Group into a financial services colossus, culminating in the 1998 merger with Citicorp to form Citigroup.

The relationship ended abruptly that same year when Weill asked Dimon to resign—a decision Weill later characterised as regrettable to The New York Times. The ouster proved fortuitous. In 2000, Dimon assumed leadership of Bank One, a struggling Chicago-based institution he successfully revitalised. When JPMorgan acquired Bank One in 2004, Dimon became president and chief operating officer before ascending to chief executive two years later. Under his stewardship, JPMorgan’s stock value has tripled, and in 2023 the bank recorded the largest annual profit in US banking history at nearly $50 billion.

Dimon’s leadership during the 2008 financial crisis distinguished him amongst his peers. Whilst competitors collapsed or required government rescue, JPMorgan emerged strengthened, acquiring Bear Stearns and Washington Mutual. He reprised this role during the 2023 regional banking crisis, coordinating an industry response that saw eleven major banks contribute $30 billion to stabilise First Republic Bank. This pattern of crisis management has positioned him as what analyst Mike Mayo termed “a senior statesperson” for the financial industry.

Beyond banking, Dimon maintains substantial political engagement. Having donated over $500,000 to Democratic candidates between 1989 and 2009, he has since adopted a more centrist posture, famously declaring to CNBC in 2019 that “my heart is Democratic, but my brain is kind of Republican”. He served briefly on President Trump’s business advisory council in 2017 and has repeatedly faced speculation about presidential ambitions, confirming in 2016 he would “love to be president” whilst acknowledging the practical obstacles. In 2024, he endorsed Nikki Haley in the Republican primary before speaking positively about Trump following Haley’s defeat.

The Technological Context: AI’s Investment Frenzy

Dimon’s October 2025 remarks addressed the extraordinary capital deployment underway in artificial intelligence infrastructure. His observation that approximately $1 trillion in AI-related spending was occurring “this year” encompasses investments by hyperscalers—the massive cloud computing providers—alongside venture capital flowing to companies like OpenAI, which despite substantial losses continues attracting vast sums. This investment boom has propelled equity markets into their third consecutive year of bull-market conditions, with asset prices reaching elevated levels and credit spreads compressing to historical lows.

At JPMorgan itself, Dimon revealed the bank has maintained systematic AI investment since 2012, allocating $2 billion annually and employing 2,000 specialists dedicated to the technology. The applications span risk management, fraud detection, marketing, customer service, and software development, with approximately 150,000 employees weekly utilising the bank’s internal generative AI tools. Crucially, Dimon reported achieving rough parity between the $2 billion expenditure and measurable benefits—a ratio he characterised as “the tip of the iceberg” given improvements in service quality that resist quantification.

His assessment that AI “will affect jobs” reflects the technology’s capacity to eliminate certain roles whilst enhancing others, though he expressed confidence that successful deployment would generate net employment growth at JPMorgan through retraining and redeployment programmes. This pragmatic stance—neither utopian nor dystopian—typifies Dimon’s approach to technological change: acknowledge disruption candidly whilst emphasising adaptive capacity.

The Dot-Com Parallel: Lessons from Previous Technological Euphoria

Dimon’s reference to the Internet bubble carries particular resonance given his vantage point during that era. In 1998, whilst serving as Citigroup’s president, he witnessed the NASDAQ’s ascent to unsustainable valuations before the March 2000 collapse obliterated trillions in market capitalisation. His claim that he could “name 100 companies that were worth $50 billion and disappeared” speaks to the comprehensive destruction of capital that accompanied the bubble’s deflation. Companies such as Pets.com, Webvan, and eToys became cautionary tales—businesses predicated upon sound concepts executed prematurely or inefficiently, consuming vast investor capital before failing entirely.

Yet from this wreckage emerged the digital economy’s defining enterprises. Google, incorporated in 1998, survived the downturn to become the internet’s primary gateway. Facebook, founded in 2004, built upon infrastructure and lessons from earlier social networking failures. YouTube, established in 2005, capitalised on broadband penetration that earlier video platforms lacked. Dimon’s point—that “there will be some real big companies, real big success” emerging from AI investment despite numerous failures—suggests that capital deployment exceeding economically optimal levels nonetheless catalyses innovation producing enduring value.

This perspective aligns with economic theories recognising that technological revolutions characteristically involve overshoot. The railway boom of the 1840s produced excessive track mileage and widespread bankruptcies, yet established transportation infrastructure enabling subsequent industrialisation. The telecommunications bubble of the late 1990s resulted in overbuilt fibre-optic networks, but this “dark fibre” later supported broadband internet at marginal cost. Dimon’s observation that technological transitions prove “productive” in aggregate “in spite of the fact that not everyone invested is going to have a great investment return” captures this dynamic: society benefits from infrastructure investment even when investors suffer losses.

Schumpeterian Creative Destruction and Technological Transition

Joseph Schumpeter’s concept of creative destruction provides theoretical foundation for understanding the pattern Dimon describes. Writing in Capitalism, Socialism and Democracy (1942), Schumpeter argued that capitalism’s essential characteristic involves “the process of industrial mutation that incessantly revolutionises the economic structure from within, incessantly destroying the old one, incessantly creating a new one.” This process necessarily produces winners and losers—incumbent firms clinging to obsolete business models face displacement by innovators exploiting new technological possibilities.

Schumpeter emphasised that monopolistic competition amongst innovators drives this process, with entrepreneurs pursuing temporary monopoly rents through novel products or processes. The expectation of extraordinary returns attracts excessive capital during technology booms, funding experiments that collectively advance knowledge even when individual ventures fail. This mechanism explains why bubbles, whilst financially destructive, accelerate technological diffusion: the availability of capital enables rapid parallel experimentation impossible under conservative financing regimes.

Clayton Christensen’s theory of disruptive innovation, elaborated in The Innovator’s Dilemma (1997), complements Schumpeter’s framework by explaining why established firms struggle during technological transitions. Christensen observed that incumbent organisations optimise for existing customer needs and established value networks, rendering them structurally incapable of pursuing initially inferior technologies serving different markets. Entrants unburdened by legacy systems and customer relationships therefore capture disruptive innovations’ benefits, whilst incumbents experience declining relevance.

Dimon’s acknowledgement that “there will be jobs that are eliminated” whilst predicting net employment growth at JPMorgan reflects these dynamics. Artificial intelligence constitutes precisely the type of general-purpose technology that Christensen’s framework suggests will restructure work organisation. Routine tasks amenable to codification face automation, requiring workforce adaptation through “retraining and redeployment”—the organisational response Dimon describes JPMorgan implementing.

Investment Cycles and Carlota Pérez’s Technological Surges

Carlota Pérez’s analysis in Technological Revolutions and Financial Capital (2002) offers sophisticated understanding of the boom-bust patterns characterising technological transitions. Pérez identifies a consistent sequence: technological revolutions begin with an “irruption” phase as entrepreneurs exploit new possibilities, followed by a “frenzy” phase when financial capital floods in, creating asset bubbles disconnected from productive capacity. Inevitable crash precipitates a “synergy” phase when surviving innovations diffuse broadly, enabling a “maturity” phase of stable growth until the next technological revolution emerges.

The dot-com bubble exemplified Pérez’s frenzy phase—capital allocated indiscriminately to internet ventures regardless of business fundamentals, producing the NASDAQ’s March 2000 peak before three years of decline. The subsequent synergy phase saw survivors like Amazon and Google achieve dominance whilst countless failures disappeared. Dimon’s reference to “100 companies that were worth $50 billion and disappeared” captures the frenzy phase’s characteristic excess, whilst his citation of “Facebook, YouTube, Google” represents the synergy phase’s enduring value creation.

Applying Pérez’s framework to artificial intelligence suggests current investment levels—the $1 trillion deployment Dimon referenced—may indicate the frenzy phase’s advanced stages. Elevated asset prices, compressed credit spreads, and widespread investor enthusiasm traditionally precede corrections enabling subsequent consolidation. Dimon’s observation that he remains “a long-term optimist” whilst cautioning that “asset prices are high” reflects precisely the ambivalence appropriate during technological transitions’ financial euphoria: confidence in transformative potential tempered by recognition of valuation excess.

Hyman Minsky’s Financial Instability Hypothesis

Hyman Minsky’s financial instability hypothesis, developed throughout the 1960s and 1970s, explains the endogenous generation of financial fragility during stable periods. Minsky identified three financing postures: hedge finance, where cash flows cover debt obligations; speculative finance, where near-term cash flows cover interest but not principal, requiring refinancing; and Ponzi finance, where cash flows prove insufficient even for interest, necessitating asset sales or further borrowing to service debt.

Economic stability encourages migration from hedge toward speculative and ultimately Ponzi finance as actors’ confidence increases. During technological booms, this migration accelerates—investors fund ventures lacking near-term profitability based upon anticipated future cash flows. The dot-com era witnessed classic Ponzi dynamics: companies burning capital quarterly whilst promising eventual dominance justified continued financing. When sentiment shifted, refinancing evaporated, triggering cascading failures.

Dimon’s comment that “not everyone invested is going to have a great investment return” implicitly acknowledges Minskian dynamics. The $1 trillion flowing into AI infrastructure includes substantial speculative and likely Ponzi finance—investments predicated upon anticipated rather than demonstrated cash flows. OpenAI’s losses despite massive valuation exemplify this pattern. Yet Minsky recognised that such dynamics, whilst generating financial instability, also fund innovation exceeding levels conservative finance would support. Society gains from experiments capital discipline would preclude.

Network Effects and Winner-Take-All Dynamics

The persistence of “real big companies, real big success” emerging from technological bubbles reflects network effects characteristic of digital platforms. Economist W. Brian Arthur’s work on increasing returns demonstrated that technologies exhibiting positive feedback—where adoption by some users increases value for others—tend toward monopolistic market structures. Each additional Facebook user enhances the platform’s value to existing users, creating barriers to competitor entry that solidify dominance.

Carl Shapiro and Hal Varian’s Information Rules (1998) systematically analysed information goods’ economics, emphasising that near-zero marginal costs combined with network effects produce natural monopolies in digital markets. This explains why Google commands search, Amazon dominates e-commerce, and Facebook controls social networking despite numerous well-funded competitors emerging during the dot-com boom. Superior execution combined with network effects enabled these firms to achieve sustainable competitive advantage.

Artificial intelligence exhibits similar dynamics. Training large language models requires enormous capital and computational resources, but deploying trained models incurs minimal marginal cost. Firms achieving superior performance attract users whose interactions generate data enabling further improvement—a virtuous cycle competitors struggle to match. Dimon’s prediction of “some real big companies, real big success” suggests he anticipates winner-take-all outcomes wherein a handful of AI leaders capture disproportionate value whilst numerous competitors fail.

Public Policy Implications: Industrial Policy and National Security

During the Bloomberg interview, Dimon addressed the Trump administration’s emerging industrial policy, particularly regarding strategic industries like rare earth minerals and semiconductor manufacturing. His endorsement of government support for MP Materials—a rare earth processor—reveals pragmatic acceptance that national security considerations sometimes warrant departure from pure market principles. This stance reflects growing recognition that adversarial competition with China necessitates maintaining domestic production capacity in strategically critical sectors.

Dani Rodrik’s work on industrial policy emphasises that whilst governments possess poor records selecting specific winners, they can effectively support broad technological capabilities through coordinated investment in infrastructure, research, and human capital. Mariana Mazzucato’s The Entrepreneurial State (2013) documents government’s crucial role funding high-risk innovation underlying commercial technologies—the internet, GPS, touchscreens, and voice recognition all emerged from public research before private commercialisation.

Dimon’s caution that industrial policy must “come with permitting” and avoid “virtue signalling” reflects legitimate concerns about implementation quality. Subsidising industries whilst maintaining regulatory barriers preventing their operation achieves nothing—a pattern frustrating American efforts to onshore manufacturing. His emphasis on “long-term purchase agreements” as perhaps “the most important thing” recognises that guaranteed demand reduces risk more effectively than capital subsidies, enabling private investment that government funding alone cannot catalyse.

Market Conditions and Forward-Looking Concerns

Dimon’s October 2025 assessment of macroeconomic conditions combined optimism about continued expansion with caution regarding inflation risks. His observation that “consumers are still okay” because of employment—”jobs, jobs, jobs”—identifies the crucial variable determining economic trajectory. Consumer spending constitutes approximately 70% of US GDP; sustained employment supports spending even as other indicators suggest vulnerability.

Yet his expression of being “a little more nervous about inflation not coming down like people expect” challenges consensus forecasts anticipating Federal Reserve interest rate cuts totalling 100 basis points over the subsequent twelve months. Government spending—which Dimon characterised as “inflationary”—combined with potential supply-side disruptions from tariffs could reverse disinflationary trends. Should inflation prove stickier than anticipated, the Fed would face constraints limiting monetary accommodation, potentially triggering the 2026 recession Dimon acknowledged “could happen.”

This assessment demonstrates Dimon’s characteristic refusal to offer false certainty. His acknowledgement that forecasts “have almost always been wrong, and the Fed’s been wrong too” reflects epistemic humility appropriate given macroeconomic forecasting’s poor track record. Rather than pretending precision, he emphasises preparedness: “I hope for the best, plan for the worst.” This philosophy explains JPMorgan’s consistent outperformance—maintaining sufficient capital and liquidity to withstand adverse scenarios whilst remaining positioned to exploit opportunities competitors’ distress creates.

Leadership Philosophy and Organisational Adaptation

The interview revealed Dimon’s approach to deploying artificial intelligence throughout JPMorgan’s operations. His emphasis that “every time we meet as a business, we ask, what are you doing that we could do to serve your people?” reflects systematic organisational learning rather than top-down technology imposition. This methodology—engaging managers to identify improvement opportunities rather than mandating specific implementations—enables bottom-up innovation whilst maintaining strategic coherence.

Dimon’s observation that “as managers learn how to do it, they’re asking more questions” captures the iterative process through which organisations absorb disruptive technologies. Initial deployments generate understanding enabling more sophisticated applications, creating momentum as possibilities become apparent. The statistic that 150,000 employees weekly utilise JPMorgan’s internal AI tools suggests successful cultural embedding—technology adoption driven by perceived utility rather than compliance.

This approach contrasts with common patterns wherein organisations acquire technology without changing work practices, yielding disappointing returns. Dimon’s insistence on quantifying benefits—”we have about $2 billion of benefit” matching the $2 billion expenditure—enforces accountability whilst acknowledging that some improvements resist measurement. The admission that quantifying “improved service” proves difficult “but we know” it occurs reflects sophisticated understanding that financial metrics capture only partial value.

Conclusion: Technological Optimism Tempered by Financial Realism

Jamie Dimon’s commentary on artificial intelligence investment synthesises his extensive experience navigating technological and financial disruption. His parallel between current AI enthusiasm and the dot-com bubble serves not as dismissal but as realistic framing—transformative technologies invariably attract excessive capital, generating both spectacular failures and enduring value creation. The challenge involves maintaining strategic commitment whilst avoiding financial overextension, deploying technology systematically whilst preserving adaptability, and pursuing innovation whilst managing risk.

His perspective carries weight because it emerges from demonstrated judgement. Having survived the dot-com collapse, steered JPMorgan through the 2008 crisis, and maintained the bank’s technological competitiveness across two decades, Dimon possesses credibility competitors lack. When he predicts “some real big companies, real big success” whilst cautioning that “not everyone invested is going to have a great investment return,” the statement reflects neither pessimism nor hype but rather accumulated wisdom about how technological revolutions actually unfold—messily, expensively, destructively, and ultimately productively.

Quote: Jamie Dimon – JP Morgan Chase CEO

“People shouldn’t put their head in the sand. [AI] is going to affect jobs. Think of every application, every service you do; you’ll be using .. AI – some to enhance it. Some of it will be you doing the same job; you’re doing a better job at it. There will be jobs that are eliminated, but you’re better off being way ahead of the curve.” – Jamie Dimon, CEO JP Morgan Chase

Jamie Dimon delivered these observations on artificial intelligence during an interview with Bloomberg’s Tom Mackenzie in London on 7 October 2025, where he discussed JPMorgan Chase’s decade-long engagement with AI technology and its implications for the financial services sector. His comments reflect both the pragmatic assessment of a chief executive who has committed substantial resources to technological transformation and the broader perspective of someone who has navigated multiple economic cycles throughout his career.

The Context of Dimon’s Statement

JPMorgan Chase has been investing in AI since 2012, well before the recent generative AI explosion captured public attention. The bank now employs 2,000 people dedicated to AI initiatives and spends $2 billion annually on these efforts. This investment has already generated approximately $2 billion in quantifiable benefits, with Dimon characterising this as merely “the tip of the iceberg.” The technology permeates every aspect of the bank’s operations—from risk management and fraud detection to marketing, idea generation and customer service.

What makes Dimon’s warning particularly salient is his acknowledgement that approximately 150,000 JPMorgan employees use the bank’s suite of AI tools weekly. This isn’t theoretical speculation about future disruption; it’s an ongoing transformation within one of the world’s largest financial institutions, with assets of $4.0 trillion. The bank’s approach combines deployment across business functions with what Dimon describes as a cultural shift—managers and leaders are now expected to ask continuously: “What are you doing that we could do to serve your people? Why can’t you do better? What is somebody else doing?”

Dimon’s perspective on job displacement is notably unsentimental whilst remaining constructive. He rejects the notion of ignoring AI’s impact, arguing that every application and service will incorporate the technology. Some roles will be enhanced, allowing employees to perform better; others will be eliminated entirely. His solution centres on anticipatory adaptation rather than reactive crisis management—JPMorgan has established programmes for retraining and redeploying staff. For the bank itself, Dimon envisions more jobs overall if the institution succeeds, though certain functions will inevitably contract.

His historical framing of technological disruption provides important context. Drawing parallels to the internet bubble, Dimon noted that whilst hundreds of companies worth billions collapsed, the period ultimately produced Facebook, YouTube and Google. He applies similar logic to current AI infrastructure spending, which is approaching $1 trillion annually across the sector. There will be “a lot of losers, a lot of winners,” but the aggregate effect will prove productive for the economy.

Jamie Dimon: A Biography

Jamie Dimon has served as Chairman and Chief Executive Officer of JPMorgan Chase since 2006, presiding over its emergence as the leading US bank by domestic assets under management, market capitalisation and publicly traded stock value. Born on 13 March 1956, Dimon’s ascent through American finance has been marked by both remarkable achievements and notable setbacks, culminating in a position where he is widely regarded as the dominant banking executive of his generation.

Dimon earned his bachelor’s degree from Tufts University in 1978 before completing an MBA at Harvard Business School in 1982. His career began with a brief stint as a management consultant at Boston Consulting Group, followed by his entry into American Express, where he worked under the mentorship of Sandy Weill—a relationship that would prove formative. At the age of 30, Dimon was appointed chief financial officer of Commercial Credit, later becoming the firm’s president. This role placed him at the centre of an aggressive acquisition strategy that included purchasing Primerica Corporation in 1987 and The Travelers Corporation in 1993.

From 1990 to 1998, Dimon served as Chief Operating Officer of both Travelers and Smith Barney, eventually becoming Co-Chairman and Co-CEO of the combined brokerage following the 1997 merger of Smith Barney and Salomon Brothers. When Travelers Group merged with Citicorp in 1998 to form Citigroup, Dimon was named president of the newly created financial services giant. However, his tenure proved short-lived; he departed later that year following a conflict with Weill over leadership succession.

This professional setback led to what would become one of the defining chapters of Dimon’s career. In 2000, he was appointed CEO of Bank One, a struggling institution that required substantial turnaround efforts. When JPMorgan Chase merged with Bank One in July 2004, Dimon became president and chief operating officer of the combined entity. He assumed the role of CEO on 1 January 2006, and one year later was named Chairman of the Board.

Under Dimon’s leadership, JPMorgan Chase navigated the 2008 financial crisis with relative success, earning him recognition as one of the few banking chiefs to emerge from the period with an enhanced reputation. As Duff McDonald wrote in his 2009 book “Last Man Standing: The Ascent of Jamie Dimon and JPMorgan Chase,” whilst much of the crisis stemmed from “plain old avarice and bad judgment,” Dimon and JPMorgan Chase “stood apart,” embodying “the values of clarity, consistency, integrity, and courage”.

Not all has been smooth sailing. In May 2012, JPMorgan Chase reported losses of at least $2 billion from trades that Dimon characterised as “flawed, complex, poorly reviewed, poorly executed and poorly monitored”—an episode that became known as the “London Whale” incident and attracted investigations from the Federal Reserve, SEC and FBI. In May 2023, Dimon testified under oath in lawsuits accusing the bank of serving Jeffrey Epstein, the late sex offender who was a client between 1998 and 2013.

Dimon’s political evolution reflects a pragmatic centrism. Having donated more than $500,000 to Democratic candidates between 1989 and 2009 and maintained close ties to the Obama administration, he later distanced himself from strict partisan identification. “My heart is Democratic,” he told CNBC in 2019, “but my brain is kind of Republican.” He primarily identifies as a “capitalist” and a “patriot,” and served on President Donald Trump’s short-lived business advisory council before Trump disbanded it in 2017. Though he confirmed in 2016 that he would “love to be president,” he deemed a campaign “too hard and too late” and ultimately decided against serious consideration of a 2020 run. In 2024, he endorsed Nikki Haley in the Republican primary before speaking more positively about Trump following Haley’s defeat.

As of May 2025, Forbes estimated Dimon’s net worth at $2.5 billion. He serves on the boards of numerous organisations, including the Business Roundtable, Bank Policy Institute and Harvard Business School, whilst also sitting on the executive committee of the Business Council and the Partnership for New York City.

Leading Theorists on AI and Labour Displacement

The question of how artificial intelligence will reshape employment has occupied economists, technologists and social theorists for decades, producing a rich body of work that frames Dimon’s observations within broader academic and policy debates.

John Maynard Keynes introduced the concept of “technological unemployment” in his 1930 essay “Economic Possibilities for our Grandchildren,” arguing that society was “being afflicted with a new disease” caused by “our discovery of means of economising the use of labour outrunning the pace at which we can find new uses for labour.” Keynes predicted this would be a temporary phase, ultimately leading to widespread prosperity and reduced working hours. His framing established the foundation for understanding technological displacement as a transitional phenomenon requiring societal adaptation rather than permanent catastrophe.

Joseph Schumpeter developed the theory of “creative destruction” in his 1942 work “Capitalism, Socialism and Democracy,” arguing that innovation inherently involves the destruction of old economic structures alongside the creation of new ones. Schumpeter viewed this process as the essential fact about capitalism—not merely a side effect but the fundamental engine of economic progress. His work provides the theoretical justification for Dimon’s observation about the internet bubble: widespread failure and waste can coexist with transformative innovation and aggregate productivity gains.

Wassily Leontief, winner of the 1973 Nobel Prize in Economics, warned in 1983 that workers might follow the path of horses, which were displaced en masse by automobable and tractor technology in the early twentieth century. His input-output economic models attempted to trace how automation would ripple through interconnected sectors, suggesting that technological displacement might be more comprehensive than previous episodes. Leontief’s scepticism about labour’s ability to maintain bargaining power against capital in an automated economy presaged contemporary concerns about inequality and the distribution of AI’s benefits.

Erik Brynjolfsson and Andrew McAfee at MIT have produced influential work on digital transformation and employment. Their 2014 book “The Second Machine Age” argued that we are in the early stages of a transformation as profound as the Industrial Revolution, with digital technologies now able to perform cognitive tasks previously reserved for humans. They coined the term “skill-biased technological change” to describe how modern technologies favour workers with higher levels of education and adaptability, potentially exacerbating income inequality. Their subsequent work on “machine learning” and “AI and the modern productivity paradox” has explored why measured productivity gains have lagged behind apparent technological advances—a puzzle relevant to Dimon’s observation that some AI benefits are difficult to quantify precisely.

Daron Acemoglu at MIT has challenged technological determinism, arguing that the impact of AI on employment depends crucially on how the technology is designed and deployed. In his 2019 paper “Automation and New Tasks: How Technology Displaces and Reinstates Labor” (co-authored with Pascual Restrepo), Acemoglu distinguished between automation that merely replaces human labour and technologies that create new tasks and roles. He has advocated for “human-centric AI” that augments rather than replaces workers, and has warned that current tax structures and institutional frameworks may be biasing technological development towards excessive automation. His work directly addresses Dimon’s categorisation of AI applications: some will enhance existing jobs, others will eliminate them, and the balance between these outcomes is not predetermined.

Carl Benedikt Frey and Michael Osborne at Oxford produced a widely cited 2013 study estimating that 47 per cent of US jobs were at “high risk” of automation within two decades. Their methodology involved assessing the susceptibility of 702 occupations to computerisation based on nine key bottlenecks, including creative intelligence, social intelligence and perception and manipulation. Whilst their headline figure attracted criticism for potentially overstating the threat—since many jobs contain a mix of automatable and non-automatable tasks—their framework remains influential in assessing which roles face displacement pressure.

Richard Freeman at Harvard has explored the institutional and policy responses required to manage technological transitions, arguing that the distribution of AI’s benefits depends heavily on labour market institutions, educational systems and social policy choices. His work emphasises that historical episodes of technological transformation involved substantial political conflict and institutional adaptation, suggesting that managing AI’s impact will require deliberate policy interventions rather than passive acceptance of market outcomes.

Shoshana Zuboff at Harvard Business School has examined how digital technologies reshape not merely what work is done but how it is monitored, measured and controlled. Her concept of “surveillance capitalism” highlights how data extraction and algorithmic management may fundamentally alter the employment relationship, potentially creating new forms of workplace monitoring and performance pressure even for workers whose jobs are augmented rather than eliminated by AI.

Klaus Schwab, founder of the World Economic Forum, has framed current technological change as the “Fourth Industrial Revolution,” characterised by the fusion of technologies blurring lines between physical, digital and biological spheres. His 2016 book of the same name argues that the speed, scope and systems impact of this transformation distinguish it from previous industrial revolutions, requiring unprecedented coordination between governments, businesses and civil society.

The academic consensus, insofar as one exists, suggests that AI will indeed transform employment substantially, but that the nature and distributional consequences of this transformation remain contested and dependent on institutional choices. Dimon’s advice to avoid “putting your head in the sand” and to stay “way ahead of the curve” aligns with this literature’s emphasis on anticipatory adaptation. His commitment to retraining and redeployment echoes the policy prescriptions of economists who argue that managing technological transitions requires active human capital investment rather than passive acceptance of labour market disruption.

What distinguishes Dimon’s perspective is his position as a practitioner implementing these technologies at scale within a major institution. Whilst theorists debate aggregate employment effects and optimal policy responses, Dimon confronts the granular realities of deployment: which specific functions can be augmented versus automated, how managers adapt their decision-making processes, what training programmes prove effective, and how to balance efficiency gains against workforce morale and capability retention. His assertion that JPMorgan has achieved approximately $2 billion in quantifiable benefits from $2 billion in annual AI spending—whilst acknowledging additional unquantifiable improvements—provides an empirical data point for theories about AI’s productivity impact.

The ten-year timeframe of JPMorgan’s AI journey also matters. Dimon’s observation that “people think it’s a new thing” but that the bank has been pursuing AI since 2012 challenges narratives of sudden disruption, instead suggesting a more gradual but accelerating transformation. This accords with Brynjolfsson and McAfee’s argument about the “productivity J-curve”—that the full economic benefits of transformative technologies often arrive with substantial lag as organisations learn to reconfigure processes and business models around new capabilities.

Ultimately, Dimon’s warning about job displacement, combined with his emphasis on staying ahead of the curve through retraining and redeployment, reflects a synthesis of Schumpeterian creative destruction, human capital theory, and practical experience managing technological change within a complex organisation. His perspective acknowledges both the inevitability of disruption and the possibility of managing transitions to benefit both institutions and workers—provided leadership acts proactively rather than reactively. For financial services professionals and business leaders more broadly, Dimon’s message is clear: AI’s impact on employment is neither hypothetical nor distant, but rather an ongoing transformation requiring immediate and sustained attention.

Quote: Jamie Dimon – JP Morgan Chase CEO

“We have about $2 billion of [AI] benefit. Some we can detail…we reduced headcount, we saved time and money. But there is some you can’t; it’s just improved service and it’s almost worthless to ask what’s the NPV. But we know about $2 billion of actual cost savings. And I think it’s the tip of the iceberg. ” – Jamie Dimon, CEO JP Morgan

Jamie Dimon’s assertion that JPMorgan Chase has achieved “$2 billion of [AI] benefit” represents a landmark moment in corporate artificial intelligence adoption, delivered by one of the most influential figures in global banking. This statement, made during a Bloomberg interview in London on 7th October 2025, encapsulates both the tangible returns from strategic AI investment and the broader transformation reshaping the financial services industry.

The Executive Behind the Innovation

Jamie Dimon stands as arguably the most prominent banking executive of his generation, having led JPMorgan Chase through nearly two decades of unprecedented growth and technological transformation. Born in 1956, Dimon’s career trajectory reads like a masterclass in financial leadership, beginning with his early mentorship under Sandy Weill at American Express in 1982. His formative years were spent navigating the complex world of financial consolidation, serving as Chief Financial Officer and later President at Commercial Credit, before ascending through the ranks at Travelers Group and briefly serving as President of Citigroup in 1998.

The defining moment of Dimon’s career came in 2000 when he assumed leadership of the struggling Bank One, transforming it into a profitable institution that would merge with JPMorgan Chase in 2004. His appointment as CEO of JPMorgan Chase in 2006 marked the beginning of an era that would see the firm become America’s largest bank by assets, with over $4 trillion under management. Under his stewardship, JPMorgan emerged from the 2008 financial crisis stronger than its competitors, earning Dimon recognition as one of Time magazine’s most influential people on multiple occasions.

Dimon’s leadership philosophy centres on long-term value creation rather than short-term earnings management, a principle clearly evident in JPMorgan’s substantial AI investments. His educational foundation—a bachelor’s degree from Tufts University and an MBA from Harvard Business School—provided the analytical framework that has guided his strategic decision-making throughout his career.

The Strategic Context of AI Investment

JPMorgan’s artificial intelligence journey, as Dimon revealed in his October 2025 interview, began in 2012—long before the current generative AI boom captured public attention. This early start positioned the bank advantageously when large language models and generative AI tools became commercially viable. The institution now employs 2,000 people dedicated to AI initiatives, with an annual investment of $2 billion, demonstrating the scale and seriousness of their commitment to technological transformation.

The $2 billion in benefits Dimon describes represents a rare quantification of AI’s return on investment at enterprise scale. His candid acknowledgment that “some we can detail… we reduced headcount, we saved time and money. But there is some you can’t; it’s just improved service and it’s almost worthless to ask what’s the NPV” reflects the dual nature of AI value creation—measurable efficiency gains alongside intangible service improvements that ultimately drive customer satisfaction and competitive advantage.

The deployment spans multiple business functions including risk management, fraud detection, marketing, customer service, and idea generation. Particularly striking is Dimon’s revelation that 150,000 employees weekly utilise internal AI tools for research, report summarisation, and contract analysis—indicating systematic integration rather than isolated pilot programmes.

The Broader AI Investment Landscape

Dimon’s comments on the broader AI infrastructure spending—the trillion-dollar investments in chips, cloud computing, and AI model development—reveal his seasoned perspective on technological transformation cycles. Drawing parallels to the Internet bubble, he noted that whilst many companies worth billions ultimately failed, the infrastructure investments enabled the emergence of Facebook, YouTube, and Google. This historical context suggests that current AI spending, despite its magnitude, follows established patterns of technological disruption where substantial capital deployment precedes widespread value creation.

His observation that “there will be some real big companies, real big success. It will work in spite of the fact that not everyone invested is going to have a great investment return” provides a pragmatic assessment of the AI investment frenzy. This perspective, informed by decades of witnessing technological cycles, lends credibility to his optimistic view that AI benefits represent merely “the tip of the iceberg.”

Leading Theorists and Foundational Concepts

The theoretical foundations underlying JPMorgan’s AI strategy and Dimon’s perspective draw from several key areas of economic and technological theory that have shaped our understanding of innovation adoption and value creation.

Clayton Christensen’s theory of disruptive innovation provides crucial context for understanding JPMorgan’s AI strategy. Christensen’s framework distinguishes between sustaining innovations that improve existing products and disruptive innovations that create new market categories. JPMorgan’s approach appears to embrace both dimensions—using AI to enhance traditional banking services whilst simultaneously creating new capabilities that could redefine financial services delivery.

Joseph Schumpeter’s concept of “creative destruction” offers another lens through which to view Dimon’s frank acknowledgment that AI “is going to affect jobs.” Schumpeter argued that technological progress inherently involves the destruction of old economic structures to create new ones. Dimon’s emphasis on retraining and redeploying employees reflects an understanding of this dynamic, positioning JPMorgan to capture the benefits of technological advancement whilst managing its disruptive effects on employment.

Michael Porter’s competitive strategy theory illuminates the strategic logic behind JPMorgan’s substantial AI investments. Porter’s work on competitive advantage suggests that sustainable competitive positions arise from activities that are difficult for competitors to replicate. By building internal AI capabilities over more than a decade, JPMorgan has potentially created what Porter would term a “activity system”—a network of interconnected organisational capabilities that collectively provide competitive advantage.

Erik Brynjolfsson and Andrew McAfee’s research on digital transformation and productivity paradoxes provides additional theoretical grounding. Their work suggests that the full benefits of technological investments often emerge with significant time lags, as organisations learn to reorganise work processes around new capabilities. Dimon’s observation that parts of AI value creation are “almost worthless to ask what’s the NPV” aligns with their findings that transformational technologies create value through complex, interconnected improvements that resist simple measurement.

Geoffrey Moore’s “Crossing the Chasm” framework offers insights into JPMorgan’s AI adoption strategy. Moore’s model describes how technological innovations move from early adopters to mainstream markets. JPMorgan’s systematic deployment across business units and its achievement of 150,000 weekly users suggests successful navigation of this transition—moving AI from experimental technology to operational infrastructure.

Paul David’s work on path dependence and technological lock-in provides context for understanding the strategic importance of JPMorgan’s early AI investments. David’s research suggests that early advantages in technological adoption can become self-reinforcing, creating competitive positions that persist over time. JPMorgan’s 2012 start in AI development may have created such path-dependent advantages.

Brian Arthur’s theories of increasing returns and network effects add further depth to understanding JPMorgan’s AI strategy. Arthur’s work suggests that technologies exhibiting increasing returns—where value grows with adoption—can create winner-take-all dynamics. The network effects within JPMorgan’s AI systems, where each application and user potentially increases system value, align with Arthur’s theoretical framework.

Economic and Strategic Implications

Dimon’s AI commentary occurs within a broader economic context characterised by elevated asset prices, low credit spreads, and continued consumer strength, as he noted in the Bloomberg interview. His cautious optimism about economic conditions, combined with his bullish view on AI benefits, suggests a nuanced understanding of how technological investment can provide competitive insulation during economic uncertainty.

The timing of Dimon’s remarks—amid ongoing debates about AI regulation, job displacement, and technological sovereignty—positions JPMorgan as a thought leader in practical AI implementation. His emphasis on “rules and regulations” around data usage and deployment safety reflects awareness of the regulatory environment that will shape AI adoption across financial services.

His comparison of current AI spending to historical technology booms provides valuable perspective on the sustainability of current investment levels. The acknowledgment that “not everyone invested is going to have a great investment return” whilst maintaining optimism about overall technological progress reflects the sophisticated risk assessment capabilities that have characterised Dimon’s leadership approach.

The broader implications of JPMorgan’s AI success extend beyond individual firm performance to questions of competitive dynamics within financial services, the future of employment in knowledge work, and the role of large institutions in technological advancement. Dimon’s frank discussion of job displacement, combined with JPMorgan’s commitment to retraining, offers a model for how large organisations might navigate the social implications of technological transformation.

The quote thus represents not merely a financial milestone but a crystallisation of strategic thinking about artificial intelligence’s role in institutional transformation—delivered by an executive whose career has been defined by successfully navigating technological and economic disruption whilst building enduring competitive advantage.

Quote: Jamie Dimon – JP Morgan Chase CEO

“Gen AI is kind of new, but not all of it. We have 2 000 people doing it. We spend $2 billion a year on it. It affects everything: risk, fraud, marketing, idea generation, customer service. And it’s the tip of the iceberg.” – Jamie Dimon – JP Morgan Chase CEO

This comment reflects the culmination of over a decade of accelerated investment and hands-on integration of machine learning and intelligent automation within the bank. JPMorgan Chase has been consistently ahead of its peers: by institutionalising AI and harnessing both mature machine learning systems and the latest generative AI models, the bank directs efforts not only towards operational efficiency, but also towards deeper transformation in client service and risk management. With an annual spend of $2 billion and a dedicated workforce of more than 2,000 AI professionals, JPMorgan Chase’s implementation spans from fraud detection and risk modelling through to marketing, client insight, coding automation, and contract analytics—with generative AI driving new horizons in these areas.

Dimon’s “tip of the iceberg” metaphor underscores a strategic recognition that, despite substantial results to date, the majority of possibilities and business impacts from AI adoption—particularly generative AI—lie ahead, both for JPMorgan Chase and the wider global banking sector.

About Jamie Dimon

Jamie Dimon is one of the most influential global banking leaders of his generation. Born in Queens, New York, into a family with deep Wall Street roots, he earned a Bachelor’s degree from Tufts University followed by an MBA from Harvard Business School. His early professional years were shaped under Sanford I. Weill at American Express, where Dimon soon became a trusted lieutenant.

Rising through the ranks, Dimon played strategic roles at Commercial Credit, Primerica, Travelers, Smith Barney, and Citigroup, pioneering some of the largest and most consequential mergers on Wall Street through the 1990s. Dimon’s leadership style—marked by operational discipline and strategic vision—framed his turnaround of Bank One as CEO in 2000, before orchestrating Bank One’s transformative merger with JPMorgan Chase in 2004.

He has led JPMorgan Chase as CEO and Chairman since 2006, overseeing the company’s expansion to $4 trillion in assets and positioning it as a recognised leader in investment banking, commercial banking, and financial innovation. Through the global financial crisis, Dimon was noted for prudent risk management and outspoken industry leadership. He sits on multiple influential boards and business councils, and remains a voice for free market capitalism and responsible corporate governance, with periodic speculation about his potential political aspirations.

Theorists and Pioneers in Generative AI

Dimon’s remarks rest on decades of foundational research and development in AI from theory to practice. Key figures responsible for the rapid evolution and commercialisation of generative AI include:

Geoffrey Hinton, Yann LeCun, Yoshua Bengio
Often referred to as the ‘godfathers of deep learning’, these researchers advanced core techniques in neural networks—especially deep learning architectures—that make generative AI possible. Hinton’s breakthroughs in backpropagation and LeCun’s convolutional networks underlie modern generative models. Bengio contributed key advances in unsupervised and generative learning. Their collective work earned them the 2018 Turing Award.
Ian Goodfellow
As inventor of the Generative Adversarial Network (GAN) in 2014, Goodfellow created the first popular architecture for synthetic data generation—training two neural networks adversarially so that one creates fake data and the other tries to detect fakes. GANs unlocked capabilities in art, image synthesis, fraud detection, and more, and paved the way for further generative AI advances.
Ilya Sutskever, Sam Altman, and the OpenAI team
Their leadership at OpenAI has driven widespread deployment of large language models such as GPT-2, GPT-3, and GPT-4. These transformer-based architectures demonstrated unprecedented text generation, contextual analysis, and logical reasoning—essential for many AI deployments in financial services, as referenced by Dimon.
Demis Hassabis (DeepMind)
With advances in deep reinforcement learning and symbolic AI, Hassabis’ work at DeepMind has influenced the use of generative AI in problem-solving, optimisation, and scientific modelling—a model frequently referenced in financial risk and strategy.
Fei-Fei Li, Andrew Ng, and the Stanford lineage
Early research in large-scale supervised learning and the creation of ImageNet established datasets and benchmarking methods crucial for scaling generative AI solutions in real-world business contexts.

These theorists’ work ensures that generative AI is not a passing trend, but the result of methodical advances in algorithmic intelligence—now entering practical, transformative use cases across the banking and professional services landscape. The strategic embrace by large corporates, as described by Jamie Dimon, thus marks a logical next step in the commercial maturity of AI technologies.

Summary:
Jamie Dimon’s quote reflects JPMorgan Chase’s scale, seriousness, and strategic commitment to AI—and in particular to generative AI—as the next engine of business change. This stance is underpinned by Dimon’s career of financial leadership and by the foundational work of global theorists who have made practical generative AI possible.

Quote: Dr. Jane Goodall- Environmental activist

“In the place where I am now, I look back over my life… What message do I want to leave? I want to make sure that you all understand that each and every one of you has a role to play. You may not know it, you may not find it, but your life matters, and you are here for a reason.” – Dr. Jane Goodall – Environmental activist

Dr Jane Goodall’s final published words reflect not only a lifetime of scientific pioneering and passionate environmentalism but also a worldview grounded in the intrinsic significance of every individual and the power of hope to catalyse meaningful change. Her message, left as a legacy, underscores that each person—regardless of circumstance—has a unique, essential role to play on Earth, even if that role is not always immediately apparent. She urges recognition of our interconnectedness with nature and calls for resilience and conscious action, particularly in a time of global ecological uncertainty.

Context of the Quote

This message stems from Dr Goodall’s unique vantage point following a long, globally influential life. She addresses not only the scientific community but citizens broadly, emphasising that daily choices and individual agency accumulate to drive change. The reflection is both a personal summation and a universal exhortation—drawing on decades spent witnessing the impact of individual and collective action, whether through habitat protection, compassionate choices, or environmental advocacy. Her words encapsulate a persistent theme from her life’s work: hope is not passive, but an active discipline that demands our participation.

Dr Jane Goodall: Backstory and Influence

Jane Goodall (1934–2025) began her career without formal training, yet revolutionised primatology—most notably through her extended fieldwork at Gombe Stream National Park, Tanzania, beginning in 1960. By meticulously documenting chimpanzee behaviours—tool use, social structures, and emotional expressions—she dismantled long-held assumptions surrounding the human-animal divide. Her findings compelled the scientific world to re-evaluate the concept of animal minds, emotions, and even culture.

Goodall’s methodological hallmark was the fusion of empathy and rigorous observation, often eschewing traditional scientific detachment in favour of fostering understanding and connection. This approach not only advanced natural science, but also set the stage for her lifelong advocacy.

Her research evolved into a commitment to conservation, culminating in the founding of the Jane Goodall Institute in 1977, and later, Roots & Shoots in 1991—a global youth movement empowering the next generations to enact practical, local initiatives for the environment and society. As a tireless speaker and advisor, Goodall travelled globally, addressing world leaders and grassroots communities alike, continually reinforcing the power and responsibility of individuals in safeguarding the planet.

Her activism grew ever more encompassing: she advocated for animal welfare, ethical diets, and systemic change in conservation policy, always championing “those who cannot speak for themselves”. Her campaigns spanned from ending unethical animal research practices to encouraging tree-planting initiatives across continents.

Related Theorists and Intellectual Foundations

The substance of Goodall’s quote—regarding the existential role and agency of each person—resonates with leading figures in several overlapping fields:

Aldo Leopold: Widely regarded for articulating the land ethic in A Sand County Almanac, Leopold stated that humanity is “a plain member and citizen of the biotic community,” reshaping attitudes on individual responsibility to the natural world.
Rachel Carson: Her seminal work Silent Spring ignited environmental consciousness in the public imagination and policy, stressing the interconnectedness of humans and ecosystems and underscoring that individual action can ignite systemic transformation.
E. O. Wilson: Advanced the field of sociobiology and biodiversity, famously advocating for “biophilia”—the innate human affinity for life and nature. Wilson’s conservation philosophy built on the notion that personal and collective choices determine the fate of planetary systems.
Mark Bekoff: As an ethologist and close collaborator with Goodall, Bekoff argued for the emotional and ethical lives of animals. His work, often aligning with Goodall’s, emphasised compassion and ethical responsibility in both scientific research and daily behaviour.
Albert Bandura: His theory of self-efficacy is relevant, suggesting that people’s beliefs in their own capacity to effect change significantly influence their actions—a theme intrinsic to Goodall’s message of individual agency and hope.
Carl Sagan: A scientist and science communicator who highlighted the “pale blue dot” perspective, Sagan reinforced that human actions, albeit individually small, collectively yield profound planetary consequences.

Legacy and Enduring Impact

Jane Goodall’s final words distil her life’s central insight: significance is not reserved for the prominent or powerful, but is inherent in every lived experience. The challenge she poses—to recognise, enact, and never relinquish our capacity to make a difference—is rooted in decades of observational science, a global environmental crusade, and a fundamental hopefulness about humanity’s potential to safeguard and restore the planet. This ethos is as relevant to individuals seeking purpose as it is to leaders shaping the future of conservation science and policy.

Quote: Dr Martin Luther King Jr. – American Baptist minister

“Darkness cannot drive out darkness: only light can do that. Hate cannot drive out hate: only love can do that.” – Dr Martin Luther King Jr. – American Baptist minister

This line, included in A Testament of Hope: The Essential Writings and Speeches of Martin Luther King, Jr., is not only emblematic of King’s message but also of his lived philosophy—one deeply rooted in Christian ethics and the practice of nonviolence.

Martin Luther King, Jr. (1929–1968) was an American Baptist minister and activist who became the most visible spokesman for the nonviolent civil rights movement from the mid-1950s until his assassination in 1968. King drew extensively from Gospel teachings, particularly the Sermon on the Mount, and from earlier theorists of nonviolent resistance, notably Mohandas Gandhi. He argued that true social transformation could only be achieved through love and reconciliation, not retaliation or hatred. The Testament of Hope anthology, compiled by James Melvin Washington at the request of Coretta Scott King, brings together King’s seminal essays, iconic speeches, sermons, and interviews—showing the evolution of his thought in response to the escalating struggles of the American civil rights movement.

This specific quote reflects King’s insistence on moral consistency: that the means must be as righteous as the ends. It was delivered against the backdrop of violent backlash against civil rights progress, racial segregation, and systemic injustice in the United States. King’s philosophy sought not merely to win legal rights for African Americans, but to do so in a way that would heal society and affirm the dignity of all individuals. The quote serves as a concise manifesto for constructive, rather than destructive, social change—urging individuals and movements to transcend cycles of resentment and to build a community rooted in justice and mutual respect.

Context: Leading Theories and Theorists

Gandhi and the Power of Satyagraha
A cornerstone of King’s intellectual framework was Gandhi’s concept of satyagraha (truth-force) or nonviolent resistance. Gandhi demonstrated that mass movements could challenge colonial oppression without resorting to violence, emphasizing moral authority over physical force. King adapted these principles to the American context, arguing that nonviolence could expose the moral contradictions of segregation and compel a reluctant nation to live up to its democratic ideals.

Christian Ethics and the Social Gospel
King’s theological training at Morehouse College, Crozer Theological Seminary, and Boston University exposed him to the Social Gospel tradition—a movement that sought to apply Christian ethics to social problems. Figures like Walter Rauschenbusch influenced King’s belief that salvation was not merely individual but communal, requiring active engagement against injustice. King’s sermons often invoked biblical parables to argue that love and forgiveness were not passive virtues but powerful forces for societal transformation.

Thoreau and Civil Disobedience
Henry David Thoreau’s essay “Civil Disobedience” also shaped King’s thinking, particularly the idea that individuals have a moral duty to resist unjust laws. However, King went further by tying civil disobedience to a broader strategy of mass mobilisation and moral witness. He argued that nonviolent protest, when met with violent repression, would reveal the brutality of the status quo and galvanise public opinion in favour of reform.

Pacifism and Social Democracy
King’s later writings and speeches reveal a growing engagement with democratic socialist thought, advocating not only for racial equality but also for economic justice. He critiqued both unbridled capitalism and the excesses of state control, positioning himself as a pragmatic reformer seeking to reconcile individual rights with collective welfare. Though less discussed in popular narratives, this aspect of King’s thought underscores his holistic approach to justice—one that integrates personal morality, social ethics, and political strategy.

Insights for Contemporary Consideration

King’s assertion that love and light—not their opposites—are the true agents of change remains pertinent. In an era marked by polarisation, the temptation to meet hostility with hostility is ever-present. King’s legacy, however, suggests that sustainable progress is built not on animosity but on courageous empathy, principled nonviolence, and a steadfast commitment to the common good. His writings compiled in A Testament of Hope continue to challenge us to consider not just what we seek to achieve, but how we pursue it—reminding us that the character of our methods shapes the quality of our outcomes.

« Older Entries

Download brochure

Our latest podcasts on Spotify

News and Tools

Share this:

“[With AI] we’re not building animals. We’re building ghosts or spirits.” – Andrej Karpathy – Ex-OpenAI, Ex-Tesla AI

The Cognitive Core—Karpathy’s Vision for AI Intelligence

Karpathy’s Journey and Influence

The Leading Theorists Shaping the Debate

Why “Ghosts,” Not “Animals”?

Broader Significance

Share this:

Sholto Douglas: Career Trajectory and Perspective

Intellectual and Scientific Context: The ‘Plateau’ Debate and Leading Theorists

The “Bitter Lesson” and Richard Sutton

Yann LeCun and Alternative Paradigms

Shane Legg, Demis Hassabis, and DeepMind

The Scaling Hypothesis: GW’s Essay and the Modern Era

Summing Up: The Current Frontier

Share this:

“The talk about AI bubbles seemed very divorced from what was happening in frontier labs and what we were seeing. We are not seeing any slowdown of progress.” – Julian Schrittwieser – Anthropic

Context of the Quote

Profile: Julian Schrittwieser

Theoretical Backstory: Exponential AI Progress and Key Thinkers

Leading Theorists and Historical Perspective

Synthesis: Why the Perspective Matters

Share this:

Share this:

“The countries that control compute will control AI. You cannot have compute without energy.” – Jonathan Ross – CEO Groq

The Architecture of Advantage

The Energy Calculus

Strategic Implications

The Theorists and Precedents

The Historical Pattern

The Future Ross Envisions

Conclusion

Share this:

“Be the person your dog thinks you are!” – J.W. Stephens – Author

The Author and the Quote’s Origins

Philosophical Foundations: Authenticity and the Divided Self

The Psychology of Self-Perception and Moral Development

Moral Philosophy and the Ethics of Character

Contemporary Perspectives: Positive Psychology and Virtue Ethics

The Human-Animal Bond as Moral Mirror

Practical Implications: From Aspiration to Action

The Quote in Contemporary Context

Share this:

Share this:

The Paradox of Pre-training

The Cognitive Core Concept

From Evolution to Engineering: The Path Not Taken

The Limits of Current Learning Paradigms

Model Collapse and the Entropy Problem

The Decade of Agents: Why Progress Takes Time

The Software Engineering Puzzle

The Trajectory of Intelligence Explosion

The Human Element: Education in an AI Future

The Physics of Understanding: Karpathy’s Pedagogical Philosophy

Leading Theorists: The Intellectual Lineage

Richard Sutton and the Bitter Lesson

Geoffrey Hinton and the Neural Network Renaissance

Yann LeCun and Convolutional Networks

The Broader Intellectual Context

The Cognitive Core in Context: A New Paradigm for Intelligence

Implications for the Decade Ahead

The Path Forward: Research Priorities

Conclusion: Intelligence as Engineering Challenge

Share this:

“I feel like the [ AI ] problems are tractable, they’re surmountable, but they’re still difficult. If I just average it out, it just feels like a decade [ to AGI ] to me.” – Andrej Karpathy – Ex-OpenAI, Ex-Tesla AI

Share this:

The Institutional Memory Problem

Cost Structure and Margin Dynamics

Decision-Making Architecture and Team Composition

Implications for Founders and Emerging Managers

Theoretical Foundations: Power Laws and Portfolio Construction

Historical Analogues and Market Cycles

The Broader Context: Technology Adoption and Market Scale

The Economics of Intelligence and Margin Evolution

Governance, Conflict and Confidentiality

Implications for Limited Partner Strategy

Share this:

Context of the Quote

Leading Theorists on AI and Employment