Select Page

Global Advisors | Quantified Strategy Consulting

AI
Quote: Dr. Fei-Fei Li – Stanford Professor – world-renowned authority in artificial intelligence

Quote: Dr. Fei-Fei Li – Stanford Professor – world-renowned authority in artificial intelligence

“I do think countries all should invest in their own human capital, invest in partnerships and invest in their own technological stack as well as the business ecosystem… I think not investing in AI would be macroscopically the wrong thing to do.” – Dr. Fei-Fei Li – Stanford Professor – world-renowned authority in artificial intelligence

The statement was delivered during a high-stakes panel discussion on artificial superintelligence, convened at the Future Investment Initiative in Riyadh, where nation-state leaders, technologists, and investors gathered to assess their strategic positioning in the emerging AI era. Her words strike at the heart of a dilemma facing governments worldwide: how to build national AI capabilities whilst avoiding the trap of isolationism, and why inaction would be economically and strategically untenable.

Context: The Geopolitical Stakes of AI Investment

The Historical Moment

Dr. Li’s statement comes at a critical juncture. By late 2024 and into 2025, artificial intelligence had transitioned from speculative technology to demonstrable economic driver. Estimates suggested AI could generate between $15 trillion and $20 trillion in economic value globally by 2030—a figure larger than the current gross domestic product of most nations. This windfall is not distributed evenly; rather, it concentrates among early movers with capital, infrastructure, and talent. The race is on, and the stakes are existential for national competitiveness, employment, and geopolitical influence.

In this landscape, a nation that fails to invest in AI capabilities risks profound economic displacement. Yet Dr. Li is equally clear: isolation is counterproductive. The most realistic path forward combines three pillars:

  • Human Capital: The talent to conceive, build, and deploy AI systems
  • Partnerships: Strategic alliances, particularly with leading technological ecosystems (the US hyperscalers, for instance)
  • Domestic Technological Infrastructure: The local research bases, venture capital, regulatory frameworks, and business ecosystems that enable sustained innovation

This is not a counsel of surrender to Silicon Valley hegemony, but rather a sophisticated argument about comparative advantage and integration within global technological networks.

Dr. Fei-Fei Li: The Person and Her Arc

Early Life and Foundational Values

Dr. Fei-Fei Li’s perspective is shaped by her personal trajectory. Born in Chengdu, China, she emigrated to the United States at age fifteen, settling in New Jersey where her parents ran a small business. This background infuses her thinking: she understands both the promise of technological mobility and the structural barriers that constrain developing economies. She obtained her undergraduate degree in physics from Princeton University in 1999, with high honours, before pursuing doctoral studies at the California Institute of Technology, where she worked across computer science, electrical engineering, and cognitive neuroscience, earning her PhD in 2005.

The ImageNet Revolution

In 2007, whilst at Princeton, Dr. Li embarked on a project that would reshape artificial intelligence. Observing that cognitive psychologist Irving Biederman estimated humans recognise approximately 30,000 object categories, Li conceived ImageNet: a massive, hierarchically organised visual database. Colleagues dismissed the scale as impractical. Undeterred, she led a team (including Princeton professors Jia Deng, Kai Li, and Wei Dong) that leveraged Amazon Mechanical Turk to label over 14 million images across 22,000 categories.

By 2009, ImageNet was published. More critically, the team created the ImageNet Large Scale Visual Recognition Challenge (ILSVRC), an annual competition that invited researchers worldwide to develop algorithms for image classification. This contest became the crucible in which modern deep learning was forged. When Geoffrey Hinton’s group achieved a breakthrough using convolutional neural networks in 2012, winning the competition by a decisive margin, the deep learning revolution was catalysed. ImageNet is now recognised as one of the three foundational forces in the birth of modern AI.

What is instructive here is that Dr. Li’s contribution was not merely technical but infrastructural: she created a shared resource that democratised AI research globally. Academic groups from universities across continents—not just Silicon Valley—could compete on equal footing. This sensibility—that progress depends on enabling distributed talent—runs through her subsequent work.

Career Architecture and Strategic Leadership

Following her Princeton years, Dr. Li joined Stanford University in 2009, eventually becoming the Sequoia Capital Professor of Computer Science—a title of singular prestige. From 2013 to 2018, she directed Stanford’s Artificial Intelligence Lab (SAIL), one of the world’s premier research institutes. Her publications exceed 400 papers in top-tier venues, and she remains one of the most cited computer scientists of her generation.

During a sabbatical from Stanford (January 2017 to September 2018), Dr. Li served as Vice President and Chief Scientist of AI/ML at Google Cloud. Her mandate was to democratise AI technology, lowering barriers for businesses and developers—work that included advancing products like AutoML, which enabled organisations without deep AI expertise to deploy machine learning systems.

Upon returning to Stanford in 2019, she became the founding co-director of the Stanford Institute for Human-Centered Artificial Intelligence (HAI), an explicitly multidisciplinary initiative spanning computer science, social sciences, humanities, law, and medicine—all united by the conviction that AI must serve human flourishing, not vice versa.

Current Work and World Labs

Most recently, Dr. Li co-founded and serves as chief executive officer of World Labs, an AI company focused on spatial intelligence and generative world models. This venture extends her intellectual agenda: if large language models learn patterns over text, world models learn patterns over 3D environments, enabling machines to understand, simulate, and reason about physical and virtual spaces. For robotics, healthcare simulation, autonomous systems, and countless other domains, this represents the next frontier.

Recognition and Influence

Her standing is reflected in numerous accolades: election to the National Academy of Engineering, the National Academy of Medicine (2020), and the American Academy of Arts and Sciences (2021); the Intel Lifetime Achievement Innovation Award in 2023; and inclusion in Time magazine’s 100 Most Influential People in AI. She is colloquially known as the “Godmother of AI.” In 2023, she published a memoir, The Worlds I See: Curiosity, Exploration and Discovery at the Dawn of AI, which chronicles her personal journey and intellectual evolution.

Leading Theorists and Strategic Thinkers: The Landscape of AI and National Strategy

The backdrop to Dr. Li’s statement includes several strands of thought about technology, development, and national strategy:

Economic and Technological Diffusion

  • Erik Brynjolfsson and Andrew McAfee (The Second Machine Age, Machine Platform Crowd): These MIT researchers have articulated how technological revolutions create winners and losers, and that policy choices—not technology alone—determine whether gains are broadly shared. They underscore that without intentional intervention, automation and AI tend to concentrate wealth and opportunity.
  • Dani Rodrik (Harvard economist): Rodrik’s work on “premature demonetarisation” and structural transformation highlights the risks faced by developing economies when technological progress accelerates faster than institutions can adapt. His analysis supports Dr. Li’s argument: countries must actively build capacity or risk being left behind.
  • Mariana Mazzucato (University College London): Mazzucato’s research on the entrepreneurial state emphasises that breakthrough innovations—including AI—depend on public investment in foundational research, education, and infrastructure. Her work buttresses the case for public and private sector partnership.

Artificial Intelligence and Cognition

  • Geoffrey Hinton, Yann LeCun, and Yoshua Bengio: The triumvirate of deep learning pioneers recognised that neural networks could scale to superhuman performance in perception and pattern recognition, yet have increasingly stressed that current approaches may be insufficient for general intelligence. Their candour about limitations supports a measured, long-term investment view.
  • Stuart Russell (UC Berkeley): Russell has been a prominent voice calling for AI safety and governance frameworks to accompany capability development. His framing aligns with Dr. Li’s insistence that human-centred values must guide AI research and deployment.

Geopolitics and Technology Competition

  • Michael Mazarr (RAND Corporation): Mazarr and colleagues have analysed great-power competition in emerging technologies, emphasising that diffusion of capability is inevitable but the pace and terms of diffusion are contestable. Nations that invest in talent pipelines and partnerships will sustain influence; those that isolate will atrophy.
  • Kai-Fu Lee: The Taiwanese-American venture capitalist and author (AI Superpowers) has articulated how the US and China are in a competitive race, but also how smaller nations and regions can position themselves through strategic partnerships and focus on applied AI problems relevant to their economies.
  • Eric Schmidt (former Google CEO): Schmidt, who participated in the same FII panel as Dr. Li, has emphasised that geopolitical advantage flows to nations with capital markets, advanced chip fabrication (such as Taiwan’s TSMC), and deep talent pools. Yet he has also highlighted pathways for other nations to benefit through partnerships and focused investment in particular domains.

Human-Centred Technology and Inclusive Growth

  • Timnit Gebru and Joy Buolamwini: These AI ethics researchers have exposed how AI systems can perpetuate bias and harm marginalised communities. Their work reinforces Dr. Li’s emphasis on human-centred design and inclusive governance. For developing nations, this implies that AI investment must account for local contexts, values, and risks of exclusion.
  • Turing Award recipients and foundational figures (such as Barbara Liskov on systems reliability, and Leslie Valiant on learning theory): Their sustained emphasis on rigour, safety, and verifiability underpins the argument that sustainable AI development requires not just speed but also deep technical foundations—something that human capital investment cultivates.

Development Economics and Technology Transfer

  • Paul Romer (Nobel laureate): Romer’s work on endogenous growth emphasises that ideas and innovation are the drivers of long-term prosperity. For developing nations, this implies that investment in research capacity, education, and institutional learning—not merely adopting foreign technologies—is essential.
  • Ha-Joon Chang: The heterodox development economist has critiqued narratives of “leapfrogging” technology. His argument suggests that nations building indigenous technological ecosystems—through domestic investment in research, venture capital, and entrepreneurship—are more resilient and capable of adapting innovations to local needs.

The Three Pillars: An Unpacking

Dr. Li’s framework is sophisticated precisely because it avoids two traps: technological nationalism (the fantasy that any nation can independently build world-leading AI from scratch) and technological fatalism (the resignation that small and medium-sized economies cannot compete).

Human Capital

The most portable, scalable asset a nation can develop is talent. This encompasses:

  • Education pipelines: From primary through tertiary education, with emphasis on mathematics, computer science, and critical thinking
  • Research institutions: Universities, national laboratories, and research councils capable of contributing to fundamental and applied AI knowledge
  • Retention and diaspora engagement: Policies to keep talented individuals from emigrating, and mechanisms to attract expatriate expertise
  • Diversity and inclusion: As Dr. Li has emphasised through her co-founding of AI4ALL (a nonprofit working to increase diversity in AI), innovation benefits from diverse perspectives and draws from broader talent pools

Partnerships

Rather than isolating, Dr. Li advocates for strategic alignment:

  • North-South partnerships: Developed nations’ hyperscalers and technology firms partnering with developing economies to establish data centres, training programmes, and applied research initiatives. Saudi Arabia and the UAE have pursued this model successfully
  • South-South cooperation: Peer learning and knowledge exchange among developing nations facing similar challenges
  • Academic and research collaborations: Open-source tools, shared benchmarks (as exemplified by ImageNet), and collaborative research that diffuse capability globally
  • Technology licensing and transfer agreements: Mechanisms by which developing nations can access cutting-edge tools and methods at affordable terms

Technological Stack and Ecosystem

A nation cannot simply purchase AI capability; it must develop home-grown institutional and commercial ecosystems:

  • Open-source communities: Participation in and contribution to open-source AI frameworks (PyTorch, TensorFlow, Hugging Face) builds local expertise and reduces dependency on proprietary systems
  • Venture capital and startup ecosystems: Policies fostering entrepreneurship in AI applications suited to local economies (agriculture, healthcare, manufacturing)
  • Regulatory frameworks: Balanced approaches to data governance, privacy, and AI safety that neither stifle innovation nor endanger citizens
  • Domain-specific applied AI: Rather than competing globally in large language models, nations can focus on AI applications addressing pressing local challenges: medical diagnostics, precision agriculture, supply-chain optimisation, or financial inclusion

Why Inaction Is “Macroscopically the Wrong Thing”

Dr. Li’s assertion that not investing in AI would be fundamentally mistaken rests on several converging arguments:

Economic Imperatives

AI is reshaping productivity across sectors. Nations that fail to develop internal expertise will find themselves dependent on foreign technology, unable to adapt solutions to local contexts, and vulnerable to supply disruptions or geopolitical pressure. The competitive advantage flows to early movers and sustained investors.

Employment and Social Cohesion

While AI will displace some jobs, it will create others—particularly for workers skilled in AI-adjacent fields. Nations that invest in reskilling and education can harness these transitions productively. Those that do not risk deepening inequality and social fracture.

Sovereignty and Resilience

Over-reliance on foreign AI systems limits national agency. Whether in healthcare, defence, finance, or public administration, critical systems should rest partly on domestic expertise and infrastructure to ensure resilience and alignment with national values.

Participation in Global Governance

As AI governance frameworks emerge—whether through the UN, regional bodies, or multilateral forums—nations with substantive technical expertise and domestic stakes will shape the rules. Those without will have rules imposed upon them.

The Tension and Its Resolution

Implicit in Dr. Li’s statement is a tension worth articulating: the world cannot support 200 competing AI superpowers, each building independent foundational models. Capital and talent are finite. Yet neither is the world a binary of a few AI leaders and many followers. The resolution lies in specialisation and integration:

  • A nation may not lead in large language models but excel in robotics for agriculture
  • It may not build chips but pioneer AI applications in healthcare or education
  • It may not host hyperscaler data centres but contribute essential research in AI safety or fairness
  • It will necessarily depend on global partnerships whilst developing sovereign capacity in domains critical to its citizens

This is neither capitulation nor isolation, but rather a mature acceptance of global interdependence coupled with strategic autonomy in domains of national importance.

Conclusion: The Compass for National Strategy

Dr. Li’s counsel, grounded in decades of research leadership, industrial experience, and global perspective, offers a compass for policymakers navigating the AI era. Investment in human capital, strategic partnerships, and home-grown technological ecosystems is not a luxury or academic exercise—it is fundamental to national competitiveness, prosperity, and agency. The alternative—treating AI as an external force to be passively absorbed—is indeed “macroscopically” mistaken, foreclosing decades of economic opportunity and surrendering the right to shape how this powerful technology serves human flourishing.

read more
Quote: Dr. Fei-Fei Li – Stanford Professor – world-renowned authority in artificial intelligence

Quote: Dr. Fei-Fei Li – Stanford Professor – world-renowned authority in artificial intelligence

“I think robotics has a long way to go… I think the ability, the dexterity of human-level manipulation is something we have to wait a lot longer to get. ” – Dr. Fei-Fei Li – Stanford Professor – world-renowned authority in artificial intelligence

While AI has made dramatic progress in perception and reasoning, the physical manipulation and dexterity seen in human hands is far from being matched by machines.

Context of the Quote: The State and Limitations of Robotics

Dr. Li’s comment was made against the backdrop of accelerating investment and hype in artificial intelligence and robotics. While AI systems now master complex games, interpret medical scans, and facilitate large-scale automation, the field of robotics—especially with respect to dexterous manipulation and embodied interaction in the real world—remains restricted by hardware limitations, incomplete world models, and a lack of general adaptability.

  • Human dexterity involves fine motor control, real-time feedback, and a deep understanding of spatial and causal relationships. As Dr. Li emphasises, current robots struggle with tasks that are mundane for humans: folding laundry, pouring liquids, assembling diverse objects, or improvising repairs in unpredictable environments.
  • Even state-of-the-art robot arms and hands, controlled by advanced machine learning, manage select tasks in highly structured settings. Scaling to unconstrained, everyday environments has proven exceedingly difficult.
  • The launch of benchmarks such as the BEHAVIOR Challenge by Stanford, led by Dr. Li’s group, is a direct response to these limitations. The challenge simulates 1,000 everyday tasks across varied household environments, aiming to catalyse progress by publicly measuring how far the field is from truly general-purpose, dexterous robots.

Dr. Fei-Fei Li: Biography and Impact

Dr. Fei-Fei Li is a world-renowned authority in artificial intelligence, best known for foundational contributions to computer vision and the promotion of “human-centred AI”. Her career spans:

  • Academic Leadership: Professor of Computer Science at Stanford University; founding co-director of the Stanford Institute for Human-Centered AI (HAI).
  • ImageNet: Li created the ImageNet dataset, which transformed machine perception by enabling deep neural networks to outperform previous benchmarks and catalysed the modern AI revolution. This advance shaped progress in visual recognition, autonomous systems, and accessibility technologies.
  • Human-Centred Focus: Dr. Li is recognised for steering the field towards responsible, inclusive, and ethical AI, ensuring research aligns with societal needs and multidisciplinary perspectives.
  • Spatial Intelligence and Embodied AI: A core strand of her current work is in spatial intelligence—teaching machines to understand, reason about, and interact with the physical world with flexibility and safety. Her venture World Labs is pioneering this next frontier, aiming to bridge the gap from words to worlds.
  • Recognition: She was awarded the Queen Elizabeth Prize for Engineering in 2025—alongside fellow AI visionaries—honouring transformative contributions to computing, perception, and human-centred innovation.
  • Advocacy: Her advocacy spans diversity, education, and AI governance. She actively pushes for multidisciplinary, transparent approaches to technology that are supportive of human flourishing.

Theoretical Foundations and Leading Figures in Robotic Dexterity

The quest for human-level dexterity in machines draws on several fields—robotics, neuroscience, machine learning—and builds on the insights of leading theorists:

Name
Contributions
Relevance to Dexterity Problem
Rodney Brooks
Developed subsumption architecture for mobile robots; founded iRobot and Rethink Robotics
Emphasised embodied intelligence: physical interaction is central; argued autonomous robots must learn in the real world and adapt to uncertainty.
Yoshua Bengio, Geoffrey Hinton, Yann LeCun
Deep learning pioneers; applied neural networks to perception
Led the transformation in visual perception and sensorimotor learning; current work extends to robotic learning but recognises that perception alone is insufficient for dexterity.
Pieter Abbeel
Expert in reinforcement learning and robotics (UC Berkeley)
Advanced algorithms for robotic manipulation, learning from demonstration, and real-world transfer; candid about the gulf between lab demonstrations and robust household robots.
Jean Ponce, Dieter Fox, Ken Goldberg
Leading researchers in computer vision and robot manipulation
Developed grasping algorithms and modelling for manipulation, but acknowledge that even “solved” tasks in simulation often fail in the unpredictable real world.
Dr. Fei-Fei Li
Computer vision, spatial intelligence, embodied AI
Argues spatial understanding and physical intelligence are critical, and that world models must integrate perception, action, and context to approach human-level dexterity.
Demis Hassabis
DeepMind CEO; led breakthroughs in deep reinforcement learning
AlphaZero and related systems have shown narrow superhuman performance, but the physical control and manipulation necessary for robotics remains unsolved.
Chris Atkeson
Humanoid and soft robotics pioneer
Developed advanced dexterous hands and whole-body motion, but highlights the vast gap between the best machines and human adaptability.

The Challenge: Why Robotics Remains “a Long Way to Go”

  • Embodiment: Unlike pure software, robots operate under real-world physical constraints. Variability in object geometry, materials, lighting, and external force must be mastered for consistent human-like manipulation.
  • Generalisation: A robot that succeeds at one task often fails catastrophically at another, even if superficially similar. Human hands, with sensory feedback and innate flexibility, effortlessly adapt.
  • World Modelling: Spatial intelligence—anticipating the consequences of actions, integrating visual, tactile, and proprioceptive data—is still largely unsolved. As Dr. Li notes, machines must “understand, navigate, and interact” with complex, dynamic environments.
  • Benchmarks and Community Efforts: The BEHAVIOR Challenge and open-source simulators aim to provide transparent, rigorous measurement and accelerate community progress, but there is consensus that true general dexterity is likely years—if not decades—away.

Conclusion: Where Theory Meets Practice

While AI and robotics have delivered astonishing advances in perception, narrowly focused automation, and simulation, the dexterity, adaptability, and common-sense reasoning required for robust, human-level robotic manipulation remain an unsolved grand challenge. Dr. Fei-Fei Li’s work and leadership define the state of the art—and set the aspirational vision for the next wave: embodied, spatially conscious AI, built with a profound respect for the complexity of human life and capability. Those who follow in her footsteps, across academia and industry, measure their progress not against hype or isolated demonstrations, but against the demanding reality of everyday human tasks.

read more
Quote: Dr. Fei-Fei Li – Stanford Professor – world-renowned authority in artificial intelligence

Quote: Dr. Fei-Fei Li – Stanford Professor – world-renowned authority in artificial intelligence

“That ability that humans have, it’s the combination of creativity and abstraction. I do not see today’s AI or tomorrow’s AI being able to do that yet.” – Dr. Fei-Fei Li – Stanford Professor – world-renowned authority in artificial intelligence

Dr. Li’s statement came amid wide speculation about the near-term prospects for artificial general intelligence (AGI) and superintelligence. While current AI already exceeds human capacity in specific domains (such as language translation, memory recall, and vast-scale data analysis), Dr. Li draws a line at creative abstraction—the human ability to form new concepts and theories that radically change our understanding of the world. She underscores that, despite immense data and computational resources, AI does not demonstrate the generative leap that allowed Newton to discover classical mechanics or Einstein to reshape physics with relativity. Dr. Li insists that, absent fundamental conceptual breakthroughs, neither today’s nor tomorrow’s AI can replicate this synthesis of creativity and abstract reasoning.

About Dr. Fei-Fei Li

Dr. Fei-Fei Li holds the title of Sequoia Capital Professor of Computer Science at Stanford University and is a world-renowned authority in artificial intelligence, particularly in computer vision and human-centric AI. She is best known for creating ImageNet, the dataset that triggered the deep learning revolution in computer vision—a cornerstone of modern AI systems. As the founding co-director of Stanford’s Institute for Human-Centered Artificial Intelligence (HAI), Dr. Li has consistently championed the need for AI that advances, rather than diminishes, human dignity and agency. Her research, with over 400 scientific publications, has pioneered new frontiers in machine learning, neuroscience, and their intersection.

Her leadership extends beyond academia: she served as chief scientist of AI/ML at Google Cloud, sits on international boards, and is deeply engaged in policy, notably as a special adviser to the UN. Dr. Li is acclaimed for her advocacy in AI ethics and diversity, notably co-founding AI4ALL, a non-profit enabling broader participation in the AI field. Often described as the “godmother of AI,” she is an elected member of the US National Academy of Engineering and the National Academy of Medicine. Her personal journey—from emigrating from Chengdu, China, to supporting her parents’ small business in New Jersey, to her trailblazing career—is detailed in her acclaimed 2023 memoir, The Worlds I See.

Remarks on Creativity, Abstraction, and AI: Theoretical Roots

The distinction Li draws—between algorithmic pattern-matching and genuine creative abstraction—addresses a foundational question in AI: What constitutes intelligence, and is it replicable in machines? This theme resonates through the works of several canonical theorists:

  • Alan Turing (1912–1954): Regarded as the father of computer science, Turing posed the question of machine intelligence in his pivotal 1950 paper, “Computing Machinery and Intelligence”. He proposed what we call the Turing Test: if a machine could converse indistinguishably from a human, could it be deemed intelligent? Turing hinted at the limits but also the theoretical possibility of machine abstraction.
  • Herbert Simon and Allen Newell: Pioneers of early “symbolic AI”, Simon and Newell framed intelligence as symbol manipulation; their experiments (the Logic Theorist and General Problem Solver) made some progress in abstract reasoning but found creative leaps elusive.
  • Marvin Minsky (1927–2016): Co-founder of the MIT AI Lab, Minsky believed creativity could in principle be mechanised, but anticipated it would require complex architectures that integrate many types of knowledge. His work, especially The Society of Mind, remained vital but speculative.
  • John McCarthy (1927–2011): While he named the field “artificial intelligence” and developed the LISP programming language, McCarthy was cautious about claims of broad machine creativity, viewing abstraction as an open challenge.
  • Geoffrey Hinton, Yann LeCun, Yoshua Bengio: Fathers of deep learning, these researchers demonstrated that neural networks can match or surpass humans in perception and narrow problem-solving but have themselves highlighted the gap between statistical learning and the ingenuity seen in human discovery.
  • Nick Bostrom: In Superintelligence (2014), Bostrom analysed risks and trajectories for machine intelligence exceeding humans, but acknowledged that qualitative leaps in creativity—paradigm shifts, theory building—remain a core uncertainty.
  • Gary Marcus: An outspoken critic of current AI, Marcus argues that without genuine causal reasoning and abstract knowledge, current models (including the most advanced deep learning systems) are far from truly creative intelligence.

Synthesis and Current Debates

Across these traditions, a consistent theme emerges: while AI has achieved superhuman accuracy, speed, and recall in structured domains, genuine creativity—the ability to abstract from prior knowledge to new paradigms—is still uniquely human. Dr. Fei-Fei Li, by foregrounding this distinction, not only situates herself within this lineage but also aligns her ongoing research on “large world models” with an explicit goal: to design AI tools that augment—but do not seek to supplant—human creative reasoning and abstract thought.

Her caution, rooted in both technical expertise and a broader philosophical perspective, stands as a rare check on techno-optimism. It articulates the stakes: as machine intelligence accelerates, the need to centre human capabilities, dignity, and judgement—especially in creativity and abstraction—becomes not just prudent but essential for responsibly shaping our shared future.

read more
Quote: Dr Eric Schmidt – Ex-Google CEO

Quote: Dr Eric Schmidt – Ex-Google CEO

“I worry a lot about … Africa. And the reason is: how does Africa benefit from [AI]? There’s obviously some benefit of globalisation, better crop yields, and so forth. But without stable governments, strong universities, major industrial structures – which Africa, with some exceptions, lacks – it’s going to lag.” – Dr Eric Schmidt – Former Google CEO

Dr Eric Schmidt’s observation stems from his experience at the highest levels of the global technology sector and his acute awareness of both the promise and the precariousness of the coming AI age. His warning about Africa’s risk of lagging in AI adoption and benefit is rooted in today’s uneven technological landscape and long-standing structural challenges facing the continent.

About Dr Eric Schmidt

Dr Eric Schmidt is one of the most influential technology executives of the 21st century. As CEO of Google from 2001 to 2011, he oversaw Google’s transformation from a Silicon Valley start-up into a global technology leader. Schmidt provided the managerial and strategic backbone that enabled Google’s explosive growth, product diversification, and a culture of robust innovation. After Google, he continued as Executive Chairman and Technical Advisor through Google’s restructuring into Alphabet, before transitioning to philanthropic and strategic advisory work. Notably, Schmidt has played significant roles in US national technology strategy, chairing the US National Security Commission on Artificial Intelligence and founding the bipartisan Special Competitive Studies Project, which advises on the intersections of AI, security, and economic competitiveness.

With a background encompassing leading roles at Sun Microsystems, Novell, and advisory positions at Xerox PARC and Bell Labs, Schmidt’s career reflects deep immersion in technology and innovation. He is widely regarded as a strategic thinker on the global opportunities and risks of technology, regularly offering perspective on how AI, digital infrastructure, and national competitiveness are shaping the future economic order.

Context of the Quotation

Schmidt’s remark appeared during a high-level panel at the Future Investment Initiative (FII9), in conversation with Dr Fei-Fei Li of Stanford and Peter Diamandis. The discussion centred on “What Happens When Digital Superintelligence Arrives?” and explored the likely economic, social, and geopolitical consequences of rapid AI advancement.

In this context, Schmidt identified a core risk: that AI’s benefits will accrue unevenly across borders, amplifying existing inequalities. He emphasised that while powerful AI tools may drive exceptional economic value and efficiencies—potentially in the trillions of dollars—these gains are concentrated by network effects, investment, and infrastructure. Schmidt singled out Africa as particularly vulnerable: absent stable governance, strong research universities, or robust industrial platforms—critical prerequisites for technology absorption—Africa faces the prospect of deepening relative underdevelopment as the AI era accelerates. The comment reflects a broader worry in technology and policy circles: global digitisation is likely to amplify rather than repair structural divides unless deliberate action is taken.

Leading Theorists and Thinking on the Subject

The dynamics Schmidt describes are at the heart of an emerging literature on the “AI divide,” digital colonialism, and the geopolitics of AI. Prominent thinkers in these debates include:

  • Professor Fei-Fei Li
    A leading AI scientist, Dr Li has consistently framed AI’s potential as contingent on human-centred design and equitable access. She highlights the distinction between the democratisation of access (e.g., cheaper healthcare or education via AI) and actual shared prosperity—which hinges on local capacity, policy, and governance. Her work underlines that technical progress does not automatically result in inclusive benefit, validating Schmidt’s concerns.
  • Kate Crawford and Timnit Gebru
    Both have written extensively on the risks of algorithmic exclusion, surveillance, and the concentration of AI expertise within a handful of countries and firms. In particular, Crawford’s Atlas of AI and Gebru’s leadership in AI ethics foreground how global AI development mirrors deeper resource and power imbalances.
  • Nick Bostrom and Stuart Russell
    Their theoretical contributions address the broader existential and ethical challenges of artificial superintelligence, but they also underscore risks of centralised AI power—technically and economically.
  • Ndubuisi Ekekwe, Bitange Ndemo, and Nanjira Sambuli
    These African thought leaders and scholars examine how Africa can leapfrog in digital adoption but caution that profound barriers—structural, institutional, and educational—must be addressed for the continent to benefit from AI at scale.
  • Eric Schmidt himself has become a touchstone in policy/tech strategy circles, having co-chaired the US National Security Commission on Artificial Intelligence. The Commission’s reports warned of a bifurcated world where AI capabilities—and thus economic and security advantages—are ever more concentrated.

Structural Elements Behind the Quote

Schmidt’s remark draws attention to a convergence of factors:

  • Institutional robustness
    Long-term AI prosperity requires stable governments, responsive regulatory environments, and a track record of supporting investment and innovation. This is lacking in many, though not all, of Africa’s economies.
  • Strong universities and research ecosystems
    AI innovation is talent- and research-intensive. Weak university networks limit both the creation and absorption of advanced technologies.
  • Industrial and technological infrastructure
    A mature industrial base enables countries and companies to adapt AI for local benefit. The absence of such infrastructure often results in passive consumption of foreign technology, forgoing participation in value creation.
  • Network effects and tech realpolitik
    Advanced AI tools, data centres, and large-scale compute power are disproportionately located in a few advanced economies. The ability to partner with these “hyperscalers”—primarily in the US—shapes national advantage. Schmidt argues that regions which fail to make strategic investments or partnerships risk being left further behind.

Summary

Schmidt’s statement is not simply a technical observation but an acute geopolitical and developmental warning. It reflects current global realities where AI’s arrival promises vast rewards, but only for those with the foundational economic, political, and intellectual capital in place. For policy makers, investors, and researchers, the implication is clear: bridging the digital-structural gap requires not only technology transfer but also building resilient, adaptive institutions and talent pipelines that are locally grounded.

read more
Quote: Trevor McCourt – Extropic CTO

Quote: Trevor McCourt – Extropic CTO

“We need something like 10 terawatts in the next 20 years to make LLM systems truly useful to everyone… Nvidia would need to 100× output… You basically need to fill Nevada with solar panels to provide 10 terawatts of power, at a cost around the world’s GDP. Totally crazy.” – Trevor McCourt – Extropic CTO

Trevor McCourt, Chief Technology Officer and co-founder of Extropic, has emerged as a leading voice articulating a paradox at the heart of artificial intelligence advancement: the technology that promises to democratise intelligence across the planet may, in fact, be fundamentally unscalable using conventional infrastructure. His observation about the terawatt imperative captures this tension with stark clarity—a reality increasingly difficult to dismiss as speculative.

Who Trevor McCourt Is

McCourt brings a rare convergence of disciplinary expertise to his role. Trained in mechanical engineering at the University of Waterloo (graduating 2015) and holding advanced credentials from the Massachusetts Institute of Technology (2020), he combines rigorous physical intuition with deep software systems architecture. Prior to co-founding Extropic, McCourt worked as a Principal Software Engineer, establishing a track record of delivering infrastructure at scale: he designed microservices-based cloud platforms that improved deployment speed by 40% whilst reducing operational costs by 30%, co-invented a patented dynamic caching algorithm for distributed systems, and led open-source initiatives that garnered over 500 GitHub contributors.

This background—spanning mechanical systems, quantum computation, backend infrastructure, and data engineering—positions McCourt uniquely to diagnose what others in the AI space have overlooked: that energy is not merely a cost line item but a binding physical constraint on AI’s future deployment model.

Extropic, which McCourt co-founded alongside Guillaume Verdon (formerly a quantum technology lead at Alphabet’s X division), closed a $14.1 million Series Seed funding round in 2023, led by Kindred Ventures and backed by institutional investors including Buckley Ventures, HOF Capital, and OSS Capital. The company now stands at approximately 15 people distributed across integrated circuit design, statistical physics research, and machine learning—a lean team assembled to pursue what McCourt characterises as a paradigm shift in compute architecture.

The Quote in Strategic Context

McCourt’s assertion that “10 terawatts in the next 20 years” is required for universal LLM deployment, coupled with his observation that this would demand filling Nevada with solar panels at a cost approaching global GDP, represents far more than rhetorical flourish. It is the product of methodical back-of-the-envelope engineering calculation.

His reasoning unfolds as follows:

From Today’s Baseline to Mass Deployment:
A text-based assistant operating at today’s reasoning capability (approximating GPT-5-Pro performance) deployed to every person globally would consume roughly 20% of the current US electrical grid—approximately 100 gigawatts. This is not theoretical; McCourt derives this from first principles: transformer models consume roughly 2 × (parameters × tokens) floating-point operations; modern accelerators like Nvidia’s H100 operate at approximately 0.7 picojoules per FLOP; population-scale deployment implies continuous, always-on inference at scale.

Adding Modalities and Reasoning:
Upgrade that assistant to include video capability at just 1 frame per second (envisioning Meta-style augmented-reality glasses worn by billions), and the grid requirement multiplies by approximately 10×. Enhance the reasoning capability to match models working on the ARC AGI benchmark—problems of human-level reasoning difficulty—and the text assistant alone requires a 10× expansion: 5 terawatts. Push further to expert-level systems capable of solving International Mathematical Olympiad problems, and the requirement reaches 100× the current grid.

Economic Impossibility:
A single gigawatt data centre costs approximately $10 billion to construct. The infrastructure required for mass-market AI deployment rapidly enters the hundreds of trillions of dollars—approaching or exceeding global GDP. Nvidia’s current manufacturing capacity would itself require a 100-fold increase to support even McCourt’s more modest scenarios.

Physical Reality Check:
Over the past 75 years, US grid capacity has grown remarkably consistently—a nearly linear expansion. Sam Altman’s public commitment to building one gigawatt of data centre capacity per week alone would require 3–5× the historical rate of grid growth. Credible plans for mass-market AI acceleration push this requirement into the terawatt range over two decades—a rate of infrastructure expansion that is not merely economically daunting but potentially physically impossible given resource constraints, construction timelines, and raw materials availability.

McCourt’s conclusion: the energy path is not simply expensive; it is economically and physically untenable. The paradigm must change.

Intellectual Foundations: Leading Theorists in Energy-Efficient Computing and Probabilistic AI

Understanding McCourt’s position requires engagement with the broader intellectual landscape that has shaped thinking about computing’s physical limits and probabilistic approaches to machine learning.

Geoffrey Hinton—Pioneering Energy-Based Models and Probabilistic Foundations:
Few figures loom larger in the theoretical background to Extropic’s work than Geoffrey Hinton. Decades before the deep learning boom, Hinton developed foundational theory around Boltzmann machines and energy-based models (EBMs)—the conceptual framework that treats learning as the discovery and inference of complex probability distributions. His work posits that machine learning, at its essence, is about fitting a probability distribution to observed data and then sampling from it to generate new instances consistent with that distribution. Hinton’s recognition with the 2023 Nobel Prize in Physics for “foundational discoveries and inventions that enable machine learning with artificial neural networks” reflects the deep prescience of this probabilistic worldview. More than theoretical elegance, this framework points toward an alternative computational paradigm: rather than spending vast resources on deterministic matrix operations (the GPU model), a system optimised for efficient sampling from complex distributions would align computation with the statistical nature of intelligence itself.

Michael Frank—Physics of Reversible and Adiabatic Computing:
Michael Frank, a senior scientist now at Vaire (a near-zero-energy chip company), has spent decades at the intersection of physics and computing. His research programme, initiated at MIT in the 1990s and continued at the University of Florida, Florida State, and Sandia National Laboratories, focuses on reversible computing and adiabatic CMOS—techniques aimed at reducing the fundamental energy cost of information processing. Frank’s work addresses a deep truth: in conventional digital logic, information erasure is thermodynamically irreversible and expensive, dissipating energy as heat. By contrast, reversible computing minimises such erasure, thereby approaching theoretical energy limits set by physics rather than by engineering convention. Whilst Frank’s trajectory and Extropic’s diverge in architectural detail, both share the conviction that energy efficiency must be rooted in physical first principles, not merely in engineering optimisation of existing paradigms.

Yoshua Bengio and Chris Bishop—Probabilistic Learning Theory:
Leading researchers in deep generative modelling—including Bengio, Bishop, and others—have consistently advocated for probabilistic frameworks as foundational to machine learning. Their work on diffusion models, variational inference, and sampling-based approaches has legitimised the view that efficient inference is not about raw compute speed but about statistical appropriateness. This theoretical lineage underpins the algorithmic choices at Extropic: energy-based models and denoising thermodynamic models are not novel inventions but rather a return to first principles, informed by decades of probabilistic ML research.

Richard Feynman—Foundational Physics of Computing:
Though less directly cited in contemporary AI discourse, Feynman’s 1982 lectures on the physics of computation remain conceptually foundational. Feynman observed that computation’s energy cost is ultimately governed by physical law, not engineering ingenuity alone. His observations on reversibility and the thermodynamic cost of irreversible operations informed the entire reversible-computing movement and, by extension, contemporary efforts to align computation with physics rather than against it.

Contemporary Systems Thinkers (Sam Altman, Jensen Huang):
Counterintuitively, McCourt’s critique is sharpened by engagement with the visionary statements of industry leaders who have perhaps underestimated energy constraints. Altman’s commitment to building one gigawatt of data centre capacity per week, and Huang’s roadmaps for continued GPU scaling, have inadvertently validated McCourt’s concern: even the most optimistic industrial plans require infrastructure expansion at rates that collide with physical reality. McCourt uses their own projections as evidence for the necessity of paradigm change.

The Broader Strategic Narrative

McCourt’s remarks must be understood within a convergence of intellectual and practical pressures:

The Efficiency Plateau:
Digital logic efficiency, measured as energy per operation, has stalled. Transistor capacitance plateaued around the 10-nanometre node; operating voltage is thermodynamically bounded near 300 millivolts. Architectural optimisations (quantisation, sparsity, tensor cores) improve throughput but do not overcome these physical barriers. The era of “free lunch” efficiency gains from Moore’s Law miniaturisation has ended.

Model Complexity Trajectory:
Whilst small models have improved at fixed benchmarks, frontier AI systems—those solving novel, difficult problems—continue to demand exponentially more compute. AlphaGo required ~1 exaFLOP per game; AlphaCode required ~100 exaFLOPs per coding problem; the system solving International Mathematical Olympiad problems required ~100,000 exaFLOPs. Model miniaturisation is not offsetting capability ambitions.

Market Economics:
The AI market has attracted trillions in capital precisely because the economic potential is genuine and vast. Yet this same vastness creates the energy paradox: truly universal AI deployment would consume resources incompatible with global infrastructure and economics. The contradiction is not marginal; it is structural.

Extropic’s Alternative:
Extropic proposes to escape this local minimum through radical architectural redesign. Thermodynamic Sampling Units (TSUs)—circuits architected as arrays of probabilistic sampling cells rather than multiply-accumulate units—would natively perform the statistical operations that diffusion and generative AI models require. Early simulations suggest energy efficiency improvements of 10,000× on simple benchmarks compared to GPU-based approaches. Hybrid algorithms combining TSUs with compact neural networks on conventional hardware could deliver intermediate gains whilst establishing a pathway toward a fundamentally different compute paradigm.

Why This Matters Now

The quote’s urgency reflects a dawning recognition across technical and policy circles that energy is not a peripheral constraint but the central bottleneck determining AI’s future trajectory. The choice, as McCourt frames it, is stark: either invest in a radically new architecture, or accept that mass-market AI remains perpetually out of reach—a luxury good confined to the wealthy and powerful rather than a technology accessible to humanity.

This is not mere speculation or provocation. It is engineering analysis grounded in physics, economics, and historical precedent, articulated by someone with the technical depth to understand both the problem and the extraordinary difficulty of solving it.

read more
Quote: Trevor McCourt – Extropic CTO

Quote: Trevor McCourt – Extropic CTO

“If you upgrade that assistant to see video at 1 FPS – think Meta’s glasses… you’d need to roughly 10× the grid to accommodate that for everyone. If you upgrade the text assistant to reason at the level of models working on the ARC AGI benchmark… even just the text assistant would require around a 10× of today’s grid.” – Trevor McCourt – Extropic CTO

The quoted remark by Trevor McCourt, CTO of Extropic, underscores a crucial bottleneck in artificial intelligence scaling: energy consumption outpaces technological progress in compute efficiency, threatening the viability of universal, always-on AI. The quote translates hard technical extrapolation into plain language—projecting that if every person were to have a vision-capable assistant running at just 1 video frame per second, or if text models achieved a level of reasoning comparable to ARC AGI benchmarks, global energy infrastructure would need to multiply several times over, amounting to many terawatts—figures that quickly reach into economic and physical absurdity.

Backstory and Context of the Quote & Trevor McCourt

Trevor McCourt is the co-founder and Chief Technology Officer of Extropic, a pioneering company targeting the energy barrier limiting mass-market AI deployment. With multidisciplinary roots—a blend of mechanical engineering and quantum programming, honed at the University of Waterloo and Massachusetts Institute of Technology—McCourt contributed to projects at Google before moving to the hardware-software frontier. His leadership at Extropic is defined by a willingness to challenge orthodoxy and champion a first-principles, physics-driven approach to AI compute architecture.

The quote arises from a keynote on how present-day large language models and diffusion AI models are fundamentally energy-bound. McCourt’s analysis is rooted in practical engineering, economic realism, and deep technical awareness: the computational demands of state-of-the-art assistants vastly outstrip what today’s grid can provide if deployed at population scale. This is not merely an engineering or machine learning problem, but a macroeconomic and geopolitical dilemma.

Extropic proposes to address this impasse with Thermodynamic Sampling Units (TSUs)—a new silicon compute primitive designed to natively perform probabilistic inference, consuming orders of magnitude less power than GPU-based digital logic. Here, McCourt follows the direction set by energy-based probabilistic models and advances it both in hardware and algorithm.

McCourt’s career has been defined by innovation at the technical edge: microservices in cloud environments, patented improvements to dynamic caching in distributed systems, and research in scalable backend infrastructure. This breadth, from academic research to commercial deployment, enables his holistic critique of the GPU-centred AI paradigm, as well as his leadership at Extropic’s deep technology startup.

Leading Theorists & Influencers in the Subject

Several waves of theory and practice converge in McCourt’s and Extropic’s work:

1. Geoffrey Hinton (Energy-Based and Probabilistic Models):
Long before deep learning’s mainstream embrace, Hinton’s foundational work on Boltzmann machines and energy-based models explored the idea of learning and inference as sampling from complex probability distributions. These early probabilistic paradigms anticipated both the difficulties of scaling and the algorithmic challenges that underlie today’s generative models. Hinton’s recognition—including the Nobel Prize for work on energy-based models—cements his stature as a theorist whose footprints underpin Extropic’s approach.

2. Michael Frank (Reversible Computing)
Frank is a prominent physicist in reversible and adiabatic computing, having led major advances at MIT, Sandia National Laboratories, and others. His research investigates how the physics of computation can reduce the fundamental energy cost—directly relevant to Extropic’s mission. Frank’s focus on low-energy information processing provides a conceptual environment for approaches like TSUs to flourish.

3. Chris Bishop & Yoshua Bengio (Probabilistic Machine Learning):
Leaders like Bishop and Bengio have shaped the field’s probabilistic foundations, advocating both for deep generative models and for the practical co-design of hardware and algorithms. Their research has stressed the need to reconcile statistical efficiency with computational tractability—a tension at the core of Extropic’s narrative.

4. Alan Turing & John von Neumann (Foundations of Computing):
While not direct contributors to modern machine learning, the legacies of Turing and von Neumann persist in every conversation about alternative architectures and the physical limits of computation. The post-von Neumann and post-Turing trajectory, with a return to analogue, stochastic, or sampling-based circuitry, is directly echoed in Extropic’s work.

5. Recent Industry Visionaries (e.g., Sam Altman, Jensen Huang):
Contemporary leaders in the AI infrastructure space—such as Altman of OpenAI and Huang of Nvidia—have articulated the scale required for AGI and the daunting reality of terawatt-scale compute. Their business strategies rely on the assumption that improved digital hardware will be sufficient, a view McCourt contests with data and physical models.

Strategic & Scientific Context for the Field

  • Core problem: The energy that powers AI is reaching non-linear scaling—mass-market AI could consume a significant fraction or even multiples of the entire global grid if naively scaled with today’s architectures.
  • Physics bottlenecks: Improvements in digital logic are limited by physical constants: capacitance, voltage, and the energy required for irreversible computation. Digital logic has plateaued at the 10nm node.
  • Algorithmic evolution: Traditional deep learning is rooted in deterministic matrix computations, but the true statistical nature of intelligence calls for sampling from complex distributions—as foregrounded in Hinton’s work and now implemented in Extropic’s TSUs.
  • Paradigm shift: McCourt and contemporaries argue for a transition to native hardware–software co-design where the core computational primitive is no longer the multiply–accumulate (MAC) operation, but energy-efficient probabilistic sampling.

Summary Insight

Trevor McCourt anchors his cautionary prognosis for AI’s future on rigorous cross-disciplinary insights—from physical hardware limits to probabilistic learning theory. By combining his own engineering prowess with the legacy of foundational theorists and contemporary thinkers, McCourt’s perspective is not simply one of warning but also one of opportunity: a new generation of probabilistic, thermodynamically-inspired computers could rewrite the energy economics of artificial intelligence, making “AI for everyone” plausible—without grid-scale insanity.

read more
Quote: Alex Karp – Palantir CEO

Quote: Alex Karp – Palantir CEO

“The idea that chips and ontology is what you want to short is batsh*t crazy.” – Alex Karp -Palantir CEO

Alex Karp, co-founder and CEO of Palantir Technologies, delivered the now widely-circulated statement, “The idea that chips and ontology is what you want to short is batsh*t crazy,” in response to famed investor Michael Burry’s high-profile short positions against both Palantir and Nvidia. This sharp retort came at a time when Palantir, an enterprise software and artificial intelligence (AI) powerhouse, had just reported record earnings and was under intense media scrutiny for its meteoric stock rise and valuation.

Context of the Quote

The remark was made in early November 2025 during a CNBC interview, following public disclosures that Michael Burry—of “The Big Short” fame—had taken massive short positions in Palantir and Nvidia, two companies at the heart of the AI revolution. Burry’s move, reminiscent of his contrarian bets during the 2008 financial crisis, was interpreted by the market as both a challenge to the soaring “AI trade” and a critique of the underlying economics fueling the sector’s explosive growth.

Karp’s frustration was palpable: not only was Palantir producing what he described as “anomalous” financial results—outpacing virtually all competitors in growth, cash flow, and customer retention—but it was also emerging as the backbone of data-driven operations across government and industry. For Karp, Burry’s short bet went beyond traditional market scepticism; it targeted firms, products (“chips” and “ontology”—the foundational hardware for AI and the architecture for structuring knowledge), and business models proven to be both technically indispensable and commercially robust. Karp’s rejection of the “short chips and ontology” thesis underscores his belief in the enduring centrality of the technologies underpinning the modern AI stack.

Backstory and Profile: Alex Karp

Alex Karp stands out as one of Silicon Valley’s true iconoclasts:

  • Background and Education: Born in New York City in 1967, Karp holds a philosophy degree from Haverford College, a JD from Stanford, and a PhD in social theory from Goethe University Frankfurt, where he studied under and wrote about the influential philosopher Jürgen Habermas. This rare academic pedigree—blending law, philosophy, and critical theory—deeply informs both his contrarian mindset and his focus on the societal impact of technology.
  • Professional Arc: Before founding Palantir in 2004 with Peter Thiel and others, Karp had forged a career in finance, running the London-based Caedmon Group. At Palantir, he crafted a unique culture and business model, combining a wellness-oriented, sometimes spiritual corporate environment with the hard-nosed delivery of mission-critical systems for Western security, defence, and industry.
  • Leadership and Philosophy: Karp is known for his outspoken, unconventional leadership. Unafraid to challenge both Silicon Valley’s libertarian ethos and what he views as the groupthink of academic and financial “expert” classes, he publicly identifies as progressive—yet separates himself from establishment politics, remaining both a supporter of the US military and a critic of mainstream left and right ideologies. His style is at once brash and philosophical, combining deep skepticism of market orthodoxy with a strong belief in the capacity of technology to deliver real-world, not just notional, value.
  • Palantir’s Rise: Under Karp, Palantir grew from a niche contractor to one of the world’s most important data analytics and AI companies. Palantir’s products are deeply embedded in national security, commercial analytics, and industrial operations, making the company essential infrastructure in the rapidly evolving AI economy.

Theoretical Background: ‘Chips’ and ‘Ontology’

Karp’s phrase pairs two of the foundational concepts in modern AI and data-driven enterprise:

  • Chips: Here, “chips” refers specifically to advanced semiconductors (such as Nvidia’s GPUs) that provide the computational horsepower essential for training and deploying cutting-edge machine learning models. The AI revolution is inseparable from advances in chip design, leading to historic demand for high-performance hardware.
  • Ontology: In computer and information science, “ontology” describes the formal structuring and categorising of knowledge—making data comprehensible, searchable, and actionable by algorithms. Robust ontologies enable organisations to unify disparate data sources, automate analytical reasoning, and achieve the “second order” efficiencies of AI at scale.

Leading theorists in the domain of ontology and AI include:

  • John McCarthy: A founder of artificial intelligence, McCarthy’s foundational work on formal logic and semantics laid groundwork for modern ontological structures in AI.
  • Tim Berners-Lee: Creator of the World Wide Web, Berners-Lee developed the Semantic Web, championing knowledge structuring via ontologies—enabling data to be machine-readable and all but indispensable for AI’s next leap.
  • Thomas Gruber: Known for his widely cited definition of ontology in AI as “a specification of a conceptualisation,” Gruber’s research shaped the field’s approach to standardising knowledge representations for complex applications.

In the chip space, the pioneering work of:

  • Jensen Huang: CEO and co-founder of Nvidia, drove the company’s transformation from graphics to AI acceleration, cementing the centrality of chips as the hardware substrate for everything from generative AI to advanced analytics.
  • Gordon Moore and Robert Noyce: Their early explorations in semiconductor fabrication set the stage for the exponential hardware progress that enabled the modern AI era.

Insightful Context for the Modern Market Debate

The “chips and ontology” remark reflects a deep divide in contemporary technology investing:

  • On one side, sceptics like Burry see signs of speculative excess, reminiscent of prior bubbles, and bet against companies with high valuations—even when those companies dominate core technologies fundamental to AI.
  • On the other, leaders like Karp argue that while the broad “AI trade” risks pockets of overvaluation, the engine—the computational hardware (chips) and data-structuring logic (ontology)—are not just durable, but irreplaceable in the digital economy.

With Palantir and Nvidia at the centre of the current AI-driven transformation, Karp’s comment captures not just a rebuttal to market short-termism, but a broader endorsement of the foundational technologies that define the coming decade. The value of “chips and ontology” is, in Karp’s eyes, anchored not in market narrative but in empirical results and business necessity—a perspective rooted in a unique synthesis of philosophy, technology, and radical pragmatism.

read more
Quote: David Solomon – Goldman Sachs CEO

Quote: David Solomon – Goldman Sachs CEO

“Generally speaking people hate change. It’s human nature. But change is super important. It’s inevitable. In fact, on my desk in my office I have a little plaque that says ‘Change or die.’ As a business leader, one of the perspectives you have to have is that you’ve got to constantly evolve and change.” – David Solomon – Goldman Sachs CEO

The quoted insight comes from David M. Solomon, Chief Executive Officer and Chairman of Goldman Sachs, a role he has held since 2018. It was delivered during a high-profile interview at The Economic Club of Washington, D.C., 30 October 2025, as Solomon reflected on the necessity of adaptability both personally and as a leader within a globally significant financial institution.

“We have very smart people, and we can put these [AI] tools in their hands to make them more productive… By using AI to reimagine processes, we can create operating efficiencies that give us a scaled opportunity to reinvest in growth.” – David Solomon – Goldman Sachs CEO

David Solomon, Chairman and CEO of Goldman Sachs, delivered the quoted remarks during an interview at the HKMA Global Financial Leaders’ Investment Summit on 4 November 2025, articulating Goldman’s strategic approach to integrating artificial intelligence across its global franchise. His comments reflect both personal experience and institutional direction: leveraging new technology to drive productivity, reimagine workflows, and reinvest operational gains in sustainable growth, rather than pursuing simplistic headcount reductions or technological novelty for its own sake.

Backstory and Context of the Quote

David Solomon’s statement arises from Goldman Sachs’ current transformation—“Goldman Sachs 3.0”—centred on AI-driven process re-engineering. Rather than employing AI simply as a cost-cutting device, Solomon underscores its strategic role as an enabler for “very smart people” to magnify their productivity and impact. This perspective draws on his forty-year career in finance, where successive waves of technological disruption (from Lotus 1-2-3 spreadsheets to cloud computing) have consistently shifted how talent is leveraged, but have not diminished its central value.

The immediate business context is one of intense change: regulatory uncertainty in cross-border transactions, rebounding capital flows into China post-geopolitical tension, and a high backlog of M&A activity, particularly for large-cap US transactions. In this environment, efficiency gains from AI allow frontline teams to refocus on advisory, origination, and growth while adjusting operational models at a rapid pace. Solomon’s leadership style—pragmatic, unsentimental, and data-driven—favours process optimisation, open collaboration, and the breakdown of legacy silos.

About David Solomon

Background:

  • Born in Hartsdale, New York, in 1962; educated at Hamilton College with a BA in political science, then entered banking.
  • Career progression: Held senior roles at Irving Trust, Drexel Burnham, Bear Stearns; joined Goldman Sachs in 1999 as partner, eventually leading the Financing Group and serving as co-head of the Investment Banking Division for a decade.
  • Appointed President and COO in 2017, then CEO in October 2018 and Chairman in January 2019, succeeding Lloyd Blankfein.
  • Brought a reputation for transformative leadership, advocating modernisation, flattening hierarchies, and integrating technology across every aspect of the firm’s operations.

Leadership and Culture:

  • Solomon is credited with pushing through “One Goldman Sachs,” breaking down internal silos and incentivising cross-disciplinary collaboration.
  • He has modernised core HR and management practices: implemented real-time performance reviews, loosened dress codes, and raised compensation for programmers.
  • Personal interests—such as his sideline as DJ D-Sol—underscore his willingness to defy convention and challenge the insularity of Wall Street leadership.

Institutional Impact:

  • Under his stewardship, Goldman has accelerated its pivot to technology—automating trading operations, consolidating platforms, and committing substantial resources to digital transformation.
  • Notably, the current “GS 3.0” agenda focuses on automating six major workflows to direct freed capacity into growth, consistent with a multi-decade productivity trend.

Leading Theorists and Intellectual Lineage of AI-Driven Productivity in Business

Solomon’s vision is shaped and echoed by several foundational theorists in economics, management science, and artificial intelligence:

1. Clayton Christensen

  • Theory: Disruptive Innovation—frames how technological change transforms industries not through substitution but by enabling new business models and process efficiencies.
  • Relevance: Goldman Sachs’ approach to using AI to reimagine workflows and create new capabilities closely mirrors Christensen’s insights on sustaining versus disruptive innovation.

2. Erik Brynjolfsson & Andrew McAfee

  • Theory: Race Against the Machine, The Second Machine Age—chronicled how digital automation augments human productivity and reconfigures the labour market, not just replacing jobs but reshaping roles and enhancing output.
  • Relevance: Solomon’s argument for enabling smart people with better tools directly draws on Brynjolfsson’s proposition that the best organisational outcomes occur when firms successfully combine human and machine intelligence.

3. Michael Porter

  • Theory: Competitive Advantage—emphasised how operational efficiency and information advantage underpin sustained industry leadership.
  • Relevance: Porter’s ideas connect to Goldman’s agenda by showing that AI integration is not just about cost, but about improving information processing, strategic agility, and client service.

4. Herbert Simon

  • Theory: Bounded Rationality and Decision Support Systems—pioneered the concept that decision-making can be dramatically improved by systems that extend the cognitive capabilities of professionals.
  • Relevance: Solomon’s claim that AI puts better tools in the hands of talented staff traces its lineage to Simon’s vision of computers as skilled assistants, vital to complex modern organisations.

5. Geoffrey Hinton, Yann LeCun, Yoshua Bengio

  • Theory: Deep Learning—established the contemporary AI revolution underpinning business process automation, language models, and data analysis at enterprise scale.
  • Relevance: Without the breakthroughs made by these theorists, AI’s current generation—capable of augmenting financial analysis, risk modelling, and operational management—could not be applied as Solomon describes.

 

Synthesis and Strategic Implications

Solomon’s quote epitomises the intersection of pragmatic executive leadership and theoretical insight. His advocacy for AI-integrated productivity reinforces a management consensus: sustainable competitive advantage hinges not just on technology, but on empowering skilled individuals to unlock new modes of value creation. This approach is echoed by leading researchers who situate automation as a catalyst for role evolution, scalable efficiency, and the ability to redeploy resources into higher-value growth opportunities.

Goldman Sachs’ specific AI play is therefore neither a defensive move against headcount nor a speculative technological bet, but a calculated strategy rooted in both practical business history and contemporary academic theory—a paradigm for how large organisations can adapt, thrive, and lead in the face of continual disruption.

read more
Quote: Satya Nadella – Microsoft CEO

Quote: Satya Nadella – Microsoft CEO

“At scale, nothing is a commodity. We have to have our cost structure, supply-chain efficiency, and software efficiencies continue to compound to ensure margins. Scale – and one of the things I love about the OpenAI partnership – is it’s gotten us to scale. This is a scale game.” – Satya Nadella – Microsoft CEO

Satya Nadella has been at the helm of Microsoft since 2014, overseeing its transformation into one of the world’s most valuable technology companies. Born in Hyderabad, India, and educated in electrical engineering and computer science, Nadella joined Microsoft in 1992, quickly rising through the ranks in technical and business leadership roles. Prior to becoming CEO, he was best known for driving the rapid growth of Microsoft Azure, the company’s cloud infrastructure platform—a business now central to Microsoft’s global strategy.

Nadella’s leadership style is marked by systemic change—he has shifted Microsoft away from legacy, siloed software businesses and repositioned it as a cloud-first, AI-driven, and highly collaborative tech company. He is recognised for his ability to anticipate secular shifts—most notably, the move to hyperscale cloud computing and, more recently, the integration of advanced AI into core products such as GitHub Copilot and Microsoft 365 Copilot. His background—combining deep technical expertise with rigorous business training (MBA, University of Chicago)—enables him to bridge both the strategic and operational dimensions of global technology.

This quote was delivered in the context of Nadella’s public discussion on the scale economics of AI, hyperscale cloud, and the transformative partnership between Microsoft and OpenAI (the company behind ChatGPT, Sora, and GPT-4/5/6) on the BG2 podcast, 1st November 2025 In this conversation, Nadella outlines why, at the extreme end of global tech infrastructure, nothing remains a “commodity”: system costs, supply chain and manufacturing agility, and relentless software optimisation all become decisive sources of competitive advantage. He argues that scale—meaning not just size, but the compounding organisational learning and cost improvement unlocked by operating at frontier levels—determines who captures sustainable margins and market leadership.

The OpenAI partnership is, from Nadella’s perspective, a practical illustration of this thesis. By integrating OpenAI’s frontier models deeply (and at exclusive scale) within Azure, Microsoft has driven exponential increases in compute utilisation, data flows, and the learning rate of its software infrastructure. This allowed Microsoft to amortise fixed investments, rapidly reduce unit costs, and create a loop of innovation not accessible to smaller or less integrated competitors. In Nadella’s framing, scale is not a static achievement, but a perpetual game—one where the winners are those who compound advantages across the entire stack: from chip supply chains through to application software and business model design.

Theoretical Foundations and Key Thinkers

The quote’s themes intersect with multiple domains: economics of platforms, organisational learning, network effects, and innovation theory. Key theoretical underpinnings and thinkers include:

Scale Economics and Competitive Advantage

  • Alfred Chandler (1918–2007): Chandler’s work on the “visible hand” and the scale and scope of modern industrial firms remains foundational. He showed how scale, when coupled with managerial coordination, allows firms to achieve durable cost advantages and vertical integration.
  • Bruce Greenwald & Judd Kahn: In Competition Demystified (2005), they argue sustainable competitive advantage stems from barriers to entry—often reinforced by scale, especially via learning curves, supply chains, and distribution.

Network Effects and Platform Strategy

  • Jean Tirole & Marcel Boyer: Tirole’s work on platform economics shows how scale-dependent markets (like cloud and AI) naturally concentrate—network effects reinforce the value of leading platforms, and marginal cost advantage compounds alongside user and data scale.
  • Geoffrey Parker, Marshall Van Alstyne, Sangeet Paul Choudary: In their research and Platform Revolution, these thinkers elaborate how the value in digital markets accrues disproportionately to platforms that achieve scale—because transaction flows, learning, and innovation all reinforce one another.

Learning Curves and Experience Effects

  • The Boston Consulting Group (BCG): In the 1960s, Bruce Henderson’s concept of the “experience curve” formalised the insight that unit costs fall as cumulative output grows—the canonical explanation for why scale delivers persistent cost advantage.
  • Clayton Christensen: In The Innovator’s Dilemma, Christensen illustrates how technological discontinuities and learning rates enable new entrants to upend incumbent advantage—unless those incumbents achieve scale in the new paradigm.

Supply Chain and Operations

  • Taiichi Ohno and Shoichiro Toyoda (Toyota Production System): The industrial logic that relentless supply chain optimisation and compounding process improvements, rather than static cost reduction, underpin long-run advantage, especially during periods of rapid demand growth or supply constraint.

Economics of Cloud and AI

  • Hal Varian (Google, UC Berkeley): Varian’s analyses of cloud economics demonstrate the massive fixed-cost base and “public utility” logic of hyperscalers. He has argued that AI and cloud converge when scale enables learning (data/usage) to drive further cost and performance improvements.
  • Andrew Ng, Yann LeCun, Geoffrey Hinton: Pioneer practitioners in deep learning and large language models, whose work established the “scaling laws” now driving the AI infrastructure buildout—i.e., that model capability increases monotonically with scale of data, compute, and parameter count.

Why This Matters Now

Organisations at the digital frontier—notably Microsoft and OpenAI—are now locked in a scale game that is reshaping both industry structure and the global economy. The cost, complexity, and learning rate needed to operate at hyperscale mean that “commodities” (compute, storage, even software itself) cease to be generic. Instead, they become deeply differentiated by embedded knowledge, utilisation efficiency, supply-chain integration, and the ability to orchestrate investments across cycles of innovation.

Nadella’s observation underscores a reality that now applies well beyond technology: the compounding of competitive advantage at scale has become the critical determinant of sector leadership and value capture. This logic is transforming industries as diverse as finance, logistics, pharmaceuticals, and manufacturing—where the ability to build, learn, and optimise at scale fundamentally redefines what was once considered “commodity” business.

In summary: Satya Nadella’s words reflect not only Microsoft’s strategy but a broader economic and technological transformation, deeply rooted in the theory and practice of scale, network effects, and organisational learning. Theorists and practitioners—from Chandler and BCG to Christensen and Varian—have analysed these effects for decades, but the age of AI and cloud has made their insights more decisive than ever. At the heart of it: scale—properly understood and operationalised—remains the ultimate competitive lever.

read more
Quote: David Solomon – Goldman Sachs CEO

Quote: David Solomon – Goldman Sachs CEO

“Generally speaking people hate change. It’s human nature. But change is super important. It’s inevitable. In fact, on my desk in my office I have a little plaque that says ‘Change or die.’ As a business leader, one of the perspectives you have to have is that you’ve got to constantly evolve and change.” – David Solomon – Goldman Sachs CEO

The quoted insight comes from David M. Solomon, Chief Executive Officer and Chairman of Goldman Sachs, a role he has held since 2018. It was delivered during a high-profile interview at The Economic Club of Washington, D.C., 30 October 2025, as Solomon reflected on the necessity of adaptability both personally and as a leader within a globally significant financial institution.

His statement is emblematic of the strategic philosophy that has defined Solomon’s executive tenure. He uses the ‘Change or die’ principle to highlight the existential imperative for renewal in business, particularly in the context of technological transformation, competitive dynamics, and economic disruption.

Solomon’s leadership at Goldman Sachs has been characterised by deliberate modernisation. He has overseen the integration of advanced technology, notably in artificial intelligence and fintech, implemented culture and process reforms, adapted workforce practices, and expanded strategic initiatives in sustainable finance. His approach blends operational rigour with entrepreneurial responsiveness – a mindset shaped both by his formative years in high-yield credit markets at Drexel Burnham and Bear Stearns, and by his rise through leadership roles at Goldman Sachs.

His remark on change was prompted by questions of business resilience and the need for constant adaptation amidst macroeconomic uncertainty, regulatory flux, and the competitive imperatives of Wall Street. For Solomon, resisting change is an instinct, but enabling it is a necessity for long-term health and relevance — especially for institutions in rapidly converging markets.

About David M. Solomon

  • Born 1962, Hartsdale, New York.
  • Hamilton College graduate (BA Political Science).
  • Early career: Irving Trust, Drexel Burnham, Bear Stearns.
  • Joined Goldman Sachs as a partner in 1999, advancing through financing and investment banking leadership.
  • CEO from October 2018, Chairman from January 2019.
  • Known for a modernisation agenda, openness to innovation and talent, commitment to client service and culture reform.
  • Outside finance: Philanthropy, board service, and a second career as electronic dance music DJ “DJ D-Sol”, underscoring a multifaceted approach to leadership and personal renewal.

Theoretical Backstory: Leading Thinkers on Change and Organisational Adaptation

Solomon’s philosophy echoes decades of foundational theory in business strategy and organisational behaviour:

Charles Darwin (1809–1882)
While not a business theorist, Darwin’s principle of “survival of the fittest” is often cited in strategic literature to emphasise the adaptive imperative — those best equipped to change, survive.

Peter Drucker (1909–2005)
Drucker, regarded as the father of modern management, wrote extensively on innovation, entrepreneurial management and the need for “planned abandonment.” He argued, “The greatest danger in times of turbulence is not the turbulence; it is to act with yesterday’s logic.” Drucker’s legacy forms a pillar of contemporary change management, advising leaders not only to anticipate change but to institutionalise it.

John Kotter (b. 1947)
Kotter’s model for Leading Change remains a classic in change management. His eight-step framework starts with establishing a sense of urgency and is grounded in the idea that successful transformation is both necessary and achievable only with decisive leadership, clear vision, and broad engagement. Kotter demonstrated that people’s resistance to change is natural, but can be overcome through structured actions and emotionally resonant leadership.

Clayton Christensen (1952-2020)
Christensen’s work on disruptive innovation clarified how incumbents often fail by ignoring, dismissing, or underinvesting in change — even when it is inevitable. His concept of the “Innovator’s Dilemma” remains seminal, showing that leaders must embrace change not as an abstract imperative but as a strategic necessity, lest they be replaced or rendered obsolete.

Rosabeth Moss Kanter
Kanter’s work focuses on the human dynamics of change, the importance of culture, empowerment, and the “innovation habit” in organisations. She holds that the secret to business success is “constant, relentless innovation” and that resistance to change is deeply psychological, calling for leaders to engineer positive environments for innovation.

Integration: The Leadership Challenge

Solomon’s ethos channels these frameworks into practical executive guidance. For business leaders, particularly in financial services and Fortune 500 firms, the lesson is clear: inertia is lethal; organisational health depends on reimagining processes, culture, and client engagement for tomorrow’s challenges. The psychological aversion to change must be managed actively at all levels — from the boardroom to the front line.

In summary, the context of Solomon’s quote reflects not only a personal credo but also the consensus of generations of theoretical and practical leadership: only those prepared to “change or die” can expect to thrive and endure in an era defined by speed, disruption, and relentless unpredictability.

read more
Quote: Andrej Karpathy – Ex-OpenAI, Ex-Tesla AI

Quote: Andrej Karpathy – Ex-OpenAI, Ex-Tesla AI

“[With AI] we’re not building animals. We’re building ghosts or spirits.” – Andrej Karpathy – Ex-OpenAI, Ex-Tesla AI

Andrej Karpathy, renowned for his leadership roles at OpenAI and Tesla’s Autopilot programme, has been at the centre of advances in deep learning, neural networks, and applied artificial intelligence. His work traverses both academic research and industrial deployment, granting him a panoramic perspective on the state and direction of AI.

When Karpathy refers to building “ghosts or spirits,” he is drawing a conceptual line between biological intelligence—the product of millions of years of evolution—and artificial intelligence as developed through data-driven, digital systems. In his view, animals are “baked in” with instincts, embodiment, and innate learning capacities shaped by evolution, a process unfolding over geological timeframes. By contrast, today’s AI models are “ghosts” in the sense that they are ethereal, fully digital artefacts, trained to imitate human-generated data rather than to evolve or learn through direct interaction with the physical world. They lack bodily instincts and the evolutionary substrate that endows animals with survival strategies and adaptation mechanisms.

Karpathy describes the pre-training process that underpins large language models as a form of “crappy evolution”—a shortcut that builds digital entities by absorbing the statistical patterns of internet-scale data without the iterative adaptation of embodied beings. Consequently, these models are not “born” into the world like animals with built-in survival machinery; instead, they are bootstrapped as “ghosts,” imitating but not experiencing life.

 

The Cognitive Core—Karpathy’s Vision for AI Intelligence

Karpathy’s thinking has advanced towards the critical notion of the “cognitive core”: the kernel of intelligence responsible for reasoning, abstraction, and problem-solving, abstracted away from encyclopaedic factual knowledge. He argues that the true magic of intelligence is not in the passive recall of data, but in the flexible, generalisable ability to manipulate ideas, solve problems, and intuit patterns—capabilities that a system exhibits even when deprived of pre-programmed facts or exhaustive memory.

He warns against confusing memorisation (the stockpiling of internet facts within a model) with general intelligence, which arises from this cognitive core. The most promising path, in his view, is to isolate and refine this core, stripping away the accretions of memorised data, thereby developing something akin to a “ghost” of reasoning and abstraction rather than an “animal” shaped by instinct and inheritance.

This approach entails significant trade-offs: a cognitive core lacks the encyclopaedic reach of today’s massive models, but gains in adaptability, transparency, and the capacity for compositional, creative thought. By foregrounding reasoning machinery, Karpathy posits that AI can begin to mirror not the inflexibility of animals, but the open-ended, reflective qualities that characterise high-level problem-solving.

 

Karpathy’s Journey and Influence

Karpathy’s influence is rooted in a career spent on the frontier of AI research and deployment. His early proximity to Geoffrey Hinton at the University of Toronto placed him at the launch-point of the convolutional neural networks revolution, which fundamentally reshaped computer vision and pattern recognition.

At OpenAI, Karpathy contributed to an early focus on training agents to master digital environments (such as Atari games), a direction in retrospect he now considers premature. He found greater promise in systems that could interact with the digital world through knowledge work—precursors to today’s agentic models—a vision he is now helping to realise through ongoing work in educational technology and AI deployment.

Later, at Tesla, he directed the transformation of autonomous vehicles from demonstration to product, gaining hard-won appreciation for the “march of nines”—the reality that progressing from system prototypes that work 90% of the time to those that work 99.999% of the time requires exponentially more effort. This experience informs his scepticism towards aggressive timelines for “AGI” and his insistence on the qualitative differences between robust system deployment and controlled demonstrations.

 

The Leading Theorists Shaping the Debate

Karpathy’s conceptual framework emerges amid vibrant discourse within the AI community, shaped by several seminal thinkers:

Theorist
Core Idea
Relation to Karpathy’s Ghosts vs. Animals Analogy
Richard Sutton
General intelligence emerges through learning algorithms honed by evolution (“bitter lesson”)
Sutton advocates building “animals” via RL and continual learning; Karpathy sees modern AI as ghosts—data-trained, not evolved.
Geoffrey Hinton
Neural networks model learning and perception as statistical pattern discovery
Hinton’s legacy underpins the digital cortex, but Karpathy stresses what’s missing: embodied instincts, continual memory.
Yann LeCun
Convolutional neural networks and representation learning for perceptual tasks
LeCun’s work forms part of the “cortex”, but Karpathy highlights the missing brain structures and instincts for full generality.

Sutton’s “bitter lesson” posits that scale and generic algorithms, rather than domain-specific tricks, ultimately win—suggesting a focus on evolving animal-like intelligence. Karpathy, however, notes that current development practices, with their reliance on dataset imitation, sidestep the deep embodiment and evolutionary learning that define animal cognition. Instead, AI today creates digital ghosts—entities whose minds are not grounded in physical reality, but in the manifold of internet text and data.

Hinton and LeCun supply the neural and architectural foundations—the “cortex” and reasoning traces—while both Karpathy and their critics note the absence of rich, consolidated memory (the hippocampus analogue), instincts (amygdala), and the capacity for continual, self-motivated world interaction.

Why “Ghosts,” Not “Animals”?

The distinction is not simply philosophical. It carries direct consequences for:

  • Capabilities: AI “ghosts” excel at pattern reproduction, simulation, and surface reasoning but lack the embodied, instinctual grounding (spatial navigation, sensorimotor learning) of animals.
  • Limitations: They are subject to model collapse, producing uniform, repetitive outputs, lacking the spontaneous creativity and entropy seen in human (particularly child) cognition.
  • Future Directions: The field is now oriented towards distilling this cognitive core, seeking a scalable, adaptable reasoning engine—compact, efficient, and resilient to overfitting—rather than continuing to bloat models with ever more static memory.

This lens sharpens expectations: the way forward is not to mimic biology in its totality, but to pursue the unique strengths and affordances of a digital, disembodied intelligence—a spirit of the datasphere, not a beast evolved in the forest.

 

Broader Significance

Karpathy’s “ghosts” metaphor crystallises a critical moment in the evolution of AI as a discipline. It signals a turning point: the shift from brute-force memorisation of the internet to intelligent, creative algorithms capable of abstraction, reasoning, and adaptation.

This reframing is shaping not only the strategic priorities of the most advanced labs, but also the philosophical and practical questions underpinning the next decade of AI research and deployment. As AI becomes increasingly present in society, understanding its nature—not as an artificial animal, but as a digital ghost—will be essential to harnessing its strengths and mitigating its limitations.

read more
Quote: Sholto Douglas – Anthropic

Quote: Sholto Douglas – Anthropic

“People have said we’re hitting a plateau every month for three years… I look at how models are produced and every part could be improved. The training pipeline is primitive, held together by duct tape, best efforts, and late nights. There’s so much room to grow everywhere.” – Sholto Douglas – Anthropic

Sholto Douglas made the statement during a major public podcast interview in October 2025, coinciding with Anthropic’s release of Claude Sonnet 4.5—at the time, the world’s strongest and most “agentic” AI coding model. The comment specifically rebuts repeated industry and media assertions that large AI models have reached a ceiling or are slowing in progress. Douglas argues the opposite: that the field is in a phase of accelerating advancement, driven both by transformative hardware investment (“compute super-cycle”), new algorithmic techniques (particularly reinforcement learning and test-time compute), and the persistent “primitive” state of today’s AI engineering infrastructure.

He draws an analogy with early-stage, improvisational systems: the models are held together “by duct tape, best efforts, and late nights,” making clear that immense headroom for improvement remains at every level, from training data pipelines and distributed infrastructure to model architecture and reward design. As a result, every new benchmark and capability reveals further unrealised opportunity, with measurable progress charted month after month.

Douglas’s deeper implication is that claims of a plateau often arise from surface-level analysis or the “saturation” of public benchmarks, not from a rigorous understanding of what is technically possible or how much scale remains untapped across the technical stack.

Sholto Douglas: Career Trajectory and Perspective

Sholto Douglas is a leading member of Anthropic’s technical staff, focused on scaling reinforcement learning and agentic AI. His unconventional journey illustrates both the new talent paradigm and the nature of breakthrough AI research today:

  • Early Life and Mentorship: Douglas grew up in Australia, where he benefited from unusually strong academic and athletic mentorship. His mother, an accomplished physician frustrated by systemic barriers, instilled discipline and a systemic approach; his Olympic-level fencing coach provided a first-hand experience of how repeated, directed effort leads to world-class performance.
  • Academic Formation: He studied computer science and robotics as an undergraduate, with a focus on practical experimentation and a global mindset. A turning point was reading the “scaling hypothesis” for AGI, convincing him that progress on artificial general intelligence was feasible within a decade—and worth devoting his career to.
  • Independent Innovation: As a student, Douglas built “bedroom-scale” foundation models for robotics, working independently on large-scale data collection, simulation, and early adoption of transformer-based methods. This entrepreneurial approach—demonstrating initiative and technical depth without formal institutional backing—proved decisive.
  • Google (Gemini and DeepMind): His independent work brought him to Google, where he joined just before the release of ChatGPT, in time to witness and help drive the rapid unification and acceleration of Google’s AI efforts (Gemini, Brain, DeepMind). He co-designed new inference infrastructure that reduced costs and worked at the intersection of large-scale learning, reinforcement learning, and applied reasoning.
  • Anthropic (from 2025): Drawn by Anthropic’s focus on measurable, near-term economic impact and deep alignment work, Douglas joined to lead and scale reinforcement learning research—helping push the capability frontier for agentic models. He values a culture where every contributor understands and can articulate how their work advances both capability and safety in AI.

Douglas is distinctive for his advocacy of “taste” in AI research, favouring mechanistic understanding and simplicity over clever domain-specific tricks—a direct homage to Richard Sutton’s “bitter lesson.” This perspective shapes his belief that the greatest advances will come not from hiding complexity with hand-crafted heuristics, but from scaling general algorithms and rigorous feedback loops.

 

Intellectual and Scientific Context: The ‘Plateau’ Debate and Leading Theorists

The debate around the so-called “AI plateau” is best understood against the backdrop of core advances and recurring philosophical arguments in machine learning.

The “Bitter Lesson” and Richard Sutton

  • Richard Sutton (University of Alberta, DeepMind), one of the founding figures in reinforcement learning, crystallised the field’s “bitter lesson”: that general, scalable methods powered by increased compute will eventually outperform more elegant, hand-crafted, domain-specific approaches.
  • In practical terms, this means that the field’s recent leaps—from vision to language to coding—are powered less by clever new inductive biases, and more by architectural simplicity plus massive compute and data. Sutton has also maintained that real progress in AI will come from reinforcement learning with minimal task-specific assumptions and maximal data, computation, and feedback.

Yann LeCun and Alternative Paradigms

  • Yann LeCun (Meta, NYU), a pioneer of deep learning, has maintained that the transformer paradigm is limited and that fundamentally novel architectures are necessary for human-like reasoning and autonomy. He argues that unsupervised/self-supervised learning and new world-modelling approaches will be required.
  • LeCun’s disagreement with Sutton’s “bitter lesson” centres on the claim that scaling is not the final answer: new representation learning, memory, and planning mechanisms will be needed to reach AGI.

Shane Legg, Demis Hassabis, and DeepMind

  • DeepMind’s approach has historically been “science-first,” tackling a broad swathe of human intelligence challenges (AlphaGo, AlphaFold, science AI), promoting a research culture that takes long-horizon bets on new architectures (memory-augmented neural networks, world models, differentiable reasoning).
  • Demis Hassabis and Shane Legg (DeepMind co-founders) have advocated for testing a diversity of approaches, believing that the path to AGI is not yet clear—though they too acknowledge the value of massive scale and reinforcement learning.

The Scaling Hypothesis: GW’s Essay and the Modern Era

  • The so-called “scaling hypothesis”—the idea that simply making models larger and providing more compute and data will continue yielding improvements—has become the default “bet” for Anthropic, OpenAI, and others. Douglas refers directly to this intellectual lineage as the critical “hinge” moment that set his trajectory.
  • This hypothesis is now being extended into new areas, including agentic systems where long context, verification, memory, and reinforcement learning allow models to reliably pursue complex, multi-step goals semi-autonomously.
 

Summing Up: The Current Frontier

Today, researchers like Douglas are moving beyond the original transformer pre-training paradigm, leveraging multi-axis scaling (pre-training, RL, test-time compute), richer reward systems, and continuous experimentation to drive model capabilities in coding, digital productivity, and emerging physical domains (robotics and manipulation).

Douglas’s quote epitomises the view that not only has performance not plateaued—every “limitation” encountered is a signpost for further exponential improvement. The modest, “patchwork” nature of current AI infrastructure is a competitive advantage: it means there is vast room for optimisation, iteration, and compounding gains in capability.

As the field races into a new era of agentic AI and economic impact, his perspective serves as a grounded, inside-out refutation of technological pessimism and a call to action grounded in both technical understanding and relentless ambition.

read more
Quote: Julian Schrittwieser – Anthropic

Quote: Julian Schrittwieser – Anthropic

“The talk about AI bubbles seemed very divorced from what was happening in frontier labs and what we were seeing. We are not seeing any slowdown of progress.” – Julian Schrittwieser – Anthropic

Those closest to technical breakthroughs are witnessing a pattern of sustained, compounding advancement that is often underestimated by commentators and investors. This perspective underscores both the power and limitations of conventional intuitions regarding exponential technological progress.

 

Context of the Quote

Schrittwieser delivered these remarks in a 2025 interview on the MAD Podcast, prompted by widespread discourse on the so-called ‘AI bubble’. His key contention is that debate around an AI investment or hype “bubble” feels disconnected from the lived reality inside the world’s top research labs, where the practical pace of innovation remains brisk and outwardly undiminished. He outlines that, according to direct observation and internal benchmarks at labs such as Anthropic, progress remains on a highly consistent exponential curve: “every three to four months, the model is able to do a task that is twice as long as before completely on its own”.

He draws an analogy to the early days of COVID-19, where exponential growth was invisible until it became overwhelming; the same mathematical processes, Schrittwieser contends, apply to AI system capabilities. While public narratives about bubbles often reference the dot-com era, he highlights a bifurcation: frontier labs sustain robust, revenue-generating trajectories, while the wider AI ecosystem might experience bubble-like effects in valuation. But at the core—the technology itself continues to improve at a predictably exponential rate well supported by both qualitative experience and benchmark data.

Schrittwieser’s view, rooted in immediate, operational knowledge, is that the default expectation of a linear future is mistaken: advances in autonomy, reasoning, and productivity are compounding. This means genuinely transformative impacts—such as AI agents that function at expert level or beyond for extended, unsupervised tasks—are poised to arrive sooner than many anticipate.

 

Profile: Julian Schrittwieser

Julian Schrittwieser is one of the world’s leading artificial intelligence researchers, currently based at Anthropic, following a decade as a core scientist at Google DeepMind. Raised in rural Austria, Schrittwieser’s journey from an adolescent fascination with game programming to the vanguard of AI research exemplifies the discipline’s blend of curiosity, mathematical rigour, and engineering prowess. He studied computer science at the Vienna University of Technology, before interning at Google.

Schrittwieser was a central contributor to several historic machine learning milestones, most notably:

 
  • AlphaGo, the first program to defeat a world champion at Go, combining deep neural networks with Monte Carlo Tree Search.
  • AlphaGo Zero and AlphaZero, which generalised the approach to achieve superhuman performance without human examples, through self-play—demonstrating true generality in reinforcement learning.
  • MuZero (as lead author), solving the challenge of mastering environments without even knowing the rules in advance, by enabling the system to learn its own internal, predictive world models—an innovation bringing RL closer to complex, real-world domains.
  • Later work includes AlphaCode (code synthesis), AlphaTensor (algorithmic discovery), and applied advances in Gemini and AlphaProof.

At Anthropic, Schrittwieser is at the frontier of research into scaling laws, reinforcement learning, autonomous agents, and novel techniques for alignment and safety in next-generation AI. True to his pragmatic ethos, he prioritises what directly raises capability and reliability, and advocates for careful, data-led extrapolation rather than speculation.

 

Theoretical Backstory: Exponential AI Progress and Key Thinkers

Schrittwieser’s remarks situate him within a tradition of AI theorists and builders focused on scaling laws, reinforcement learning (RL), and emergent capabilities:

Leading Theorists and Historical Perspective

Name
Notable Ideas and Contributions
Relevance to Quote
Demis Hassabis
Founder of DeepMind; architect of the AlphaGo programme. Emphasised general intelligence and the power of RL plus planning.
Schrittwieser’s mentor and DeepMind leader. Pioneered RL paradigms beyond games.
David Silver
Developed many of the breakthroughs underlying AlphaGo, AlphaZero, MuZero. Advanced RL and model-based search methods.
Collaborator with Schrittwieser; together, demonstrated practical scaling of RL.
Richard Sutton
Articulated reinforcement learning’s centrality: “The Bitter Lesson” (general methods, scalable computation, not handcrafted). Advanced temporal difference methods and RL theory.
Mentioned by Schrittwieser as a thought leader shaping the RL paradigm at scale.
Alex Ray, Jared Kaplan, Sam McCandlish, OpenAI Scaling Team
Quantified AI’s “scaling laws”: empirical tendencies for model performance to improve smoothly with compute, data, and parameter scaling.
Schrittwieser echoes this data-driven, incrementalist philosophy.
Ilya Sutskever
Co-founder of OpenAI; central to deep learning breakthroughs, scaling, and forecasting emergent capabilities.
OpenAI’s work on benchmarks (GDP-Val) and scaling echoes these insights.

These thinkers converge on several key observations directly reflected in Schrittwieser’s view:

  • Exponential Capability Curves: Consistent advances in performance often surprise those outside the labs due to our poor intuitive grasp of exponentiality—what Schrittwieser terms a repeated “failure to understand the exponential”.
  • Scaling Laws and Reinforcement Learning: Improvements are not just about larger models, but ever-better training, more reliable reinforcement learning, agentic architecture, and robust reward systems—developments Schrittwieser’s work epitomises.
  • Novelty and Emergence: Historically, theorists doubted whether neural models could go beyond sophisticated mimicry; the “Move 37” moment (AlphaGo’s unprecedented move in Go) was a touchstone for true machine creativity, a theme Schrittwieser stresses remains highly relevant today.
  • Bubbles, Productivity, and Market Cycles: Mainstream financial and social narratives may oscillate dramatically, but real capability growth—observable in benchmarks and direct use—has historically marched on undeterred by speculative excesses.
 

Synthesis: Why the Perspective Matters

The quote foregrounds a gap between external perceptions and insider realities. Pioneers like Schrittwieser and his cohort stress that transformative change will not follow a smooth, linear or hype-driven curve, but an exponential, data-backed progression—one that may defy conventional intuition, but is already reshaping productivity and the structure of work.

This moment is not about “irrational exuberance”, but rather the compounding product of theoretical insight, algorithmic audacity, and relentless engineering: the engine behind the next wave of economic and social transformation.

read more
Quote: Andrej Karpathy – Ex-OpenAI, Ex-Tesla AI

Quote: Andrej Karpathy – Ex-OpenAI, Ex-Tesla AI

“AI is so wonderful because there have been a number of seismic shifts where the entire field has suddenly looked a different way. I’ve maybe lived through two or three of those. I still think there will continue to be some because they come with almost surprising regularity.” – Andrej Karpathy – Ex-OpenAI, Ex-Tesla AI

Andrej Karpathy, one of the most recognisable figures in artificial intelligence, has spent his career at the epicentre of the field’s defining moments in both research and large-scale industry deployment.

Karpathy’s background is defined by deep technical expertise and a front-row seat to AI’s rapid evolution. Having completed his PhD at Stanford and held pivotal research positions, he worked alongside Geoffrey Hinton at the University of Toronto during the early surge of deep learning. His career encompasses key roles at Tesla, where he led the Autopilot vision team, and at OpenAI, contributing to some of the world’s most prominent large language models and generative AI systems. This vantage point has allowed him to participate in, and reflect upon, the discipline’s “seismic shifts”.

Karpathy’s narrative has been shaped by three inflection points:

  • The emergence of deep neural networks from a niche field to mainstream AI, spearheaded by the success of AlexNet and the subsequent shift of the research community toward neural architectures.
  • The drive towards agent-based systems, with early enthusiasm for reinforcement learning (RL) and game-based environments (such as Atari and Go). Karpathy himself was cautious about the utility of games as the true path to intelligence, focusing instead on agents acting within the real digital world.
  • The rise of large language models (LLMs)—transformers trained on vast internet datasets, shifting the locus of AI from task-specific systems to general-purpose models with the ability to perform a broad suite of tasks, and in-context learning.

His reflection on these ‘regular’ paradigm shifts arises from lived experience: “I’ve maybe lived through two or three of those. I still think there will continue to be some because they come with almost surprising regularity.” These moments recalibrate assumptions, redirect research priorities, and set new benchmarks for capability. Karpathy’s practical orientation—building “useful things” rather than targeting biological intelligence or pure AGI—shapes his approach to both innovation and scepticism about hype.

Context of the Quote
In his conversation with podcaster Dwarkesh Patel, Karpathy elaborates on the recurring nature of breakthroughs. He contrasts AI’s rapid, transformative leaps with other scientific fields, noting that in machine learning, scaling up data, compute, and novel architectures can yield abrupt improvements—yet each wave often triggers both excessive optimism and later recalibration. A major point he raises is the lack of linearity: the field does not “smoothly” approach AGI, but rather proceeds via discontinuities, often catalysed by new ideas or techniques that were previously out of favour or overlooked.

Karpathy relates how, early in his career, neural networks were a marginal interest and large-scale “representation learning” was only beginning to be considered viable by a minority in the community. With the advent of AlexNet, the landscape shifted overnight, rapidly making previous assumptions obsolete. Later, the pursuit of RL-driven agents led to a phase where entire research agendas were oriented toward gameplay and synthetic environments—another phase later superseded by the transformer revolution and language models. Karpathy reflects candidly on earlier missteps, as well as the discipline’s collective tendency to over- or under-predict the timetable and trajectory of progress.

Leading Theorists and Intellectual Heritage
The AI revolutions Karpathy describes are inseparable from the influential figures and ideas that have shaped each phase:

  • Geoffrey Hinton: Hailed as the “godfather of AI”, Hinton was instrumental in deep learning’s breakthrough, advancing techniques for training multilayered neural networks and championing representation learning against prevailing orthodoxy.
  • Yann LeCun: Developed convolutional neural networks (CNNs), foundational for computer vision and the 2010s wave of deep learning success.
  • Yoshua Bengio: Co-architect of the deep learning movement and a key figure in developing unsupervised and generative models.
  • Richard Sutton: Principal proponent of reinforcement learning, Sutton articulated the value of “animal-like” intelligence: learning from direct interaction with environments, reward, and adaptation. Sutton’s perspective frequently informs debates about the relationship between model architectures and living intelligence, encouraging a focus on agents and lifelong learning.

Karpathy’s own stance is partly a pragmatic response to this heritage: rather than pursuing analogues of biological brains, he views the productive path as building digital “ghosts”—entities that learn by imitation and are shaped by patterns in data, rather than evolutionary processes.

Beyond individual theorists, the field’s quantum leaps are rooted in a culture of intellectual rivalry and rapid intellectual cross-pollination:

  • The convolutional and recurrent networks of the 2010s pushed the boundaries of what neural networks could do.
  • The development and scaling of transformer-based architectures (as in Google’s “Attention is All You Need”) dramatically changed both natural language processing and the structure of the field itself.
  • The introduction of algorithms for in-context learning and large-scale unsupervised pre-training marked a break with hand-crafted representation engineering.

The Architecture of Progress: Seismic Shifts and Pragmatic Tension
Karpathy’s insight is that these shifts are not just about faster hardware or bigger datasets; they reflect the field’s unique ecology—where new methods can rapidly become dominant and overturn accumulated orthodoxy. The combination of open scientific exchange, rapid deployment, and intense commercialisation creates fertile ground for frequent realignment.

His observation on the “regularity” of shifts also signals a strategic realism: each wave brings both opportunity and risk. New architectures (such as transformers or large reinforcement learning agents) frequently overshoot expectations before their real limitations become clear. Karpathy remains measured on both promise and limitation—anticipating continued progress, but cautioning against overpredictions and hype cycles that fail to reckon with the “march of nines” needed to reach true reliability and impact.

Closing Perspective
The context of Karpathy’s quote is an AI ecosystem that advances not through steady accretion, but in leaps—each driven by conceptual, technical, and organisational realignments. As such, understanding progress in AI demands both technical literacy and historical awareness: the sharp pivots that have marked past decades are likely to recur, with equally profound effects on how intelligence is conceived, built, and deployed.

read more
Quote: Jonathan Ross – CEO Groq

Quote: Jonathan Ross – CEO Groq

“The countries that control compute will control AI. You cannot have compute without energy.” – Jonathan Ross – CEO Groq

Jonathan Ross stands at the intersection of geopolitics, energy economics, and technological determinism. As founder and CEO of Groq, the Silicon Valley firm challenging Nvidia’s dominance in AI infrastructure, Ross articulated a proposition of stark clarity during his September 2025 appearance on Harry Stebbings’ 20VC podcast: “The countries that control compute will control AI. You cannot have compute without energy.”

This observation transcends technical architecture. Ross is describing the emergence of a new geopolitical currency—one where computational capacity, rather than traditional measures of industrial might, determines economic sovereignty and strategic advantage in the 21st century. His thesis rests on an uncomfortable reality: artificial intelligence, regardless of algorithmic sophistication or model architecture, cannot function without the physical substrate of compute. And compute, in turn, cannot exist without abundant, reliable energy.

The Architecture of Advantage

Ross’s perspective derives from direct experience building the infrastructure that powers modern AI. At Google, he initiated what became the Tensor Processing Unit (TPU) project—custom silicon that allowed the company to train and deploy machine learning models at scale. This wasn’t academic research; it was the foundation upon which Google’s AI capabilities were built. When Amazon and Microsoft attempted to recruit him in 2016 to develop similar capabilities, Ross recognised a pattern: the concentration of advanced AI compute in too few hands represented a strategic vulnerability.

His response was to establish Groq in 2016, developing Language Processing Units optimised for inference—the phase where trained models actually perform useful work. The company has since raised over $3 billion and achieved a valuation approaching $7 billion, positioning itself as one of Nvidia’s most credible challengers in the AI hardware market. But Ross’s ambitions extend beyond corporate competition. He views Groq’s mission as democratising access to compute—creating abundant supply where artificial scarcity might otherwise concentrate power.

The quote itself emerged during a discussion about global AI competitiveness. Ross had been explaining why European nations, despite possessing strong research talent and model development capabilities (Mistral being a prominent example), risk strategic irrelevance without corresponding investment in computational infrastructure and energy capacity. A brilliant model without compute to run it, he argued, will lose to a mediocre model backed by ten times the computational resources. This isn’t theoretical—it’s the lived reality of the current AI landscape, where rate limits and inference capacity constraints determine what services can scale and which markets can be served.

The Energy Calculus

The energy dimension of Ross’s statement carries particular weight. Modern AI training and inference require extraordinary amounts of electrical power. The hyperscalers—Google, Microsoft, Amazon, Meta—are each committing tens of billions of dollars annually to AI infrastructure, with significant portions dedicated to data centre construction and energy provision. Microsoft recently announced it wouldn’t make certain GPU clusters available through Azure because the company generated higher returns using that compute internally rather than renting it to customers. This decision, more than any strategic presentation, reveals the economic value density of AI compute.

Ross draws explicit parallels to the early petroleum industry: a period of chaotic exploration where a few “gushers” delivered extraordinary returns whilst most ventures yielded nothing. In this analogy, compute is the new oil—a fundamental input that determines economic output and strategic positioning. But unlike oil, compute demand doesn’t saturate. Ross describes AI demand as “insatiable”: if OpenAI or Anthropic received twice their current inference capacity, their revenue would nearly double within a month. The bottleneck isn’t customer appetite; it’s supply.

This creates a concerning dynamic for nations without indigenous energy abundance or the political will to develop it. Ross specifically highlighted Europe’s predicament: impressive AI research capabilities undermined by insufficient energy infrastructure and regulatory hesitance around nuclear power. He contrasted this with Norway’s renewable capacity (80% wind utilisation) or Japan’s pragmatic reactivation of nuclear facilities—examples of countries aligning energy policy with computational ambition. The message is uncomfortable but clear: technical sophistication in model development cannot compensate for material disadvantage in energy and compute capacity.

Strategic Implications

The geopolitical dimension becomes more acute when considering China’s position. Ross noted that whilst Chinese models like DeepSeek may be cheaper to train (through various optimisations and potential subsidies), they remain more expensive to run at inference—approximately ten times more costly per token generated. This matters because inference, not training, determines scalability and market viability. China can subsidise AI deployment domestically, but globally—what Ross terms the “away game”—cost structure determines competitiveness. Countries cannot simply construct nuclear plants at will; energy infrastructure takes decades to build.

This asymmetry creates opportunity for nations with existing energy advantages. The United States, despite higher nominal costs, benefits from established infrastructure and diverse energy sources. However, Ross’s framework suggests this advantage is neither permanent nor guaranteed. Control over compute requires continuous investment in both silicon capability and energy generation. Nations that fail to maintain pace risk dependency—importing not just technology, but the capacity for economic and strategic autonomy.

The corporate analogy proves instructive. Ross predicts that every major AI company—OpenAI, Anthropic, Google, and others—will eventually develop proprietary chips, not necessarily to outperform Nvidia technically, but to ensure supply security and strategic control. Nvidia currently dominates not purely through superior GPU architecture, but through control of high-bandwidth memory (HBM) supply chains. Building custom silicon allows organisations to diversify supply and avoid allocation constraints that might limit their operational capacity. What applies to corporations applies equally to nations: vertical integration in compute infrastructure is increasingly a prerequisite for strategic autonomy.

The Theorists and Precedents

Ross’s thesis echoes several established frameworks in economic and technological thought, though he synthesises them into a distinctly contemporary proposition.

Harold Innis, the Canadian economic historian, developed the concept of “staples theory” in the 1930s and 1940s—the idea that economies organised around the extraction and export of key commodities (fur, fish, timber, oil) develop institutional structures, trade relationships, and power dynamics shaped by those materials. Innis later extended this thinking to communication technologies in works like Empire and Communications (1950) and The Bias of Communication (1951), arguing that the dominant medium of a society shapes its political and social organisation. Ross’s formulation applies Innisian logic to computational infrastructure: the nations that control the “staples” of the AI economy—energy and compute—will shape the institutional and economic order that emerges.

Carlota Perez, the Venezuelan-British economist, provided a framework for understanding technological revolutions in Technological Revolutions and Financial Capital (2002). Perez identified how major technological shifts (steam power, railways, electricity, mass production, information technology) follow predictable patterns: installation phases characterised by financial speculation and infrastructure building, followed by deployment phases where the technology becomes economically productive. Ross’s observation about current AI investment—massive capital expenditure by hyperscalers, uncertain returns, experimental deployment—maps cleanly onto Perez’s installation phase. The question, implicit in his quote, is which nations will control the infrastructure when the deployment phase arrives and returns become tangible.

W. Brian Arthur, economist and complexity theorist, articulated the concept of “increasing returns” in technology markets through works like Increasing Returns and Path Dependence in the Economy (1994). Arthur demonstrated how early advantages in technology sectors compound through network effects, learning curves, and complementary ecosystems—creating winner-take-most dynamics rather than the diminishing returns assumed in classical economics. Ross’s emphasis on compute abundance follows this logic: early investment in computational infrastructure creates compounding advantages in AI capability, which drives economic returns, which fund further compute investment. Nations entering this cycle late face escalating barriers to entry.

Joseph Schumpeter, the Austrian-American economist, introduced the concept of “creative destruction” in Capitalism, Socialism and Democracy (1942)—the idea that economic development proceeds through radical innovation that renders existing capital obsolete. Ross explicitly invokes Schumpeterian dynamics when discussing the risk that next-generation AI chips might render current hardware unprofitable before it amortises. This uncertainty amplifies the strategic calculus: nations must invest in compute infrastructure knowing that technological obsolescence might arrive before economic returns materialise. Yet failing to invest guarantees strategic irrelevance.

William Stanley Jevons, the 19th-century English economist, observed what became known as Jevons Paradox in The Coal Question (1865): as technology makes resource use more efficient, total consumption typically increases rather than decreases because efficiency makes the resource more economically viable for new applications. Ross applies this directly to AI compute, noting that as inference becomes cheaper (through better chips or more efficient models), demand expands faster than costs decline. This means the total addressable market for compute grows continuously—making control over production capacity increasingly valuable.

Nicholas Georgescu-Roegen, the Romanian-American economist, pioneered bioeconomics and introduced entropy concepts to economic analysis in The Entropy Law and the Economic Process (1971). Georgescu-Roegen argued that economic activity is fundamentally constrained by thermodynamic laws—specifically, that all economic processes dissipate energy and cannot be sustained without continuous energy inputs. Ross’s insistence that “you cannot have compute without energy” is pure Georgescu-Roegen: AI systems, regardless of algorithmic elegance, are bound by physical laws. Compute is thermodynamically expensive—training large models requires megawatts, inference at scale requires sustained power generation. Nations without access to abundant energy cannot sustain AI economies, regardless of their talent or capital.

Mancur Olson, the American economist and political scientist, explored collective action problems and the relationship between institutional quality and economic outcomes in works like The Rise and Decline of Nations (1982). Olson demonstrated how established interest groups can create institutional sclerosis that prevents necessary adaptation. Ross’s observations about European regulatory hesitance and infrastructure underinvestment reflect Olsonian dynamics: incumbent energy interests, environmental lobbies, and risk-averse political structures prevent the aggressive nuclear or renewable expansion required for AI competitiveness. Meanwhile, nations with different institutional arrangements (or greater perceived strategic urgency) act more decisively.

Paul Romer, the American economist and Nobel laureate, developed endogenous growth theory, arguing in works like “Endogenous Technological Change” (1990) that economic growth derives from deliberate investment in knowledge and technology rather than external factors. Romer’s framework emphasises the non-rivalry of ideas (knowledge can be used by multiple actors simultaneously) but the rivalry of physical inputs required to implement them. Ross’s thesis fits perfectly: AI algorithms can be copied and disseminated, but the computational infrastructure to deploy them at scale cannot. This creates a fundamental asymmetry that determines economic power.

The Historical Pattern

History provides sobering precedents for resource-driven geopolitical competition. Britain’s dominance in the 19th century rested substantially on coal abundance that powered industrial machinery and naval supremacy. The United States’ 20th-century ascendance correlated with petroleum access and the industrial capacity to refine and deploy it. Oil-dependent economies in the Middle East gained geopolitical leverage disproportionate to their population or industrial capacity purely through energy reserves.

Ross suggests we are witnessing the emergence of a similar dynamic, but with a critical difference: AI compute is both resource-intensive (requiring enormous energy) and productivity-amplifying (making other economic activity more efficient). This creates a multiplicative effect where compute advantages compound through both direct application (better AI services) and indirect effects (more efficient production of goods and services across the economy). A nation with abundant compute doesn’t just have better chatbots—it has more efficient logistics, agricultural systems, manufacturing processes, and financial services.

The “away game” concept Ross introduced during the podcast discussion adds a critical dimension. China, despite substantial domestic AI investment and capabilities, faces structural disadvantages in global competition because international customers cannot simply replicate China’s energy subsidies or infrastructure. This creates opportunities for nations with more favourable cost structures or energy profiles, but only if they invest in both compute capacity and energy generation.

The Future Ross Envisions

Throughout the podcast, Ross painted a vision of AI-driven abundance that challenges conventional fears of technological unemployment. He predicts labour shortages, not mass unemployment, driven by three mechanisms: deflationary pressure (AI makes goods and services cheaper), workforce opt-out (people work less as living costs decline), and new industry creation (entirely new job categories emerge, like “vibe coding”—programming through natural language rather than formal syntax).

This optimistic scenario depends entirely on computational abundance. If compute remains scarce and concentrated, AI benefits accrue primarily to those controlling the infrastructure. Ross’s mission with Groq—creating faster deployment cycles (six months versus two years for GPUs), operating globally distributed data centres, optimising for cost efficiency rather than margin maximisation—aims to prevent that concentration. But the same logic applies at the national level. Countries without indigenous compute capacity will import AI services, capturing some productivity benefits but remaining dependent on external providers for the infrastructure that increasingly underpins economic activity.

The comparison Ross offers—LLMs as “telescopes of the mind”—is deliberately chosen. Galileo’s telescope revolutionised human understanding but required specific material capabilities to construct and use. Nations without optical manufacturing capacity could not participate in astronomical discovery. Similarly, nations without computational and energy infrastructure cannot participate fully in the AI economy, regardless of their algorithmic sophistication or research talent.

Conclusion

Ross’s statement—”The countries that control compute will control AI. You cannot have compute without energy”—distils a complex geopolitical and economic reality into stark clarity. It combines Innisian materialism (infrastructure determines power), Schumpeterian dynamism (innovation renders existing capital obsolete), Jevonsian counterintuition (efficiency increases total consumption), and Georgescu-Roegen’s thermodynamic constraints (economic activity requires energy dissipation).

The implications are uncomfortable for nations unprepared to make the necessary investments. Technical prowess in model development provides no strategic moat if the computational infrastructure to deploy those models remains controlled elsewhere. Energy abundance, or the political will to develop it, becomes a prerequisite for AI sovereignty. And AI sovereignty increasingly determines economic competitiveness across sectors.

Ross occupies a unique vantage point—neither pure academic nor disinterested observer, but an operator building the infrastructure that will determine whether his prediction proves correct. Groq’s valuation and customer demand suggest the market validates his thesis. Whether nations respond with corresponding urgency remains an open question. But the framework Ross articulates will likely define strategic competition for the remainder of the decade: compute as currency, energy as prerequisite, and algorithmic sophistication as necessary but insufficient for competitive advantage.

read more
Quote: Jensen Huang – CEO Nvidia

Quote: Jensen Huang – CEO Nvidia

“Oftentimes, if you reason about things from first principles, what’s working today incredibly well — if you could reason about it from first principles and ask yourself on what foundation that first principle is built and how that would change over time — it allows you to hopefully see around corners.” – Jensen Huang – CEO Nvidia

Jensen Huang’s quote was delivered in the context of an in-depth dialogue with institutional investors on the trajectory of Nvidia, the evolution of artificial intelligence, and strategies for anticipating and shaping the technological future.

Context of the Quote

The quote was made during an interview at a Citadel Securities event in October 2025, hosted by Konstantine Buhler, a partner at Sequoia Capital. The dialogue’s audience consisted of leading institutional investors, all seeking avenues for sustainable advantage or ‘edge’. The conversation explored the founding moments of Nvidia in the early 1990s, through the reinvention of the graphics processing unit (GPU), the creation of new computing markets, and the subsequent rise of Nvidia as the platform underpinning the global AI boom. The question of how to ‘see around corners’ — to anticipate technology and industry shifts before they crystallise for others — was at the core of the discussion. Huang’s answer, invoking first-principles reasoning, linked Nvidia’s success to its ability to continually revisit and challenge foundational assumptions, and to methodically project how they will be redefined by progress in science and technology.

Jensen Huang: Profile and Approach

Jensen Huang, born in Tainan, Taiwan in 1963, immigrated to the United States as a child, experiencing the formative challenges of cultural dislocation, financial hardship, and adversity. He obtained his undergraduate degree in electrical engineering from Oregon State University and a master’s from Stanford University. After working at AMD and LSI Logic, he co-founded Nvidia in 1993 at 30, reportedly at a Denny’s restaurant. From the outset, the company faced daunting odds — neither established market nor assured funding, and frequent existential risk in the initial years.

Huang is distinguished not only by technical fluency — he is deeply involved in hardware and software architecture — but also by an ability to translate complexity for diverse audiences. He eschews corporate formality in favour of trademark leather jackets and a focus on product. His leadership style is marked by humility, a willingness to bet on emerging ideas, and what he describes as “urgent innovation” born of early near-failure. This disposition has been integral to Nvidia’s progress, especially as the company repeatedly “invented markets” and defined entirely new categories, such as accelerated computing and AI infrastructure.

By 2024, Nvidia became the world’s most valuable public company, with its GPUs foundational to gaming, scientific computing, and, critically, the rise of AI. Huang’s awards — from the IEEE Founder’s Medal to listing among Time Magazine’s 100 most influential — underscore his reputation as a technologist and strategic thinker. He is widely recognised for being able to establish technical direction well before it becomes market consensus, an approach reflected in the quote.

First-Principles Thinking: Theoretical Foundations

Huang’s endorsement of “first principles” echoes a method of problem-solving and innovation associated with thinkers as diverse as Aristotle, Isaac Newton, and, in the modern era, entrepreneurs and strategists such as Elon Musk. The essence of first-principles thinking is to break down complex systems to their most fundamental truths — concepts that cannot be deduced from anything simpler — and to reason forward from those axioms, unconstrained by traditional assumptions, analogies, or received wisdom.

  • Aristotle was the first to coin the term “first principles”, distinguishing knowledge derived from irreducible foundational truths from knowledge obtained through analogy or precedent.
  • René Descartes advocated for systematic doubt and logical rebuilding of knowledge from foundational elements.
  • Richard Feynman, the physicist, was famous for urging students to “understand from first principles”, encouraging deep understanding and avoidance of rote memorisation or mere pattern recognition.
  • Elon Musk is often cited as a contemporary example, applying first-principles thinking to industries as varied as automotive (Tesla), space (SpaceX), and energy. Musk has described the technique as “boiling things down to the most fundamental truths and then reasoning up from there,” directly influencing not just product architectures but also cost models and operational methods.

Application in Technology and AI

First-principles thinking is particularly powerful in periods of technological transition:

  • In computing, first principles were invoked by Carver Mead and Lynn Conway, who reimagined the semiconductor industry in the 1970s by establishing the foundational laws for microchip design, known as Mead-Conway methodology. This approach was cited by Huang as influential for predicting the physical limitations of transistor miniaturisation and motivating Nvidia’s focus on accelerated computing.
  • Clayton Christensen, cited by Huang as an influence, introduced the idea of disruptive innovation, arguing that market leaders must question incumbent logic and anticipate non-linear shifts in technology. His books on disruption and innovation strategy have shaped how leaders approach structural shifts and avoid the “innovator’s dilemma”.
  • The leap from von Neumann architectures to parallel, heterogenous, and ultimately AI-accelerated computing frameworks — as pioneered by Nvidia’s CUDA platform and deep learning libraries — was possible because leaders at Nvidia systematically revisited underlying assumptions about how computation should be structured for new workloads, rather than simply iterating on the status quo.
  • The AI revolution itself was catalysed by the “deep learning” paradigm, championed by Geoffrey Hinton, Yann LeCun, and Andrew Ng. Each demonstrated that previous architectures, which had reached plateaus, could be superseded by entirely new approaches, provided there was willingness to reinterpret the problem from mathematical and computational fundamentals.

Backstory of the Leading Theorists

The ecosystem that enabled Nvidia’s transformation is shaped by a series of foundational theorists:

  • Mead and Conway: Their 1979 textbook and methodologies codified the “first-principles” approach in chip design, allowing for the explosive growth of Silicon Valley’s fabless innovation model.
  • Gordon Moore: Moore’s Law, while originally an empirical observation, inspired decades of innovation, but its eventual slow-down prompted leaders such as Huang to look for new “first principles” to govern progress, beyond mere transistor scaling.
  • Clayton Christensen: His disruption theory is foundational in understanding why entire industries fail to see the next shift — and how those who challenge orthodoxy from first principles are able to “see around corners”.
  • Geoffrey Hinton, Yann LeCun, Andrew Ng: These pioneers directly enabled the deep learning revolution by returning to first principles on how learning — both human and artificial — could function at scale. Their work with neural networks, widely doubted after earlier “AI winters”, was vindicated with landmark results like AlexNet (2012), enabled by Nvidia GPUs.

Implications

Jensen Huang’s quote is neither idle philosophy nor abstract advice — it is a methodology proven repeatedly by his own journey and by the history of technology. It is a call to scrutinise assumptions, break complex structures to their most elemental truths, and reconstruct strategy consciously from the bedrock of what is not likely to change, but also to ask: on what foundation do these principles rest, and how will these foundations themselves evolve.

Organisations and individuals who internalise this approach are equipped not only to compete in current markets, but to invent new ones — to anticipate and shape the next paradigm, rather than reacting to it.

read more
Quote: Andrej Karpathy – Ex-OpenAI, Ex-Tesla AI

Quote: Andrej Karpathy – Ex-OpenAI, Ex-Tesla AI

“What I think we have to do going forward…is figure out ways to remove some of the knowledge and to keep what I call this cognitive core. It’s this intelligent entity that is stripped from knowledge but contains the algorithms and contains the magic of intelligence and problem-solving and the strategies of it and all this stuff.” – Andrej Karpathy – Ex-OpenAI, Ex-Tesla AI

Andrej Karpathy’s observation about the need to “strip away knowledge whilst retaining the cognitive core” represents one of the most penetrating insights into contemporary artificial intelligence development. Speaking on Dwarkesh Patel’s podcast in October 2025, Karpathy—formerly a leading figure at both OpenAI and Tesla’s Autopilot programme—articulated a fundamental tension at the heart of modern AI: the current generation of large language models have become prodigious memorisers, yet this very capability may be constraining their potential for genuine intelligence.

The Paradox of Pre-training

To comprehend Karpathy’s thesis, one must first understand the architecture of contemporary AI systems. Large language models are trained on vast corpora—often 15 trillion tokens or more—through a process called pre-training. During this phase, models learn to predict the next token in a sequence, effectively compressing the entire internet into their neural networks. Karpathy describes this compressed representation as only “0.07 bits per token” for a model like Llama 3 70B, highlighting the extraordinary degree of compression occurring.

This compression serves two distinct functions, which Karpathy carefully delineates. First, models accumulate factual knowledge—the content of Wikipedia articles, the specifics of historical events, the details of scientific papers. Second, and more crucially, they develop what Karpathy terms “algorithmic patterns”—the capacity for in-context learning, the ability to recognise and complete patterns, the fundamental mechanisms of reasoning itself.

The problem, as Karpathy sees it, is that contemporary models have become too adept at the former whilst the latter remains the true seat of intelligence. When a model can regurgitate passages verbatim or recite obscure facts, it demonstrates remarkable memory. But this same capability creates what he calls a “distraction”—the model becomes reliant on its hazy recollections of training data rather than developing robust reasoning algorithms that could operate independently of specific factual knowledge.

The Cognitive Core Concept

Karpathy’s proposed solution is to isolate and preserve what he terms the “cognitive core”—an intelligent entity stripped of encyclopaedic knowledge but retaining the fundamental algorithms of problem-solving, the strategies of thought, and what he describes as “the magic of intelligence.” This concept represents a profound shift in how we conceptualise artificial intelligence.

Consider the analogy to human cognition. Humans are remarkably poor memorisers compared to AI systems. Present a human with a random sequence of numbers, and they’ll struggle after seven or eight digits. Yet this apparent limitation forces humans to develop robust pattern-recognition capabilities and abstract reasoning skills. We’re compelled to “see the forest for the trees” precisely because we cannot memorise every individual tree.

Karpathy suggests that AI systems would benefit from similar constraints. A model with less memory but stronger reasoning capabilities would be forced to look up factual information whilst maintaining sophisticated algorithms for processing that information. Such a system would more closely resemble human intelligence—not in its limitations, but in the way those limitations drive the development of generalisable cognitive strategies.

The implications extend beyond mere technical architecture. Karpathy envisions cognitive cores as compact as one billion parameters—potentially even smaller—that could operate as genuine reasoning engines rather than glorified databases. These systems would “know that they don’t know” when confronted with factual questions, prompting them to retrieve information whilst applying sophisticated analysis. The result would be AI that thinks more than it remembers, that reasons rather than recites.

From Evolution to Engineering: The Path Not Taken

Karpathy’s perspective on AI development diverges sharply from what he calls the “Richard Sutton viewpoint”—the notion that we should build AI systems analogous to biological intelligence, learning from scratch through reinforcement learning in the manner of animals. Instead, Karpathy argues we’re building what he evocatively describes as “ghosts” or “spirit entities”—ethereal intelligences that emerge from imitating human-generated text rather than evolving through environmental interaction.

This distinction illuminates a crucial divergence in AI philosophy. Biological intelligence, as embodied in animals, emerges from evolution operating over millions of years, with vast amounts of capability “baked in” to neural circuitry. A zebra foal runs within minutes of birth not through reinforcement learning but through evolutionary encoding. Humans similarly arrive with substantial cognitive machinery pre-installed, with lifetime learning representing maturation and refinement rather than learning from first principles.

By contrast, contemporary AI systems learn through what Karpathy terms “crappy evolution”—pre-training on internet documents serves as a compressed, accelerated alternative to evolutionary optimisation. This process creates entities fundamentally different from biological intelligence, optimised for different tasks through different mechanisms. The current approach imitates the products of human intelligence (text, code, conversations) rather than replicating the developmental process that creates intelligence.

The Limits of Current Learning Paradigms

Karpathy’s critique extends to reinforcement learning, which he describes with characteristic bluntness as “terrible.” His concerns illuminate deep problems in how AI systems currently learn from experience. In reinforcement learning, a model generates hundreds of solution attempts, and those that arrive at correct answers have every intermediate step up-weighted, whilst failed attempts are down-weighted. Karpathy calls this “sucking supervision through a straw”—extracting minimal learning signal from vast amounts of computational work.

The fundamental issue is noise. When a solution works, not every step along the way was necessarily correct or optimal. The model may have taken wrong turns, pursued dead ends, or stumbled upon the answer despite flawed reasoning. Yet reinforcement learning broadcasts the final reward across the entire trajectory, reinforcing both good and bad reasoning indiscriminately. Karpathy notes that “you may have gone down the wrong alleys until you arrived at the right solution,” yet every mistaken step gets marked as something to do more of.

Humans, by contrast, engage in sophisticated post-hoc analysis. After solving a problem, we reflect on which approaches worked, which didn’t, and why. We don’t simply label an entire problem-solving session as “good” or “bad”—we dissect our reasoning, identify productive and unproductive strategies, and refine our approach. Current AI systems lack this reflective capacity entirely.

This limitation connects to broader questions about how AI systems might achieve continual learning—the ability to genuinely learn from ongoing experience rather than requiring massive retraining. Karpathy suggests that humans engage in a nightly “distillation phase” during sleep, processing the day’s experiences and integrating insights into long-term knowledge. AI systems have no equivalent mechanism. They simply restart from the same state each time, unable to evolve based on individual experiences.

Model Collapse and the Entropy Problem

A subtle but critical concern in Karpathy’s analysis is what he terms “model collapse”—the tendency of AI systems to produce outputs that occupy “a very tiny manifold of the possible space of thoughts.” Ask ChatGPT to tell a joke repeatedly, and you’ll receive the same three jokes. Request reflection on a topic multiple times, and you’ll observe striking similarity across responses. The models are “silently collapsed,” lacking the entropy and diversity that characterises human thought.

This phenomenon creates profound challenges for synthetic data generation, a technique labs use to create additional training material. If models generate training data for themselves or subsequent models, this collapsed distribution gradually dominates the training corpus. Training on one’s own outputs creates a dangerous feedback loop—each generation becomes less diverse, more stereotyped, more “collapsed” than the last. Karpathy suggests this may not even be a solvable problem, noting that humans similarly “collapse over time,” becoming more rigid and less creative as they age, revisiting the same thoughts and patterns with decreasing learning rates.

The contrast with children is illuminating. Young minds, not yet “overfitted” to the world, produce shocking, creative, unexpected responses precisely because they haven’t collapsed into standard patterns of thought. This freshness, this maintenance of high entropy in cognitive processes, may be essential to genuine intelligence. Yet our current training paradigms actively work against it, rewarding convergence towards common patterns and penalising deviation.

The Decade of Agents: Why Progress Takes Time

When Karpathy states this will be “the decade of agents” rather than “the year of agents,” he draws on hard-won experience from five years leading Tesla’s Autopilot programme. His insights into why artificial intelligence deployment takes far longer than demonstrations suggest carry particular weight given this background.

The central concept is what Karpathy calls “the march of nines.” Getting something to work 90% of the time—the level typically showcased in demonstrations—represents merely the first nine in “99.9%.” Each additional nine requires equivalent effort. During his tenure at Tesla, the team progressed through perhaps two or three nines over five years. More crucially, numerous nines remain before self-driving cars achieve true autonomy at scale.

This pattern isn’t unique to autonomous vehicles. Karpathy argues it applies across safety-critical domains, including software engineering. When code errors can leak millions of Social Security numbers or create critical security vulnerabilities, the cost of failure becomes prohibitively high. The demo-to-product gap widens dramatically. What works impressively in controlled conditions fails in countless edge cases when confronting reality’s full complexity.

Waymo’s experience illustrates this challenge. Despite providing “perfect drives” as early as 2014, the company still operates limited deployments requiring elaborate teleoperation infrastructure and supervision. Humans haven’t been removed; they’ve been rendered invisible, beaming in remotely to handle edge cases. The technology lives in a “pulled-back future”—functional but not yet economical, capable but not yet scalable.

Contemporary AI agents face analogous challenges. Whilst Claude and GPT-5 Pro demonstrate remarkable capabilities, they remain what Karpathy characterises as “elementary school students”—savants with perfect memory but lacking robust reasoning across all necessary dimensions. They’re “cognitively deficient” in ways users intuitively recognise even if they can’t articulate precisely what’s missing.

The Software Engineering Puzzle

Perhaps no domain better illustrates the puzzling contours of current AI capabilities than software engineering. Karpathy notes, somewhat ruefully, that whilst these systems were meant to enable “any economically valuable task,” API revenue remains “dominated by coding.” This supposedly general intelligence overwhelmingly excels at one specific domain.

This concentration isn’t accidental. Code enjoys unique properties that make it ideal for current AI architectures. Software development has always operated through text—terminals, editors, version control systems all manipulate textual representations. LLMs, trained on internet text, encounter code as a native format. Moreover, decades of infrastructure exist for handling code textually: diff tools for showing changes, IDEs for navigation, testing frameworks for verification.

Contrast this with domains lacking such infrastructure. Creating presentations involves spatial arrangement and visual design—there’s no “diff” for slides that elegantly shows modifications. Many knowledge work tasks involve physical documents, in-person interactions, or tacit knowledge that resists textual representation. These domains haven’t been pre-optimised for AI interaction in the way software development has.

Yet even in coding, Karpathy remains sceptical of current capabilities for cutting-edge work. When building nanoChat, a repository implementing a complete ChatGPT clone in simplified form, he found AI tools valuable for autocomplete and handling familiar patterns but inadequate for novel architectural decisions. The models kept trying to impose standard approaches when he deliberately chose non-standard implementations. They couldn’t comprehend his custom solutions, constantly suggesting deprecated APIs and bloating code with unnecessary defensive programming.

This points to a deeper truth: current models excel at reproducing common patterns from their training data but struggle with code “that has never been written before”—precisely the domain of frontier AI research itself. The recursive self-improvement that some forecast, where AI systems rapidly enhance their own capabilities, founders on this limitation. Models can accelerate work within established paradigms but cannot yet pioneer truly novel approaches.

The Trajectory of Intelligence Explosion

Karpathy’s perspective on potential intelligence explosions diverges sharply from both pessimistic and optimistic extremes. He sees AI not as a discrete, alien technology but as a continuation of computing’s evolution—part of an ongoing automation trend stretching back through compilers, high-level programming languages, and computer-aided design tools. From this view, the “intelligence explosion” has already been occurring for decades, visible in the exponential GDP growth curve that represents accumulated automation across countless domains.

This framing leads to counterintuitive predictions. Rather than expecting AI to suddenly accelerate economic growth from 2% annually to 20%, Karpathy suggests it will enable continued progress along the existing exponential trajectory. Just as computers, the internet, and mobile phones transformed society without producing visible discontinuities in aggregate growth statistics, AI will diffuse gradually across industries, maintaining rather than disrupting established growth patterns.

This gradualism doesn’t imply insignificance. The compounding effects of sustained exponential growth produce extraordinary transformation over time. But it does suggest that simple extrapolations from impressive demonstrations to imminent superintelligence misunderstand how technology integrates into society. There will be no discrete moment when “AGI” arrives and everything changes. Instead, we’ll experience continuous advancement in capabilities, continuous expansion of automation, and continuous adaptation of society to new technological possibilities.

The analogy to the Industrial Revolution proves instructive. That transformation didn’t occur through a single breakthrough but through cascading improvements across multiple technologies and practices, gradually shifting society from 0.2% annual growth to 2%. Similarly, AI’s impact will emerge through countless incremental deployments, each automating specific tasks, enabling new workflows, and creating feedback loops that accelerate subsequent progress.

The Human Element: Education in an AI Future

Karpathy’s work on Eureka, his educational initiative, reveals his deepest concerns about AI’s trajectory. He fears not that AI will fail but that “humanity gets disempowered by it,” relegated to the sidelines like the portly, passive citizens of WALL-E. His solution lies in radically reimagining education around the principle that “pre-AGI education is useful; post-AGI education is fun.”

The analogy to fitness culture illuminates this vision. Nobody needs physical strength to manipulate heavy objects—we have machines for that. Yet gyms proliferate because exercise serves intrinsic human needs: health, aesthetics, the satisfaction of physical mastery. Similarly, even in a world where AI handles most cognitive labour, humans will pursue learning for its inherent rewards: the pleasure of understanding, the status of expertise, the deep satisfaction of mental cultivation.

But achieving this vision requires solving a technical problem: making learning genuinely easy and rewarding. Currently, most people abandon learning because they encounter material that’s too difficult or too trivial, bouncing between frustration and boredom. Karpathy describes the experience of working with an expert language tutor who maintained a perfect calibration—always presenting challenges at the edge of current capability, never boring, never overwhelming. This created a state where “I was the only constraint to learning,” with knowledge delivery perfectly optimised.

Replicating this experience at scale represents what Karpathy sees as education’s great technical challenge. Current AI tutors, despite their sophistication, remain far from this standard. They can answer questions but cannot probe understanding, identify gaps, or sequence material to create optimal learning trajectories. The capability exists in exceptional human tutors; the challenge lies in encoding it algorithmically.

Yet Karpathy sees this challenge as tractable. Just as AI has transformed coding through autocomplete and code generation, it will eventually transform education through personalised, responsive tutoring. When learning becomes “trivial”—not in the sense of requiring no effort but in the sense of encountering no artificial obstacles—humans will pursue it enthusiastically. Not everyone will become an expert in everything, but the ceiling on human capability will rise dramatically as the floor on accessibility descends.

The Physics of Understanding: Karpathy’s Pedagogical Philosophy

Karpathy’s approach to teaching reveals principles applicable far beyond AI. His background in physics instilled what he describes as finding “first-order terms”—identifying the essential, dominant factors in any system whilst recognising that second and third-order effects exist but matter less. This habit of abstraction, of seeing spherical cows where others see only messy complexity, enables the creation of minimal, illustrative examples that capture phenomena’s essence.

MicroGrad exemplifies this approach perfectly. In 100 lines of Python, Karpathy implements backpropagation—the fundamental algorithm underlying all neural network training. Everything else in modern deep learning frameworks, he notes, is “just efficiency”—optimisations for speed, memory management, numerical stability. But the intellectual core, the actual mechanism by which networks learn, fits in 100 comprehensible lines. This distillation makes the previously arcane accessible.

The broader principle involves “untangling knowledge”—reorganising understanding so each concept depends only on what precedes it. This creates “ramps to knowledge” where learners never encounter gaps or leaps that would require them to take claims on faith. The famous transformer tutorial embodies this, beginning with a simple bigram model (literally a lookup table) and progressively adding components, each motivated by solving a specific limitation of what came before.

This approach contrasts sharply with the standard academic practice of presenting solutions before establishing problems, or introducing abstractions before concrete examples. Karpathy sees such approaches as, in his words, “a dick move”—they rob learners of the opportunity to grapple with challenges themselves, to develop intuition about what solutions might work, and to appreciate why particular approaches succeed where alternatives fail. The pedagogical crime isn’t challenging students; it’s presenting answers without first establishing questions.

Leading Theorists: The Intellectual Lineage

Richard Sutton and the Bitter Lesson

Richard Sutton, a pioneering reinforcement learning researcher, articulated what became known as “the bitter lesson”—the observation that simple, scalable methods leveraging computation consistently outperform approaches incorporating human knowledge or structural assumptions. His perspective suggests that the path to artificial general intelligence lies through learning algorithms powerful enough to discover structure from experience, much as evolution discovered biological intelligence.

Sutton’s famous assertion that “if you got to the squirrel, you’d be most of the way to AGI” reflects this view. Animal intelligence, in his framework, represents the core achievement—the fundamental learning algorithms that enable organisms to navigate environments, solve problems, and adapt to challenges. Human language and culture, whilst impressive, represent relatively minor additions to this foundation.

Karpathy respectfully dissents. His “we’re building ghosts, not animals” formulation captures the divergence: current AI systems don’t replicate the learning processes that create biological intelligence. They imitate the products of human intelligence (text, code, reasoning traces) rather than replicating its developmental origins. This distinction matters profoundly for predicting AI’s trajectory and understanding its capabilities and limitations.

Geoffrey Hinton and the Neural Network Renaissance

Geoffrey Hinton, often termed the “godfather of AI,” pioneered the neural network approaches that underpin contemporary systems. His persistence through decades when neural networks were unfashionable, his development of backpropagation techniques, and his later work on capsule networks and other architectures established the foundation for today’s large language models.

Karpathy studied directly under Hinton at the University of Toronto, experiencing firsthand the intellectual ferment as deep learning began its ascent to dominance. Hinton’s influence appears throughout Karpathy’s thinking—the emphasis on learning from data rather than hand-crafted rules, the focus on representation learning, the conviction that scale and simplicity often trump elaborate architectural innovations.

Yet Karpathy’s view extends beyond his mentor’s. Where Hinton focused primarily on perception (particularly computer vision), Karpathy grapples with the full scope of intelligence—reasoning, planning, continual learning, multi-agent interaction. His work synthesises Hinton’s foundational insights with broader questions about cognitive architecture and the nature of understanding itself.

Yann LeCun and Convolutional Networks

Yann LeCun’s development of convolutional neural networks in 1989 represented the first successful application of gradient descent to real-world pattern recognition. His work on handwritten digit recognition established core principles: the power of hierarchical feature learning, the importance of translation invariance, the value of specialised architectures for specific domains.

Karpathy’s reconstruction of LeCun’s 1989 network, time-travelling 33 years of algorithmic improvements, reveals his appreciation for this lineage. He found that pure algorithmic advances—modern optimisers, better architectures, regularisation techniques—could halve error rates. But achieving further gains required more data and more computation. This trinity—algorithms, data, compute—advances in lockstep, with no single factor dominating.

This lesson shapes Karpathy’s predictions about AI’s future. He expects continued progress across all three dimensions, with the next decade bringing better algorithms, vaster datasets, more powerful hardware, and more efficient software. But no breakthrough in any single dimension will produce discontinuous acceleration. Progress emerges from the intersection of many incremental improvements.

The Broader Intellectual Context

The debate Karpathy engages extends beyond specific individuals to fundamental questions about intelligence itself. Does intelligence arise primarily from general learning algorithms (the Sutton view) or from accumulated structure and innate mechanisms (the evolutionary perspective)? Can we build intelligence by imitating its products (the current LLM approach) or must we replicate its developmental processes? Will artificial intelligence remain fundamentally tool-like, augmenting human capability, or evolve into genuinely autonomous agents pursuing their own goals?

These questions connect to century-old debates in psychology and cognitive science between behaviourists emphasising learning and nativists emphasising innate structure. They echo discussions in evolutionary biology about the relative roles of genetic determination and developmental plasticity. They parallel arguments in philosophy of mind about whether intelligence requires embodiment or can exist as pure information processing.

Karpathy’s position threads between extremes. He acknowledges both the power of learning from data and the necessity of architectural structure. He recognises both the distinctiveness of AI systems and their illuminating analogies to biological intelligence. He balances optimism about AI’s potential with realism about current limitations and the difficulty of translating demonstrations into robust, deployed systems.

The Cognitive Core in Context: A New Paradigm for Intelligence

The concept of a cognitive core stripped of factual knowledge represents more than a technical proposal—it’s a reconceptualisation of what intelligence fundamentally is. Rather than viewing intelligence as encompassing both reasoning algorithms and accumulated knowledge, Karpathy proposes treating these as separate, with reasoning capability as the essence and factual knowledge as external resources to be accessed rather than internalised.

This separation mirrors certain aspects of human cognition whilst diverging in others. Humans do maintain a distinction between knowing how to think and knowing specific facts—we can reason about novel situations without direct experience, applying general problem-solving strategies learned in one domain to challenges in another. Yet our factual knowledge isn’t purely external; it shapes the very structure of our reasoning, creating rich semantic networks that enable rapid, intuitive judgement.

The proposal to strip AI systems down to cognitive cores involves accepting tradeoffs. Such systems would need to perform external lookups for factual information, introducing latency and dependency on knowledge bases. They would lack the pattern-matching capabilities that arise from vast memorisation, potentially missing connections between superficially unrelated domains. They might struggle with tasks requiring seamless integration of many small facts, where lookup costs would dominate processing time.

Yet the gains could prove transformative. A genuine cognitive core—compact, efficient, focused on algorithmic reasoning rather than fact retrieval—could operate in settings where current models fail. Edge deployment becomes feasible when models don’t require storing terabytes of parameters. Personalisation becomes practical when core reasoning engines can be fine-tuned or adapted without retraining on entire knowledge corpora. Interpretability improves when reasoning processes aren’t obscured by retrieval of memorised patterns.

Most profoundly, genuine cognitive cores might avoid the collapse and loss of entropy that plagues current models. Freed from the burden of maintaining consistency with vast memorised datasets, such systems could explore more diverse solution spaces, generate more varied outputs, and maintain the creative flexibility that characterises human cognition at its best.

Implications for the Decade Ahead

Karpathy’s decade-long timeline for agentic AI reflects hard-earned wisdom about technology deployment. His experience with autonomous vehicles taught him that impressive demonstrations represent merely the beginning of a long productisation journey. Each additional “nine” of reliability—moving from 90% to 99% to 99.9% accuracy—requires comparable effort. Safety-critical domains demand many nines before deployment becomes acceptable.

This reality shapes expectations for AI’s economic impact. Rather than sudden disruption, we’ll witness gradual diffusion across domains with varying characteristics. Tasks that are repetitive, well-defined, purely digital, and allowing high error rates will automate first. Call centre work exemplifies this profile—short interaction horizons, clear success criteria, limited context requirements, tolerance for occasional failures that human supervisors can catch.

More complex knowledge work will resist automation longer. Radiologists, consultants, accountants—professionals whose work involves lengthy timescales, subtle judgements, extensive context, and high costs of error—will see AI augmentation before replacement. The pattern will resemble Waymo’s current state: AI handling routine cases whilst humans supervise, intervene in edge cases, and maintain ultimate responsibility.

This graduated deployment creates an “autonomy slider”—a continuous spectrum from pure human operation through various degrees of AI assistance to eventual full automation. Most jobs won’t flip discretely from human to machine. Instead, they’ll slide along this spectrum as AI capabilities improve and organisations develop confidence in delegation. This process will unfold over years or decades, not months.

The economic implications differ from both optimistic and pessimistic extremes. We won’t see overnight mass unemployment—the gradual nature of deployment, the persistence of edge cases requiring human judgement, and society’s adaptation through creating new roles all mitigate disruption. But neither will we see disappointing underutilisation—the compound effect of many small automations across countless tasks will produce genuine transformation.

The Path Forward: Research Priorities

Karpathy’s analysis suggests several critical research directions for developing robust, capable AI systems. First, developing methods to isolate cognitive cores from memorised knowledge whilst maintaining reasoning capability. This might involve novel training objectives that penalise rote memorisation whilst rewarding generalisation, or architectural innovations that separate knowledge storage from reasoning mechanisms.

Second, creating effective continual learning systems that can distil experience into lasting improvements without catastrophic forgetting or model collapse. This requires moving beyond simple fine-tuning toward something more akin to the reflection and consolidation humans perform during sleep—identifying patterns in experience, extracting lessons, and integrating insights whilst maintaining diversity.

Third, advancing beyond current reinforcement learning to richer forms of learning from experience. Rather than broadcasting sparse reward signals across entire trajectories, systems need sophisticated credit assignment that identifies which reasoning steps contributed to success and which didn’t. This might involve explicit review processes where models analyse their own problem-solving attempts, or meta-learning approaches that learn how to learn from experience.

Fourth, developing multi-agent systems with genuine culture—shared knowledge bases that agents collectively maintain and evolve, self-play mechanisms that drive capability improvement through competition, and organisational structures that enable collaboration without centralized control. Current systems remain fundamentally solitary; genuine agent economies will require breakthroughs in coordination and communication.

Fifth, and perhaps most ambitiously, maintaining entropy in AI systems—preventing the collapse toward stereotyped outputs that currently plagues even frontier models. This might involve explicit diversity penalties, adversarial training to prevent convergence, or inspiration from biological systems that maintain variation through mechanisms like mutation and recombination.

Conclusion: Intelligence as Engineering Challenge

Andrej Karpathy’s vision of the cognitive core represents a mature perspective on artificial intelligence—neither breathlessly optimistic about imminent superintelligence nor dismissively pessimistic about current limitations. He sees AI as an engineering challenge rather than a mystical threshold, requiring patient work across multiple dimensions rather than awaiting a single breakthrough.

This perspective derives from direct experience with the messy reality of deploying AI systems at scale. Self-driving cars that work perfectly in demonstrations still require years of refinement before handling edge cases reliably. Coding agents that generate impressive solutions for common problems still struggle with novel architectural challenges. Educational AI that answers questions adequately still falls far short of expert human tutors’ adaptive responsiveness.

Yet within these limitations lies genuine progress. Models continue improving along multiple dimensions simultaneously. Infrastructure for deploying and managing AI systems grows more sophisticated. Understanding of these systems’ capabilities and constraints becomes more nuanced. The path forward is visible, even if it stretches further than optimists anticipated.

The concept of stripping knowledge to reveal the cognitive core captures this mature vision perfectly. Rather than pursuing ever-larger models memorising ever-more data, we might achieve more capable intelligence through subtraction—removing the crutch of memorisation to force development of robust reasoning algorithms. Like humans compelled to abstract and generalise because we cannot remember everything, AI systems might benefit from similar constraints.

This vision offers hope not for sudden transformation but for steady progress—the kind that compounds over decades into revolutionary change. It suggests that the hard technical problems of intelligence remain tractable whilst acknowledging their genuine difficulty. Most importantly, it positions humans not as passive observers of AI’s ascent but as active participants in shaping its development and ensuring its integration enhances rather than diminishes human flourishing.

The decade ahead will test these ideas. We’ll discover whether cognitive cores can be effectively isolated, whether continual learning mechanisms can be made robust, whether the demo-to-product gap can be bridged across diverse domains. The answers will shape not just the trajectory of AI technology but the future of human society in an increasingly automated world. Karpathy’s contribution lies in framing these questions with clarity, drawing on hard-won experience to guide expectations, and reminding us that the most profound challenges often yield to patient, disciplined engineering rather than waiting for miraculous breakthroughs.

read more
Quote: Andrej Karpathy – Ex-OpenAI, Ex-Tesla AI

Quote: Andrej Karpathy – Ex-OpenAI, Ex-Tesla AI

“I feel like the [ AI ] problems are tractable, they’re surmountable, but they’re still difficult. If I just average it out, it just feels like a decade [ to AGI ] to me.” – Andrej Karpathy – Ex-OpenAI, Ex-Tesla AI

Andrej Karpathy’s reflection—“I feel like the [ AI ] problems are tractable, they’re surmountable, but they’re still difficult. If I just average it out, it just feels like a decade [ to AGI ] to me.”—encapsulates both a grounded optimism and a caution honed through years at the forefront of artificial intelligence research. Understanding this statement requires context about the speaker, the evolution of the field, and the intellectual landscape that shapes contemporary thinking on artificial general intelligence (AGI).

Andrej Karpathy: Technical Leadership and Shaping AI’s Trajectory

Karpathy is recognised as one of the most influential figures in modern AI. With a doctorate under Geoffrey Hinton, the so-called “godfather” of deep learning, Karpathy’s early career put him at the confluence of academic breakthroughs and industrial deployment. At Stanford, he helped launch the seminal CS231n course, which became a training ground for a generation of practitioners. He subsequently led critical efforts at OpenAI and Tesla, where he served as Director of AI, architecting large-scale deep learning systems for both language and autonomous driving.

From the earliest days of deep learning, Karpathy has witnessed—and helped drive—several “seismic shifts” that have periodically redefined the field. He recalls, for example, the transition from neural networks being considered a niche topic to their explosive relevance with the advent of AlexNet. At OpenAI, he observed the limitations of reinforcement learning when applied too soon to general agent-building and became an early proponent of focusing on practical, useful systems rather than chasing abstract analogies with biological evolution.

Karpathy’s approach is self-consciously pragmatic. He discounts analogies between AI and animal evolution, preferring to frame current efforts as “summoning ghosts,” i.e., building digital entities trained by imitation, not evolved intelligence. His career has taught him to discount industry hype cycles and focus on the “march of nines”—the painstaking work required to close the gap between impressive demos and robust, trustworthy products. This stance runs through his entire philosophy on AI progress.

Context for the Quote: Realism amidst Exponential Hype

The statement about AGI’s timeline emerges from Karpathy’s nuanced position between the extremes of utopian accelerationism and excessive scepticism. Against a backdrop of industry figures claiming near-term transformative breakthroughs, Karpathy advocates for a middle path: current models represent significant progress, but numerous “cognitive deficits” persist. Key limitations include the lack of robust continual learning, difficulties generalising out-of-distribution, and the absence of key memory and reasoning capabilities seen in human intelligence.

Karpathy classifies present-day AI systems as “competent, but not yet capable agents”—useful in narrow domains, such as code generation, but unable to function autonomously in open-ended, real-world contexts. He highlights how models exhibit an uncanny ability to memorise, yet often lack the generalisation skills required for truly adaptive behaviour; they’re powerful, but brittle. The hard problems left are not insurmountable, but solving them—including integrating richer memory, developing agency, and building reliable, context-sensitive learning—will take sustained, multi-year effort.

AGI and the Broader Field: Dialogue with Leading Theorists

Karpathy’s thinking exists in dialogue with several foundational theorists:

  • Geoffrey Hinton: Pioneered deep learning and neural network approaches that underlie all current large-scale AI. His early conviction in neural networks, once seen as fringe, is now mainstream, but Hinton remains open to new architectural breakthroughs.

  • Richard Sutton: A major proponent of reinforcement learning as a route to general intelligence. Sutton’s vision focuses on “building animals”—systems capable of learning from scratch via trial and error in complex environments—whereas Karpathy now sees this as less immediately relevant than imitation-based, practically grounded approaches.

  • Yann LeCun: Another deep learning pioneer, LeCun has championed the continuous push toward self-supervised learning and innovations within model architecture.

  • The Scaling Optimists: The school of thought, including some in the OpenAI and DeepMind circles, who argue that simply increasing model size and data, within current paradigms, will inexorably deliver AGI. Karpathy explicitly distances himself from this view, arguing for the necessity of algorithmic innovation and socio-technical integration.

Karpathy sees the arc of AI progress as analogous to general trends in automation and computing: exponential in aggregate, but marked by periods of over-prediction, gradual integration, and non-linear deployment. He draws lessons from the slow maturation of self-driving cars—a field he led at Tesla—where early demos quickly gave way to years of incremental improvement, ironing out “the last nines” to reach real-world reliability.

He also foregrounds the human side of the equation: as AI’s technical capability increases, the question becomes as much about organisational integration, legal and social adaptation, as it does about raw model performance.

In Summary: Surmountable Yet Difficult

Karpathy’s “decade to AGI” estimate is anchored in a sober appreciation of both technical tractability and practical difficulty. He is neither pessimistic nor a hype-driven optimist. Instead, he projects that AGI—defined as machines able to deliver the full spectrum of knowledge work at human levels—will require another decade of systematic progress spanning model architecture, algorithmic innovation, memory, continual learning, and above all, integration with the complex realities of the real world.

His perspective stands out for its blend of technical rigour, historical awareness, and humility in the face of both engineering constraints and the unpredictability of broader socio-technical systems. In this, Karpathy situates himself in conversation with a lineage of thinkers who have repeatedly recalibrated the AI field’s ambitions—and whose own varied predictions continue to shape the ongoing march toward general intelligence.

read more
Quote: David Solomon – Goldman Sachs CEO

Quote: David Solomon – Goldman Sachs CEO

“If the firm grows and you expand and you can invest in other areas for growth, we’ll wind up with more jobs… we have at every step along the journey for the last forty years as technology has made us more productive. I don’t think it’s different this time [with AI].” – David Solomon – Goldman Sachs CEO

David Michael Solomon, born in 1962 in Hartsdale, New York, is an American investment banker and DJ, currently serving as the CEO and Chairman of Goldman Sachs. His journey into the financial sector began after he graduated with a BA in political science from Hamilton College. Initially, Solomon worked at Irving Trust Company and Drexel Burnham before joining Bear Stearns. In 1999, he moved to Goldman Sachs as a partner and became co-head of the High Yield and Leveraged Loan Business.

Solomon’s rise within Goldman Sachs was swift and strategic. He became the co-head of the Investment Banking Division in 2006 and held this role for a decade. In 2017, he was appointed President and Chief Operating Officer, and by October 2018, he succeeded Lloyd Blankfein as CEO. He became Chairman in January 2019.

Beyond his financial career, Solomon is known for his passion for music, producing electronic dance music under the alias “DJ D-Sol”. He has performed at various venues, including nightclubs and music festivals in New York, Miami, and The Bahamas.

Context of the Quote

The quote highlights Solomon’s perspective on technology and job creation in the financial sector. He suggests that while technology, particularly AI, can enhance productivity and potentially lead to job reductions in certain areas, the overall growth of the firm will create more opportunities for employment. This view is rooted in his experience observing how technological advancements have historically led to increased productivity and growth for Goldman Sachs.

Leading Theorists on AI and Employment

Several leading theorists have explored the impact of AI on employment, with divergent views:

  • Joseph Schumpeter is famous for his theory of “creative destruction,” which suggests that technological innovations often lead to the destruction of existing jobs but also create new ones. This cycle is seen as essential for economic growth and innovation.

  • Klaus Schwab, founder of the World Economic Forum, has discussed the Fourth Industrial Revolution, emphasizing how AI and automation will transform industries. However, he also highlights the potential for new job creation in emerging sectors.

  • Economists Erik Brynjolfsson and Andrew McAfee have written extensively on how technology can lead to both job displacement and creation. They argue that while AI may reduce certain types of jobs, it also fosters economic growth and new opportunities.

These theorists provide a backdrop for understanding Solomon’s optimistic view on AI’s impact on employment, focusing on the potential for growth and innovation to offset job losses.

Conclusion

David Solomon’s quote encapsulates his optimism about the interplay between technology and job creation. Focusing on the strategic growth of Goldman Sachs, he believes that technological advancements will enhance productivity and create opportunities for expansion, ultimately leading to more employment opportunities. This perspective aligns with broader discussions among economists and theorists on the transformative role of AI in the workplace.

read more
Quote: David Solomon – Goldman Sachs CEO

Quote: David Solomon – Goldman Sachs CEO

“Markets run in cycles, and whenever we’ve historically had a significant acceleration in a new technology that creates a lot of capital formation and therefore lots of interesting new companies around it, you generally see the market run ahead of the potential. Are there going to be winners and losers? There are going to be winners and losers.” – David Solomon – Goldman Sachs CEO

The quote, “Markets run in cycles, and whenever we’ve historically had a significant acceleration in a new technology that creates a lot of capital formation and therefore lots of interesting new companies around it, you generally see the market run ahead of the potential. Are there going to be winners and losers? There are going to be winners and losers,” comes from a public discussion with David Solomon, CEO of Goldman Sachs, during Italian Tech Week in October 2025. This statement was made in the context of a wide-ranging interview that addressed the state of the US and global economy, the impact of fiscal stimulus and technology infrastructure spending, and, critically, the current investment climate surrounding artificial intelligence (AI) and other emergent technologies.

Solomon’s comments were prompted by questions around the record-breaking rallies in US and global equity markets and specifically the extraordinary market capitalisations reached by leading tech firms. He highlighted the familiar historical pattern: periods of market exuberance often occur when new technologies spur rapid capital formation, leading to the emergence of numerous new companies around a transformative theme. Solomon drew parallels with the Dot-com boom to underscore the cyclical nature of markets and to remind investors that dramatic phases of growth inevitably produce both outsized winners and significant casualties.

His insight reflects a seasoned banker’s view, grounded in empirical observation: while technological waves can drive periods of remarkable wealth creation and productivity gains, they also tend to attract speculative excesses. Market valuations in these periods often disconnect from underlying fundamentals, setting the stage for later corrections. The resulting market shake-outs separate enduring companies from those that fail to deliver sustainable value.

About David Solomon

David M. Solomon is one of the most prominent figures in global finance, serving as the CEO and Chairman of Goldman Sachs since 2018. Raised in New York and a graduate of Hamilton College, Solomon has built his reputation over four decades in banking—rising through leadership positions at Irving Trust, Drexel Burnham, and Bear Stearns before joining Goldman Sachs in 1999 as a partner. He subsequently became global head of the Financing Group, then co-head of the Investment Banking Division, playing a central role in shaping the firm’s capital markets strategy.

Solomon is known for his advocacy of organisational modernisation and culture change at Goldman Sachs—prioritising employee well-being, increasing agility, and investing heavily in technology. He combines traditional deal-making acumen with an openness to digital transformation. Beyond banking, Solomon has a notable side-career as a DJ under the name DJ D-Sol, performing electronic dance music at high-profile venues.

Solomon’s career reflects both the conservatism and innovative ambition associated with modern Wall Street leadership: an ability to see risk cycles clearly, and a willingness to pivot business models to suit shifts in technological and regulatory environments. His net worth in 2025 is estimated between $85 million and $200 million, owing to decades of compensation, equity, and investment performance.

Theoretical Foundations: Cycles, Disruptive Innovation, and Market Dynamics

Solomon’s perspective draws implicitly on a lineage of economic theory and market analysis concerning cycles of innovation, capital formation, and asset bubbles. Leading theorists and their contributions include:

  • Joseph Schumpeter: Schumpeter’s theory of creative destruction posited that economic progress is driven by cycles of innovation, where new technologies disrupt existing industries, create new market leaders, and ultimately cause the obsolescence or failure of firms unable to adapt. Schumpeter emphasised how innovation clusters drive periods of rapid growth, investment surges, and, frequently, speculative excess.

  • Carlota Perez: In Technological Revolutions and Financial Capital (2002), Perez advanced a model of techno-economic paradigms, proposing that every major technological revolution (e.g., steam, electricity, information technology) proceeds through phases: an initial installation period—characterised by exuberant capital inflows, speculation, and bubble formation—followed by a recessionary correction, and, eventually, a deployment period, where productive uses of the technology diffuse more broadly, generating deep-seated economic gains and societal transformation. Perez’s work helps contextualise Solomon’s caution about markets running ahead of potential.

  • Charles Kindleberger and Hyman Minsky: Both scholars examined the dynamics of financial bubbles. Kindleberger, in Manias, Panics, and Crashes, and Minsky, through his Financial Instability Hypothesis, described how debt-fuelled euphoria and positive feedback loops of speculation can drive financial markets to overshoot the intrinsic value created by innovation, inevitably resulting in busts.

  • Clayton Christensen: Christensen’s concept of disruptive innovation explains how emergent technologies, initially undervalued by incumbents, can rapidly upend entire industries—creating new winners while displacing former market leaders. His framework helps clarify Solomon’s points about the unpredictability of which companies will ultimately capture value in the current AI wave.

  • Benoit Mandelbrot: Applying his fractal and complexity theory to financial markets, Mandelbrot challenged the notion of equilibrium and randomness in price movement, demonstrating that markets are prone to extreme events—outlier outcomes that, while improbable under standard models, are a recurrent feature of cyclical booms and busts.

Practical Relevance in Today’s Environment

The patterns stressed by Solomon, and their theoretical antecedents, are especially resonant given the current environment: massive capital allocations into AI, cloud infrastructure, and adjacent technologies—a context reminiscent of previous eras where transformative innovations led markets both to moments of extraordinary wealth creation and subsequent corrections. These cycles remain a central lens for investors and business leaders navigating this era of technological acceleration.

By referencing both history and the future, Solomon encapsulates the balance between optimism over the potential of new technology and clear-eyed vigilance about the risks endemic to all periods of market exuberance.

read more

Download brochure

Introduction brochure

What we do, case studies and profiles of some of our amazing team.

Download

Our latest podcasts on Spotify

Sign up for our newsletters - free

Global Advisors | Quantified Strategy Consulting