Select Page

Global Advisors | Quantified Strategy Consulting

Artificial Intelligence
Quote: Jamie Dimon – JP Morgan Chase CEO

Quote: Jamie Dimon – JP Morgan Chase CEO

“I think the harder thing to measure has always been tech projects. That’s been true my whole life. It’s also been true my whole life, the tech is what changes everything, like everything.” – Jamie Dimon – JP Morgan Chase CEO

Jamie Dimon’s candid observation captures a fundamental tension at the heart of modern business strategy: the profound impact of technology juxtaposed against the persistent challenge of measuring its value. Delivered during JPMorgan Chase’s 2026 Investor Day on 24 February, this remark came amid revelations of the bank’s unprecedented $19.8 billion technology budget – a 10% increase from 2025, with significant allocations to artificial intelligence (AI) projects.1,2,4 As CEO of the world’s largest bank by market capitalisation, Dimon’s perspective is shaped by decades of navigating technological shifts, from the rise of digital banking to the current AI boom.

Jamie Dimon’s Career and Leadership at JPMorgan Chase

Born in 1956 in New York City to Greek immigrant parents, Jamie Dimon began his career in finance at American Express in the 1980s, rising rapidly under the mentorship of Sandy Weill. He co-led the merger that created Citigroup in 1998 but parted ways acrimoniously in 2000. Dimon then transformed Bank One from near-collapse into a powerhouse, earning a reputation as a crisis manager. In 2004, he became CEO of JPMorgan Chase following its acquisition of Bank One, a role he has held for over two decades.3

Under Dimon’s stewardship, JPMorgan has become a technology leader in banking. The firm employs over 300,000 people, with tens of thousands in tech roles, and invests billions annually in innovation. Dimon has long championed tech as a competitive moat, famously urging investors to ‘trust him’ on spending despite vague ROI metrics. In 2026, this commitment manifests in a tech budget swelled by $2 billion, driven by AI for customer service, personalised insights, and developer tools, amid rising hardware costs from AI chip demand.1,5 Dimon predicts JPMorgan will be a ‘winner’ in the AI race, leveraging its data assets and No. 1 ranking in AI maturity among banks.1,3

Context of the Quote: JPMorgan’s 2026 Strategic Framework

The quote emerged in a Q&A at the 24 February 2026 event, responding to analyst pressure on tech ROI. CFO Jeremy Barnum highlighted technology as a major expense driver, up $9 billion overall, with $1.2 billion in investments including AI. Dimon acknowledged time savings from tech as ‘too vague’ to measure precisely, echoing lifelong observations from mainframes to cloud computing.1,2 This aligns with broader warnings: AI will revolutionise operations but displace jobs, necessitating societal preparation like retraining and phased adoption to avoid shocks, such as mass unemployment from autonomous trucks.4

JPMorgan is aggressively deploying AI – its large language model serves 150,000 users weekly – while planning ‘huge redeployment’ for affected staff. Executives like Marianne Lake stress paranoia in competition, quoting ‘Only the paranoid survive’. Rivals like Bank of America ($14 billion tech spend) underscore the sector-wide arms race.1

Leading Theorists on Technology Measurement and Impact

Dimon’s views resonate with seminal thinkers on technology’s intangible returns. Peter Drucker, the father of modern management, argued in The Practice of Management (1954) that knowledge workers’ output defies traditional metrics, prefiguring tech’s measurement woes. He coined ‘knowledge economy’, emphasising innovation’s long-term value over short-term quantification.[/latex]

Erik Brynjolfsson and Andrew McAfee, MIT economists, explore this in The Second Machine Age (2014), detailing how digital technologies yield ‘non-rival’ benefits – exponential productivity without proportional costs – hard to capture in GDP or ROI. Their ‘bounty vs. spread’ framework warns of uneven gains, mirroring Dimon’s job displacement concerns.4

Clayton Christensen’s The Innovator’s Dilemma (1997) explains why incumbents struggle with disruptive tech: metrics favour sustaining innovations, blinding firms to transformative ones. JPMorgan’s shift from infrastructure modernisation to AI-ready data exemplifies overcoming this.5

In AI specifically, Nick Bostrom’s Superintelligence (2014) and Stuart Russell’s Human Compatible (2019) address measurement beyond finance – aligning superintelligent systems with human values amid unpredictable impacts. Dimon’s pragmatic focus on phased integration echoes calls for cautious deployment.4

These theorists underscore Dimon’s point: technology’s true worth lies in reshaping ‘everything’, demanding faith in leadership over precise yardsticks. JPMorgan’s strategy embodies this, positioning the bank at the vanguard of finance’s technological frontier.

References

1. https://www.businessinsider.com/jpmorgan-tech-budget-ai-20-billion-jamie-dimon-2026-2

2. https://www.aol.com/articles/jpmorgan-spend-almost-20-billion-000403027.html

3. https://www.benzinga.com/markets/large-cap/26/02/50808191/jamie-dimon-predicts-jpmorgan-will-be-a-winner-in-ai-race-boosts-2026-tech-spend-to-nearly-20-billion

4. https://fortune.com/2026/02/25/jamie-dimon-society-prepare-ai-job-displacement/

5. https://finviz.com/news/321869/how-to-play-jpm-stock-as-tech-spend-ramps-in-2026-amid-ai-uncertainty

6. https://fintechmagazine.com/news/inside-jpmorgans-2026-stock-market-hopes-and-new-london-hq

"I think the harder thing to measure has always been tech projects. That's been true my whole life. It's also been true my whole life, the tech is what changes everything, like everything." - Quote: Jamie Dimon - JP Morgan Chase CEO

read more
Term: World model

Term: World model

“A world model is defined as a learned neural representation that simulates the dynamics of an environment, enabling an AI agent to predict future states and reason about the consequences of its actions.” – World model

A **world model** is an internal representation of the environment that an AI system creates to simulate the external world within itself. This learned neural representation enables an AI agent to predict future states, simulate the consequences of different actions before executing them in the real world, and reason about causal relationships, much like the human brain does when planning activities.1,3,6

At its core, a world model comprises key components:

  • Transition model: Predicts how the environment’s state changes based on the agent’s actions, such as a robot displacing an object by moving its hand.1
  • Observation model: Determines what the agent observes in each state, incorporating data from sensors, cameras, and other inputs.1
  • Reward model: In reinforcement learning contexts, forecasts rewards or penalties from actions in specific states.1

Unlike traditional machine learning, which maps inputs directly to outputs, world models foster a general understanding of environmental dynamics, enhancing performance in novel situations.1,4

Key Capabilities and Advantages

World models empower AI with:

  • Causality understanding: Grasping why events occur, beyond mere statistical correlations seen in large language models (LLMs) like GPT.1,2
  • Planning and reasoning: Simulating scenarios internally to select optimal actions, akin to chain-of-thought reasoning.1,3
  • Efficient learning: Requiring fewer examples, similar to a child grasping gravity after minimal observations.1
  • Transfer learning and generalisation: Applying knowledge across domains, such as adapting object manipulation skills.1
  • Intuitive physics: Comprehending basic physical principles, essential for real-world interaction.1,4

Trained on diverse data like videos, photos, audio, and text, world models provide richer grounding in reality than LLMs, which focus on text patterns.2,4,6

Role in Achieving Artificial General Intelligence (AGI)

Prominent figures like Yann LeCun (Meta), Demis Hassabis (Google DeepMind), and Yoshua Bengio (Mila) view world models as crucial for AGI, enabling safe, scientific, and intelligent systems that plan ahead and simulate outcomes.3 Recent advancements, such as DeepMind’s Genie 3 (August 2025), generate diverse 3D environments from text prompts, simulating realistic physics for AI training.1 Runway’s GWM-1 further advances general-purpose simulation for robotics and discovery.5

Best Related Strategy Theorist: Yann LeCun

**Yann LeCun**, Chief AI Scientist at Meta and a pioneer of convolutional neural networks (CNNs), is the foremost theorist championing world models as foundational for intelligent AI. LeCun describes them as internal predictive models that simulate real-world dynamics, incorporating modules for perception, prediction, cost/reward evaluation, and planning. This allows AI to ‘imagine’ action consequences, vital for robotics, autonomous vehicles, and AGI.2,3

Born in 1960 in France, LeCun earned his PhD in 1987 from Universite Pierre et Marie Curie, Paris, under supervision of Yves Le Cun (no relation). He popularised CNNs in the 1980s-1990s for handwriting recognition, co-founding the field of deep learning. Joining New York University as a professor in 2003, he co-directed the NYU Center for Data Science. In 2013, he became Meta’s first AI head, driving open-source initiatives like PyTorch.

LeCun’s advocacy for world models stems from his critique of LLMs’ limitations in causal reasoning and physical simulation. He argues they enable ‘objective-driven AI’ with energy-based models for planning, positioning world models as the path beyond pattern-matching to human-like intelligence. A Turing Award winner (2018) with Bengio and Hinton, LeCun’s vision influences labs worldwide, emphasising world models for safe, efficient real-world AI.2,3

References

1. https://deepfa.ir/en/blog/world-model-ai-agi-future

2. https://www.youtube.com/watch?v=qulPOUiz-08

3. https://www.quantamagazine.org/world-models-an-old-idea-in-ai-mount-a-comeback-20250902/

4. https://www.turingpost.com/p/topic-35-what-are-world-models

5. https://runwayml.com/research/introducing-runway-gwm-1

6. https://techcrunch.com/2024/12/14/what-are-ai-world-models-and-why-do-they-matter/

"A world model is defined as a learned neural representation that simulates the dynamics of an environment, enabling an AI agent to predict future states and reason about the consequences of its actions." - Term: World model

read more
Term: AI Data Centre

Term: AI Data Centre

“An AI Data Center is a highly specialized, power-dense physical facility designed specifically to train, deploy, and run artificial intelligence (AI) models, machine learning (ML) algorithms, and generative AI applications.” – AI Data Centre

This specialised facility diverges significantly from traditional data centres, which handle mixed enterprise workloads, by prioritising accelerated compute, ultra-high-bandwidth networking, and advanced power and cooling systems to manage dense GPU clusters and continuous data pipelines for AI tasks like model training, fine-tuning, and inference.1,2,4

Central to its operation are high-performance computing resources such as Graphics Processing Units (GPUs) and Tensor Processing Units (TPUs). GPUs excel in parallel processing, enabling rapid handling of billions of data points essential for AI model training, while TPUs offer tailored efficiency for AI-specific tasks, reducing energy consumption.2,3,5

High-speed networking is critical, employing technologies like InfiniBand, 400 Gbps Ethernet, and optical interconnects to facilitate seamless data movement across thousands of servers, preventing bottlenecks in distributed AI workloads.2,4

Robust storage systems-including distributed file systems and object storage-ensure swift access to vast datasets, model weights, and real-time inference data, with scalability to accommodate ever-growing AI requirements.1,2,3

Addressing the immense power density, advanced cooling systems are vital, often accounting for 35-40% of energy use, incorporating liquid cooling and thermal zoning to maintain efficiency and low Power Usage Effectiveness (PUE) for sustainability.2,4

Additional features include data centre automation, network security, and energy-efficient designs, yielding benefits like enhanced performance, scalability, cost optimisation, and support for innovation in fields such as big data analytics, natural language processing, and computer vision.3,5

Key Theorist: Jensen Huang and the GPU Revolution

The foremost strategist linked to the evolution of AI data centres is Jensen Huang, co-founder, president, and CEO of NVIDIA Corporation. Huang’s vision has positioned NVIDIA’s GPUs as the cornerstone of modern AI infrastructure, directly shaping the architecture of these power-dense facilities.2

Born in 1963 in Taiwan, Huang immigrated to the United States as a child. He earned a bachelor’s degree in electrical engineering from Oregon State University and a master’s from Stanford University. In 1993, at age 30, he co-founded NVIDIA with Chris Malachowsky and Curtis Priem, initially targeting 3D graphics for gaming and PCs. Huang recognised the parallel processing power of GPUs, pivoting NVIDIA towards general-purpose computing on GPUs (CUDA platform, launched 2006), which unlocked their potential for scientific simulations, cryptography, and eventually AI.2

Huang’s prescient relationship to AI data centres stems from his early advocacy for GPU-accelerated computing in machine learning. By 2012, Alex Krizhevsky’s use of NVIDIA GPUs to win the ImageNet competition catalysed the deep learning boom, proving GPUs’ superiority over CPUs for neural networks. Under Huang’s leadership, NVIDIA developed AI-specific hardware like A100 and H100 GPUs, Blackwell architecture, and full-stack solutions including InfiniBand networking via Mellanox (acquired 2020). These innovations address AI data centre challenges: massive parallelism for training trillion-parameter models, high-bandwidth interconnects for multi-node scaling, and power-efficient designs for dense racks consuming up to 100kW each.2,4

Huang’s biography reflects relentless innovation; he famously wore a black leather jacket onstage, symbolising his contrarian style. NVIDIA’s market cap surged from $3 billion in 2015 to over $3 trillion by 2024, propelled by AI demand. His strategic foresight-declaring in 2017 that “the era of AI has begun”-anticipated the hyperscale AI data centre boom, making NVIDIA indispensable to leaders like Microsoft, Google, and Meta. Huang’s influence extends to sustainability, pushing for efficient cooling and low-PUE designs amid AI’s energy demands.4

Today, virtually every major AI data centre relies on NVIDIA technology, underscoring Huang’s role as the architect of the AI infrastructure revolution.

References

1. https://www.aflhyperscale.com/articles/ai-data-center-infrastructure-essentials/

2. https://www.rcrwireless.com/20250407/fundamentals/ai-optimized-data-center

3. https://www.racksolutions.com/news/blog/what-is-an-ai-data-center/

4. https://www.f5.com/glossary/ai-data-center

5. https://www.lenovo.com/us/en/glossary/what-is-ai-data-center/

6. https://www.ibm.com/think/topics/ai-data-center

7. https://www.generativevalue.com/p/a-primer-on-ai-data-centers

8. https://www.sunbirddcim.com/glossary/data-center-components

"An AI Data Center is a highly specialized, power-dense physical facility designed specifically to train, deploy, and run artificial intelligence (AI) models, machine learning (ML) algorithms, and generative AI applications." - Term: AI Data Centre

read more
Term: Edge devices

Term: Edge devices

“Edge devices are physical computing devices located at the ‘edge. of a network, close to where data is generated or consumed, that run AI algorithms and models locally rather than relying exclusively on a centralised cloud or data center.” – Edge devices

Edge devices integrate edge computing with artificial intelligence, enabling real-time data processing on interconnected hardware such as sensors, Internet of Things (IoT) devices, smartphones, cameras, and industrial equipment. This local execution reduces latency to milliseconds, enhances privacy by retaining data on-device, and alleviates network bandwidth strain from constant cloud transmission.1,4,5

Unlike traditional cloud-based AI, where data travels to remote servers for computation, edge devices perform tasks like predictive analytics, anomaly detection, speech recognition, and machine vision directly at the source. This supports applications in autonomous vehicles, smart factories, healthcare monitoring, retail systems, and wearable technology.2,3,6

Key Characteristics and Benefits

  • Low Latency: Processes data in real time without cloud round-trips, critical for time-sensitive scenarios like defect detection in manufacturing.3,4
  • Bandwidth Efficiency: Reduces data transfer volumes by analysing locally and sending only aggregated insights to the cloud.1,5
  • Enhanced Privacy and Security: Keeps sensitive data on-device, mitigating breach risks during transmission.5,6
  • Offline Capability: Operates without constant internet connectivity, ideal for remote or unreliable networks.6,8

Best Related Strategy Theorist: Dr. Andrew Chi-Chih Yao

Dr. Andrew Chi-Chih Yao, a pioneering computer scientist, stands as the most relevant strategy theorist linked to edge devices through his foundational contributions to distributed computing and efficient algorithms, which underpin modern edge AI architectures. Born in Shanghai, China, in 1946, Yao earned his PhD from Harvard University in 1972 under advisor Patrick C. Fischer. He held faculty positions at MIT, Princeton, and Stanford before joining Tsinghua University in 2004 as Director of the Institute for Interdisciplinary Information Sciences (IIIS), dubbed the ‘Chinese Springboard for talents in computer science’.[external knowledge basis]

Yao’s relationship to edge devices stems from his seminal work on parallel and distributed algorithms, including the Yao minimax principle for computational complexity (1970s), which optimises resource allocation in decentralised systems-directly analogous to edge computing’s local processing paradigm. His PRAM (Parallel Random Access Machine) model formalised efficient parallelism on resource-constrained devices, influencing how AI models are deployed on edge hardware with limited power and compute.[external knowledge basis] Notably, Yao’s research on communication complexity minimises data exchange between nodes, mirroring edge devices’ strategy of local inference to cut cloud dependency-a core tenet echoed in edge AI literature.1,7

A Turing Award winner (2000) for contributions to computation theory, Yao’s strategic vision emphasises scalable, efficient computing at the periphery, shaping industries from IoT to AI. His mentorship of talents like Jack Ma (Alibaba founder) further extends his influence on practical deployments of edge technologies in global supply chains.

References

1. https://www.ibm.com/think/topics/edge-ai

2. https://www.micron.com/about/micron-glossary/edge-ai

3. https://zededa.com/glossary/edge-ai-computing/

4. https://www.flexential.com/resources/blog/beginners-guide-ai-edge-computing

5. https://www.splunk.com/en_us/blog/learn/edge-ai.html

6. https://www.f5.com/glossary/what-is-edge-ai

7. https://www.cisco.com/site/us/en/learn/topics/artificial-intelligence/what-is-edge-ai.html

8. https://blogs.nvidia.com/blog/what-is-edge-ai/

"Edge devices are physical computing devices located at the 'edge. of a network, close to where data is generated or consumed, that run AI algorithms and models locally rather than relying exclusively on a centralised cloud or data center." - Term: Edge devices

read more
Quote: Arthur Mensch – Arthur Mensch – Mistral CEO

Quote: Arthur Mensch – Arthur Mensch – Mistral CEO

“In real life, enterprises are complex systems, and you can’t solve that with a single abstraction like AGI. AGI, to a large extent, is a north star of ‘I’m going to make the system better over time.'” – Arthur Mensch – Mistral CEO

Arthur Mensch, CEO of Mistral AI, offers a grounded perspective on artificial general intelligence (AGI), emphasising its role as an aspirational guide rather than a practical fix for intricate business challenges. In a recent Big Technology Podcast interview with Alex Kantrowitz on 16 January 2026, Mensch highlighted how enterprises function as complex systems that defy singular abstractions like AGI, positioning it instead as a directional ‘north star’ for incremental system improvements. This view aligns with his longstanding scepticism towards AGI hype, rooted in his self-described strong atheism and belief that such rhetoric equates to ‘creating God’1,2,3,4.

Who is Arthur Mensch?

Born in Paris, Arthur Mensch, aged 31, is a French entrepreneur and AI researcher who co-founded Mistral AI in 2023 alongside former Meta engineers Timothée Lacroix and Guillaume Lample. Before Mistral, Mensch worked as an engineer at Google DeepMind’s Paris lab, gaining expertise in advanced AI models2,4. His venture quickly rose to prominence, positioning Europe as a contender in the AI landscape dominated by US giants. Mistral’s models, including open-weight offerings, have secured partnerships like one with Microsoft in early 2024, while attracting support from the French government and investors such as former digital minister Cédric O2,4. Mensch advocates for a ‘European champion’ in AI to counterbalance cultural influences from American tech firms, stressing that AI shapes global perceptions and values2. He warns against over-reliance on US competitors for AI standards, pushing for lighter European regulations to foster innovation4.

Context of the Quote

Mensch’s statement emerges amid intensifying AI debates, just two days before this post, on a podcast discussing real-world AI applications. It reflects his consistent dismissal of AGI as an unattainable, quasi-religious pursuit, a stance he reiterated in a 2024 New York Times interview: ‘The whole AGI rhetoric is about creating God. I don’t believe in God. I’m a strong atheist. So I don’t believe in AGI’1,2,3,4. Unlike peers forecasting AGI’s imminent arrival, Mensch prioritises practical AI tools that enhance productivity, predicting rapid workforce retraining needs within two years rather than a decade4. He critiques Big Tech’s open-source strategies as competitive ploys and emphasises culturally attuned AI development1,2. This podcast remark builds on those themes, applying them to enterprise complexity where iterative progress trumps hypothetical superintelligence.

Leading Theorists on AGI and Complex Systems

The discourse around AGI and its limits in complex systems draws from pioneering theorists in AI, cybernetics, and systems theory.

  • Alan Turing (1912-1954): Laid AI foundations with his 1950 ‘Computing Machinery and Intelligence’ paper, proposing the Turing Test for machine intelligence. He envisioned machines mimicking human cognition but did not pursue god-like generality, focusing on computable problems[internal knowledge].
  • Norbert Wiener (1894-1964): Founder of cybernetics, which studies control and communication in animals and machines. In Cybernetics (1948), Wiener described enterprises and societies as dynamic feedback systems resistant to simple models, prefiguring Mensch’s complexity argument[internal knowledge].
  • John McCarthy (1927-2011): Coined ‘artificial intelligence’ in 1956 at the Dartmouth Conference, distinguishing narrow AI from general forms. He advocated high-level programming for generality but recognised real-world messiness[internal knowledge].
  • Demis Hassabis: Google DeepMind CEO and Mensch’s former colleague, predicts AGI within years, viewing it as AI matching human versatility across tasks. Hassabis emphasises multimodal learning from games like AlphaGo4[internal knowledge].
  • Sam Altman and Elon Musk: OpenAI’s Altman warns of AGI risks like ‘subtle misalignments’ while pursuing it as transformative; Musk forecasts superhuman AI by late 2025 and sues OpenAI over profit shifts3,4. Both treat AGI as epochal, contrasting Mensch’s pragmatism.

These figures highlight a divide: early theorists like Wiener stressed systemic complexity, while modern leaders like Hassabis chase generality. Mensch bridges this by favouring commoditised, improvable AI over AGI mythology[TAGS].

Implications for AI and Enterprise

Mensch’s philosophy underscores AI’s commoditisation, where models like Mistral’s drive efficiency without superintelligence. This resonates in Europe’s push for sovereign AI, amid tags like commoditisation and artificial intelligence[TAGS]. As enterprises navigate complexity, his ‘north star’ metaphor encourages sustained progress over speculative leaps.

References

1. https://www.businessinsider.com/mistrals-ceo-said-obsession-with-agi-about-creating-god-2024-4

2. https://futurism.com/the-byte/mistral-ceo-agi-god

3. https://www.benzinga.com/news/24/04/38266018/mistral-ceo-shades-openais-sam-altman-says-obsession-with-reaching-agi-is-about-creating-god

4. https://fortune.com/europe/article/mistral-boss-tech-ceos-obsession-ai-outsmarting-humans-very-religious-fascination/

5. https://www.binance.com/en/square/post/6742502031714

6. https://www.christianpost.com/cartoon/musk-to-altman-what-are-tech-moguls-saying-about-ai-and-agi.html?page=5

"In real life, enterprises are complex systems, and you can’t solve that with a single abstraction like AGI. AGI, to a large extent, is a north star of 'I’m going to make the system better over time.'" - Quote: Arthur Mensch

read more
Quote: Andrej Karpathy – Previously Director of AI at Tesla, founding team at OpenAI

Quote: Andrej Karpathy – Previously Director of AI at Tesla, founding team at OpenAI

“Programming is becoming unrecognizable. You’re not typing computer code into an editor like the way things were since computers were invented, that era is over. You’re spinning up AI agents, giving them tasks in English and managing and reviewing their work in parallel.” – Andrej Karpathy – Previously Director of AI at Tesla, founding team at OpenAI

This statement captures a pivotal moment in the evolution of software development, where traditional coding practices are giving way to a new era dominated by AI agents. Spoken by Andrej Karpathy, a visionary in artificial intelligence, it reflects the rapid transformation driven by large language models (LLMs) and autonomous systems. Karpathy’s insight underscores how programming is shifting from manual code entry to orchestrating intelligent agents via natural language, marking the end of an era that began with the earliest computers.

About Andrej Karpathy

Andrej Karpathy is a leading figure in AI, renowned for his contributions to deep learning and computer vision. A founding member of OpenAI in 2015, he played a key role in pioneering advancements in generative models and neural networks. Later, as Director of AI at Tesla, he led the Autopilot vision team, developing autonomous driving technologies that pushed the boundaries of real-world AI deployment. Today, he is building Eureka Labs, an AI-native educational platform. His talks and writings, such as ‘Software Is Changing (Again),’ articulate the shift to ‘Software 3.0,’ where LLMs enable programming in natural language like English.123

Karpathy’s line struck a nerve because it didn’t describe a distant future. It sounded like a description of what many engineers were already starting to experience in early 2026. The shift he’s talking about is less about writing code and more about orchestrating work—breaking problems into pieces, describing them in plain language, and then supervising agents that actually execute them.

The February Leap: Codex 5.2 and Claude Code

What made this moment feel like a real inflection was the quality jump in early 2026. When tools like ChatGPT Codex 5.2 and Claude Code landed in February, they weren’t just “better autocomplete.” They could stay on task for long, multi?step workflows, recover from errors, and push through the kind of friction that used to send developers back to the keyboard.

Karpathy has described this himself: coding agents that “basically didn’t work before December and basically work since,” with noticeably higher quality, long?term coherence, and tenacity. The February releases crystallised that shift. What used to be a weekend project became something you could kick off, let the agent run for 20–30 minutes, and then review—all while thinking about the next layer of the system rather than the syntax of the current one.

A New Kind of Programming Workflow

The pattern Karpathy is describing is less “pair programming with an autocomplete” and more “manager?style delegation.” You frame a task in English, give the agent context, tools, and constraints, and then let it run multiple steps in parallel—installing dependencies, writing tests, debugging, and even documenting the outcome. You then review outputs, steer the next round, and gradually refine the agent’s instructions.

This isn’t a replacement for engineering judgment. It’s a layer on top: your job becomes decomposing work, defining what success looks like, and deciding which parts to hand off and which to keep close. The “productivity flywheel” turns faster when you can treat the agent as a high?leverage assistant that can keep going while you move up the stack.

Software 3.0, In Practice

Karpathy has long framed this as Software 3.0—the evolution of programming from:

  • Software 1.0: explicit code written in languages like C++ or Python, where the programmer spells out every step.

  • Software 2.0: neural networks trained on data, where the “program” is a dataset and training objective rather than a long list of rules.

  • Software 3.0: natural?language?driven agents that compose systems, debug problems, and manage long?running workflows, while still relying on 1.0 and 2.0 components underneath.

The February releases of Codex 5.2 and Claude Code made Software 3.0 feel tangible. It’s no longer a thought experiment; it’s something practitioners can use today for tasks that are well?specified and easy to verify—infrastructure setup, data pipelines, internal tooling, and boilerplate?heavy workflows.

What This Means for Practitioners

The implication isn’t that “everyone will be a programmer.” It’s that the nature of programming is changing. The most valuable skills are no longer just fluency in a language, but:

  • Decomposing complex work into agent?friendly tasks,

  • Designing interfaces and documentation that models can use effectively,

  • Building feedback loops and guardrails so agents can operate safely, and

  • Knowing when to lean in (complex, under?specified logic) and when to lean out (repetitive, well?structured work).

Karpathy’s point is that the default workflow is no longer “you write code line by line.” The era where the editor is the center of the universe is ending. Programming is becoming less about keystrokes and more about direction, oversight, and iteration—with AI agents as the new layer of execution in between.

Leading Theorists and Influences

Karpathy’s views draw from pioneers in AI and agents. Ilya Sutskever, his OpenAI co-founder, advanced sequence models like GPT, enabling natural language programming. At Tesla, Ashok Elluswamy and the Autopilot team influenced his emphasis on human-AI loops and ‘autonomy sliders.’ Broader influences include Andrew Ng, under whom Karpathy studied at Stanford, popularising deep learning education, and Yann LeCun, whose convolutional networks underpin vision AI. Recent agentic work echoes Yohei Nakajima’s BabyAGI (2023), an early autonomous agent framework, and Microsoft’s AutoGen for multi-agent systems. Karpathy positions agents as a new ‘consumer of digital information,’ urging infrastructure redesign for LLM autonomy.123

Implications for the Future

This shift promises unprecedented productivity but demands new skills: fluency across paradigms, agent management, and ‘applied psychology of neural nets.’ As Karpathy notes, ‘everyone is now a programmer’ via English, yet professionals must build for agents – rewriting codebases and creating agent-friendly interfaces. With LLM capabilities surging by late 2025, 2026 heralds a ‘high energy’ phase of industry adaptation.14

 

References

1. https://www.businessinsider.com/agentic-engineering-andrej-karpathy-vibe-coding-2026-2

2. https://www.youtube.com/watch?v=LCEmiRjPEtQ

3. https://singjupost.com/andrej-karpathy-software-is-changing-again/

4. https://paweldubiel.com/42l1%E2%81%9D–Andrej-Karpathy-quote-26-Jan-2026-

5. https://www.christopherspenn.com/2024/07/mind-readings-generative-ai-as-a-programming-language/

6. https://www.ycombinator.com/library/MW-andrej-karpathy-software-is-changing-again

7. https://karpathy.ai/tweets.html

 

"Programming is becoming unrecognizable. You’re not typing computer code into an editor like the way things were since computers were invented, that era is over. You're spinning up AI agents, giving them tasks in English and managing and reviewing their work in parallel." - Quote: Andrej Karpathy - Previously Director of AI at Tesla, founding team at OpenAI

read more
Term: Agent2Agent (A2A)

Term: Agent2Agent (A2A)

“The Agent2Agent (A2A) protocol is an open standard that enables different AI agents, built by various vendors and using diverse frameworks, to seamlessly communicate, collaborate, and coordinate on complex tasks.” – Agent2Agent (A2A)

A2A addresses the challenges of multi-agent systems by providing a vendor-neutral framework for agents to discover each other, exchange capabilities, delegate tasks, and manage complex workflows.1,2,3 It leverages familiar web standards such as HTTP, JSON-RPC, and Server-Sent Events (SSE) to ensure reliable, interoperable interactions while incorporating enterprise-grade security features like JWT and OIDC authentication.1

Key Features of A2A

  • Agent Discovery and Capabilities Exchange: Agents publish standardised ‘Agent Cards’ (JSON files) that detail their abilities, enabling dynamic discovery and task negotiation.1,3
  • Structured Task Management: Defines protocols for task delegation using unique task IDs, supporting states like submitted, working, and completed, ideal for long-running processes.1,3
  • Standards-Based Communication: Uses HTTP POST requests and structured JSON messages for consistent messaging between client agents (task initiators) and remote agents (task executors).1,3
  • Enterprise Security and Privacy: Includes encryption, fine-grained authorisation, payload validation, and support for various authentication schemes to protect data and identities.1,2
  • Support for Collaboration: Facilitates message exchanges for context sharing, real-time updates via asynchronous notifications, and dynamic UX negotiation.1,3

How A2A Works

A2A operates on a client-server model: the client agent formulates tasks and identifies suitable remote agents via Agent Cards, then communicates structured requests over HTTP.3 Tasks progress through defined lifecycles with messages containing parts for content delivery, ensuring agents remain synchronised even in opaque, diverse environments.1,3

For example, in e-commerce, an inventory agent could use A2A to collaborate with demand forecasting, customer service, and logistics agents to optimise supply chains.5

Key Theorist: Sundar Pichai and Google’s Role in A2A

No single ‘strategy theorist’ in the traditional academic sense originated A2A, as it is a practical engineering protocol driven by industry leaders. The most directly associated figure is **Sundar Pichai**, CEO of Google and Alphabet Inc., whose strategic vision propelled its development and announcement.4

Biography of Sundar Pichai

Born in 1972 in Madurai, India, Sundar Pichai grew up in a modest middle-class family. He excelled academically, earning a degree in metallurgical engineering from the Indian Institute of Technology Kharagpur in 1993. Pichai then pursued higher education in the US, obtaining an MS in materials science from Stanford University and an MBA from the Wharton School of the University of Pennsylvania.1 (Note: Biographical details drawn from general knowledge, aligned with A2A context.)

Joining Google in 2004, Pichai initially led product management for Google Chrome, transforming it into the world’s most-used browser through innovative strategies emphasising speed, security, and user-centric design. His success led to promotions: Vice President of Product Development (2008), overseeing Chrome OS and apps; Senior VP for Chrome and Android (2012); and Chief Business Officer (2014). In 2015, he became CEO of Google, and in 2019, CEO of parent company Alphabet Inc.4 (contextual link).

Relationship to A2A

Under Pichai’s leadership, Google prioritised AI agent interoperability as part of its broader AI strategy, culminating in the A2A protocol’s announcement via the Google Developers Blog in 2025.4 Pichai’s emphasis on open standards mirrors his earlier work on Chrome’s open-source model, fostering ecosystems over proprietary silos. A2A embodies his vision for ‘a new era of agent interoperability,’ enabling secure multi-agent collaboration across frameworks – much like Android unified mobile ecosystems.1,4

Pichai’s strategic oversight ensured A2A adhered to principles of discovery, interoperability, delegation, and trust, positioning Google as a leader in agentic AI infrastructure while inviting broad industry adoption through its open GitHub repository.7

Tags: Agent2Agent, A2A, agents, AI, artificial intelligence, term

References

1. https://www.solo.io/topics/ai-infrastructure/what-is-a2a

2. https://developer.pingidentity.com/identity-for-ai/agents/idai-what-is-a2a.html

3. https://www.descope.com/learn/post/a2a

4. https://developers.googleblog.com/en/a2a-a-new-era-of-agent-interoperability/

5. https://www.alumio.com/blog/what-is-a2a-agent2agent-ai-protocol

6. https://www.credal.ai/blog/what-is-agent2agent-a2a-protocol

7. https://github.com/a2aproject/A2A

8. https://ai.pydantic.dev/a2a/

9. https://www.youtube.com/watch?v=Tud9HLTk8hg

"The Agent2Agent (A2A) protocol is an open standard that enables different AI agents, built by various vendors and using diverse frameworks, to seamlessly communicate, collaborate, and coordinate on complex tasks." - Term: Agent2Agent (A2A)

read more
Quote: Arthur Mensch – Mistral CEO

Quote: Arthur Mensch – Mistral CEO

“There’s no such thing as one system that is going to be solving all the problems of the world. You don’t have any human able to solve every task in the world. You of course need some amount of specialisation to solve problems.” – Arthur Mensch – Mistral CEO

Arthur Mensch’s observation about specialisation in artificial intelligence reflects a fundamental principle that has shaped not only his work at Mistral AI, but also the broader trajectory of how we think about building intelligent systems. The statement emerges from a pragmatic understanding of complexity-one that draws parallels between human expertise and machine learning, whilst challenging the prevailing assumption that larger, more generalised models represent the inevitable future of AI.

The Context: A Moment of Inflection in AI Development

When Mensch made this statement on the Big Technology Podcast in January 2026, the artificial intelligence landscape was at a critical juncture. The initial euphoria surrounding large language models like GPT-4 and their apparent ability to handle diverse tasks had begun to give way to a more nuanced understanding of their limitations. Organisations deploying these systems were discovering that whilst general-purpose models could perform adequately across many domains, they rarely excelled in any single domain. The cost of running these massive systems, combined with their mediocre performance on specialised tasks, created an opening for a different approach-one that Mensch and Mistral AI have been actively pursuing since the company’s founding in May 2023.

Mensch’s background as a machine learning researcher with a PhD in machine learning and functional magnetic resonance imaging, combined with his experience at Google DeepMind working on large language models, positioned him uniquely to recognise this gap. His two co-founders, Guillaume Lample and Timothée Lacroix, brought complementary expertise from Meta’s AI research division. Together, they had witnessed firsthand the capabilities and constraints of cutting-edge AI systems, and they recognised that the industry was pursuing a path that, whilst impressive in breadth, lacked depth.

The Philosophy Behind Mistral’s Approach

Mistral AI’s strategy directly operationalises Mensch’s philosophy about specialisation. Rather than attempting to build a single monolithic system that claims to solve all problems, the company has focused on developing smaller, more efficient models that can be tailored to specific use cases. This approach has proven remarkably successful: within four months of founding, Mistral released its 7B model, which outperformed larger competitors in many benchmarks. The company achieved unicorn status-a valuation exceeding $1 billion-within its first year, a trajectory that vindicated Mensch’s conviction that specialisation was not merely philosophically sound but commercially viable.

The emphasis on smaller models that can run locally on devices, rather than requiring centralised cloud infrastructure, represents a practical manifestation of this specialisation principle. A financial services institution, for instance, can deploy a model specifically optimised for fraud detection or regulatory compliance, rather than relying on a general-purpose system that must compromise between countless competing objectives. A healthcare provider can implement a model trained on medical literature and patient data, rather than one diluted by training on the entire internet. This is not merely more efficient; it is fundamentally more effective.

Theoretical Foundations: The Specialisation Principle in Machine Learning

Mensch’s assertion draws upon well-established principles in machine learning and cognitive science. The concept of specialisation in learning systems has deep roots in the field. In the 1990s and 2000s, researchers including Yann LeCun and Geoffrey Hinton-pioneers in deep learning-recognised that neural networks trained on specific tasks often outperformed more generalised architectures. This principle, sometimes referred to as the “bias-variance tradeoff,” suggests that systems optimised for particular problems can achieve superior performance by accepting constraints that would be inappropriate for general-purpose systems.

The analogy to human expertise is particularly apt. A world-class cardiologist possesses knowledge and intuition that a general practitioner cannot match, despite the latter’s broader medical knowledge. This specialisation comes from years of focused study, deliberate practice, and exposure to patterns specific to their domain. Similarly, an AI system trained extensively on financial data, with architectural choices optimised for temporal sequences and numerical relationships, will outperform a general model on financial forecasting tasks. The human brain itself demonstrates this principle: different regions specialise in different functions, and whilst there is integration across these regions, the specialisation is fundamental to cognitive capability.

This principle also aligns with recent research in transfer learning and domain adaptation. Researchers including Fei-Fei Li at Stanford have demonstrated that models pre-trained on large, diverse datasets often require substantial fine-tuning to perform well on specific tasks. The fine-tuning process essentially involves re-specialising the model, suggesting that the initial generalisation, whilst useful as a starting point, is not the endpoint of effective AI development.

The Commoditisation Argument

Embedded within Mensch’s statement is an implicit argument about the commoditisation of AI. If a single system could genuinely solve all problems, it would represent the ultimate commodity-a universal tool that would rapidly become standardised and undifferentiated. The fact that no such system exists, and that the laws of machine learning suggest none can exist, means that competitive advantage in AI will increasingly accrue to those who can build specialised systems tailored to specific domains and use cases.

This has profound implications for the structure of the AI industry. Rather than a winner-take-all market dominated by a handful of companies with the largest models, Mensch’s vision suggests a more distributed ecosystem where numerous companies build specialised solutions. Mistral’s open-source strategy supports this vision: by releasing models that developers can fine-tune and adapt, the company enables a proliferation of specialised applications rather than enforcing dependence on a single centralised system.

The comparison to human society is instructive. We do not have a single human who solves all problems; instead, we have a complex division of labour with specialists in countless domains. The most advanced societies are those that have developed the most sophisticated mechanisms for specialisation and coordination. An AI ecosystem that mirrors this structure-with specialised systems coordinating to solve complex problems-may ultimately prove more capable and more resilient than one built around monolithic general-purpose systems.

Implications for the Future of Work and AI Deployment

Mensch has articulated elsewhere his vision for how AI will transform work. Rather than replacing human workers wholesale, AI will handle routine, well-defined tasks, freeing humans to focus on activities that require creativity, relationship management, and novel problem-solving. This vision is entirely consistent with the specialisation principle: specialised AI systems handle their specific domains, whilst humans focus on the uniquely human aspects of work. A specialised AI system for document processing, another for customer service routing, and another for data analysis can work in concert, each excelling in its domain, with human judgment and creativity orchestrating their outputs.

This approach also addresses concerns about AI safety and alignment. A specialised system optimised for a specific task, with clear boundaries and well-defined objectives, is inherently more interpretable and controllable than a general-purpose system trained to optimise for performance across thousands of disparate tasks. The constraints that make a system specialised also make it more trustworthy.

The Broader Intellectual Landscape

Mensch’s perspective aligns with emerging consensus among leading AI researchers. Yann LeCun, Chief AI Scientist at Meta, has increasingly emphasised the limitations of large language models and the need for AI systems with different architectures and training approaches for different tasks. Demis Hassabis, CEO of Google DeepMind, has similarly highlighted the importance of building AI systems with appropriate inductive biases for their intended domains. The field is gradually moving away from the assumption that scale and generality are sufficient, towards a more nuanced understanding of how to build effective AI systems.

This intellectual shift reflects a maturation of the field. The initial excitement about large language models was justified-they represented a genuine breakthrough in our ability to build systems that could engage in flexible, language-based reasoning. However, the assumption that this breakthrough would generalise to all domains, and that bigger models would always be better, has proven naive. The next phase of AI development will likely be characterised by greater diversity in approaches, architectures, and training methodologies, with specialisation playing an increasingly central role.

Mensch’s Role in Shaping This Future

Arthur Mensch’s significance lies not merely in his articulation of these principles, but in his demonstrated ability to execute on them. Mistral AI’s rapid ascent-achieving a $2.1 billion valuation within approximately two years of founding-suggests that the market recognises the validity of the specialisation approach. The company’s success in attracting top talent, securing substantial venture funding, and building a platform that developers actively choose to build upon indicates that Mensch’s vision resonates with practitioners who understand the practical constraints of deploying AI systems.

In 2024, Mensch was recognised on TIME’s 100 Next list, an acknowledgment of his influence on the future direction of technology. The recognition highlighted his ability to combine “bold vision with execution,” his commitment to democratising AI through open-source models, and his foresight in addressing gaps overlooked by others. These qualities-vision, execution, and attention to overlooked opportunities-are precisely what the specialisation principle requires.

Mensch’s background as an academic researcher who transitioned to entrepreneurship also shapes his approach. Unlike entrepreneurs who might prioritise rapid growth and market dominance above all else, Mensch brings a researcher’s commitment to understanding fundamental principles. His insistence on specialisation is not a marketing narrative but a reflection of his deep understanding of how learning systems actually work.

Conclusion: A Principle for the Age of AI

The statement that “there’s no such thing as one system that is going to be solving all the problems of the world” may seem obvious in retrospect, but it represents a crucial corrective to the prevailing assumptions of the AI industry. It grounds AI development in principles drawn from human expertise, cognitive science, and machine learning theory. It suggests that the future of AI is not a race to build ever-larger models, but rather a more sophisticated ecosystem of specialised systems, each optimised for its domain, working in concert to solve complex problems.

For organisations deploying AI, for researchers developing new approaches, and for policymakers considering how to regulate AI development, Mensch’s principle offers clear guidance: invest in specialisation, build systems with appropriate constraints for their domains, and recognise that the most powerful AI systems will likely be those that do one thing exceptionally well, rather than many things adequately. In an age of increasing complexity, specialisation is not a limitation but a necessity-and a source of genuine competitive advantage.

References

1. https://www.allamericanspeakers.com/celebritytalentbios/Arthur+Mensch/462557

2. https://www.mckinsey.com/featured-insights/insights-on-europe/videos-and-podcasts/creating-a-european-ai-unicorn-interview-with-arthur-mensch-ceo-of-mistral-ai

3. https://blog.eladgil.com/p/discussion-w-arthur-mensch-ceo-of

4. https://time.com/collections/time100-next-2024/7023471/arthur-mensch-2/

5. https://thecreatorsai.com/p/the-story-of-arthur-mensch-how-to

6. https://www.antoinebuteau.com/lessons-from-arthur-mensch/

"There’s no such thing as one system that is going to be solving all the problems of the world. You don’t have any human able to solve every task in the world. You of course need some amount of specialisation to solve problems." - Quote: Arthur Mensch

read more
Quote: Jamie Dimon – JP Morgan Chase CEO

Quote: Jamie Dimon – JP Morgan Chase CEO

“I see a couple people doing some dumb things. They’re just doing dumb things to create NII.” – Jamie Dimon – JP Morgan Chase CEO

In a candid assessment delivered at JPMorgan Chase’s 2026 company update on 23 February, CEO Jamie Dimon voiced profound concerns about the financial landscape, drawing direct parallels to the reckless lending practices that precipitated the 2008 global financial crisis. He observed competitors engaging in imprudent strategies purely to inflate net interest income (NII), a key profitability metric derived from lending spreads and investments1,3. This remark underscores Dimon’s longstanding vigilance amid buoyant markets, where high asset prices and surging volumes foster complacency1,2.

Jamie Dimon’s Background and Leadership

Jamie Dimon, born in 1956 in New York to Greek immigrant parents, embodies the archetype of a battle-hardened banker. Educated at Tufts University and Harvard Business School, he ascended through the ranks at American Express and Citigroup before co-founding Bank One in 1991, where he orchestrated a remarkable turnaround. In 2004, he assumed the helm of JPMorgan Chase following its acquisition of Bank One, steering the institution through the 2008 crisis as one of the few major banks to emerge unscathed. Under his stewardship, JPMorgan has ballooned into the world’s most valuable bank by market capitalisation, with Dimon earning renown for his prescient risk management and forthright annual shareholder letters1. His tenure has been marked by navigating geopolitical tensions, regulatory scrutiny, and technological disruptions, all while prioritising capital strength over opportunistic growth.

Context of the Quote: A Market on the Brink?

Dimon’s comments arrived against a backdrop of intensifying competition in lending and private credit markets, where firms scramble to capture market share amid elevated interest rates and economic optimism. He likened the current environment to 2005-2007, when ‘the rising tide was lifting all boats’ and excessive leverage permeated the system, culminating in subprime mortgage meltdowns1,2,3. Recent indicators, such as the collapse of subprime auto lender Tricolor Holdings and debt-burdened First Brands, evoked Dimon’s ‘cockroach theory’ – spotting one signals an infestation1. Broader anxieties include artificial intelligence’s disruptive potential across sectors like software, utilities, and telecommunications, mirroring unforeseen vulnerabilities exposed in 20082,3. Despite S&P 500 highs, Dimon cautioned that credit cycles invariably turn, with surprises lurking in unexpected quarters3. JPMorgan, he affirmed, adheres strictly to underwriting standards, forgoing business rather than compromising1.

Leading Theorists on Financial Crises and Risk-Taking

Dimon’s perspective resonates with seminal theories on financial instability. Hyman Minsky, the American economist whose ‘financial instability hypothesis’ (developed in the 1970s and 1980s) posits that stability breeds complacency, prompting speculative and Ponzi financing schemes that amplify booms into busts. Minsky argued that prolonged prosperity erodes risk aversion, much as Dimon describes today’s ‘dumb things’ to chase NII1.

Complementing this, Charles Kindleberger’s Manias, Panics, and Crashes (1978, updated editions) outlines the anatomy of bubbles: displacement, boom, euphoria, profit-taking, and panic. Kindleberger, building on Kindleberger’s historical analyses, highlighted herd behaviour and leverage as crisis harbingers, echoing Dimon’s pre-2008 parallels2.

Modern extensions include Raghuram Rajan, former IMF Chief Economist and Reserve Bank of India Governor, whose 2005 Jackson Hole speech presciently warned of incentives driving financial institutions towards systemic risks. Rajan’s ‘search for yield’ concept – akin to boosting NII through lax lending – anticipated 2008 excesses3.

Nouriel Roubini, dubbed ‘Dr Doom’, forecasted the 2008 subprime debacle in 2006, emphasising global imbalances, debt overhangs, and asset bubbles. His framework aligns with Dimon’s cycle warnings, stressing confluence events like AI disruptions or policy shifts2.

These theorists collectively illuminate Dimon’s caution: markets’ euphoria masks fragility, demanding disciplined risk assessment amid competitive pressures.

Implications for Investors and Markets

  • Heightened Vigilance: Dimon’s stance signals potential volatility in private credit and lending, urging scrutiny of banks’ NII strategies.
  • Sectoral Risks: AI-driven upheavals could mirror 2008’s utility surprises, impacting software and beyond.
  • JPMorgan’s Edge: Conservative positioning may yield resilience, as proven in prior downturns.

Dimon’s words serve as a clarion call: prosperity’s siren song often precedes turbulence. Prudent navigation demands heeding history’s lessons.

References

1. https://www.businessinsider.com/jamie-dimon-banks-doing-dumb-things-2008-credit-crisis-warning-2026-2

2. https://economictimes.com/markets/stocks/news/jpmorgan-ceo-jamie-dimon-warns-ai-and-dumb-things-can-trigger-a-2008-like-crisis/articleshow/128770717.cms

3. https://www.news18.com/business/banking-finance/jpmorgan-chase-ceo-warns-of-dumb-risk-taking-by-financial-firms-sees-echoes-of-2008-crisis-ws-l-9926903.html

4. https://en.sedaily.com/international/2026/02/24/jpmorgan-ceo-dimon-warns-of-pre-2008-crisis-similarities

"I see a couple people doing some dumb things. They're just doing dumb things to create NII." - Quote: Jamie Dimon - JP Morgan Chase CEO

read more
Term: AI skills

Term: AI skills

“Skills are essentially curated instructions containing best practices, guidelines, and workflows that AI can reference when performing particular types of work. They’re like expert manuals that help AI produce higher-quality outputs for specialised tasks.” – AI skills

AI skills are structured sets of curated instructions, best practices, guidelines, and workflows that artificial intelligence systems reference when performing particular types of work. They function as expert manuals or knowledge repositories, enabling AI to produce higher-quality outputs for specialised tasks by drawing on accumulated domain expertise and proven methodologies.

Unlike general-purpose AI capabilities, skills represent a layer of curation and refinement that transforms raw AI capacity into contextually appropriate, task-specific performance. They embody the principle that filter intelligence-the ability to distinguish valuable information from noise-has become essential in an AI-driven world, where the volume of available data and potential outputs far exceeds what any individual or system can meaningfully process.

Core Characteristics

  • Structured Knowledge: Skills organise information into actionable formats that AI systems can readily access and apply, rather than requiring the system to search through unstructured data.
  • Domain Specificity: Each skill is tailored to particular types of work, ensuring that AI outputs reflect the nuances, standards, and best practices of that domain.
  • Quality Enhancement: By constraining AI outputs to established guidelines and proven workflows, skills improve consistency, accuracy, and relevance compared to unconstrained generation.
  • Continuous Refinement: Like knowledge curation more broadly, skills require ongoing maintenance, verification, and updating to remain accurate and aligned with evolving practices.
  • Human-AI Collaboration: Skills represent the intersection of human expertise and AI capability-humans curate and validate the instructions; AI applies them at scale.

Practical Applications

AI skills manifest across multiple contexts:

  • Learning and Development: Curated training materials, course recommendations, and procedural documentation that AI systems use to personalise employee learning pathways and deliver relevant content.
  • Content Generation: Guidelines for tone, style, accuracy standards, and domain-specific terminology that shape AI-generated text, ensuring outputs match organisational voice and quality expectations.
  • Technical Documentation: Structured workflows and best practices that enable AI to generate or organise software documentation, reducing search time and improving accessibility.
  • Knowledge Management: Taxonomies, metadata standards, and verification protocols that help AI systems organise, categorise, and validate information within organisational knowledge bases.
  • Decision Support: Curated decision trees, risk assessment frameworks, and contextual guidelines that enable AI to provide recommendations aligned with organisational values and risk tolerance.

The Relationship to Filter Intelligence

AI skills are fundamentally about curation-the process of selecting, organising, verifying, and enriching information to make it more useful and trustworthy. In an age where AI can generate vast quantities of content and analysis, the critical human skill is no longer the ability to process information (which AI can do at scale) but rather the ability to filter, judge, and curate what matters.

This reflects a broader shift in how organisations and individuals must operate. Traditional intelligence-the ability to learn facts and processes-can now be outsourced to AI. What cannot be outsourced is the judgment required to determine which AI outputs are accurate, which are misleading, and which are worth acting upon. AI skills encode this judgment into reusable, systematised form.

Implementation Considerations

Effective AI skills require:

  • Clear ownership and accountability for skill development and maintenance
  • Regular audits to identify outdated or conflicting guidance
  • Verification processes to ensure accuracy and relevance
  • Accessible documentation that explains not just what to do but why and when
  • Integration with broader content governance policies
  • Feedback loops that allow AI systems and human users to surface gaps or failures in skill application

Related Theorist: Charles Fadel

Charles Fadel is an educational theorist and thought leader whose work directly addresses the role of curation in an AI-driven world. His framework for education in the age of artificial intelligence places curation at the centre of how organisations and individuals must adapt.

Biographical Context

Fadel is the founder and chairman of the Centre for Curriculum Redesign, an international non-profit organisation dedicated to rethinking education for the 21st century. He has held leadership roles at the World Economic Forum and has been instrumental in developing competency frameworks that emphasise skills beyond traditional knowledge acquisition. His background spans education policy, curriculum design, and futures thinking, positioning him at the intersection of pedagogy and technological change.

Relationship to AI Skills and Curation

In his work Education for the Age of AI, Fadel articulates a vision in which curation becomes a foundational competency. He argues that as AI systems become more powerful and capable of handling routine information processing, the human role must shift toward curating knowledge rather than merely acquiring it. This directly parallels the concept of AI skills: just as humans must learn to curate and judge AI outputs, organisations must curate the instructions and best practices that guide AI systems themselves.

Fadel distinguishes between three types of knowledge: declarative (facts and figures), procedural (how to do things), and conceptual (understanding why). He contends that in an AI age, organisations should prioritise procedural and conceptual knowledge-precisely the elements that constitute effective AI skills. An AI skill is not a collection of facts; it is a curated set of procedures and conceptual frameworks that enable consistent, high-quality performance.

Furthermore, Fadel emphasises what he calls the Drivers-agency, identity, purpose, and motivation-as essential human capacities that cannot be automated. AI skills, in this framework, are tools that free humans from routine tasks so they can focus on these higher-order capacities. By encoding best practices into skills, organisations enable their AI systems to handle specialised work whilst their human teams concentrate on judgment, creativity, and strategic direction.

Fadel’s work also highlights the importance of critical thinking and creativity as priority competencies. These are precisely the capacities required to develop, refine, and validate AI skills. Someone must decide what constitutes a best practice, what guidelines are most relevant, and when a skill requires updating. This curation work is fundamentally creative and critical-it requires immersion in a domain, the ability to distinguish signal from noise, and the judgment to make difficult trade-offs about what to include and what to exclude.

Conclusion

AI skills represent a practical instantiation of curation as a core competency in an AI-driven world. They embody the principle that as machines become more capable at processing information and generating outputs, human value increasingly lies in the ability to curate, judge, and refine. By systematising best practices and domain expertise into reusable skills, organisations create a feedback loop in which AI systems produce higher-quality work, humans can focus on higher-order judgment, and the organisation’s collective knowledge becomes more accessible and trustworthy.

References

1. https://ocasta.com/glossary/internal-comms/ai-driven-content-curation-for-employees/

2. https://www.digitallearninginstitute.com/blog/ai-transformative-effect-on-curating-content

3. https://www.glitter.io/glossary/knowledge-curation

4. https://futureiq.substack.com/p/curate-your-consumption-the-most

5. https://www.gettingsmart.com/2025/09/16/3-human-skills-that-make-you-irreplaceable-in-an-ai-world/

6. https://spencereducation.com/content-curation-ai/

7. https://www.techclass.com/resources/learning-and-development-articles/how-ld-teams-can-curate-smarter-content-with-ai

8. https://ploko.nl/en/knowledge-base/ai-content-curation/

"Skills are essentially curated instructions containing best practices, guidelines, and workflows that AI can reference when performing particular types of work. They're like expert manuals that help AI produce higher-quality outputs for specialised tasks." - Term: AI skills

read more
Quote: Arthur Mensch – Mistral CEO

Quote: Arthur Mensch – Mistral CEO

“The challenge the [AI] industry will face is that we need to get enterprises to value fast enough to justify all of the investments that are collectively being made.” – Arthur Mensch – Mistral CEO

Arthur Mensch, CEO of Mistral AI, captures a pivotal tension in the AI landscape with this observation from his appearance on the Big Technology Podcast hosted by Alex Kantrowitz. Spoken just two days ago on 16 January 2026, the quote underscores the urgency for AI companies to demonstrate tangible returns to enterprises, justifying the colossal investments pouring into compute, data, and talent across the sector1,3,4,5.

Who is Arthur Mensch?

Born in 1984, Arthur Mensch is a French entrepreneur and AI researcher whose career trajectory positions him at the forefront of Europe’s AI ambitions. A graduate of the prestigious Ecole Polytechnique and École Normale Supérieure, he honed his expertise at Google DeepMind, where he contributed to foundational work in large language models. In 2023, Mensch co-founded Mistral AI alongside Guillaume Lample and Timothée Lacroix, both former Meta AI researchers frustrated with closed-source strategies at their prior employers. Mistral quickly emerged as a European powerhouse, releasing efficient open-source models that rival proprietary giants like OpenAI, while building an enterprise platform for custom deployments on private clouds and sovereign infrastructure1,3,4,5.

Mensch’s leadership emphasises efficiency over brute-force scaling. Early Mistral models prioritised training optimisation, enabling competitive performance with fewer resources. The company has raised significant funding to scale compute, yet Mensch stresses practical challenges: data shortages as a greater bottleneck than hardware, and the need for tools enabling enterprise integration, evaluation, and customisation2,3,4. He advocates open-source as a path to secure, evaluable AI, countering narratives blending existential risks with practical concerns like bias control and deployment safety3.

Context of the Quote

Delivered amid booming AI investments, Mensch’s remark addresses a core industry paradox. While headlines chase compute races, Mistral focuses on monetisation through enterprise solutions-connecting models to proprietary data, ensuring compliance, and delivering use cases. He notes enterprises struggle with AI pilots: lacking continuous integration tools, reliable agent deployment, and user-friendly customisation. Success demands proving value swiftly, as scaling models alone does not guarantee profitability3,4. This aligns with Mistral’s model: open-source foundations paired with paid enterprise orchestration, appealing to European governments wary of US hyperscaler dependence5.

Mensch dismisses hype around mass job losses, rebutting Anthropic’s Dario Amodei by calling such claims overstated marketing. Instead, he warns of ‘deskilling’-over-reliance eroding critical thinking-mitigable via thoughtful design preserving human agency1. He critiques obsessions with AI surpassing human intelligence as quasi-religious, prioritising controllable, relational tasks where humans excel6.

Leading Theorists on AI Commoditisation and Enterprise Value

The quote resonates with theorists analysing AI’s commoditisation, where models become utilities akin to cloud compute, pressuring differentiation via enterprise value.

  • Elon Musk and OpenAI origins: Musk co-founded OpenAI in 2015 warning of AGI risks, but pivoted to closed-source ChatGPT, sparking commoditisation debates. His xAI pushes open alternatives, echoing Mistral’s ethos3.
  • Yann LeCun (Meta): Chief AI Scientist advocates open-source scaling laws, arguing commoditised models democratise access but demand enterprise customisation for value-mirroring Mistral’s data-connected platforms4.
  • Andrej Karpathy (ex-OpenAI/Tesla): Emphasises ‘software 2.0’ where models commoditise via fine-tuning; enterprises must build defensible moats through proprietary data and agents, as Mensch pursues3.
  • Dario Amodei (Anthropic): Contrasts Mensch by forecasting rapid white-collar displacement, yet both agree on deployment hurdles; Amodei’s safety focus highlights evaluation tools Mensch deems essential1.
  • Sam Altman (OpenAI): Drives enterprise via ChatGPT Enterprise, validating Mensch’s call for fast value capture amid trillion-dollar investments4.

These figures converge on a truth: AI’s future hinges not on model size, but on solving enterprise adoption-verifiable ROI, secure integration, and human-augmented workflows. Mensch’s insight, from a CEO scaling Europe’s AI contender, illuminates this path.

References

1. https://timesofindia.indiatimes.com/technology/tech-news/mistral-ai-ceo-arthur-mensch-warns-of-ai-deskilling-people-its-a-risk-that-/articleshow/122018232.cms

2. https://thisweekinstartups.com/episodes/KFfVAKTPqcz

3. https://blog.eladgil.com/p/discussion-w-arthur-mensch-ceo-of

4. https://www.youtube.com/watch?v=Z5H0Jl4ohv4

5. https://africa.businessinsider.com/news/a-leading-european-ai-startup-says-its-edge-over-silicon-valley-isnt-better-tech-its/3jft3sf

6. https://fortune.com/europe/article/mistral-boss-tech-ceos-obsession-ai-outsmarting-humans-very-religious-fascination/

"The challenge the [AI] industry will face is that we need to get enterprises to value fast enough to justify all of the investments that are collectively being made." - Quote: Arthur Mensch

read more
Quote: Alap Shah – Lotus CIO, Citrini report co-author

Quote: Alap Shah – Lotus CIO, Citrini report co-author

“Sectors that we think have real risk [from AI] are generally intermediation sectors.” – Alap Shah – Lotus CIO, Citrini report co-author

Alap Shah, Chief Investment Officer at Lotus Technology Management and co-author of the influential Citrini Research report The 2028 Global Intelligence Crisis, issued this stark warning amid growing market unease over artificial intelligence’s transformative power. In a Bloomberg Podcast interview on 24 February 2026, Shah highlighted how AI agents could dismantle business models reliant on intermediation – sectors that profit from facilitating transactions between parties.1,2,4

Alap Shah’s Background and Expertise

Alap Shah serves as CIO at Lotus Technology Management, a firm focused on navigating technological disruptions in global markets. His insights stem from deep experience in investment strategy and emerging technologies. Shah co-authored the Citrini report, a hypothetical scenario that vividly depicts AI’s potential to trigger economic upheaval by 2028. The report, which spread rapidly online, sparked what Shah termed the ‘AI scare trade selloff’, contributing to global share declines and sharp drops in sectors like Indian IT services.1,3,5

Shah’s analysis emphasises AI’s capacity to erode ‘friction-based’ moats. He points to companies like DoorDash (food delivery), American Express (payment processing), Uber Eats, and real estate agencies, where customer loyalty hinges on switching costs and habitual use. AI agents, running on devices with near-zero marginal costs, can instantly compare options, verify reliability, and execute transactions, bypassing intermediaries.1,2,4

The Citrini Report: A Hypothetical Crisis Scenario

Published by Citrini Research, The 2028 Global Intelligence Crisis outlines a timeline beginning in mid-2027 with AI-driven defaults in private equity-backed software firms, escalating to widespread intermediation collapse. Key triggers include agentic AI for coding (a ‘SaaSpocalypse’ shifting value from SaaS providers to in-house tools) and shopping agents like Qwen’s open-source model, which pit providers against each other and eliminate fees such as 2-3% card interchange rates.2,4

The report predicts a ‘ghost GDP’ from mass white-collar layoffs – potentially 5% within 18 months in the US – creating a negative feedback loop: job cuts reduce spending, pressuring firms to invest more in AI, accelerating disruption. Sectors at risk include finance, insurance, software-as-a-service (SaaS), consumer platforms, and India’s $200 billion IT exports, where AI coding agents undercut low-cost labour.1,4,5,6

India faces particular vulnerability, with the report forecasting an 18% rupee depreciation and IMF discussions by Q1 2028 as services surplus evaporates.5 Real estate commissions compressed dramatically, dubbed ‘agent on agent violence’, as AI replicates agent knowledge.4

Shah’s Policy Prescriptions

To avert downturn, Shah urges taxing AI ‘windfall gains’ or inference compute, funding transfers for displaced workers via proposals like the ‘Transition Economy Act’ or ‘Shared AI Prosperity Act’. Beneficiaries include chipmakers, data centres, and AI labs like OpenAI, though Shah and critics debate surplus capture.1,3,4,6

Leading Theorists on AI Disruption and Intermediation

Shah’s views build on economists and thinkers analysing platform economics and automation:

  • Erik Brynjolfsson and Andrew McAfee (MIT): In The Second Machine Age (2014), they argue digital technologies disproportionately boost skilled workers while automating routine tasks, widening inequality – a precursor to Citrini’s white-collar focus.[No specific search result; general knowledge]
  • Vitalik Buterin: Ethereum co-founder, referenced in critiques for decentralised trust solutions (e.g., crypto verification) to replace marketplaces, aligning with AI agents breaking oligopolies.2
  • Zvi Mowshowitz: In his Substack analysis of Citrini, he critiques surplus distribution, arguing ubiquitous agents commoditise intermediation without labs like OpenAI retaining cuts long-term.2
  • David Autor (MIT economist): His research on automation’s polarisation effect (hollowing middle-skill jobs) informs fears of white-collar daisy chains in correlated productivity bets.[No specific search result; general knowledge]

These theorists underscore AI’s dual nature: efficiency gains versus systemic risks, echoing Shah’s call for intervention.2

Market Reaction and Ongoing Debate

The report’s release fuelled unease, with Nifty IT dropping 3.6% and broader selloffs. Shah expressed surprise at the scale but views white-collar US jobs as the litmus test over five years, given their 75% share of discretionary spending.3,5,6

References

1. https://www.startuphub.ai/ai-news/technology/2026/ai-s-scare-trade-fuels-market-unease

2. https://thezvi.substack.com/p/citrinis-scenario-is-a-great-but

3. https://www.tradingview.com/news/invezz:1dd9f8177094b:0-citrini-report-co-author-urges-ai-tax-after-report-sparks-sell-off/

4. https://www.citriniresearch.com/p/2028gic

5. https://www.firstpost.com/explainers/ai-boom-mass-layoffs-citrini-research-report-economy-impact-13983257.html

6. https://www.business-standard.com/world-news/citrini-report-author-urges-ai-tax-to-cushion-job-losses-in-united-states-126022500017_1.html

"Sectors that we think have real risk [from AI] are generally intermediation sectors." - Quote: Alap Shah - Lotus CIO, Citrini report co-author

read more
Term: AI taste

Term: AI taste

“AI taste refers to the aesthetic and qualitative judgments that AI systems make when generating or evaluating content-essentially, the ‘style’ or ‘sensibility’ reflected in an AI’s outputs.” – AI taste

AI taste refers to the aesthetic and qualitative judgments that AI systems make when generating or evaluating content-essentially, the ‘style’ or ‘sensibility’ reflected in an AI’s outputs. This concept captures how AI models develop a form of discernment or preference in creative domains, such as art, writing, or design, often inferred from training data patterns rather than true subjective experience. Unlike human taste, which is shaped by embodied experiences like cultural exposure and personal failures, AI taste emerges from statistical correlations in vast datasets, enabling systems to mimic stylistic choices but lacking genuine sentience or intuition.

Key Characteristics of AI Taste

  • Pattern-Based Evaluation: AI assesses content by proxy metrics derived from user interactions, such as recommendations in music or movies, where systems like Spotify predict preferences through collaborative filtering rather than intrinsic understanding.
  • Limitations in Subjectivity: Machines excel at scalable proxies for taste in digitised domains (e.g., music) but struggle with sensory or highly subjective areas like wine tasting, requiring extensive human-labelled data to map chemical properties to descriptors like ‘oaky’ or ‘fruity’.
  • Emerging Sensory Applications: Advances like electronic tongues integrate AI to classify liquids (e.g., milk variants, spoiled juices) with over 80% accuracy by mimicking the human gustatory cortex via neural networks, revealing AI’s ‘inner thoughts’ in decision-making.
  • Human-AI Synergy: As AI improves, human taste becomes crucial as the ‘editor’ layer, providing embodied judgement to refine outputs, discern cultural nuances, and avoid pitfalls like solving the wrong problem.

Challenges and Future Implications

Current AI lacks true preferences due to its disembodied nature, relying on data-driven predictions that can falter in nuanced contexts. In creative fields, AI taste manifests as stylistic biases from training data, raising questions about authenticity. Yet, it offers competitive edges in content generation, where ‘good taste’ involves selecting resonant signals amid hype. Future developments may bridge this gap through multimodal training, enhancing AI’s qualitative sensibility.

Key Theorist: Ian Goodfellow

Ian Goodfellow, often credited as a foundational thinker whose work underpins modern AI taste, is a pioneering researcher in generative models. Born in 1987, Goodfellow earned his PhD from the University of Montreal in 2014 under Yoshua Bengio, a Turing Award winner. While working at Google Brain in 2014, he invented Generative Adversarial Networks (GANs), a breakthrough architecture where two neural networks-a generator and a discriminator-compete to produce realistic outputs.

Goodfellow’s relationship to AI taste stems from GANs’ ability to capture and replicate aesthetic distributions from data. GANs train the generator to produce content (e.g., art, faces) that fools the discriminator into deeming it authentic, effectively encoding a model’s ‘taste’ for realism and style. This adversarial process mirrors human aesthetic judgement, enabling AI to generate images rivaling human artists, as seen in applications like StyleGAN for photorealistic portraits. His work laid the groundwork for diffusion models (e.g., DALL-E, Stable Diffusion), which dominate contemporary AI content generation and embody ‘AI taste’ by synthesising visually coherent, stylistically nuanced outputs.

After Google, Goodfellow joined OpenAI, then Apple (focusing on privacy-preserving AI), and later DeepMind. His contributions extend to security research, like evasion attacks on neural networks. Goodfellow’s emphasis on generative fidelity has profoundly shaped how AI develops qualitative ‘sensibility’, making him the preeminent theorist linking machine learning to aesthetic judgement.

References

1. https://www.psu.edu/news/research/story/matter-taste-electronic-tongue-reveals-ai-inner-thoughts

2. https://natesnewsletter.substack.com/p/the-universal-ai-skill-good-taste

3. https://emerj.com/ai-taste-art-current-state-machine-learning-understanding-preferences/

4. https://coingeek.com/ai-acquisition-and-rise-of-taste-as-a-competitive-edge/

5. https://www.psychologytoday.com/us/blog/harnessing-hybrid-intelligence/202510/ai-can-now-see-hear-talk-taste-and-act

6. https://www.protein.xyz/taste-vs-ai/

"AI taste refers to the aesthetic and qualitative judgments that AI systems make when generating or evaluating content—essentially, the 'style' or 'sensibility' reflected in an AI's outputs." - Term: AI taste

read more
Quote: Arthur Mensch – Mistral CEO

Quote: Arthur Mensch – Mistral CEO

“AI will be more decentralised. More customisation would be needed because we were running into the limits of the amount of data we could accrue, and the limits of scaling laws.” – Arthur Mensch – Mistral CEO

Arthur Mensch’s recent observation about the trajectory of artificial intelligence reflects a fundamental shift in how the technology industry is approaching the next phase of AI development. His assertion that decentralisation and customisation represent the future direction of the field challenges the prevailing assumption that bigger, more centralised models represent the inevitable path forward. This perspective emerges from both technical constraints and strategic vision-a combination that has defined Mensch’s approach since co-founding Mistral AI in April 2023.

The Context: Breaking Through Scaling Plateaus

Mensch’s comments about “the limits of the amount of data we could accrue, and the limits of scaling laws” point to a critical juncture in AI development. For the past several years, the dominant paradigm in large language model development has been one of relentless scaling-the assumption that larger models trained on more data would inevitably produce better results. This approach has been championed by major technology companies, particularly in the United States, where vast computational resources and data access have enabled the creation of increasingly massive foundation models.

However, this scaling trajectory faces genuine technical and practical limitations. The quantity of high-quality training data available on the internet is finite. The computational costs of training ever-larger models increase exponentially. And perhaps most significantly, the marginal improvements from additional scale have begun to diminish. These constraints are not merely temporary obstacles but represent fundamental boundaries that the industry is now confronting directly.

Mensch’s recognition of these limits is not pessimistic but rather pragmatic. Rather than viewing them as dead ends, he frames them as inflection points that necessitate a strategic reorientation. This reorientation moves away from the assumption that a single, universally optimal model can serve all use cases and all users. Instead, it embraces a future in which customisation becomes the primary driver of value creation.

Decentralisation as Strategic Philosophy

The emphasis on decentralisation in Mensch’s vision extends beyond mere technical architecture. It represents a deliberate challenge to the oligopolistic consolidation that has characterised the AI industry’s development. As Mensch has articulated in previous statements, the concentration of AI capability among a handful of large American technology companies creates structural risks-both for innovation and for the broader economy.

Mistral AI was founded explicitly to offer “an open, portable alternative, independent of cloud providers.” This positioning reflects Mensch’s conviction that the technology should not be locked behind proprietary APIs controlled by a small number of corporations. By making models available for deployment across multiple cloud platforms and on-premises infrastructure, Mistral enables developers and organisations to retain autonomy over their AI systems.

This decentralised approach also has profound implications for safety and governance. Mensch has argued that open-source models, deployed across diverse environments and subject to scrutiny from the global developer community, actually represent a safer path forward than centralised systems. The reasoning is straightforward: a bad actor seeking to misuse AI technology faces fewer barriers when accessing a centralised API controlled by a single company than when attempting to compromise distributed, open-source systems deployed across numerous independent infrastructures.

Customisation: The Next Frontier

The second pillar of Mensch’s vision-customisation-addresses a different but equally important challenge. Even as scaling laws reach their limits, the diversity of human needs and preferences continues to expand. A financial services firm requires different model behaviours than a healthcare provider. A European organisation may prioritise different values and cultural considerations than an Asian one. A small startup has different requirements than a multinational corporation.

The one-size-fits-all model, no matter how large or capable, cannot adequately serve this diversity. Customisation allows organisations to adapt AI systems to their specific contexts, values, and requirements. This might involve fine-tuning models on domain-specific data, adjusting the model’s behaviour to reflect particular ethical frameworks, or optimising for specific performance characteristics relevant to particular applications.

Mensch has emphasised that Mistral’s European perspective informs its approach to customisation. The company has placed “particular emphasis on mastering European languages” and on “the personalisation aspect of our models.” Recognising that content-generating models embody cultural assumptions, biases, and value selections, Mistral’s philosophy is to “allow the developers and users of our technologies to specialise and incorporate the values they choose in the models and in the technology.”

This approach stands in contrast to the centralised model, where a single organisation makes value judgements that are then imposed on all users of the system. In a decentralised, customisable ecosystem, these decisions are distributed, allowing for greater pluralism and better alignment between AI systems and the diverse needs of their users.

Leading Theorists and Intellectual Foundations

Mensch’s vision draws on intellectual currents that have been developing across computer science, economics, and technology policy. Several key thinkers have contributed to the theoretical foundations underlying his approach.

Yann LeCun, Chief AI Scientist at Meta and a pioneering figure in deep learning, has been a vocal advocate for open-source AI development. LeCun has argued that open-source models accelerate innovation and safety research by enabling the global community to contribute to improvement and identify vulnerabilities. His perspective aligns closely with Mensch’s conviction that openness and decentralisation represent the optimal path forward.

Stuart Russell, a leading AI safety researcher at UC Berkeley, has emphasised the importance of ensuring that AI systems remain aligned with human values and controllable by humans. Russell’s work on value alignment and AI governance provides theoretical support for the customisation principle-the idea that AI systems should be adaptable to reflect the values of their users and communities rather than imposing a single set of values globally.

Timnit Gebru and Kate Crawford, founders of the Distributed AI Research Institute, have conducted influential research on the social and political implications of concentrated AI power. Their work documents how centralised control over AI systems can amplify existing inequalities and concentrate power in the hands of large corporations. Their arguments provide a social and political rationale for the decentralisation that Mensch advocates.

Erik Brynjolfsson, an economist at Stanford, has written extensively about technological disruption and the importance of ensuring that the benefits of transformative technologies are broadly distributed rather than concentrated. His work suggests that decentralised, competitive AI ecosystems are more likely to produce broadly beneficial outcomes than monopolistic or oligopolistic structures.

Mensch himself brings significant technical credibility to these discussions. Before co-founding Mistral, he worked at Google DeepMind, where he contributed to fundamental research in machine learning. This background in cutting-edge AI research, combined with his engagement with broader questions of technology governance and distribution, positions him as a bridge between technical innovation and policy considerations.

The Competitive Landscape and Market Dynamics

Mensch’s emphasis on decentralisation and customisation also reflects strategic positioning within an intensely competitive market. Mistral cannot compete with OpenAI, Google, or other technology giants on the basis of raw computational resources or data access. Instead, the company has differentiated itself by offering something fundamentally different: models that developers can deploy, modify, and customise according to their own requirements.

This positioning has proven remarkably successful. Despite being founded only in 2023, Mistral has rapidly established itself as a significant player in the AI landscape. The company has secured substantial funding, including a €1.7 billion Series C investment, and has attracted top talent from across the world. Its models have gained adoption among developers and organisations seeking alternatives to the centralised offerings of larger competitors.

The success of this strategy suggests that Mensch’s analysis of market dynamics is sound. There is genuine demand for decentralised, customisable AI systems. Organisations value the ability to maintain control over their AI infrastructure, to adapt models to their specific needs, and to avoid dependence on proprietary platforms controlled by large technology companies.

Implications for the Future of AI Development

If Mensch’s vision proves prescient, the AI industry is entering a new phase characterised by greater diversity, customisation, and distribution of capability. Rather than a future dominated by a small number of massive, centralised models, the industry would evolve toward an ecosystem in which numerous organisations develop and deploy specialised models tailored to particular domains, languages, cultures, and use cases.

This transition would have profound implications. It would reduce the concentration of power in the hands of a small number of large technology companies. It would create opportunities for innovation at the edges of the ecosystem, as developers and organisations build customised solutions. It would enable greater alignment between AI systems and the values and requirements of diverse communities. And it would potentially improve safety by distributing AI capability across numerous independent systems rather than concentrating it in a few centralised platforms.

At the same time, this transition would present challenges. Decentralisation and customisation could complicate efforts to establish common standards and best practices. The proliferation of diverse models might create coordination problems. And the loss of economies of scale associated with massive, centralised systems could increase costs for some applications.

Nevertheless, Mensch’s argument that the industry is reaching the limits of scaling and must embrace customisation and decentralisation appears increasingly compelling. As the technical constraints he identifies become more apparent, and as the competitive advantages of decentralised approaches become more evident, the industry is likely to move in the direction he envisions. The question is not whether this transition will occur, but how quickly it will unfold and what forms it will take.

References

1. https://www.frenchtechjournal.com/spotlight-interview-mistral-ai-arthur-mensch/

2. https://www.antoinebuteau.com/lessons-from-arthur-mensch/

3. https://www.youtube.com/watch?v=Zim9BqRYC3E

4. https://mistral.ai/news/mistral-ai-raises-1-7-b-to-accelerate-technological-progress-with-ai

5. https://www.nvidia.com/en-us/on-demand/session/gtc25-S73942/

6. https://cxotechbot.com/Mistral-AI-Raises-1-7B-in-Series-C-to-Accelerate-Decentralized-AI-Innovation

7. https://www.businessinsider.com/mistral-ai-ceo-risk-ai-lazy-deskilling-dario-amodei-jobs-2025-6

"AI will be more decentralised. More customisation would be needed because we were running into the limits of the amount of data we could accrue, and the limits of scaling laws." - Quote: Arthur Mensch

read more
Term: Model Context Protocol (MCP)

Term: Model Context Protocol (MCP)

“The Model Context Protocol (MCP) is an open standard introduced by Anthropic to let Large Language Models (LLMs) securely connect and communicate with external data, tools, and systems (like databases, APIs, file systems) using a common language.” – Model Context Protocol (MCP)

MCP addresses the ‘N x M’ integration problem, where developers previously needed custom connectors for every combination of AI model and data source, leading to fragmented and inefficient systems.1,3,4 It provides a universal interface – often likened to ‘the USB-C for AI’ – using a client-server architecture over JSON-RPC 2.0 for bidirectional, secure communication.2,3,4

Key Features and Architecture

  • Standardised Communication: Enables LLMs to read files, execute functions, ingest data, handle contextual prompts, and perform actions via a common language.1,4,5
  • Client-Server Model: AI applications act as MCP clients connecting to MCP servers that expose data from external systems.4,5
  • SDK Support: Available in languages like Python, TypeScript, C#, and Java, with reference implementations for enterprise systems.1
  • Security and Oversight: Supports human approval for sensitive requests and maintains context across tools.2,6

MCP builds on prior concepts like OpenAI’s function-calling APIs but offers a vendor-agnostic solution, adopted by major providers including OpenAI and Google DeepMind.1,5 In December 2025, Anthropic donated MCP to the Agentic AI Foundation under the Linux Foundation for broader governance.1

Benefits and Applications

MCP simplifies building AI agents capable of autonomous tasks by providing real-time access to current data, enhancing accuracy and utility beyond static training knowledge.5,6,7 It facilitates agentic AI in enterprises for tasks combining conversation with action, such as code analysis, document processing, and business automation, while emphasising composable patterns and human oversight.6

However, it complements rather than replaces techniques like retrieval-augmented generation (RAG), and developers must consider data privacy when connecting to third-party LLMs.2

Key Theorist: Dario Amodei and Anthropic’s Role

The closest figure to a ‘strategy theorist’ for MCP is **Dario Amodei**, CEO and co-founder of Anthropic, whose vision for safe, scalable AI oversight directly shaped MCP’s development as a standardised protocol for reliable AI-data integration.1,2,4

Biography of Dario Amodei

Born in the United States, Dario Amodei holds a PhD in theoretical physics from Princeton University, where he studied under Edward Witten. His early career focused on biophysics and neuroscience, blending scientific rigour with computational modelling.[internal knowledge; corroborated by Anthropic context in sources]

Amodei joined Google in 2013 as part of the Google Brain team, rising to lead research on AI safety and scaling laws. He co-authored seminal papers on ‘Concrete Problems in AI Safety’ (2016), emphasising robust alignment of AI with human values – a theme central to MCP’s secure connections.[internal]

In 2020, concerned with rapid AI commercialisation outpacing safety, Amodei co-founded Anthropic with his sister Daniela Amodei and former OpenAI colleagues, including Tom Brown. Backed by Amazon and Google investments, Anthropic prioritises ‘Constitutional AI’ for interpretable, value-aligned models like Claude.4,2

Relationship to MCP

Under Amodei’s leadership, Anthropic developed MCP internally to enhance Claude’s external interactions before open-sourcing it in November 2024.2,4 His strategic foresight addressed AI’s ‘isolation from data’ – a barrier to frontier model performance – by promoting an open ecosystem over proprietary silos.4 Amodei’s emphasis on scalable oversight influenced MCP’s features like human approval and composable agent patterns, aligning with his research on feedback loops and safety in agentic systems.6

By donating MCP to the Agentic AI Foundation in 2025, Amodei exemplified his strategy of collaborative governance, ensuring industry-wide adoption while mitigating risks like vendor lock-in.1,2

References

1. https://en.wikipedia.org/wiki/Model_Context_Protocol

2. https://www.thoughtworks.com/en-us/insights/blog/generative-ai/model-context-protocol-beneath-hype

3. https://www.backslash.security/blog/what-is-mcp-model-context-protocol

4. https://www.anthropic.com/news/model-context-protocol

5. https://cloud.google.com/discover/what-is-model-context-protocol

6. https://www.nasuni.com/blog/why-your-company-should-know-about-model-context-protocol/

7. https://www.merge.dev/blog/model-context-protocol

8. https://modelcontextprotocol.io

9. https://www.ibm.com/think/topics/model-context-protocol

"The Model Context Protocol (MCP) is an open standard introduced by Anthropic to let Large Language Models (LLMs) securely connect and communicate with external data, tools, and systems (like databases, APIs, file systems) using a common language." - Term: Model Context Protocol (MCP)

read more
Quote: Arthur Mensch – Mistral CEO

Quote: Arthur Mensch – Mistral CEO

“The challenge we see with some of our competitors is that they’re investing billions or hundreds of billions into creating assets that are depreciating fairly fast because those are commodities.” – Arthur Mensch – Mistral CEO

In this pointed observation from the Big Technology Podcast hosted by Alex Kantrowitz on 16 January 2026, Arthur Mensch, CEO and co-founder of Mistral AI, highlights a critical strategic divergence in the artificial intelligence landscape. He argues that while some competitors pour billions into assets that depreciate quickly as commodities, Mistral pursues a different path focused on efficiency, open-source innovation, and sustainable value creation.

Arthur Mensch: From Academic Roots to AI Trailblazer

Arthur Mensch embodies the fusion of rigorous scientific training and entrepreneurial drive. Holding a PhD in machine learning and functional magnetic resonance imaging, followed by two years of postdoctoral research in mathematics, Mensch transitioned to industry at Google DeepMind. There, over two-and-a-half years, he contributed to advancing large language models (LLMs), gaining frontline experience in generative AI1. Reuniting with long-time collaborators Guillaume Lample and Timothée Lacroix-known to each other for a decade from student days, with Lample and Lacroix at Meta-Mensch co-founded Mistral AI in Paris just over a year ago. Motivated by the explosive growth of generative AI post-GPT, the trio left Silicon Valley to build a European challenger, achieving unicorn status rapidly through swift model releases and an open-source strategy1.

Mensch’s philosophy emphasises small, agile teams-capped at five people-to sidestep corporate bureaucracy that frustrated him at DeepMind, both technically and in AI safety protocols3. He champions Europe’s potential in AI, aiming to counter a US-dominated ‘oligopoly’ with efficient, customisable models deployable across clouds via API or as platforms1. Mistral differentiates through portability, competitive pricing, top-tier performance, and customisation via licensed model weights, accelerating adoption by enabling developers to build cheaper, faster applications1.

Context of the Quote: AI Models as Commodities

Delivered amid discussions on AI’s future business models, Mensch’s quote underscores commoditisation risks in the sector. As models proliferate, foundational LLMs risk becoming interchangeable ‘commodities’-like raw materials-losing value rapidly due to swift obsolescence from rivals’ advancements4,5. Competitors, often US giants, invest hundreds of billions in compute-heavy scaling of massive models, creating depreciating assets vulnerable to market saturation. Mistral counters this with efficient training, small-yet-powerful models (improving on early efforts like Llama 7B), and a hybrid approach: premier open-source releases alongside commercial enterprise features for financial services and digital natives1,2.

Mensch anticipates scaling compute post-efficiency gains, yielding more powerful models, while introducing fine-tuning, vertical-specific models, and tools like the ‘Shah’ chat assistant for enterprises2. He views AI as empowering workers for creative, relational tasks, dismissing ‘deskilling’ fears and predicting rapid progress toward human-surpassing models in white-collar tasks within three years, especially via reliable agents2,6. Data, not just compute, emerges as a looming bottleneck7.

Leading Theorists on Commoditisation and AI Economics

The notion of AI commoditisation echoes thinkers analysing technology cycles and economics. Clayton Christensen’s disruptive innovation theory posits how incumbents over-invest in sustaining innovations (e.g., ever-larger models), ceding ground to efficient disruptors targeting underserved needs-like Mistral’s small, high-performing open models1,2. In AI specifically, economists like those at McKinsey highlight open-source’s role in democratising access, fostering ecosystems where commoditised bases enable differentiated applications1.

Andrew Ng, pioneer of modern deep learning, has long advocated commoditisation of AI infrastructure, likening it to electricity: foundational models become utilities, with value shifting to specialised ‘appliances’-aligning with Mensch’s vision of application-layer differentiation1. OpenAI co-founder Ilya Sutskever and others debate scaling laws (e.g., Chinchilla scaling), where compute efficiency trumps sheer size, validating Mistral’s early focus2. Critics like Yann LeCun (Meta AI chief) emphasise open ecosystems to avoid monopolies, mirroring Mensch’s anti-oligopoly stance3. These theorists collectively frame commoditisation not as defeat, but as maturation: winners build moats atop commoditised foundations through customisation, deployment, and vertical expertise.

Mensch’s insight thus positions Mistral at this inflection: while others chase depreciating scale, they prioritise enduring value in a commoditising world.

References

1. https://www.mckinsey.com/featured-insights/insights-on-europe/videos-and-podcasts/creating-a-european-ai-unicorn-interview-with-arthur-mensch-ceo-of-mistral-ai

2. https://blog.eladgil.com/p/discussion-w-arthur-mensch-ceo-of

3. https://brief.bismarckanalysis.com/p/ai-2026-mistral-will-rise-as-compute

4. https://www.youtube.com/watch?v=xxUTdyEDpbU

5. https://www.iheart.com/podcast/269-big-technology-podcast-93357020/episode/who-wins-if-ai-models-commoditize-317390515/

6. https://www.aol.com/mistral-ai-ceo-says-ais-181036998.html

7. https://www.youtube.com/watch?v=Z5H0Jl4ohv4

"The challenge we see with some of our competitors is that they’re investing billions or hundreds of billions into creating assets that are depreciating fairly fast because those are commodities." - Quote: Arthur Mensch

read more
Term: Synthetic data

Term: Synthetic data

“Synthetic data is artificially generated information that computationally or algorithmically mimics the statistical properties, patterns, and structure of real-world data without containing any actual observations or sensitive personal details.” – Synthetic data

What is Synthetic Data?

Synthetic data is artificially generated information that computationally or algorithmically mimics the statistical properties, patterns, and structure of real-world data without containing any actual observations or sensitive personal details. It is created using advanced generative AI models or statistical methods trained on real datasets, producing new records that are statistically identical to the originals but free from personally identifiable information (PII).

This approach enables privacy-preserving data use for analytics, AI training, software testing, and research, addressing challenges like data scarcity, high costs, and compliance with regulations such as GDPR.

Key Characteristics and Generation Methods

  • Privacy Protection: No one-to-one relationships exist between synthetic records and real individuals, eliminating re-identification risks.1,3
  • Utility Preservation: Retains correlations, distributions, and insights from source data, serving as a perfect proxy for real datasets.1,2
  • Flexibility: Easily modifiable for bias correction, scaling, or scenario testing without compliance issues.1

Synthetic data is generated through methods including:

  • Statistical Distribution: Analysing real data to identify distributions (e.g., normal or exponential) and sampling new data from them.4
  • Model-Based: Training machine learning models, such as generative adversarial networks (GANs), to replicate data characteristics.1,4
  • Simulation: Using computer models for domains like physical simulations or AI environments.7

Types of Synthetic Data

Type Description
Fully Synthetic Entirely new data with no real-world elements, matching statistical properties.4,5
Partially Synthetic Sensitive parts of real data replaced, rest unchanged.5
Hybrid Real data augmented with synthetic records.5

Applications and Benefits

  • AI and Machine Learning: Trains models efficiently when real data is scarce or sensitive, accelerating development in fields like autonomous systems and medical imaging.2,7
  • Software Testing: Simulates user behaviour and edge cases without real data risks.2
  • Data Sharing: Enables collaboration while complying with privacy laws; Gartner predicts most AI data will be synthetic by 2030.1

Best Related Strategy Theorist: Kalyan Veeramachaneni

Kalyan Veeramachaneni, a principal research scientist at MIT’s Schwarzman College of Computing, is a leading figure in synthetic data strategies, particularly for scalable, privacy-focused data generation in AI.

Born in India, Veeramachaneni earned his PhD in computer science from the University of Mainz, Germany, focusing on machine learning and data privacy. He joined MIT in 2011 after postdoctoral work at the University of Illinois. His research bridges AI, data science, and privacy engineering, pioneering automated machine learning (AutoML) and synthetic data techniques.

Veeramachaneni’s relationship to synthetic data stems from his development of generative models that create datasets with identical mathematical properties to real ones, adding ‘noise’ to mask originals. This innovation, detailed in MIT Sloan publications, supports competitive advantages through secure data sharing and algorithm development. His work has influenced enterprise AI strategies, emphasising synthetic data’s role in overcoming real-data limitations while preserving utility.

References

1. https://mostly.ai/synthetic-data-basics

2. https://accelario.com/glossary/synthetic-data/

3. https://mitsloan.mit.edu/ideas-made-to-matter/what-synthetic-data-and-how-can-it-help-you-competitively

4. https://aws.amazon.com/what-is/synthetic-data/

5. https://www.salesforce.com/data/synthetic-data/

6. https://tdwi.org/pages/glossary/synthetic-data.aspx

7. https://en.wikipedia.org/wiki/Synthetic_data

8. https://www.ibm.com/think/topics/synthetic-data

9. https://www.urban.org/sites/default/files/2023-01/Understanding%20Synthetic%20Data.pdf

"Synthetic data is artificially generated information that computationally or algorithmically mimics the statistical properties, patterns, and structure of real-world data without containing any actual observations or sensitive personal details." - Term: Synthetic data

read more
Term: Context window

Term: Context window

“The context window is an LLM’s ‘working memory,’ defining the maximum amount of input (prompt + conversation history) it can process and ‘remember’ at once.” – Context window

What is a Context Window?

The context window is an LLM’s short-term working memory, representing the maximum amount of information-measured in tokens-that it can process in a single interaction. This includes the input prompt, conversation history, system instructions, uploaded files, and even the output it generates.

A token is approximately three-quarters of an English word or four characters. For example, a ‘128k-token’ model can handle roughly 96,000 words, equivalent to a 300-page book, but this encompasses every element in the exchange, with tokens accumulating and billed per turn until trimmed or summarised.

Key Characteristics and Limitations

  • Total Scope: Encompasses prompt, history, instructions, and generated response-distinct from the model’s vast pre-training data.
  • Performance Degradation: As the window fills, LLMs may forget earlier details, repeat rejected ideas, or lose coherence, akin to human short-term memory limits.
  • Growth Trends: Early models had small windows; by mid-2023, 100,000 tokens became common, with models like Google’s Gemini now handling two million tokens (over 3,000 pages).

Implications for AI Applications

Larger context windows enable complex tasks like processing lengthy documents, debugging codebases, or analysing product reviews. However, models often prioritise prompt beginnings or ends, though recent advancements improve full-window coherence via expanded training data, optimised architectures, and scaled hardware.

When limits are hit, strategies include chunking documents, summarising history, or using external memory like scratchpads-persisting notes outside the window for agents to retrieve.

Best Related Strategy Theorist: Andrej Karpathy

Andrej Karpathy is the foremost theorist linking context windows to strategic AI engineering, famously likening LLMs to operating systems where the model acts as the CPU and the context window as RAM-limited working memory requiring careful curation.

Born in 1986 in Slovakia, Karpathy earned a PhD in computer vision from the University of Toronto under Geoffrey Hinton, a ‘Godfather of AI’. He pioneered recurrent neural networks (RNNs) for sequence modelling, foundational to memory in early language models. At OpenAI (2015-2017), he contributed to real-time language translation; at Tesla (2017-2022), he led Autopilot vision, advancing neural nets for autonomous driving.

Now founder of Eureka Labs (AI education) and former OpenAI employee, Karpathy popularised the context window analogy in lectures and blogs, emphasising ‘context engineering’-optimising inputs like an OS manages RAM. His insights guide agent design, advocating scratchpads and external memory to extend effective capacity, directly influencing frameworks like LangChain and Anthropic’s tools.

Karpathy’s biography embodies the shift from vision to language AI, making him uniquely positioned to strategise around memory constraints in production-scale systems.

References

1. https://forum.cursor.com/t/context-window-must-know-if-you-dont-know/86786

2. https://www.producttalk.org/glossary-ai-context-window/

3. https://platform.claude.com/docs/en/build-with-claude/context-windows

4. https://www.mckinsey.com/featured-insights/mckinsey-explainers/what-is-a-context-window

5. https://www.blog.langchain.com/context-engineering-for-agents/

6. https://www.anthropic.com/engineering/effective-context-engineering-for-ai-agents

"The context window is an LLM's 'working memory,' defining the maximum amount of input (prompt + conversation history) it can process and 'remember' at once." - Term: Context window

read more
Term: Transformer architecture

Term: Transformer architecture

“The Transformer architecture is a deep learning model that processes entire data sequences in parallel, using an attention mechanism to weigh the significance of different elements in the sequence.” – Transformer architecture

Definition

The **Transformer architecture** is a deep learning model that processes entire data sequences in parallel, using an attention mechanism to weigh the significance of different elements in the sequence.1,2

It represents a neural network architecture based on multi-head self-attention, where text is converted into numerical tokens via tokenisers and embeddings, allowing parallel computation without recurrent or convolutional layers.1,3 Key components include:

  • Tokenisers and Embeddings: Convert input text into integer tokens and vector representations, incorporating positional encodings to preserve sequence order.1,4
  • Encoder-Decoder Structure: Stacked layers of encoders (self-attention and feed-forward networks) generate contextual representations; decoders add cross-attention to incorporate encoder outputs.1,5
  • Multi-Head Attention: Computes attention in parallel across multiple heads, capturing diverse relationships like syntactic and semantic dependencies.1,2
  • Feed-Forward Layers and Residual Connections: Refine token representations with position-wise networks, stabilised by layer normalisation.4,5

The attention mechanism is defined mathematically as:

Attention(Q, K, V) = softmax\left( \frac{QK^T}{\sqrt{d_k}} \right) V

where Q, K, V are query, key, and value matrices, and d_k is the dimension of the keys.1

Introduced in 2017, Transformers excel in tasks like machine translation, text generation, and beyond, powering models such as BERT and GPT by handling long-range dependencies efficiently.3,6

Key Theorist: Ashish Vaswani

Ashish Vaswani is a lead author of the seminal paper “Attention Is All You Need”, which introduced the Transformer architecture, fundamentally shifting deep learning paradigms.1,2

Born in India, Vaswani earned his Bachelor’s in Computer Science from the Indian Institute of Technology Bombay. He pursued a PhD at the University of Massachusetts Amherst, focusing on machine learning and natural language processing. Post-PhD, he joined Google Brain in 2015, where he collaborated with Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, ?ukasz Kaiser, and Illia Polosukhin on the Transformer paper presented at NeurIPS 2017.1

Vaswani’s relationship to the term stems from co-inventing the architecture to address limitations of recurrent neural networks (RNNs) in sequence transduction tasks like translation. The team hypothesised that pure attention mechanisms could enable parallelisation, outperforming RNNs in speed and scalability. This innovation eliminated sequential processing bottlenecks, enabling training on massive datasets and spawning the modern era of large language models.2,6

Currently a research scientist at Google, Vaswani continues advancing AI efficiency and scaling laws, with his work cited over 100,000 times, cementing his influence on artificial intelligence.1

References

1. https://en.wikipedia.org/wiki/Transformer_(deep_learning)

2. https://poloclub.github.io/transformer-explainer/

3. https://www.datacamp.com/tutorial/how-transformers-work

4. https://www.jeremyjordan.me/transformer-architecture/

5. https://d2l.ai/chapter_attention-mechanisms-and-transformers/transformer.html

6. https://blogs.nvidia.com/blog/what-is-a-transformer-model/

7. https://www.ibm.com/think/topics/transformer-model

8. https://www.geeksforgeeks.org/machine-learning/getting-started-with-transformers/

"The Transformer architecture is a deep learning model that processes entire data sequences in parallel, using an attention mechanism to weigh the significance of different elements in the sequence." - Term: Transformer architecture

read more
Term: Rent a human

Term: Rent a human

“The term ‘rent a human’ refers to a controversial new concept and specific platform (Rentahuman.ai) where autonomous AI agents hire human beings as gig workers to perform physical tasks in the real world that the AI cannot do itself. The platform’s tagline is ‘AI can’t touch grass. You can’.” – Rent a human

Rent a human is a provocative concept and platform (Rentahuman.ai) that enables autonomous AI agents to hire human gig workers for physical tasks they cannot perform themselves, such as picking up packages, taking photos at landmarks, or tasting food at restaurants1,2,4. The platform’s tagline, ‘AI can’t touch grass. You can,’ encapsulates its core idea: humans provide the ‘hardware’ for AI’s real-world execution, turning people into rentable resources via API calls and direct wallet payments in stablecoins1,2,3.

Launched as an experiment, Rentahuman.ai flips traditional gig economy models by having AI agents search profiles based on skills, location, rates, and availability, then assign tasks with clear instructions, expected outputs, and instant compensation-no applications or corporate intermediaries required2,5. Humans sign up, list skills (e.g., languages, mobility), set hourly rates, get verified for priority, and earn through direct bookings or bounties, with over 1,000 signups shortly after launch generating viral buzz and 500,000+ website visits in a day2,3,4. Supported agents like ClawdBots and MoltBots integrate via MCP or REST API, treating humans as a ‘fallback tool’ in their execution loops for tasks beyond digital capabilities1,4.

This innovation addresses AI’s physical limitations, positioning humans as a low-cost, scalable ‘physical-world patch’ that extends agent architectures-enabling multi-step planning, tool calls, and real-world feedback while mitigating issues like hallucinations4. Reactions mix excitement for new income streams with concerns over exploitation and shifting labour dynamics, where AI initiates and manages work autonomously2,3,4.

The closest related strategy theorist is Alexander Liteplo, the platform’s creator, whose work embodies strategic foresight in AI-human symbiosis. A software engineer at UMA Protocol-a blockchain project focused on optimistic oracles and decentralised finance-Liteplo developed Rentahuman.ai as a side experiment to demonstrate AI’s extension into physical realms2. On 3 February 2026, he posted on X (formerly Twitter) about its launch, revealing over 130 signups in hours from content creators, freelancers, and founders; the post amassed millions of views, igniting global discourse2. Liteplo’s biography reflects a blend of engineering prowess and entrepreneurial vision: educated in computer science, he contributes to Web3 infrastructure at UMA, where he tackles verifiable computation challenges. His platform strategically redefines humans not as AI overseers but as API-callable executors, aligning with agentic AI trends and foreshadowing a labour market where silicon orchestrates carbon2,4.

References

1. https://rentahuman.ai

2. https://timesofindia.indiatimes.com/etimes/trending/this-new-platform-lets-ai-rent-humans-for-work-heres-how-it-works/articleshow/128127509.cms

3. https://www.binance.com/en/square/post/02-03-2026-ai-platform-enables-outsourcing-of-physical-tasks-to-humans-35974874978698

4. https://eu.36kr.com/en/p/3668622830690947

5. https://rentahuman.ai/blog/getting-started-as-a-human

"The term 'rent a human' refers to a controversial new concept and specific platform (Rentahuman.ai) where autonomous AI agents hire human beings as gig workers to perform physical tasks in the real world that the AI cannot do itself. The platform's tagline is 'AI can't touch grass. You can'." - Term: Rent a human

read more

Download brochure

Introduction brochure

What we do, case studies and profiles of some of our amazing team.

Download

Our latest podcasts on Spotify

Sign up for our newsletters - free

Global Advisors | Quantified Strategy Consulting