ARTIFICIAL INTELLIGENCE
An AI-native strategy firmGlobal Advisors: a consulting leader in defining quantified strategy, decreasing uncertainty, improving decisions, achieving measureable results.
A Different Kind of Partner in an AI World
AI-native strategy
consulting
Experienced hires
We are hiring experienced top-tier strategy consultants
Quantified Strategy
Decreased uncertainty, improved decisions
Global Advisors is a leader in defining quantified strategies, decreasing uncertainty, improving decisions and achieving measureable results.
We specialise in providing highly-analytical data-driven recommendations in the face of significant uncertainty.
We utilise advanced predictive analytics to build robust strategies and enable our clients to make calculated decisions.
We support implementation of adaptive capability and capacity.
Our latest
Thoughts
Podcast – The Real AI Signal from Davos 2026
While the headlines from Davos were dominated by geopolitical conflict and debates on AGI timelines and asset bubbles, a different signal emerged from the noise. It wasn’t about if AI works, but how it is being ruthlessly integrated into the real economy.
In our latest podcast, we break down the “Diffusion Strategy” defining 2026.
3 Key Takeaways:
- China and the “Global South” are trying to leapfrog: While the West debates regulation, emerging economies are treating AI as essential infrastructure.
- China has set a goal for 70% AI diffusion by 2027.
- The UAE has mandated AI literacy in public schools from K-12.
- Rwanda is using AI to quadruple its healthcare workforce.
- The Rise of the “Agentic Self”: We aren’t just using chatbots anymore; we are employing agents. Entrepreneur Steven Bartlett revealed he has established a “Head of Experimentation and Failure” to use AI to disrupt his own business before competitors do. Musician will.i.am argued that in an age of predictive machines, humans must cultivate their “agentic self” to handle the predictable, while remaining unpredictable themselves.
- Rewiring the Core: Uber’s CEO Dara Khosrowshahi noted the difference between an “AI veneer” and a fundamental rewire. It’s no longer about summarising meetings; it’s about autonomous agents resolving customer issues without scripts.
The Global Advisors Perspective: Don’t wait for AGI. The current generation of models is sufficient to drive massive value today. The winners will be those who control their “sovereign capabilities” – embedding their tacit knowledge into models they own.
Read our original perspective here – https://with.ga/w1bd5
Listen to the full breakdown here – https://with.ga/2vg0z

Strategy Tools
Strategy Tools: The 7S Framework – A Comprehensive Guide
By John Khova Global Advisors digital consultant Introduction The McKinsey 7S Framework is one of the most enduring and widely recognised management models in strategic consulting and organisational design. It posits that organisational effectiveness depends not on...
Fast Facts
Fast Fact: Great returns aren’t enough
Key insights
It’s not enough to just have great returns – top-line growth is just as critical.
In fact, S&P 500 investors rewarded high-growth companies more than high-ROIC companies over the past decade.
While the distinction was less clear on the JSE, what is clear is that getting a balance of growth and returns is critical.
Strong and consistent ROIC or RONA performers provide investors with a steady flow of discounted cash flows – without growth effectively a fixed-income instrument.
Improvements in ROIC through margin improvements, efficiencies and working-capital optimisation provide point-in-time uplifts to share price.
Top-line growth presents a compounding mechanism – ROIC (and improvements) are compounded each year leading to on-going increases in share price.
However, without acceptable levels of ROIC, the benefits of compounding will be subdued and share price appreciation will be depressed – and when ROIC is below WACC value will be destroyed.
Maintaining high levels of growth is not as sustainable as maintaining high levels of ROIC – while both typically decline as industries mature, growth is usually more affected.
Getting the right balance between ROIC and growth is critical to optimising shareholder value.
Selected News
Term: Model density
“Model density” in AI, particularly regarding LLMs, is a performance-efficiency metric defined as the ratio of a model’s effective capability (performance) to its total parameter size.” – Model density
Model density represents a fundamental shift in how we measure artificial intelligence performance, moving beyond raw computational power to assess how effectively a model utilises its parameters. Rather than simply counting the number of parameters in a neural network, model density quantifies the ratio of effective capability to total parameter count, revealing how intelligently a model has been trained and architected.3
The Core Concept
At its essence, model density answers a critical question: how much useful intelligence does each parameter contribute? This metric emerged from the recognition that newer models achieve superior performance with fewer parameters than their predecessors, suggesting that progress in large language models stems not merely from scaling size, but from improving architecture, training data quality, and algorithmic efficiency.3
The concept can be understood through what researchers call capability density, formally defined as the ratio of a model’s effective parameter count to its actual parameter count.3 The effective parameter count is estimated by fitting scaling laws to existing models and determining how large a reference model would need to be to match current performance. When this ratio exceeds 1.0, it indicates that a model performs better than expected for its size-a hallmark of efficient design.
Information Compression and the “Great Squeeze”
Model density becomes particularly illuminating when examined through the lens of information compression. Modern large language models achieve remarkable density through what has been termed “the Great Squeeze”-the process of compressing vast training datasets into mathematical representations.1
Consider the Llama 3 family as a concrete example. During training, the model encountered approximately 15 trillion tokens of information. If stored in a traditional database, this would require 15 to 20 terabytes of raw data. The resulting Llama 3 70B model, however, contains only 70 billion parameters with a final weight of roughly 140 gigabytes-representing a 100:1 reduction in physical size.1 This translates to a squeeze ratio where each parameter has “seen” over 200 different tokens of information during training.1
The smaller Llama 3 8B model demonstrates even more extreme density, compressing 15 trillion tokens into 8 billion parameters-a ratio of nearly 1,875 tokens per parameter.1 This extreme over-training paradoxically enables superior reasoning capabilities, as the higher density of learned experience per parameter allows the model to extract more nuanced patterns from its training data.
Semantic Density and Output Reliability
Beyond parameter efficiency, model density extends to the quality and consistency of outputs. Semantic density measures the confidence level of an LLM’s response by analysing how probable and semantically consistent the generated answer is.2 This metric evaluates how well each answer aligns with alternative responses and the query’s overall context, functioning as a post-processing step that requires no retraining or fine-tuning.2
High semantic density indicates strong understanding of a topic and internal consistency, resulting in more reliable outputs.2 This proves particularly valuable given that LLMs lack built-in confidence measures and can produce outputs that sound authoritative even when incorrect or misleading.5 By generating multiple responses and computing confidence scores between 0 and 1, semantic density identifies responses located in denser regions of output semantic space-and therefore more trustworthy.5
Intelligence Density in Practical Application
Beyond parameter ratios, practitioners increasingly focus on intelligence density as the amount of useful intelligence produced per unit of time or computational resource.4 This reframing acknowledges that once models achieve sufficient peak intelligence for their intended tasks, the primary constraint shifts from maximum capability to the density of intelligence they can produce.4 In customer support and similar domains, this means optimising the quantity of intelligence produced per second becomes more valuable than pursuing ever-higher peak performance.4
This principle reveals that high-enough peak intelligence is necessary but not sufficient; once achieved, value creation moves towards latency and density optimisation, where significant opportunities for differentiation remain under-explored and are cheaper to capture.4
The Exponential Progress Trend
Research indicates that the best-performing models at each time point show rising capability density, with newer models achieving given performance levels with fewer parameters than older models.3 This trend appears approximately exponential over time, suggesting that progress in large language models is fundamentally about improving efficiency rather than simply scaling up.3 This observation underscores that tracking parameter efficiency is essential for understanding future directions in natural language processing and machine learning.
Related Theorist: Ilya Sutskever and Scaling Laws
The theoretical foundations of model density connect deeply to the work of Ilya Sutskever, Chief Scientist at OpenAI and a pioneering researcher in understanding how neural networks scale. Sutskever’s research on scaling laws-particularly his work demonstrating predictable relationships between model size, data size, and performance-provided the mathematical framework upon which modern density metrics rest.
Born in 1986 in Yegoryevsk, Russia, Sutskever emigrated to Canada as a child and developed an early passion for artificial intelligence. He completed his PhD at the University of Toronto under Geoffrey Hinton, one of the founding figures of deep learning, where he focused on understanding the principles governing neural network training and optimisation.
Sutskever’s seminal work on scaling laws, conducted whilst at OpenAI alongside researchers including Jared Kaplan, revealed that model performance follows predictable power-law relationships with respect to compute, data, and model size.3 These discoveries fundamentally changed how the field approaches model development. Rather than viewing larger models as inherently better, Sutskever’s work demonstrated that the efficiency with which a model uses its parameters matters profoundly.
His research established that progress in AI is not merely about building bigger models, but about understanding and optimising the relationship between parameters and capability-the very essence of model density. Sutskever’s theoretical contributions directly enabled the concept of capability density, as researchers could now quantify how much “effective” capacity a model possessed relative to its actual parameter count. His work demonstrated that architectural innovations, superior training algorithms, and higher-quality data could yield models that achieve better performance with fewer parameters, validating the principle that density-not size-drives progress.
Sutskever’s influence extends beyond scaling laws to shaping how the entire field conceptualises model efficiency. His emphasis on understanding the mathematical principles underlying neural network training rather than pursuing brute-force scaling has become increasingly relevant as computational costs and environmental concerns make parameter efficiency paramount. In this sense, model density represents the practical realisation of Sutskever’s theoretical insights: the recognition that intelligent design and efficient parameter utilisation outweigh raw computational scale.
References
1. https://dentro.de/ai/blog/2025/12/20/the-great-squeeze—understanding-llm-information-density/
2. https://www.geekytech.co.uk/semantic-density-and-its-impact-on-llm-ranking/
3. https://research.aimultiple.com/llm-scaling-laws/
4. https://fin.ai/research/we-dont-need-higher-peak-intelligence-only-more-intelligence-density/
5. https://www.cognizant.com/us/en/ai-lab/blog/semantic-density-demo
6. https://www.educationdynamics.com/ai-density-in-search-marketing/
7. https://pub.towardsai.net/the-generative-ai-model-map-fff0b6490f77

Polls
What determines your success?
We need your help!
We’re testing how people think about their successes and failures. It would be great if you would take 2 minutes to give a simple multiple choice answer and share the poll with your friends.
Take the poll – “What determines your success?”
Then read So You Think You’re Self Aware?
Services
Global Advisors is different
We help clients to measurably improve strategic decision-making and the results they achieve through defining clearly prioritised choices, reducing uncertainty, winning hearts and minds and partnering to deliver.
Our difference is embodied in our team. Our values define us.
Corporate portfolio strategy
Define optimal business portfolios aligned with investor expectations
BUSINESS UNIT STRATEGY
Define how to win against competitors
Reach full potential
Understand your business’ core, reach full potential and grow into optimal adjacencies
Deal advisory
M&A, due diligence, deal structuring, balance sheet optimisation
Global Advisors Digital Data Analytics
14 years of quantitative and data science experience
An enabler to delivering quantified strategy and accelerated implementation
Digital enablement, acceleration and data science
Leading-edge data science and digital skills
Experts in large data processing, analytics and data visualisation
Developers of digital proof-of-concepts
An accelerator for Global Advisors and our clients
Join Global Advisors
We hire and grow amazing people
Consultants join our firm based on a fit with our values, culture and vision. They believe in and are excited by our differentiated approach. They realise that working on our clients’ most important projects is a privilege. While the problems we solve are strategic to clients, consultants recognise that solutions primarily require hard work – rigorous and thorough analysis, partnering with client team members to overcome political and emotional obstacles, and a large investment in knowledge development and self-growth.
Get In Touch
16th Floor, The Forum, 2 Maude Street, Sandton, Johannesburg, South Africa
+27114616371
