Solutions
SOLUTIONS
INSIGHTS
NEWS AND EVENTS
RESEARCH | 3 POINT PERSPECTIVE
Contributors: Tom O'Malley, Ross Sandler, Trevor Young
09 Dec 2024
GO TO SECTION
Since ChatGPT’s debut in November 2022, AI has experienced unprecedented growth: ChatGPT and Meta AI have now gained more than 300 million weekly active users. Other ‘frontier’ or cutting-edge models have also shown promise.
Our Research team sees three distinct eras emerging. While chatbots and copilots may still feel new, the text-based Chatbot Era (2023-2024) is about to be superseded: the next wave of AI product innovation is nearly here.
It’s important to note that while AI may be making the headlines, broader adoption remains limited, and enterprise use is largely confined to niche applications such as code writing and customer service. The industry has experimented with various business models, but a clear ‘killer app’ has yet to emerge. Most financial gains so far have gone to hardware providers and foundational AI infrastructure.
While chatbots and copilots should continue to grow, the next AI period is poised to begin. The Agent Era (2025-2026) is likely to be defined by AI systems that can complete end-to-end tasks autonomously, in contrast to the previous chatbot and copilot stage of AI. These agents are likely to bring about meaningful consumer and enterprise AI adoption with far more accurate and powerful responses. Agents are expected to boost enterprise productivity and help consumers with everyday tasks such as completing transactions.
As the quality and performance of AI applications increases by embedding agents, our Research team predicts that consumer AI adoption should hit one billion daily active users (DAU) by 2026. However, significant challenges remain if this target is to be achieved, including reducing errors and improving memory. There are some concerns that the concept of ‘scaling laws’, which posits that AI models improve with more compute and data, may be plateauing, potentially signaling leveling out in the growth and pace of AI advancement. However, AI insiders dispute this perspective.
Looking further ahead, the AI Employee and Consumer Robot Era (2027-2028) envisions AI evolving into digital employees for enterprises and humanoid robots for consumers. These advancements could reshape industries by reducing human involvement in tasks and increasing efficiency. Digital employees and humanoid robots are still in early development and likely years away from widespread adoption.
ChatGPT and other consumer text-based tools have achieved significant user adoption in a short time, and AI is starting to proliferate in industries and sectors where a clear return on investment exists, such as marketing, analytics and engineering
As we move on to the next AI eras, Barclays Research expects huge growth. While consumer AI use should continue to grow rapidly, enterprise is where the action is likely to be. Our analysts estimate that there are around 7 billion tasks that AI agents could execute independently, enabling entire teams and organisations to increase system-wide productivity.
While it will take time for AI systems to gain ground, the pace of adoption is likely to accelerate rapidly: starting with just 2.5% of enterprise tasks in 2025, AI could capture 5% of this market by 2026 and could add an additional 250 to 500 basis points of task adoption annually.
As AI adoption grows, so should the demand for computational resources such as graphics processing units (GPUs) and application-specific integrated circuits (ASICs). These are essential for training models and putting them into operation to make predictions or draw conclusions based on unfamiliar data (a process known as inference).
Our Research team expects demand for GPUs and ASICs will be 250% higher than consensus forecasts in 2025 and rise to 14 times current levels by 2027. This surge will be partly driven by the proliferation of AI agents, which require significant compute power, as each human query is autonomously passed through an AI platform multiple times. For example, even a modest adoption of AI agents for 2% of tasks in 2025 could increase inference compute needs by 15%.
While demand is increasing rapidly, costs are falling fast. Inference unit costs per 1 million tokens (the smallest unit of data that is processed by algorithms) have fallen by 90% in the past 18 months and are expected to continue dropping.
Lower AI costs are driven by factors such as algorithmic improvements, competition and a shift from GPUs to custom ASICs for inference workloads. Major AI players like Amazon, Meta, Google and Microsoft, known as hyperscalers, are developing expensive custom ASICs for training and AI inference which could outperform GPUs for specific tasks on price/performance.
Barclays Research estimates that GPUs currently handle 80% of the inference market, but their share is expected to drop to around 50% by 2028 as hyperscalers’ custom ASICs gain market share.
"As the cost of compute declines, AI consumption picks up. This is somewhat counterintuitive relative to economic theory. Thus far, the reduction in unit cost has not led to over-capacity, but much greater usage. Cheaper compute leads to more products and faster adoption. "
The capital investment required for AI remains immense. Chip-related capital expenditure for consumer AI inference alone is expected to approach $120 billion in 2026 and exceed $1.1 trillion by 2028.
However, falling costs are expected to significantly improve AI's unit economics. According to our Research team’s analysis, some leading AI companies are already profitable on a per-model basis (excluding future model development costs). As model development slows in the next 5–10 years and inference costs decrease, profits could grow substantially. This suggests current concerns about losses and high burn rates may be overstated.
What's Next in AI? A Framework for Thinking About Inference Compute
About the experts
Ross Sandler
Managing Director, Senior Equity Research Analyst
Tom O'Malley
Director, Semiconductor Research Analyst
Trevor Young, CFA
Director, Internet Research Analyst
* We acknowledge and agree for Barclays to collect, use and otherwise process our/the Relevant Individual's Information in accordance with the Notice, other effective privacy terms and information processing terms agreed by ourselves/the Relevant Individual with Barclays, for the purposes set out therein, respectively.
* We acknowledge and agree that Barclays may disclose to any third party described in the Notice as a potential recipient of data outside mainland China our.the Relevant Individual's Information in accordance with the Notice, other effective privacy terms and personal information processing terms agreed by ourselves/the Relevant Individual with Barclays, and for the purposes set out therein, respectively.
I consent to my email address being used by Barclays to provide me with personalized advertisements on third-party websites and social media platforms, as described in our Privacy Notice.
An email was sent to you at the address provided. Complete your subscription by clicking the link provided to verify your email address.
Sorry there was a problem. Unfortunately your subscription to our newsletter has encountered an error.
In addition to the cookies we use on our website, we also use cookies and similar technologies in some emails and push notifications. These help us to understand whether you have opened the email and how you have interacted with it. If you have enabled images, cookies may be set on your computer or mobile device. Cookies will also be set if you click on any link within the email.
Please review and manage your email cookie settings below. For more information, please read our Cookie Policy. Please select 'Save and Subscribe' below to remember your email cookie preferences and subscribe to the newsletter.