- Huang's Announcement: A Market That Didn't Exist Yet
- AI agents need dedicated hardware for several reasons, primarily related to the computational demands of artificial intelligence models: * **Processing Power:** AI models, especially deep learning models, require massive amounts of parallel processing. This is needed for tasks like matrix multiplications, which are fundamental to neural networks. General-purpose CPUs are not optimized for this type of highly parallel computation. * **Specialized Architectures:** Graphics Processing Units (GPUs) were initially designed for rendering graphics, which involves a lot of parallel processing. Their architecture proved highly effective for the parallel computations required by AI. More recently, specialized hardware like Tensor Processing Units (TPUs) and Neural Processing Units (NPUs) have been developed specifically for AI workloads, offering even greater efficiency and speed. * **Memory Bandwidth:** AI models often involve large datasets and complex computations that require fast access to memory. Dedicated AI hardware typically has higher memory bandwidth, allowing for quicker data transfer between the processing units and memory, which speeds up training and inference. * **Energy Efficiency:** Performing complex AI calculations on general-purpose hardware can be very energy-intensive. Specialized AI hardware is designed to perform these computations more efficiently, consuming less power and generating less heat. This is crucial for everything from large data centers to mobile devices. * **Real-time Performance:** For applications like autonomous vehicles, robotics, or real-time AI assistants, the ability to process information and make decisions instantaneously is critical. Dedicated hardware can significantly reduce latency and enable these real-time capabilities. * **Scalability:** As AI models become larger and more complex, they require increasingly powerful hardware. Dedicated AI hardware is often designed with scalability in mind, allowing users to combine multiple units to handle more demanding tasks. In essence, dedicated hardware is built to accelerate the specific types of computations that AI workloads require, making AI agents faster, more efficient, and capable of handling more complex tasks than they could on standard computer hardware.
- Immediate impact on the cloud market and providers
- What should Italian B2B SMEs do now
- The work in progress: what Nvidia still hasn't said
- 18-Month Outlook: What to Expect by the End of 2027
Jensen Huang, the CEO of Nvidia, has publicly announced that he has identified a completely new market: CPUs designed specifically for AI agents. The estimate is $200 billion. This is a strong signal for the entire tech industry.
However, the news is not just about the big players. In fact, Italian B2B SMEs considering investments in AI infrastructure need to understand how this paradigm shift will affect costs, availability, and architectures in the next 18-24 months. Consequently, the infrastructural choices made today could prove to be premature or, conversely, strategically advantageous.
In this article, we at SHM Studio Let's analyze what has changed with this announcement, what immediate impact is expected on the market, and what operational moves are advisable for medium-sized Italian companies operating in the B2B or advanced retail sector. Therefore, this reading is recommended for those who manage technology budgets or oversee corporate digitalization projects.
Huang's Announcement: A Market That Didn't Exist Yet
During a public event in May 2026, Jensen Huang stated that he had identified a «completely new» market for Nvidia. This is not about GPUs, the company's traditional playing field. It's about CPUs specifically designed to support AI agents. The estimated figure communicated is $200 billion. According to TechCrunch, Huang described this segment as distinct and complementary to Nvidia's current offering.
Therefore, the statement is not just a financial projection. It is a precise strategic signal: AI agents require a different type of processing compared to pure generative models. They need CPUs optimized for low latency, sequential reasoning, and the orchestration of autonomous tasks.
Furthermore, this positioning pushes Nvidia to compete directly with Intel and AMD on previously unclaimed territory. The enterprise CPU market is already worth hundreds of billions. Adding the AI agent component completely redefines it.
AI agents need dedicated hardware for several reasons, primarily related to the computational demands of artificial intelligence models: * **Processing Power:** AI models, especially deep learning models, require massive amounts of parallel processing. This is needed for tasks like matrix multiplications, which are fundamental to neural networks. General-purpose CPUs are not optimized for this type of highly parallel computation. * **Specialized Architectures:** Graphics Processing Units (GPUs) were initially designed for rendering graphics, which involves a lot of parallel processing. Their architecture proved highly effective for the parallel computations required by AI. More recently, specialized hardware like Tensor Processing Units (TPUs) and Neural Processing Units (NPUs) have been developed specifically for AI workloads, offering even greater efficiency and speed. * **Memory Bandwidth:** AI models often involve large datasets and complex computations that require fast access to memory. Dedicated AI hardware typically has higher memory bandwidth, allowing for quicker data transfer between the processing units and memory, which speeds up training and inference. * **Energy Efficiency:** Performing complex AI calculations on general-purpose hardware can be very energy-intensive. Specialized AI hardware is designed to perform these computations more efficiently, consuming less power and generating less heat. This is crucial for everything from large data centers to mobile devices. * **Real-time Performance:** For applications like autonomous vehicles, robotics, or real-time AI assistants, the ability to process information and make decisions instantaneously is critical. Dedicated hardware can significantly reduce latency and enable these real-time capabilities. * **Scalability:** As AI models become larger and more complex, they require increasingly powerful hardware. Dedicated AI hardware is often designed with scalability in mind, allowing users to combine multiple units to handle more demanding tasks. In essence, dedicated hardware is built to accelerate the specific types of computations that AI workloads require, making AI agents faster, more efficient, and capable of handling more complex tasks than they could on standard computer hardware.
An AI agent is not a chatbot. It is an autonomous system capable of planning, executing, and correcting actions in sequence. Therefore, the computational load is structurally different from that of a model that responds to single prompts.
GPUs excel at massive parallelism. However, AI agents often work serially: they read context, decide, act, and verify. This flow requires CPUs with large caches, low latency, and efficient management of context memory. In particular, architectures like those described in recent reports from Gartner on enterprise AI confirm that 2026 will mark a turning point in the adoption of agents in structured business processes.
Consequently, Nvidia is anticipating demand that B2B SMEs will begin to express in the next 12-18 months: cloud or on-premise optimized hardware not for training models, but for running agents in production.
Immediate impact on the cloud market and providers
The announcement has direct implications for major cloud providers. AWS, Google Cloud, and Microsoft Azure will need to update their compute instance offerings to include CPUs optimized for agent workloads. This process takes time. Therefore, in the short term, SMEs already operating on public clouds will not see immediate changes.
However, prices for AI-oriented instances may face downward pressure in the medium term. In fact, Nvidia's entry into this segment increases competition. Similarly, hybrid infrastructure vendors will need to update their product roadmaps.
According to the analysis of McKinsey on the AI market, companies that invest in appropriate AI infrastructure see operational returns that are 20–30% higher than those that adopt generic solutions. This figure becomes even more significant when it comes to deploying agents at scale.
What should Italian B2B SMEs do now
First of all, it is necessary to distinguish between two business scenarios. The first concerns SMEs that are still evaluating whether to adopt AI tools. The second concerns those that have already started pilot projects or production deployments.
For companies in the first group, the Nvidia announcement suggests not rushing into autonomous hardware investments. In fact, the ecosystem is being redefined. Relying on managed cloud solutions remains the most flexible choice for the next 12 months. Services from AI Consulting by SHM Studio include a thorough evaluation of the most suitable architecture for the specific business context.
For companies in the second group, however, it is advisable to actively monitor cloud provider roadmaps. Following the Nvidia announcement, AWS and Google Cloud will update their agent-optimized instance offerings. Therefore, those with active projects should plan for an infrastructural review by the end of 2026.
In addition, B2B SMEs that use CRM, ERP, or marketing automation platforms should check whether their vendors are integrating agent-based capabilities. This directly impacts strategies for digital marketing and of LinkedIn campaign based on advanced automation.
The work in progress: what Nvidia still hasn't said
Huang's announcement is intentionally high-level. No product names, launch dates, or technical specifications were shared. Therefore, the estimate of 200 billion remains a total addressable market projection, not a stated revenue plan.
Several questions remain open. How will this new CPU line position itself against the ARM architecture, which currently dominates AI inference chips? What will the licensing models be for cloud providers? To what extent will SMEs be able to access these resources without enterprise intermediaries?
Nevertheless, the strategic message is clear. Nvidia is building a vertically integrated ecosystem that spans from training GPUs to agent execution CPUs. This vertical integration is comparable to what Apple has achieved with its M-series chips in the consumer segment. The implications for the enterprise market are significant.
For those managing projects web development with built-in AI components, or campaigns Google Ads optimized by predictive algorithms, understanding the evolution of underlying hardware is not an academic exercise. It is a concrete operational variable.
18-Month Outlook: What to Expect by the End of 2027
By the first half of 2027, we can reasonably expect Nvidia to make its first concrete product announcements in this segment. Similarly, major cloud providers will integrate CPU-agent instances into their managed offerings. The market will begin to differentiate pricing between generative workloads and agent-based workloads.
For Italian SMEs, this means that infrastructural decisions made today will have a direct impact on competitiveness in 2027-2028. In particular, those who begin now to structure workflows based on AI agents—even in a simple form, through no-code or low-code platforms—will have an advantage when dedicated hardware becomes available at affordable costs.
Finally, it's worth remembering that the adoption of AI agents isn't just about IT. It impacts the SEO strategy, the production of contents, the management of advertising campaigns, and even website design. At SHM Studio We are already guiding small and medium-sized businesses through this strategic adaptation process. Those who wish to learn more can consult our section blog or contact us directly from the page contacts.
Related articles
Discover other articles that explore similar topics in depth, selected to give you a more complete and stimulating view. Each piece of content is carefully chosen to enrich your experience.